Live Multi-Modal Language Translation System

Authors

  • Ms Sameera Begum Assistant Professor; Department Of Computer Science And Engineering (Ai&Ml) Bhoj Reddy Engineering College For Women Hyderabad India. Author
  • K Reena Smith, T Sri Nikitha, S Vaishnavi B.Tech Students; Department Of Computer Science And Engineering (Ai&Ml) Bhoj Reddy Engineering College For Women Hyderabad India. Author

DOI:

https://doi.org/10.62647/

Keywords:

Live Multimodal Translation, Computer Vision, Natural Language Processing, Speech Recognition, BERT, Tesseract OCR, Optical Character Recognition, MFCC, LSTM, Real-Time Language Translation, Multimodal AI.

Abstract

Live Multimodal Language Translation is an advanced software solution designed to bridge communication gaps across diverse linguistic and sensory formats. By integrating Computer Vision, Natural Language Processing (NLP), and Speech Recognition, the system provides a unified platform for real-time translation of text, speech, and images. The application leverages high-performance models like BERT for contextual understanding and Tesseract OCR for visual text extraction, ensuring high accuracy across various input types .Designed with a focus on seamless user experience, the system allows for instantaneous conversion between multiple global languages, catering to international travelers, students, and professionals. Key features include a Live Translator interface, automated speech normalization using MFCC and LSTMs, and a robust backend built on Python and Next.js15. This project represents a significant advancement in multimodal AI, offering a scalable, intuitive, and highly accessible tool that transforms how individuals interact across different languages and media formats.

Downloads

Download data is not yet available.

Downloads

Published

30-03-2026

How to Cite

Live Multi-Modal Language Translation System. (2026). International Journal of Information Technology and Computer Engineering, 14(1), 1040-1045. https://doi.org/10.62647/

Similar Articles

1-10 of 1189

You may also start an advanced similarity search for this article.