
INTERNATIONAL JOURNAL OF LATEST TECHNOLOGY IN ENGINEERING,
MANAGEMENT & APPLIED SCIENCE (IJLTEMAS)
ISSN 2278-2540 | DOI: 10.51583/IJLTEMAS | Volume XV, Issue V, May 2026
On-device voice personalization through few-shot speaker adaptation can make the system more relatable while
keeping data private. Comprehensive MOS-style intelligibility testing in Hindi/English and future languages
should accompany each release for credible benchmarks.
Strengthening privacy by keeping all inference on device and encrypting any optional logs will build trust for
use in schools and healthcare settings in India. Routine fairness audits across skin tones, hand sizes, and camera
conditions should track subgroup performance, triggering targeted data augmentation or reweighting where gaps
appear. Clear, in-app consent flows and on-device deletion controls will align with ethical deployment for
accessibility. Federated or split-learning pilots can explore privacy-preserving improvements without
centralizing sensitive data, if ever needed for aggregate model updates. Publishing audit summaries will
encourage community scrutiny and shared problem-solving for equitable SLR.
To accelerate community progress, releasing standardized subject-independent splits, mobile telemetry scripts,
and reference Android projects will enable fair comparisons and rapid replication. Vector diagrams and a
comprehensive flowchart mapping data, training, quantization, and runtime will lower barriers for adoption in
assistive programs. A lightweight evaluation app that logs latency, FPS, battery, and temperature across devices
can serve as a common harness for academic and industry tests. Hosting challenge tracks for dynamic words and
continuous signing with submission leaderboards will focus efforts on the hardest, most impactful tasks.
Collaboration with ISL linguists and educators can refine labels, expand grammar support, and validate
real-world usability beyond lab metrics. These also can support in future very large scale and help people for
their life.
ACKNOWLEDGEMENT
We express our heartfelt gratitude to the Department of Electronics and Telecommunication Engineering, Thakur
College of Engineering and Technology (TCET), for providing us with the opportunity and resources to work
on this research project. We are especially thankful to our project guide for their valuable insights, continuous
encouragement, and expert guidance throughout the course of this work. We also extend our appreciation to our
peers and faculty members who contributed their time and feedback during various stages of the project. Their
support played a crucial role in helping us shape and refine our ideas. Finally, we are grateful to our families and
friends for their unwavering motivation and support, which enabled us to complete this work successfully.
REFERENCE
1. F. Zhang, V. Bazarevsky, A. Vakunov, A. Tkachenka, G. Sung, C. L. Chang, and M. Grundmann,
“MediaPipe hands: On-device real-time hand tracking,” arXiv preprint arXiv:2006.10214, 2020.
2. P. Kumar and A. Sharma, “Indian Sign Language Recognition using Deep Convolutional Neural
Networks,” International Journal of Computer Applications, vol. 175, no. 8, pp. 23–29, 2021.
3. L. Zhang, X. Wang, Y. Li, and M. Chen, “CNN-RNN Hybrid Architecture for Dynamic Gesture
Recognition in Real-time Applications,” IEEE Transactions on Pattern Analysis and Machine
Intelligence, vol. 44, no. 6, pp. 1234–1247, 2022.
4. J. Shen, R. Pang, R. J. Weiss, M. Schuster, N. Jaitly, Z. Yang,…and Y. Wu, “Natural TTS synthesis by
conditioning WaveNet on mel spectrogram predictions,” in Proc. IEEE ICASSP, pp. 4779–4783, 2018.
5. R. Patel, S. Gupta, and M. Jain, “Mobile-based Gesture Recognition System for Augmentative and
Alternative Communication,” Journal of Assistive Technologies, vol. 15, no. 2, pp. 89–102, 2021.
6. Census of India, “Data on Disability – India and States/UTs,” Office of the Registrar General & Census
Commissioner, Ministry of Home Affairs, Government of India, 2011.
7. Microsoft Research, “Kinect-based Sign Language Translation System for American Sign Language,”
Microsoft Technical Report MSR-TR-2020-15, 2020.
8. G. Bradski, “The OpenCV Library,” Dr. Dobb’s Journal of Software Tools, vol. 25, no. 11, pp. 120–125,
2000.
9. World Health Organization, Global Report on Assistive Technology, WHO Press, Geneva, Switzerland,
2023.
10. Google AI, “TensorFlow Lite: Machine Learning for Mobile and Edge Devices,” Google Developers
Documentation, 2022.