Academia.eduAcademia.edu

Speech to Sign Language

2020

https://doi.org/10.13140/RG.2.2.10331.72487

Abstract

Goal • To bridge the gap between hearing impaired people in India and others.

Key takeaways
sparkles

AI

  1. The system bridges communication gaps for hearing impaired individuals in India.
  2. The dataset comprises 3,268 English words to enhance recognition accuracy.
  3. Performance is evaluated based on accuracy and time complexity metrics.
  4. The methodology includes stages of voice recognition, text conversion, and 3D avatar animation generation.
  5. The final output utilizes SIGML for accurate sign language representation.
ES2ISL : English Speech to Indian Sign Language Translator Manthan Khanvilkar, Nidhi Patel, Bhavin Patel, Harshit Patel Lakehead University, Thunder Bay, ON, Canada Goal Model Qualitative Analysis : 1. Word-level parsing • To bridge the gap between hearing impaired Input : I see you there. people in India and others. ISL based output : I you there see. Hearing Impaired Sign Language “I” “you” “there” “see” Challenges • Building an efficient speech recognition system 2. Character-level parsing • Generate flawless 3D avatar animation output Input : Show card. • Building cost-effective system ISL based output : c, a, r, d, s, h, o, w. • User friendly GUI Results Methodology Stage 1: Evaluation T2SLT A2SLT ES2ISL Contributions • Voice recognition and text conversion using Google SR. Va (%) NA 60 75 • ES2ISL takes less processing time and saves memory. • Tokenization into words using Machine Learning. Accuracy Pa (%) NA 72 72 c a r d • Large corpus of words to support more sign motions. Stage 2: • Tokens are normalized using different techniques. Aa (%) 50 65 85 • Translate the input text with English to ISL grammar. Time Ts (s) 5.35 5.20 1.00 Complexity Stage 3: Tw (s) 2.35 2.20 0.70 • Words are matched with its respected SIGML files. • 3D avatar animation with matching SIGML is generated. Average accuracy - 77.33% Processing Time - 0.85s s h o w Performance Analysis Conclusion • The dataset is updated to corpus of 3268 English words. Interactive system for hearing impaired people for impactful communication. It transforms speech into a 3D avatar • Experimental study takes 101 natural speech inputs. animation that portrays signs rather than GIFs, pictures, or • The performance is measured based on Accuracy and videos for handling the memory effectively. It generates a Time complexity. realistic and vibrant appeal animation to the speech. Accuracy • Accuracy is measured in terms of recognizing voice (Va), grammar parsing (Pa) and action performed (Aa). References Time Complexity 1. Mobile translation system from speech language to hand motion language, Rekha and B. Latha. • Time Complexity is measured after voice recognition to 2. Increasing adaptability of a speech into sign language translation perform an action (Ts) and between two words/characters system, V. L ó pez-Lude ñ a, R. San-Segundo, C. G. Morcillo. to perform an action (Tw). T2SLT - Text to Sign Language Translator A2SLT - Audio to Sign Language Translator . Department of Computer Science Supervisor Lakehead University Dr. Thangarajah Akilan 955 Oliver Rd, Thunder Bay, ON P7B 5E1, Canada [email protected] View publication stats

References (2)

  1. Mobile translation system from speech language to hand motion language, Rekha and B. Latha.
  2. Increasing adaptability of a speech into sign language translation system, V. L ópez-Lude ña, R. San-Segundo, C. G. Morcillo. .

FAQs

sparkles

AI

What was the size of the updated corpus for English words?add

The dataset was updated to include a corpus of 3268 English words, enhancing speech recognition capability.

How many natural speech inputs were analyzed in the experimental study?add

The experimental study analyzed 101 natural speech inputs to evaluate system performance.

What metrics were used to measure performance in this study?add

Performance was measured based on Accuracy and Time Complexity following voice recognition procedures.

What roles do T s and T w metrics play in time complexity?add

Time Complexity involves T s, measuring time post-voice recognition, and T w, addressing time between word actions.

How is the speech-to-sign language transformation visualized in the results?add

The system generates a 3D avatar animation to portray signs, providing a realistic communication medium for the hearing impaired.

About the author

Computer Vision

Papers
60
Followers
6
View all papers from Thangarajah Akilanarrow_forward