Description

"Hey dude , I am born mute but I don't want to live as a mute . Can you be on my side ? " My friend's question struck me and it eventually led me to this innovation , as I call it Sign Language Translator (SLT) .

What's something much about a sign language translator. It's an idea being 10 or more years old . But it is much different by it's low cost . It only costs less than $50

A bit about project, we are using Computer Vision to pull of this project .When someone is in front of Camera , it will produce a corresponding image.The image is transferred to Raspberry Pi , the brain. Pi has pre-stored images with their corresponding translations. Accordingly Raspberry Pi can match the data which we get from camera and that on Raspberry Pi.When those images get matched Raspberry will send the corresponding sound and command for translation to Text To Speech (TTS) application , which utters the sound.

Details

The components which are needed to accomplish this project are

HardWare components

1 . Raspberry Pi 3

2. Camera module (compatible with Raspberry Pi)

3. Speaker

4.LCD module

5. Mic

SoftWare components

1. OpenCV

2. Text to Speech application (TTS)

3 . Speech to Text application

Working

1 . Camera

Camera acts as an input device for SLT which captures the sign and transfer corresponding images to Raspberry Pi , the brain .

2 . Microphone

Microphone also acts as an input device for SLT which captures the sound and transfer it to the brain .

3 . Raspberry Pi

Raspberry Pi act as the brain of the entire device and it perform different types of function depending on the three modes of operation of this device .

1 . SIGN to SPEECH Translation mode

This is the mode which is used when a speech impaired man communicates with a common man . In this mode the camera snap the gestures and the corresponding images are transferred into Raspberry Pi . The OpenCV library installed on the Raspberry Pi processes the image and produce a corresponding text output , which is made to speech using a Text to Speech application .

2 . SIGN to TEXT Translation mode

This is the mode which is used when a speech impaired man communicates with a hearing impaired man . In this mode the camera snap the gestures and the corresponding images are transferred into Raspberry Pi . The OpenCV library installed on the Raspberry Pi processes the image and produce a corresponding text output , which is displayed on the LCD module .

3 . SPEECH to TEXT Translation mode

This is the mode which is used when a common man communicates with a hearing impaired man . In this mode the microphone records the sound and is transferred into Raspberry Pi . Using a Speech to Text application the sound is converted into text and is displayed on the LCD module .

4 . Speaker

It acts as an output device which produce sound output according to the signals from the brain .

5 . LCD MODULE

It also acts as an output device which produce text output according to the signals from the brain .

6 . OpenCV

It is the core where actual function of translation takes place . OpenCV is a real time computer vision library with strong processing efficiency . OpenCV processes the image captured by camera on various approaches

1. Template Based Approaches

Unknown speech is compared against a set of pre-recorded words (templates) in order to find the best match.

2.Knowledge Based Approaches

An expert knowledge about variations in speech is hand coded into a system.

3.Statistical Based Approaches

In which variations in speech are modelled statistically, using automatic, statistical learning procedure.

4.The Artificial Intelligence Approach

The artificial intelligence approach attempts to mechanize the recognition procedure according to the way a person applies its intelligence in visualizing, analyzing, and finally making a decision on the measured acoustic features.

We are using template based approaches and in future versions another approaches may be used .

7 . Text to Speech application

It is used to convert the text output produced by OpenCV to speech .

8 . Speech to Text application

It is used to convert the speech input into text output .

The flow of data is as shown below

Through this project I want to empower all those who are in need of such a device and dedicate this to my dear friend. Also I hope this project...

Components

1 × Raspberry Pi 3

1 × Camera Module

1 × Speaker

1 × Microphone

1 × LCD module

Discussions

Robert Mateja wrote 04/19/2018 at 19:43

There are more than one "dialect" of sign language so maybe Google Cloud Compute AI app? They give 12 months trial with $300 to spent so integration with OpenCV might be a way, and you are not constraint with RPi compute power for quick recognition.

Are you sure? yes | no

Shebin Jose Jacob wrote 04/19/2018 at 23:05

Yeah . Due to the low processing speed we are also looking forward to integrate the recognition with cloud services. Anyway Thanks for your concern ☺

Are you sure? yes | no

Morning.Star wrote 04/16/2018 at 01:54

Great idea Jonathan, I was thinking of something similar as my daughter has no speech and I've tried several times over the last decade to help her. Computers are now pretty fast and I've even developed my own vision system that's a lot faster than OpenCV and can accurately track movement and detail.

The trouble I've had seems insurmountable, I'm using a 6GHz dual Athlon to develop on and that isnt really fast enough. Are you using an alphabet? From experience using OpenCV it will take around 2-3 seconds per letter to discriminate and your friend will have to be VERY accurate. It was that decided it for me, and I fell back to using pictorial information with an AI to present it meaningfully according to context.

You might want to look into this technique, it will cut down on the signs needed for the system to learn by combining them in sequences like a phone does with typed information. Mine takes an icon representing a word and then presents only icons representing things that can follow it, and it uses a variation of written Makaton to do it, backed up by photos. But using predictive text between the input and output will seriously cut down on the data your friend has to impart to get a spoken phrase.

The average English-speaking human gets by with around 5k words and can interact in society and perform non-technical work using them. A secretarial job requires over twice that many, and a lot of them are spelled differently but sound the same. (There Their and Theyre for example are a nuisance to a machine. On the Internet we ignore the spelling and concentrate on context alone, so to transmit a sentence containing one of those words doesnt have to have it spelt correctly and an AI can discriminate and spell it correctly at the other end. In transmission all 3 words are actually the same to the computer, its the words either side that spell it correctly.)

If you store the phrases contextually it will be a lot faster for your friend to access them with less signs, and may even be fast enough for him to have a conversation of sorts. Communicating is possible, conversing may be harder than you think this way.

Saying that, liked and followed, and I wish you and your friend the best of luck. Tentacles crossed, humans are hard to communicate with ;-)

Are you sure? yes | no

Shebin Jose Jacob wrote 04/16/2018 at 05:04

Immense thanks for your concern and helpful information ☺

Are you sure? yes | no

Morning.Star wrote 04/16/2018 at 06:40

No problem, hit me up on PM if you want to chat about it some. My work has gone unused because my Bea has severe learning difficulties as well, she cant read or write as well as speak but she does understand a fair bit. And I'm not allowed to help others because the UK government are so shortsighted they'd have me care for her unpaid where I cant work for myself, let alone help all disabled people.

I'm fighting back, but the system is broke. They made me the UK's official hacker in the Prize this year and told me to build a robot with 300 quid, and then they'll take any award off me because I'm in the social care system.

I know its no better in the States, and there are less concerned governments than that...

Happy to help anyone fix that sh*t tho. :-)

Are you sure? yes | no

Shebin Jose Jacob wrote 04/16/2018 at 09:10

Of course sir . I'm so happy to contact you and I'm seriously considering to make some suitable changes as per your comment and trying more to implement it as a neural network. Anyway I'm ready to take this opportunity to help my needy siblings and hope that your help will be available

Thanks☺

Are you sure? yes | no

Eric Hertz wrote 04/15/2018 at 20:34

Impressive!

Are you sure? yes | no

Shebin Jose Jacob wrote 04/15/2018 at 23:57

Thank You 😊

Are you sure? yes | no

ashiklal864 wrote 04/15/2018 at 17:32

nice work bro .Keep it up

Are you sure? yes | no

Shebin Jose Jacob wrote 04/15/2018 at 23:57

Thank You 😊

Are you sure? yes | no

Binit Shah wrote 04/11/2018 at 15:46

Cool project! Was wondering how you planned to deal with words where hands overlapped or covered one another. Words like cat or phrases where object interact.

Are you sure? yes | no

Shebin Jose Jacob wrote 04/11/2018 at 22:46

We are using a key image for different signs and the signs are differentiated using key images and since homologous images can be matched , I hope it will be possible too .

Are you sure? yes | no

Mike Szczys wrote 04/09/2018 at 19:29

Neat idea! Matching the pre-stored images sounds tricky. Lately I've been trying to think like a neural network engineer to look for applications of the technology. Running this through some machine learning to develop a new data set might be a good way to speed up this recognition task on minimal hardware.

Are you sure? yes | no

Shebin Jose Jacob wrote 04/10/2018 at 12:48

Thank you Mike :)

Are you sure? yes | no

SIGN LANGUAGE TRANSLATOR

Description

Details

Working

1 . Camera

2 . Microphone

3 . Raspberry Pi

4 . Speaker

7 . Text to Speech application

8 . Speech to Text application

The flow of data is as shown below

Through this project I want to empower all those who are in need of such a device and dedicate this to my dear friend. Also I hope this project...

Components

Discussions

Similar Projects

TextEye: Raspberry Pi (Zero) Mobile Textreader

SleePi

The Pi Guardian

Object Detection Doodle camera with Raspberry Pi

SIGN LANGUAGE TRANSLATOR

Become a Hackaday.io member

Just one more thing

Description

Details

Working

1 . Camera

2 . Microphone

3 . Raspberry Pi

4 . Speaker

7 . Text to Speech application

8 . Speech to Text application

The flow of data is as shown below

Through this project I want to empower all those who are in need of such a device and dedicate this to my dear friend. Also I hope this project...

Components

Enjoy this project?

Discussions

Become a Hackaday.io Member

Similar Projects

TextEye: Raspberry Pi (Zero) Mobile Textreader

SleePi

The Pi Guardian

Object Detection Doodle camera with Raspberry Pi

Does this project spark your interest?

Report project as inappropriate

Send message

Remove Member