Abstract:
The idea behind this project is to create an application that would assist people with visual
impairment in analyzing their surroundings. To analyze their surrounding all they have to do is
to take a picture with their mobile phone which will automatically learn the contents/objects in
the picture and will provide voice assistance, about the types of objects nearby. Well, all they have
to do is to open this application on their mobile devices with the help of any virtual assistance and
after that; they can point their camera into object. Then that captured picture will be input, to
pre-trained Convolutional Neural Network (CNN) and all the detected objects in that picture will
get labeled based on the pre trained objects. Those labels will then be passed to Text to Speech
engine which by analyzing and processing the text using Natural Language Processing (NLP)
converts the text into speech (TTS). This mobile application is really helpful for blind people to
identify objects.