DSpace Repository

Mem2Speech: An Intelligent Character Recognition-to-Speech Application Using Long Short-Term Memory Networks

Show simple item record

dc.contributor.author Gonzalvo, Jerome Patrick
dc.date.accessioned 2019-08-16T17:05:00Z
dc.date.available 2019-08-16T17:05:00Z
dc.date.issued 2018-06
dc.identifier.uri http://dspace.cas.upm.edu.ph:8080/xmlui/handle/123456789/452
dc.description.abstract Smartphones are no longer used solely for communication but also for photography, video recording, internet sur fing, etc. As technology continues to advance, it is possible to apply some techniques to perform tasks such as Optical Character Recognition (OCR) and Intelligent Character Recognition (ICR). Tess2Speech is one of the applications that perform these tasks and it uses the Tesseract OCR Engine. Each trained Tesseract model of Tess2Speech is able to recognize a handwriting style of a user but not that of other users. It is possible for each user to train the engine themselves to t their handwriting but this may be an issue if they don't have the technical background needed to train the engine. This special problem uses Long Short-Term Memory (LSTM) networks for handwritten text recognition. It provides a user-friendly trainer that will help AI experts in training handwriting recognition models. It also provides a mobile application for recognizing handwritten texts using the trained models. en_US
dc.language.iso en en_US
dc.subject Artificial Intelligence en_US
dc.subject Machine Learning en_US
dc.subject Deep Learning en_US
dc.subject Recurrent Neural Networks en_US
dc.subject Long Short-Term Memory Networks en_US
dc.subject Optical Character Recognition en_US
dc.subject Intelligent Character Recognition en_US
dc.title Mem2Speech: An Intelligent Character Recognition-to-Speech Application Using Long Short-Term Memory Networks en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account