dc.contributor.author |
Gonzalvo, Jerome Patrick |
|
dc.date.accessioned |
2019-08-16T17:05:00Z |
|
dc.date.available |
2019-08-16T17:05:00Z |
|
dc.date.issued |
2018-06 |
|
dc.identifier.uri |
http://dspace.cas.upm.edu.ph:8080/xmlui/handle/123456789/452 |
|
dc.description.abstract |
Smartphones are no longer used solely for communication but also for photography, video recording, internet sur fing, etc. As technology continues to advance, it is possible to apply some techniques to perform tasks such as Optical Character Recognition (OCR) and Intelligent Character Recognition (ICR). Tess2Speech is one of the applications that perform these tasks and it uses the Tesseract OCR Engine. Each trained Tesseract model of Tess2Speech is able to recognize a handwriting style of a user but not that of other users. It is possible for each user to train the engine themselves to t their handwriting but this may be an issue if they don't have the technical background needed to train the engine. This special problem uses Long Short-Term Memory (LSTM) networks for handwritten text recognition. It provides a user-friendly trainer that will help AI experts in
training handwriting recognition models. It also provides a mobile application for recognizing handwritten texts using the trained models. |
en_US |
dc.language.iso |
en |
en_US |
dc.subject |
Artificial Intelligence |
en_US |
dc.subject |
Machine Learning |
en_US |
dc.subject |
Deep Learning |
en_US |
dc.subject |
Recurrent Neural Networks |
en_US |
dc.subject |
Long Short-Term Memory Networks |
en_US |
dc.subject |
Optical Character Recognition |
en_US |
dc.subject |
Intelligent Character Recognition |
en_US |
dc.title |
Mem2Speech: An Intelligent Character Recognition-to-Speech Application Using Long Short-Term Memory Networks |
en_US |
dc.type |
Thesis |
en_US |