Please use this identifier to cite or link to this item:
http://dspace.cas.upm.edu.ph:8080/xmlui/handle/123456789/2678
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Agahan, Alfonso Luis R. | - |
dc.date.accessioned | 2024-05-06T05:07:22Z | - |
dc.date.available | 2024-05-06T05:07:22Z | - |
dc.date.issued | 2023-07 | - |
dc.identifier.uri | http://dspace.cas.upm.edu.ph:8080/xmlui/handle/123456789/2678 | - |
dc.description.abstract | Diabetes is a condition in one’s blood glucose that affects millions of people every day, and there exists no easier way in predicting this aside from using machine learning algorithms. These algorithms also lack the ability to explain their prediction results, thus being dubbed as “black box” models. Aside from these, past researches have not studied the effect of different changes that occur during the data selection process in the performance of machine learning models in diabetes prediction. The study aims to assess the effects of implementing SMOTE and feature selection techniques on the predictive power of machine learning models. Moreover, the study aims to create a web-based application that would function as a decision support tool to predict the risk of patients developing diabetes. The web application is also integrated with explainable artificial intelligence in order to highlight the characteristics leading towards the prediction decision of the machine learning model. Performance metrics of five supervised machine learning algorithms (Na¨ıve Bayes, Logistic Regression, K-Nearest Neighbors, Random Forests, and Support Vector Machines) were tested on a blood glucose dataset. Results show that implementing SMOTE provided an increase in values of the performance metrics (accuracy, AUROC score, precision, recall, f1 score), and that the best performing model was the Random Forest classifier with the implementation of SMOTE, heightening the use of machine learning algorithms in clinical practice for increased healthcare. The resulting web application was implemented with LIME, a Python library used for explainable artificial intelligence. | en_US |
dc.subject | Diabetes | en_US |
dc.subject | Machine learning | en_US |
dc.subject | Decision support tool | en_US |
dc.subject | Explainable artificial intelligence | en_US |
dc.subject | Random Forests | en_US |
dc.subject | LIME | en_US |
dc.title | A Machine Learning Based Web Application Focused on Predicting the Risk of Diabetes Using Explainable Artificial Intelligence | en_US |
dc.type | Thesis | en_US |
Appears in Collections: | Computer Science SP |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
CD-CS102.pdf Until 9999-01-01 | 2.61 MB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.