Please use this identifier to cite or link to this item:
http://dspace.aiub.edu:8080/jspui/handle/123456789/926
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Moniruzzaman, Md | - |
dc.contributor.author | Zaman, A. G. M. | - |
dc.contributor.author | Tasnia, Rifah | - |
dc.contributor.author | Biswas, Sutopa | - |
dc.contributor.author | Khanam, Mehnur | - |
dc.date.accessioned | 2023-08-19T06:06:45Z | - |
dc.date.available | 2023-08-19T06:06:45Z | - |
dc.date.issued | 2022-08-11 | - |
dc.identifier.isbn | 978-1-4503-9734-6 | - |
dc.identifier.uri | http://dspace.aiub.edu:8080/jspui/handle/123456789/926 | - |
dc.description.abstract | Diabetes is a very common disease nowadays. If not treated early diabetes can pose a profoundly serious health threat. Much research has been conducted to find out the optimal solution for diabetes detection by applying different data mining algorithms, where the dataset consists of different medicinal attributes. In this study, our aim is to examine whether diabetes can be detected at early-stage by applying different data mining algorithms to the non-medicinal dataset; as well as to investigate whether data normalization techniques can improve the classifiers accuracy. Naive Bayes, K-Nearest Neighbor (KNN), Support Vector Machines (SVM), Decision Tree, Random Forest, and Gradient Boosting Classifier (GBC) algorithms are applied to the Early Stage Diabetes Risk Prediction Dataset in conjunction with Decimal Point Scaling, Z-Score Normalization, Pareto Scaling, Variable Stability Scaling, Min-Max normalization, Max normalization, Maximum Absolute Scaling, Mean Centered Scaling, Soft-max normalization, Power Transformer, Median and Median Absolute Deviation Normalization, Robust Scaling and Log Scaling normalization methods. In this experiment, we discovered that early-stage diabetes detection is possible without any medical diagnosis data. The result shows that GBC performs better compared to other classification algorithms in combination with data normalization and achieved an impressive 99.038% prediction accuracy. | en_US |
dc.language.iso | en | en_US |
dc.publisher | ACM Digital Library | en_US |
dc.subject | Diabetes, K-Nearest Neighbor (KNN) | en_US |
dc.subject | Support vector Machines (SVM) | en_US |
dc.subject | Decision Tree | en_US |
dc.subject | Random Forest | en_US |
dc.subject | Gradient Boosting Classifiers (GBC) | en_US |
dc.subject | Normalization | en_US |
dc.subject | Machine Learning | en_US |
dc.subject | Early-Stage Diabetes Prediction | en_US |
dc.title | Early-Stage Diabetes Prediction using Data Mining Algorithms | en_US |
dc.type | Article | en_US |
Appears in Collections: | Publications: Conference |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Early-Stage Diabetes Prediction using Data Mining Algorithms.docx | Early-Stage Diabetes Prediction using Data Mining Algorithms | 3.56 MB | Microsoft Word XML | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.