Fusion: Practice and Applications
FPA
2692-4048
2770-0070
10.54216/FPA
https://www.americaspg.com/journals/show/3659
2018
2018
Innovations in Health Anomaly Detection: A Comparative Review of Machine Learning and Statistical Approaches
Faculty of Computer Studies, Arab Open University, Riyadh, Saudi Arabia
Nada
Nada
Faculty of Business Studies, Arab Open University, Riyadh, Saudi Arabia
Eman Ben
Salah
One of the significant challenges in modern healthcare is the early and accurate detection of health anomalies, especially in the case of life-threatening diseases such as breast cancer. This paper investigates the comparative efficacy of ML models and statistical methods for the classification of breast tumors as benign or malignant using the Breast Cancer Wisconsin (Diagnostic) Dataset. The dataset, comprising various tumor cell attributes, was preprocessed with Principal Component Analysis (PCA) to enhance model training efficiency. The first 11 principal components retained 95% of the total variance, ensuring minimal information loss while reducing dimensionality. We compared the performance of several machine learning algorithms, including Logistic Regression, Support Vector Machines (SVM), Artificial Neural Networks (ANN), Decision Trees (DT), Random Forests (RF), Naïve Bayes (NB), and K-Nearest Neighbors (KNN). Among them, Logistic Regression, SVM, and ANN achieved near-perfect classification accuracy with balanced precision-recall metrics, where the accuracy rates were all more than 98%. XGBoost and Random Forest were also very impressive as advanced models, while simple models like Decision Trees and Naïve Bayes proved to be less potent and were unable to manage class imbalances and complex data patterns. Our main findings are essentially reflective of the transformative role machine learning would play in healthcare; for instance, enhancing the accuracy of diagnosis, optimizing clinical workflow, and promoting decision-making. These insights are made actionable for practitioners in healthcare to promote the adoption of reliable ML solutions for breast cancer detection. In the future, real-time data integration, external validation, and hybrid modeling approaches must be considered to further enhance the practical utility of these findings.
2025
2025
28
44
10.54216/FPA.190203
https://www.americaspg.com/articleinfo/3/show/3659