Fusion: Practice and Applications

Journal DOI

https://doi.org/10.54216/FPA

Submit Your Paper

2692-4048ISSN (Online) 2770-0070ISSN (Print)

Volume 19 , Issue 2 , PP: 28-44, 2025 | Cite this article as | XML | Html | PDF | Full Length Article

Innovations in Health Anomaly Detection: A Comparative Review of Machine Learning and Statistical Approaches

Nada M. Sallam 1 * , Eman Ben Salah 2

  • 1 Faculty of Computer Studies, Arab Open University, Riyadh, Saudi Arabia - (n.sallam@arabou.edu.sa)
  • 2 Faculty of Business Studies, Arab Open University, Riyadh, Saudi Arabia - (e.salah@arabou.edu.sa)
  • Doi: https://doi.org/10.54216/FPA.190203

    Received: January 07, 2025 Revised: February 17, 2025 Accepted: March 06, 2025
    Abstract

    One of the significant challenges in modern healthcare is the early and accurate detection of health anomalies, especially in the case of life-threatening diseases such as breast cancer. This paper investigates the comparative efficacy of ML models and statistical methods for the classification of breast tumors as benign or malignant using the Breast Cancer Wisconsin (Diagnostic) Dataset. The dataset, comprising various tumor cell attributes, was preprocessed with Principal Component Analysis (PCA) to enhance model training efficiency. The first 11 principal components retained 95% of the total variance, ensuring minimal information loss while reducing dimensionality. We compared the performance of several machine learning algorithms, including Logistic Regression, Support Vector Machines (SVM), Artificial Neural Networks (ANN), Decision Trees (DT), Random Forests (RF), Naïve Bayes (NB), and K-Nearest Neighbors (KNN). Among them, Logistic Regression, SVM, and ANN achieved near-perfect classification accuracy with balanced precision-recall metrics, where the accuracy rates were all more than 98%. XGBoost and Random Forest were also very impressive as advanced models, while simple models like Decision Trees and Naïve Bayes proved to be less potent and were unable to manage class imbalances and complex data patterns. Our main findings are essentially reflective of the transformative role machine learning would play in healthcare; for instance, enhancing the accuracy of diagnosis, optimizing clinical workflow, and promoting decision-making. These insights are made actionable for practitioners in healthcare to promote the adoption of reliable ML solutions for breast cancer detection. In the future, real-time data integration, external validation, and hybrid modeling approaches must be considered to further enhance the practical utility of these findings.

    Keywords :

    Breast Cancer Classification , Machine Learning , Artificial intelligence Statistical Methods , Dimensionality Reduction

    References

    [1] P. Esmaeilzadeh, “Challenges and strategies for wide-scale artificial intelligence (AI) deployment in healthcare practices: A perspective for healthcare organizations,” Artif. Intell. Med., vol. 151, p. 102861, 2024. 

    [2] R. G. L. da Silva, “The advancement of artificial intelligence in biomedical research and health innovation: challenges and opportunities in emerging economies,” Global Health, vol. 20, no. 1, p. 44, 2024. 

    [3] A. Alanazi, “Using machine learning for healthcare challenges and opportunities,” Inform. Med. Unlocked, vol. 30, p. 100924, 2022. 

    [4] A. A. Soomro et al., “Insights into modern machine learning approaches for bearing fault classification: a systematic literature review,” Results Eng., p. 102700, 2024. 

    [5] Z. Zong and Y. Guan, “AI-driven intelligent data analytics and predictive analysis in Industry 4.0: Transforming knowledge, innovation, and efficiency,” J. Knowl. Econ., pp. 1–40, 2024. 

    [6] M. Javaid et al., “Significance of machine learning in healthcare: Features, pillars and applications,” Int. J. Intell. Netw., vol. 3, pp. 58–73, 2022. 

    [7] R. Krishnamoorthi et al., “[Retracted] A novel diabetes healthcare disease prediction framework using machine learning techniques,” J. Healthc. Eng., vol. 2022, no. 1, p. 1684017, 2022. 

    [8] D. A. Debal and T. M. Sitote, “Chronic kidney disease prediction using machine learning techniques,” J. Big Data, vol. 9, no. 1, p. 109, 2022. 

    [9] H. Lu et al., “A patient network-based machine learning model for disease prediction: The case of type 2 diabetes mellitus,” Appl. Intell., vol. 52, no. 3, pp. 2411–2422, 2022. 

    [10] S. I. Ayon et al., “Coronary artery heart disease prediction: A comparative study of computational intelligence techniques,” IETE J. Res., vol. 68, no. 4, pp. 2488–2507, 2022. 

    [11] Ç. Oğuz and M. Yağanoğlu, “Detection of COVID-19 using deep learning techniques and classification methods,” Inf. Process. Manag., vol. 59, no. 5, p. 103025, 2022. 

    [12] J. Chung and J. Teo, “Mental health prediction using machine learning: taxonomy, applications, and challenges,” Appl. Comput. Intell. Soft Comput., vol. 2022, no. 1, p. 9970363, 2022. 

    [13] V. Chang et al., “An artificial intelligence model for heart disease detection using machine learning algorithms,” Healthc. Anal., vol. 2, p. 100016, 2022. 

    [14] K. Zeberga et al., “[Retracted] A novel text mining approach for mental health prediction using Bi-LSTM and BERT model,” Comput. Intell. Neurosci., vol. 2022, no. 1, p. 7893775, 2022. 

    [15] M. Rezapour and L. Hansen, “A machine learning analysis of COVID-19 mental health data,” Sci. Rep., vol. 12, no. 1, p. 14965, 2022. 

    [16] A. L. Yadav et al., “Heart diseases prediction using machine learning,” in Proc. 14th Int. Conf. Comput. Commun. Netw. Technol. (ICCCNT), 2023, pp. 1–7. 

    [17] A. S. Kwekha-Rashid et al., “Coronavirus disease (COVID-19) cases analysis using machine-learning applications,” Appl. Nanosci., vol. 13, no. 3, pp. 2013–2025, 2023. 

    [18] K. Arumugam et al., “Multiple disease prediction using machine learning algorithms,” Mater. Today Proc., vol. 80, pp. 3682–3685, 2023. 

    [19] C. M. Bhatt et al., “Effective heart disease prediction using machine learning techniques,” Algorithms, vol. 16, no. 2, p. 88, 2023. 

    [20] S. Goyal and R. Singh, “Detection and classification of lung diseases for pneumonia and COVID-19 using machine and deep learning techniques,” J. Ambient Intell. Humaniz. Comput., vol. 14, no. 4, pp. 3239–3259, 2023. 

    [21] M. Ajith et al., “A deep learning approach for mental health quality prediction using functional network connectivity and assessment data,” Brain Imaging Behav., pp. 1–16, 2024. 

    [22] S. Jayanthi et al., “Mental health status monitoring for people with autism spectrum disorder using machine learning,” Int. J. Inf. Technol., vol. 16, no. 1, pp. 43–51, 2024. 

    [23] A. Ahmad, “Breast cancer statistics: Recent trends,” in Breast Cancer Metastasis and Drug Resistance: Challenges and Progress, pp. 1–7, 2019. 

    [24] R. D. Joshi and C. K. Dhakal, “Predicting type 2 diabetes using logistic regression and machine learning approaches,” Int. J. Environ. Res. Public Health, vol. 18, no. 14, p. 7346, 2021. 

    [25] O. Kramer, “K-nearest neighbors,” in Dimensionality Reduction with Unsupervised Nearest Neighbors, pp. 13–23, 2013. 

    [26] S. Zhang, “Nearest neighbor selection for iteratively kNN imputation,” J. Syst. Softw., vol. 85, no. 11, pp. 2541–2552, 2012. 

    [27] M. Awad and R. Khanna, “Support vector machines for classification,” in Efficient Learning Machines: Theories, Concepts, and Applications for Engineers and System Designers, pp. 39–66, 2015. 

    [28] V. G. Costa and C. E. Pedreira, “Recent advances in decision trees: An updated survey,” Artif. Intell. Rev., vol. 56, no. 5, pp. 4765–4800, 2023. 

    [29] K. Fawagreh et al., “Random forests: From early developments to recent advancements,” Syst. Sci. Control Eng., vol. 2, no. 1, pp. 602–609, 2014. 

    [30] K. Larsen, “Generalized naive Bayes classifiers

    Cite This Article As :
    M., Nada. , Ben, Eman. Innovations in Health Anomaly Detection: A Comparative Review of Machine Learning and Statistical Approaches. Fusion: Practice and Applications, vol. , no. , 2025, pp. 28-44. DOI: https://doi.org/10.54216/FPA.190203
    M., N. Ben, E. (2025). Innovations in Health Anomaly Detection: A Comparative Review of Machine Learning and Statistical Approaches. Fusion: Practice and Applications, (), 28-44. DOI: https://doi.org/10.54216/FPA.190203
    M., Nada. Ben, Eman. Innovations in Health Anomaly Detection: A Comparative Review of Machine Learning and Statistical Approaches. Fusion: Practice and Applications , no. (2025): 28-44. DOI: https://doi.org/10.54216/FPA.190203
    M., N. , Ben, E. (2025) . Innovations in Health Anomaly Detection: A Comparative Review of Machine Learning and Statistical Approaches. Fusion: Practice and Applications , () , 28-44 . DOI: https://doi.org/10.54216/FPA.190203
    M. N. , Ben E. [2025]. Innovations in Health Anomaly Detection: A Comparative Review of Machine Learning and Statistical Approaches. Fusion: Practice and Applications. (): 28-44. DOI: https://doi.org/10.54216/FPA.190203
    M., N. Ben, E. "Innovations in Health Anomaly Detection: A Comparative Review of Machine Learning and Statistical Approaches," Fusion: Practice and Applications, vol. , no. , pp. 28-44, 2025. DOI: https://doi.org/10.54216/FPA.190203