Journal of Artificial Intelligence and Metaheuristics

Journal DOI

https://doi.org/10.54216/JAIM

Submit Your Paper

2833-5597ISSN (Online)

Volume 9 , Issue 1 , PP: 44-52, 2025 | Cite this article as | XML | Html | PDF | Full Length Article

Lung Cancer Prediction from Smoking Cause by Machine Learning Classification Models

Nada M. Sallam 1 * , P. K. Dutta 2

  • 1 Faculty of Computer Studies, Arab Open University, Riyadh, 11681, Saudi Arabia - (n.sallam@arabou.edu.sa)
  • 2 School of Engineering and Technology, Amity University Kolkata, India - (pkdutta@kol.amity.edu)
  • Doi: https://doi.org/10.54216/JAIM.090105

    Received: October 21, 2024 Revised: December 17, 2024 Accepted: January 19, 2025
    Abstract

    The incidence of lung cancer varies in males and females, which occurs due to the abnormal and uncontrolled growth of cells in the lungs. It has a greater predilection in males as compared to females. Smoking is the most important risk factor for lung cancer. It causes serious breathing issues and also affects other organs. It increases the mortality rate both in young adults as well as in the older age group. Therefore, there is improvement in medical technologies to facilitate specialized diagnosis and treatment, but the mortality has not been controlled to a satisfactory extent. It is important to take preventive measures and precautions at the initial stages. Machine learning brings various advancements to the medical sector due to which various diseases can be detected at an early stage. In this paper, we presented different machine learning classifier techniques used for the classification of the present lung cancer data in the UCI machine learning repository as benign and malignant. The dataset is divided into cancerous and non-cancerous by converting the input data into binary form and using the classifier technique in theWeka tool. This specifically includes classifiers used: Logistic Regression, Random Forest, Support Vector Machine (SVM), k-Nearest Neighbors (k-NN), Decision Trees, and Na¨ıve Bayes. In addition, we study the effect of data preprocessing methods on our prediction accuracy, such as data normalization and feature selection. The study seeks to help develop various reliable resources for lung cancer identification, which are critical for diagnosing and treating patients in a timely manner and improving their outcomes.

    Keywords :

    Lung cancer prediction , Machine learning classifiers , Smoking risk factors , Medical data preprocessing , Early cancer diagnosis.

    References

    [1] R. Patra. Prediction of lung cancer using machine learning classifier. pages 132–142.

    [2] M. I. Faisal, S. Bashir, S. Khan, Z. S. Khan, and F. H. Khan. An evaluation of machine learning classifiers and ensembles for early stage prediction of lung cancer. In Proceedings of the 2018 3rd International Conference on Emerging Trends in Engineering, Sciences and Technology (ICEEST), pages 1–4, Thrissur, Kerala, India.

    [3] J. A. Bartholomai and H. B. Frieboes. Lung cancer survival prediction via machine learning regression, classification, and statistical techniques. pages 632–637.

    [4] Yasemin G¨ultepe. Performance of lung cancer prediction methods using different classification algorithms. Computers, Materials & Continua.

    [5] Joy Chakra Bortty, Proshanta Kumar Bhowmik, Syed Ali Reza, Irin Akter Liza, Mohammed Nazmul Islam Miah, Muhammad Shoyaibur Rahman Chowdhury, and Md Al Amin. Optimizing lung cancer risk prediction with advanced machine learning algorithms and techniques. Journal of Modern Health Sciences.

    [6] Qinglan Ma, Yulong Shen, Wei Guo, Kaiyan Feng, Tao Huang, and Yudong Cai. Machine learning reveals impacts of smoking on gene profiles of different cell types in lung. Life, 14(502).

    [7] R. K. Pathan, I. J. Shorna, M. S. Hossain, M. U. Khandaker, H. I. Almohammed, and Z. Y. Hamd. The efficacy of machine learning models in lung cancer risk prediction with explainability. PLoS ONE, 19(6):e0305035.

    [8] E. Nemlander, A. Rosenblad, E. Abedi, S. Ekman, J. Hasselstrom, and L. E. Eriksson. Lung cancer prediction using machine learning on data from a symptom e-questionnaire for never smokers, former smokers and current smokers. PLoS ONE, 17(10):e0276703.

    [9] J. C. Hsu, P.-A. Nguyen, P. T. Phuc, T.-C. Lo, M.-H. Hsu, M.-S. Hsieh, N. Q. K. Le, C.-T. Cheng, T. H. Chang, and C.-Y. Chen. Development and validation of novel deep-learning models using multiple data types for lung cancer survival. Cancers, 14(5562).

    [10] E. Dritsas and M. Trigka. Lung cancer risk prediction with machine learning models. Big Data Cogn. Comput., 6(139).

    [11] K. Balachandran and R. Anitha. Ensemble based optimal classification model for pre diagnosis of lung cancer. In IEEE, page 31661.

    [12] K. Kobyli´nska, T. Orłowski, M. Adamek, and P. Biecek. Explainable machine learning for lung cancer screening models. Applied Sciences, 12(1926).

    [13] Huu-Huy Ngo and Hung Linh Le. A prediction model for lung cancer levels based on machine learning. International Journal of Open Information Technologies, 10(5):202.

    [14] Saif Al Rumhi, Raza Hasan, Saqib Hussain, and Jitendra Pandey. Lung cancer prediction using machine learning techniques. In 6th Middle East College Student Research Conference Proceeding.

    [15] Radhanath Patra. Prediction of lung cancer using machine learning classifier. Electronics Science, Berhampur University, Berhampur, Odisha, India.

    [16] Md Nur Hossain, Murshida Alam, Nafis Anjum, Md Habibur Rahman, Md Siam Taluckder, Md Nad Vi Al Bony, SMShadul Islam Rishad, and Afrin Hoque Jui. Performance of machine learning algorithms for lung cancer prediction: a comparative study.

    [17] Masoud Aliramezani, Charles Robert Koch, and Mahdi Shahbakhti. Modeling, diagnostics, optimization, and control of internal combustion engines via modern machine learning techniques: A review and future directions. Progress in Energy and Combustion Science, 88:100967, January 2022.

    [18] Heng Chi, Yuyu Zhang, Tsz Ling Elaine Tang, Lucia Mirabella, Livio Dalloro, Le Song, and Glaucio H. Paulino. Universal machine learning for topology optimization. Computer Methods in Applied Mechanics and Engineering, 375:112739, March 2021.

    [19] Kerr Ding, Michael Chin, Yunlong Zhao, Wei Huang, Binh Khanh Mai, Huanan Wang, Peng Liu, Yang Yang, and Yunan Luo. Machine learning-guided co-optimization of fitness and diversity facilitates combinatorial library design in enzyme engineering. Nature Communications, 15(1):6392, July 2024. Publisher: Nature Publishing Group.

    [20] Ponnarasan Krishnan. Ai-Driven Optimization In Healthcare: Machine Learning Models For Predictive Diagnostics And Personalized Treatment Strategies. Well Testing Journal, 33(S2):10–33, September 2024. Number: S2.

    [21] Shiqi Wang, Peng Xia, Keyu Chen, Fuyuan Gong, Hailong Wang, Qinghe Wang, Yuxi Zhao, and Weiliang Jin. Prediction and optimization model of sustainable concrete properties using machine learning, deep learning and swarm intelligence: A review. Journal of Building Engineering, 80:108065, December 2023.

    [22] Morteza Esfandyari, Amin Amiri Delouei, and Ali Jalai. Optimization of ultrasonic-excited doublepipe heat exchanger with machine learning and PSO. International Communications in Heat and Mass Transfer, 147:106985, October 2023.

    [23] Binayak Kar, Widhi Yahya, Ying-Dar Lin, and Asad Ali. Offloading Using Traditional Optimization and Machine Learning in Federated Cloud–Edge–Fog Systems: A Survey. IEEE Communications Surveys & Tutorials, 25(2):1199–1226, 2023. Conference Name: IEEE Communications Surveys & Tutorials.

    Cite This Article As :
    M., Nada. , K., P.. Lung Cancer Prediction from Smoking Cause by Machine Learning Classification Models. Journal of Artificial Intelligence and Metaheuristics, vol. , no. , 2025, pp. 44-52. DOI: https://doi.org/10.54216/JAIM.090105
    M., N. K., P. (2025). Lung Cancer Prediction from Smoking Cause by Machine Learning Classification Models. Journal of Artificial Intelligence and Metaheuristics, (), 44-52. DOI: https://doi.org/10.54216/JAIM.090105
    M., Nada. K., P.. Lung Cancer Prediction from Smoking Cause by Machine Learning Classification Models. Journal of Artificial Intelligence and Metaheuristics , no. (2025): 44-52. DOI: https://doi.org/10.54216/JAIM.090105
    M., N. , K., P. (2025) . Lung Cancer Prediction from Smoking Cause by Machine Learning Classification Models. Journal of Artificial Intelligence and Metaheuristics , () , 44-52 . DOI: https://doi.org/10.54216/JAIM.090105
    M. N. , K. P. [2025]. Lung Cancer Prediction from Smoking Cause by Machine Learning Classification Models. Journal of Artificial Intelligence and Metaheuristics. (): 44-52. DOI: https://doi.org/10.54216/JAIM.090105
    M., N. K., P. "Lung Cancer Prediction from Smoking Cause by Machine Learning Classification Models," Journal of Artificial Intelligence and Metaheuristics, vol. , no. , pp. 44-52, 2025. DOI: https://doi.org/10.54216/JAIM.090105