The Early Detection of Lung Cancer among Indian Patients using Machine Learning Algorithms
DOI:
https://doi.org/10.47611/jsr.v13i1.2383Keywords:
Indian Lung Cancer, Prediction, Machine Learning, DiagnosisAbstract
Lung cancer causes 1 in 5 cancer related deaths globally. Statistics show that less than 16% of lung cancer cases are detected at an initial stage, leading to a significant number of lung cancer deaths within a year of diagnosis. This emphasises a requirement of improved early diagnostic methodologies for lung cancer. For this primary research, we collected comprehensive data from a prominent Indian hospital for over 2500 Indian patients. Our objective was to employ machine learning(ML) models to identify individuals at a heightened risk of developing lung cancer and more specifically, to identify key symptoms that could guide early diagnosis within the Indian populace. Furthermore, we aimed to assess and compare the diagnostic effectiveness of various machine learning (ML) models using established metrics. By evaluating multiple ML models, our study sought to identify the most optimal approach for early lung cancer detection in the Indian population. Notably, among the evaluated models, the Random Forest model emerged as the most promising. It demonstrated remarkable performance across multiple metrics, ultimately achieving the highest diagnostic accuracy of 93.3%. The significant findings of this study provide a strong foundation for future advancements in lung cancer diagnosis in India.
Downloads
Metrics
References or Bibliography
Beresford, M.J., Wilson, G.D. and Makris, A. (2006) ‘Measuring proliferation in breast cancer: Practicalities and applications’, Breast Cancer Research, 8(6). doi:10.1186/bcr1618.
Brennan, P. and Davey-Smith, G. (2021) ‘Identifying novel causes of cancers to enhance cancer prevention: New strategies are needed’, JNCI: Journal of the National Cancer Institute, 114(3), pp. 353–360. doi:10.1093/jnci/djab204.
Cancer (2022) World Health Organization. Available at: https://www.who.int/news-room/fact-sheets/detail/cancer (Accessed: 24 July 2023).
Falzone, L., Salomone, S. and Libra, M. (2018) ‘Evolution of cancer pharmacological treatments at the turn of the Third Millennium’, Frontiers in Pharmacology, 9. doi:10.3389/fphar.2018.01300.
Fares, J. et al. (2020) Molecular principles of metastasis: A hallmark of cancer revisited, Nature News. Available at: https://www.nature.com/articles/s41392-020-0134-x (Accessed: 18 September 2023).
Mathur, P. et al. (2022) ‘A clinicoepidemiological profile of lung cancers in India – results from the National Cancer Registry Programme’, Indian Journal of Medical Research, 155(2), p. 264. doi:10.4103/ijmr.ijmr_1364_21.
Huang, S., Yang, J., Fong, S., & Zhao, Q. (2020). Artificial Intelligence in cancer diagnosis and prognosis: Opportunities and challenges. Cancer Letters, 471, 61-71. doi:10.1016/j.canlet.2019.12.007
Iorio, F., Knijnenburg, T. A., Vis, D. J., Bignell, G. R., Menden, M. P., Schubert, M., . . . Garnett, M. J. (2016). A landscape of pharmacogenomic interactions in cancer. Cell, 166(3), 740-754. doi:10.1016/j.cell.2016.06.017
P.R., R., Nair, R.A.S. and G., V. (2019) ‘A comparative study of lung cancer detection using machine learning algorithms’, 2019 IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT) [Preprint]. doi:10.1109/icecct.2019.8869001.
de Groot, P.M. et al. (2018) ‘The epidemiology of Lung Cancer’, Translational Lung Cancer Research, 7(3), pp. 220–233. doi:10.21037/tlcr.2018.05.06.
Bates JHT, Hamlington KL, Garrison G, Kinsey CM. Prediction of lung cancer risk based on age and smoking history. Comput Methods Programs Biomed. 2022 Apr;216:106660. doi: 10.1016/j.cmpb.2022.106660. Epub 2022 Jan 25. PMID: 35114461; PMCID: PMC8920760.
Damani A, Ghoshal A, Salins N, Muckaden MA, Deodhar J. High Prevalence of Dyspnea in Lung Cancer: An Observational Study. Indian J Palliat Care. 2019 Jul-Sep;25(3):403-406. doi: 10.4103/IJPC.IJPC_64_19. PMID: 31413456; PMCID: PMC6659529.
Harle, A.S.M. et al. (2019) ‘Cough in patients with lung cancer’, Chest, 155(1), pp. 103–113. doi:10.1016/j.chest.2018.10.003.
Gorlova, O.Y. et al. (2007) ‘Aggregation of cancer among relatives of never-smoking lung cancer patients’, International Journal of Cancer, 121(1), pp. 111–118. doi:10.1002/ijc.22615.
Singh, N. et al. (2021) ‘Lung cancer in India’, Journal of Thoracic Oncology, 16(8), pp. 1250–1266. doi:10.1016/j.jtho.2021.02.004.
Published
How to Cite
Issue
Section
Copyright (c) 2024 Sarthak Gupta
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright holder(s) granted JSR a perpetual, non-exclusive license to distriute & display this article.