The Early Detection of Lung Cancer among Indian Patients using Machine Learning Algorithms

Authors

  • Sarthak Gupta Delhi Public School, Gurgaon

DOI:

https://doi.org/10.47611/jsr.v13i1.2383

Keywords:

Indian Lung Cancer, Prediction, Machine Learning, Diagnosis

Abstract

Lung cancer causes 1 in 5 cancer related deaths globally. Statistics show that less than 16% of lung cancer cases are detected at an initial stage, leading to a significant number of lung cancer deaths within a year of diagnosis. This emphasises a requirement of improved early diagnostic methodologies for lung cancer. For this primary research, we collected comprehensive data from a prominent Indian hospital for over 2500 Indian patients. Our objective was to employ machine learning(ML) models to identify individuals at a heightened risk of developing lung cancer and more specifically, to identify key symptoms that could guide early diagnosis within the Indian populace. Furthermore, we aimed to assess and compare the diagnostic effectiveness of various machine learning (ML) models using established metrics. By evaluating multiple ML models, our study sought to identify the most optimal approach for early lung cancer detection in the Indian population. Notably, among the evaluated models, the Random Forest model emerged as the most promising. It demonstrated remarkable performance across multiple metrics, ultimately achieving the highest diagnostic accuracy of 93.3%. The significant findings of this study provide a strong foundation for future advancements in lung cancer diagnosis in India.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

References or Bibliography

Beresford, M.J., Wilson, G.D. and Makris, A. (2006) ‘Measuring proliferation in breast cancer: Practicalities and applications’, Breast Cancer Research, 8(6). doi:10.1186/bcr1618.

Brennan, P. and Davey-Smith, G. (2021) ‘Identifying novel causes of cancers to enhance cancer prevention: New strategies are needed’, JNCI: Journal of the National Cancer Institute, 114(3), pp. 353–360. doi:10.1093/jnci/djab204.

Cancer (2022) World Health Organization. Available at: https://www.who.int/news-room/fact-sheets/detail/cancer (Accessed: 24 July 2023).

Falzone, L., Salomone, S. and Libra, M. (2018) ‘Evolution of cancer pharmacological treatments at the turn of the Third Millennium’, Frontiers in Pharmacology, 9. doi:10.3389/fphar.2018.01300.

Fares, J. et al. (2020) Molecular principles of metastasis: A hallmark of cancer revisited, Nature News. Available at: https://www.nature.com/articles/s41392-020-0134-x (Accessed: 18 September 2023).

Mathur, P. et al. (2022) ‘A clinicoepidemiological profile of lung cancers in India – results from the National Cancer Registry Programme’, Indian Journal of Medical Research, 155(2), p. 264. doi:10.4103/ijmr.ijmr_1364_21.

Huang, S., Yang, J., Fong, S., & Zhao, Q. (2020). Artificial Intelligence in cancer diagnosis and prognosis: Opportunities and challenges. Cancer Letters, 471, 61-71. doi:10.1016/j.canlet.2019.12.007

Iorio, F., Knijnenburg, T. A., Vis, D. J., Bignell, G. R., Menden, M. P., Schubert, M., . . . Garnett, M. J. (2016). A landscape of pharmacogenomic interactions in cancer. Cell, 166(3), 740-754. doi:10.1016/j.cell.2016.06.017

P.R., R., Nair, R.A.S. and G., V. (2019) ‘A comparative study of lung cancer detection using machine learning algorithms’, 2019 IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT) [Preprint]. doi:10.1109/icecct.2019.8869001.

de Groot, P.M. et al. (2018) ‘The epidemiology of Lung Cancer’, Translational Lung Cancer Research, 7(3), pp. 220–233. doi:10.21037/tlcr.2018.05.06.

Bates JHT, Hamlington KL, Garrison G, Kinsey CM. Prediction of lung cancer risk based on age and smoking history. Comput Methods Programs Biomed. 2022 Apr;216:106660. doi: 10.1016/j.cmpb.2022.106660. Epub 2022 Jan 25. PMID: 35114461; PMCID: PMC8920760.

Damani A, Ghoshal A, Salins N, Muckaden MA, Deodhar J. High Prevalence of Dyspnea in Lung Cancer: An Observational Study. Indian J Palliat Care. 2019 Jul-Sep;25(3):403-406. doi: 10.4103/IJPC.IJPC_64_19. PMID: 31413456; PMCID: PMC6659529.

Harle, A.S.M. et al. (2019) ‘Cough in patients with lung cancer’, Chest, 155(1), pp. 103–113. doi:10.1016/j.chest.2018.10.003.

Gorlova, O.Y. et al. (2007) ‘Aggregation of cancer among relatives of never-smoking lung cancer patients’, International Journal of Cancer, 121(1), pp. 111–118. doi:10.1002/ijc.22615.

Singh, N. et al. (2021) ‘Lung cancer in India’, Journal of Thoracic Oncology, 16(8), pp. 1250–1266. doi:10.1016/j.jtho.2021.02.004.

Published

02-29-2024

How to Cite

Gupta, S. (2024). The Early Detection of Lung Cancer among Indian Patients using Machine Learning Algorithms. Journal of Student Research, 13(1). https://doi.org/10.47611/jsr.v13i1.2383

Issue

Section

Research Articles