Using a Logistic Regression and K Nearest Neighbor Model to Accurately Diagnose Breast Cancer
DOI:
https://doi.org/10.47611/jsrhs.v11i4.3063Keywords:
Breast Cancer, artificial intelligenceAbstract
Breast cancer is one of the most dangerous and rapidly growing diseases in the world. Diagnosing breast cancer is expensive, difficult, and time-consuming. However, artificial intelligence and machine learning algorithms can help physicians to diagnose people with breast cancer at an early stage which will help people to avoid exhaustive treatments. The objective of our research was to classify if someone has malignant or benign cancer. We used the Wisconsin Breast Cancer dataset which was obtained from the UCI repository to create models using supervised learning. We used K Nearest Neighbors, and Logistic Regression algorithms to obtain a model with high accuracy. Both the models had an accuracy of 97%. In the future, the model can be enhanced to be more accurate and accessible to people. This research can help others to create models to predict various other cancers. In the future, we would also like to improve the model by using other methods like image recognition and reducing the input from the user to make it more accurate and accessible.
Downloads
References or Bibliography
UCI Machine Learning Repository: Breast Cancer wisconsin (diagnostic) data set. (n.d.). Retrieved June 25, 2022, from https://archive.ics.uci.edu/ml/datasets/breast+cancer+wisconsin+(diagnostic)
Supervised machine learning - javatpoint. www.javatpoint.com. (n.d.). Retrieved June 25, 2022, from https://www.javatpoint.com/supervised-machine-learning
Breast cancer early detection and diagnosis: How to detect breast cancer. American Cancer Society. (n.d.). Retrieved June 25, 2022, from https://www.cancer.org/cancer/breast-cancer/screening-tests-and-early-detection.html
Logistic regression. Logistic Regression - an overview | ScienceDirect Topics. (n.d.). Retrieved June 25, 2022, from https://www.sciencedirect.com/topics/computer-science/logistic-regression#:~:text=Logistic%20regression%20is%20a%20process,%2Fno%2C%20and%20so%20on.
Statistical Data Analysis Techniques in machine learning. Analytics Vidhya. (2021, June 24). Retrieved June 25, 2022, from https://www.analyticsvidhya.com/blog/2021/06/must-know-statistical-data-analysis-techniques-in-machine-learning/
Mayo Foundation for Medical Education and Research. (2022, April 27). Breast cancer. Mayo Clinic. Retrieved June 30, 2022, from https://www.mayoclinic.org/diseases-conditions/breast-cancer/diagnosis-treatment/drc-20352475#:~:text=Most%20women%20undergo%20surgery%20for,before%20surgery%20in%20certain%20situations.
Published
How to Cite
Issue
Section
Copyright (c) 2022 Shridhula Srinivasan, Mridula Srinivasan; Govind Tatachari
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright holder(s) granted JSR a perpetual, non-exclusive license to distriute & display this article.