Using a Logistic Regression and K Nearest Neighbor Model to Accurately Diagnose Breast Cancer

Authors

  • Shridhula Srinivasan Monta Vista High School
  • Mridula Srinivasan
  • Govind Tatachari

DOI:

https://doi.org/10.47611/jsrhs.v11i4.3063

Keywords:

Breast Cancer, artificial intelligence

Abstract

Breast cancer is one of the most dangerous and rapidly growing diseases in the world. Diagnosing breast cancer is expensive, difficult, and time-consuming. However, artificial intelligence and machine learning algorithms can help physicians to diagnose people with breast cancer at an early stage which will help people to avoid exhaustive treatments. The objective of our research was to classify if someone has malignant or benign cancer. We used the Wisconsin Breast Cancer dataset which was obtained from the UCI repository to create models using supervised learning. We used K Nearest Neighbors, and Logistic Regression algorithms to obtain a model with high accuracy. Both the models had an accuracy of 97%. In the future, the model can be enhanced to be more accurate and accessible to people. This research can help others to create models to predict various other cancers. In the future, we would also like to improve the model by using other methods like image recognition and reducing the input from the user to make it more accurate and accessible. 

Downloads

Download data is not yet available.

References or Bibliography

UCI Machine Learning Repository: Breast Cancer wisconsin (diagnostic) data set. (n.d.). Retrieved June 25, 2022, from https://archive.ics.uci.edu/ml/datasets/breast+cancer+wisconsin+(diagnostic)

Supervised machine learning - javatpoint. www.javatpoint.com. (n.d.). Retrieved June 25, 2022, from https://www.javatpoint.com/supervised-machine-learning

Breast cancer early detection and diagnosis: How to detect breast cancer. American Cancer Society. (n.d.). Retrieved June 25, 2022, from https://www.cancer.org/cancer/breast-cancer/screening-tests-and-early-detection.html

Logistic regression. Logistic Regression - an overview | ScienceDirect Topics. (n.d.). Retrieved June 25, 2022, from https://www.sciencedirect.com/topics/computer-science/logistic-regression#:~:text=Logistic%20regression%20is%20a%20process,%2Fno%2C%20and%20so%20on.

Statistical Data Analysis Techniques in machine learning. Analytics Vidhya. (2021, June 24). Retrieved June 25, 2022, from https://www.analyticsvidhya.com/blog/2021/06/must-know-statistical-data-analysis-techniques-in-machine-learning/

Mayo Foundation for Medical Education and Research. (2022, April 27). Breast cancer. Mayo Clinic. Retrieved June 30, 2022, from https://www.mayoclinic.org/diseases-conditions/breast-cancer/diagnosis-treatment/drc-20352475#:~:text=Most%20women%20undergo%20surgery%20for,before%20surgery%20in%20certain%20situations.

Published

11-30-2022

How to Cite

Srinivasan, S., Srinivasan, M., & Tatachari, G. (2022). Using a Logistic Regression and K Nearest Neighbor Model to Accurately Diagnose Breast Cancer. Journal of Student Research, 11(4). https://doi.org/10.47611/jsrhs.v11i4.3063

Issue

Section

HS Research Projects