Applying Machine Learning to Heart Disease Risk Analysis

Authors

  • Sanaa Bhorkar The Harker School
  • Guillermo Goldsztein

DOI:

https://doi.org/10.47611/jsrhs.v13i1.6284

Keywords:

heart disease, artificial intellingence, machine learning, TensorFlow

Abstract

Heart disease is known to be deadly; about 700,000 Americans die from it annually. It is essential for young adults to calculate their risk early on, as many factors could lead to a high probability of heart disease. Some factors, such as diet or weight are obvious, but several others, such as mental health, hours of sleep, and even race, can be crucial indicators. This is where machine learning can help, as AI models can look at a large number of factors together to calculate risk by drawing upon real-world data. This paper discusses the implementation of one such model. The model described below is trained on data from about 320,000 patients in the United States. The model uses a Sequential Neural Network algorithm and was created using TensorFlow. It analyzes each patient's data and calculates their risk for heart disease, performing reasonably well with a 77% accuracy. This model can replace a linear-regression-based model, such as the ASCVD Risk Estimator, as it accounts for more factors in its analysis. Further testing with new and diverse datasets and different algorithms, like a RandomForestClassifier, could potentially improve this model.

Downloads

Download data is not yet available.

References or Bibliography

Alpaydin, E. (2010). Introduction to Machine Learning. MIT Press.

Centers for Disease Control and Prevention (CDC). (2020). Behavioral Risk Factor Surveillance System Survey Data.

U.S. Department of Health and Human Services, Centers for Disease Control and Prevention.

Guido, S. (2016). Introduction to Machine Learning with Python. O'Reilly Media.

Mohri, M., Rostamizadeh, A., & Talwalkar, A. (2012). Foundations of Machine Learning. The MIT Press.

Murphy, K. P. (2013). Machine Learning: A Probabilistic Perspective. MIT Press.

New York Department of Health. (2022, August). Heart Disease in the United States. Heart Disease and

Stroke Prevention. https://www.health.ny.gov/diseases/cardiovascular/heart_disease/

Published

02-28-2024

How to Cite

Bhorkar, S., & Goldsztein, G. (2024). Applying Machine Learning to Heart Disease Risk Analysis. Journal of Student Research, 13(1). https://doi.org/10.47611/jsrhs.v13i1.6284

Issue

Section

HS Research Projects