Skip to content

czephyr/heart_disease_classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Heart Disease Classification

This repository hosts an analysis performed on the Personal Key Indicators of Heart Disease dataset from the CDC annual USA health survey.

The objective of the analysis was prediction of incidence of Heart Disease in respondents based on lifestyle indicators such as BMI, Age, physical activity, diseases and mental health.

An AUC score of 0.759 and a Precision score of 0.78, in finding out if the respondent had Heart Disease, was achieved by using a Logistic Regression optimizing the threshold for Precision using a nested cross validation. The decision to optimize for Precision was made given the unbalanced nature of the data and the conclusion that, correctly identifying diseased was of more importance than creating false positives; a problem that could be eventually addressed by further work in creating categories of risk depending on the probability output by the model.

A comprehensive report on the results of the analysis can be found in report.pdf.

Presentation preview

Following are some slides taken from the powerpoint presentation used to present the study keypoints.

slide1

slide4

slide8

slide9

slide14

About

🫀 Prediction of incidence of Heart Disease in respondents based on lifestyle indicators such as BMI, Age, physical activity, diseases and mental health.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages