diagnosing_breast_cancer

An accurate diagnosis of breast cancer is critical to the well-being of the patient. The analysis of data from fine needle aspirate (FNA) images of cell nuclei sampled from benign and malignant breast tumors can be applied to develop a statistical learning model to correctly classify tumors as cancerous or benign, using measurements taken from similar FNA images. The data set used in this study is a cleaned version of the 1993 Street et al. data from the University of Wisconsin, and consists of 569 observations of women with breast tumors. The dependent variable is whether the tumor was malignant or benign, and the 30 features of the data are measures of the shape, size, and texture of the tumor cell nuclei derived from the FNA images.

Past models have achieved an estimated 97.5% accuracy rate for this data set, and the objective of this research is to improve this accuracy rate through the application of several classification techniques. One classification method will be selected as the best through repeated tests on a validation set randomly sampled from the data. Models to be investigated include the logistic regression model, tree methods such as random forests, support vector machines with linear kernels, and k nearest neighbors. Variable selection procedures will be implemented to refine these models and to discover the most important features. Health care professionals can implement the selected model in the R language to better diagnose breast cancer.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
cancer_analysis_files/figure-markdown_github		cancer_analysis_files/figure-markdown_github
code_md_files/figure-markdown_github		code_md_files/figure-markdown_github
.gitignore		.gitignore
README.md		README.md
breast_cancer.R		breast_cancer.R
cancer_analysis.Rmd		cancer_analysis.Rmd
cancer_analysis.md		cancer_analysis.md
code_md.Rmd		code_md.Rmd
code_md.md		code_md.md
data.csv		data.csv
diagnosing_breast_cancer.Rproj		diagnosing_breast_cancer.Rproj
fna_image_2.JPG		fna_image_2.JPG

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cancer_analysis_files/figure-markdown_github

cancer_analysis_files/figure-markdown_github

code_md_files/figure-markdown_github

code_md_files/figure-markdown_github

.gitignore

.gitignore

README.md

README.md

breast_cancer.R

breast_cancer.R

cancer_analysis.Rmd

cancer_analysis.Rmd

cancer_analysis.md

cancer_analysis.md

code_md.Rmd

code_md.Rmd

code_md.md

code_md.md

data.csv

data.csv

diagnosing_breast_cancer.Rproj

diagnosing_breast_cancer.Rproj

fna_image_2.JPG

fna_image_2.JPG

Repository files navigation

diagnosing_breast_cancer

About

Releases

Packages

Languages

akl21/diagnosing_breast_cancer

Folders and files

Latest commit

History

Repository files navigation

diagnosing_breast_cancer

About

Resources

Stars

Watchers

Forks

Languages