Skip to content

ryanquinnnelson/CMU-02718-Biomarker-Discovery-using-ML

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CMU-02718-HW2

Fall 2020 - Computational Medicine course project - Biomarker Discovery on Immunological Data (HW 2)

Summary

This project uses feature selection to identify inflammatory biomarkers that can distinguish between one of three conditions in children:

  • SARS-CoV-2
  • Multi-system Inflammatory Syndrome in Children (MIS-C)
  • Kawasaki disease

The project involves experimenting with three major categories of feature selection (filter-based, wrapper-based, embedded), applying standard techniques for preprocessing and training (standardization, encoding, cross-validation), and using multiple machine learning techniques (Mutual Information, Recursive Feature Elimination, Random Forest Classifier, SVM) to identify biomarkers. The project also employs a permutation test strategy for identifying and ignoring spurious correlation.

Analysis was performed using Jupyter Notebook and Python.