Skip to content

oopDaniel/COEN281-Imbalanced-Data-Binary-Classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Antibacterial Peptide Binary Classification

Predictive models that can determine, given an antibacterial peptide, whether it is also an antibiofilm peptide.

(NOTE: the dataset is imbalanced)

The accuracy is measured by Matthews’s correlation coefficient (MCC)

Preprocess

  • 2-mers or 3-mers
  • Optionally apply PCA

Methods

  • Logistic Regression
  • Naive Bayes Classifier
  • Support Vector Machine
  • K-nearest Neightbor Classifier