Skip to content

Sampling algorithms and machine learning models to reduce bias and predict credit risk.

Notifications You must be signed in to change notification settings

mthalken/Credit_Risk_Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Credit Risk Analysis

The purpose of this analysis was to use sampling algorithms and machine learning models to reduce bias and predict credit risk to help a Credit Card Lending company provide a more reliable loan experience and more accurate identification for loan candidates.

Project Overview:

  1. Use Resampling Models to Predict Credit Risk
    • Naive Random Oversampling
    • SMOTE Oversampling
    • Undersampling
    • SMOTEEN (Over and Under Sampling)
  2. Use Ensemble Models to Predict Credit Risk
    • Balanced Random Forest Classifier
    • Easy Ensemble AdaBoost Classifier
  3. Interpret results and determine which supervised learning algorithm is best used for this dataset

Resources

Results

From our 6 different machine learning tests that when looking at balanced accuracy score, precision, and recall the Easy Ensemble Adaboost Classifier would be the best to choose at the following: - 93% Balanced Accuracy Score - 7% Precision - 91% Recall

png

png

Even though the Easy Ensemble Adaboost Classifier does not have a very reliable precision score out of the 6 machine learning models used the Easy Ensemble Adaboost Classifier would be the best model for credit card analysis within this dataset.

About

Sampling algorithms and machine learning models to reduce bias and predict credit risk.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published