Skip to content

EricAsna/Loan-Dataset-Exploration

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Loan Dataset Exploration

Loan dataset

For this study, I have chosen "loan data from prosper" dataset which is available on the Udacity server. The original loan dataset consists of 113,937 loans with 81 variables on each loan, including loan amount, borrower rate (or interest rate), current loan status, borrower income, and many others. After conducting the wrangling process, 97671 loans were remained and 12 variables were selected for this study. The focus here is on two main features of any loan; loan amount and the interest rate as well as some other important features that can greatly affect the two main variables. Insights created through the explanatory analysis using univariate, bivariate and multivariate visualizations. Features explored in this study are:

Main variables

  1. Loan amount
  2. Interest rate (borrower rate)

Supporting variables

  1. Income range
  2. Employment status
  3. Employment duration
  4. Loan term (duration)
  5. Loan status
  6. Lower & upper credit score
  7. Debt-to-income ratio
  8. Listing category

Source File

Cleaned source file is provided in "Source" folder.

Code

Python codes are available in Code folder.

Summary of Findings

Summary of findings is provided in findings_summary.md

Key Insights for Presentation

Key insights are provided in the key_insights.md

Results

Insights are presented visually in a presentation deck file.