Skip to content

Ansh3101/MachineLearning

Repository files navigation

Machine Learning

Machine Learning Modelling On Regression & Classification Problems

Linear Regression

Code - Training Notebook

Datasets - Medical Insurance Dataset

Model Weights - Linear Regression Weights For Medical Insurance Dataset


Logistic Regression

Code - Training Notebook

Datasets - Australia Weather Dataset

Model Weights - Logistic Regression Weights For Australia Weather Dataset


Decision Trees

Code - Training Notebook

Datasets - Australia Weather Dataset

Model Weights - Logistic Regression Weights For Australia Weather Dataset


Helper Functions

Functions

  • def dtype_select(dtypes) :

Returns Column Names With Specific Data Values (dtypes) From Dataset Defined When Assigning Variable To Class

  • def impute(numeric_cols, missing_values, strategy) :

Imputes missing_values Of numeric_cols Of Dataset With Specified strategy

  • def minMax(numeric_cols) :

Scales All Values In numeric_cols Of Dataset To A (0,1) Range


  • def one_hot(cat_cols) :

Encodes All Values In cat_cols Of Dataset By Each Unique Value

Code


Key Packages

scikit-learn : pip install sklearn / conda install sklearn

The Best Python Library For Statistical Anaylsis Techniques

Documentation

Pandas : pip install pandas / conda install pandas

A Go-To Tool For Tabular Data Handling & Manipulation

Documentation

NumPy : pip install numpy / conda install numpy

The Most Prominent Library For Scientific Numeric Computation

Documentation

Check Out Issue For Upcoming Implementations

R Implementations Coming Soon!