Build software better, together

RamnathKumar181 / Lending-Club-Analysis

We create a model using the gradient boosting algorithm to cut down on the noise and improve performance. This work was done during an informal project under Prof. Yaganti while studying at BITS.

gradient-boosting feature-importance credit-risk-assessment

Updated Aug 11, 2021
Python

marfappv / ML-dissertation

Star

This repository is a partial fulfilment of the requirements for the module of MSIN0114: Business Analytics Consulting Project/Dissertation for UCL School of Management.

engineering feature-importance classification-model recoverability aec profitability aec-industry aeci

Updated Mar 17, 2023
Jupyter Notebook

rrambhia22 / Crimes_Incarceration_Analysis

Star

Crime and Incarceration in the United States contain data on crimes that are committed, and the prisoner counts in every 50 states, for which the data is analyzed using various analytical methods.

python linear-regression exploratory-data-analysis jupyter-notebook statistical-analysis tableau model-building datavisualization dataanalysis correlation-matrix decision-tree-regression feature-importance datacleaning random-forest-regression datacollection machinelearningalgorithms datapreparation labelencoding pre-modeling-steps

Updated Jul 5, 2022
Jupyter Notebook

acurioussid / Kidney-Disease-Classification

Star

Develop a classification model that can accurately diagnose the presence of kidney disease in a person based on their medical test results. The model will then identify which factors are the most influential in determining a person's chances of developing kidney disease.

machine-learning healthcare-application classification-algorithm feature-importance

Updated May 6, 2023
Jupyter Notebook

LadaRudnitckaia / telemarketing-optimization

Star

Develoment of a machine learning model optimizing telemarketing through prediction of marketing calls that don't lead to customer conversion

random-forest feature-importance post-request ml-pipeline rest-service

Updated Oct 12, 2023
Jupyter Notebook

parantapa / integrated-directional-gradients

Star

Implementation of the Integrated Directional Gradients method for Deep Neural Network model explanations.

interpretability feature-importance interpretable-ai interpretable-ml feature-attribution

Updated Aug 25, 2021
Python

MichaelAlexanderBryant / vehicle-price-prediction

Star

An end-to-end project to analyze and model vehicle sale price data then productionize the best model to help people select a price to sell their vehicle.

python sales data-science machine-learning backend regression feature-engineering vehicles data-cleaning support-vector-regression feature-importance

Updated Dec 4, 2022
Python

abhmalik / categorical-feature-importances-without-one-hot-encoding-dummies

Star

Feature Importance of categorical variables by converting them into dummy variables (One-hot-encoding) can skewed or hard to interpret results. Here I present a method to get around this problem using H2O.

h2oai categorical-variables feature-importance one-hot-encode categorical-features

Updated Jun 10, 2019
Jupyter Notebook

lennartwallentin / feature_selection_functions

Star

Feature selection is widely used in nearly all data science pipelines. Hence I have created functions that do a form of backward stepwise selection based on the XGBoost classifier feature importance and a set of other input values with the goal to return the number of features to keep in regard to a prefered AUC-score.

python machine-learning automation pipeline feature-selection xgboost feature-engineering machine-learning-pipelines interpretability feature-importance functions-python pipelines-supervised-learning

Updated Oct 5, 2021
Jupyter Notebook

toshitorihara / group-project-4

Star

Breast Cancer Identifier

python sql neural-network random-forest scikit-learn pandas seaborn pca feature-importance

Updated Nov 28, 2021
Jupyter Notebook

smulage / Isolation_Forests

Star

Generating feature importances for outliers identified through Isolation Forests

anomaly-detection isolation-forest feature-importance sklearn-tree-export-text

Updated May 7, 2022
Python

fredyyyya / AB-Testing-and-Experiment-Design-Project

Star

MSBA Big Data course project

python bigdata pyspark ab-testing tableau feature-importance

Updated Jul 12, 2023
Jupyter Notebook

NishadKhudabux / Data-Science-in-Golf-Strokes-Gained-vs-Traditional-Metrics

Star

Unleashed the power of data science to analyze the performance of golfers from the PGA tour. Built ML models and compared Strokes Gained to traditional metrics, resulting in insightful findings and actionable recommendations for golfers at all levels. Showcased advanced data analysis, decision trees, and visualizations in this comprehensive project

python data-science machine-learning random-forest eda classification decision-trees data-cleaning feature-importance data-driven-decisions

Updated Feb 9, 2023
Jupyter Notebook

oakhamis / Bank_Data_Mining

Star

Predicting bank term deposits using classification ML algorithms.

machine-learning r exploratory-data-analysis banking datamining feature-importance modelcomparision

Updated Oct 20, 2023
R

janasatvika / Optimizing-Classification-Models-using-Permutation-Feature-Importance-Method

Star

High data dimensionality and irrelevant features can negatively impact the performance of machine learning algorithms. This repository implements the Permutation feature importance method to enhance the performance of some machine learning models by identifying the contribution of each feature used.