Skip to content

serengil/h2o-ai-101

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 

Repository files navigation

h2o-ai-101

This repository includes h2o.ai based machine learning project implementations and documentations.

1- h2o frame Tutorial

Pandas is de facto standard for data manipulation operations among data scientist. It is fast but it runs on single cpu core. Herein, h2o frame is a powerful alternative to pandas. It supports multi-core calculations whereas it covers almost same functions with Pandas.

2- h2o GBM Code, Tutorial

GBM dominates tabular data based kaggle competitions nowadays. Herein, h2o covers both XGBoost and its own GBM. This is a gentle introduction to h2o GBM.

3- h2o AutoML Code, Tutorial, Video

The hottest topic in machine learning is AutoML. Even though model design is accepted as state-of-the-art, today AutoML can design better models than us. h2o AutoML covers linear models, tree-based models including random forest and gradient boosting (XGBoost and h2o GBM) and deep learning (regular fully connected neural networks).

4- Explaining h2o models with Lime Code, Tutorial

Interpretability and accuracy are inversely proportional concepts. You cannot deploy unexplainable models to production even if they have high accuracy. Here, lime offers to explain custom predictions of your built models.

5- Interpratable Machine Learning with H2O and SHAP Code, Tutorial

SHAP offers very deeply explanations for built models against LIME. Still, it comes with a time cost. You should use SHAP if you have enough time to analysis your model.

6- Why You Should Build XGBoost Models Within H2O Tutorial, XGBoost within h2o CPU Code, Regular XGBoost CPU Code, XGBoost within h2o GPU Code

h2o offers faster XGBoost models than regular XGBoost. We will compare these two XGBoost distribution performances.

Requirements

I have tested this repository on the following environment configurations. Confirm your environment is same as below to avoid environmental issues.

>>> !python --version
Python 3.6.3
>>> import h2o
>>> h2o.__version__
3.26.0.3

Support

There are many ways to support a project - starring the GitHub repos is one.

Licence

This repository is licensed under MIT license - see LICENSE for more details