GitHub - nickkunz/nestedhyperline: Nested Cross-Validation for Bayesian Optimized Linear Regularization

Nested Cross-Validation for Bayesian Optimized Linear Regularization

Description

A Python implementation that unifies Nested K-Fold Cross-Validation, Bayesian Hyperparameter Optimization, and Linear Regularization. Designed for rapid prototyping on small to mid-sized data sets (can be manipulated within memory). Quickly obtains high quality prediction results by abstracting away tedious hyperparameter tuning and implementation details in favor of usability and implementation speed. Bayesian Hyperparamter Optimization utilizes Tree Parzen Estimation (TPE) from the Hyperopt package. Linear Regularization can be conducted one of three ways. Select between Ridge, Lasso, or Elastic-Net. Useful where linear regression is applicable.

Features

Consistent syntax across all Linear Regularization methods.
Supported Linear Regularization methods: Ridge, Lasso, Elastic-Net.
Returns custom object that includes performance metrics and plots.
Developed for readability, maintainability, and future improvement.

Requirements

Python 3
NumPy
Pandas
MatPlotLib
Seaborn
Scikit-Learn
Hyperopt

Installation

## install pypi release
pip install nestedhyperline

## install developer version
pip install git+https://github.com/nickkunz/nestedhyperline.git

Usage

## load libraries
from nestedhyperline import regressors
from sklearn import datasets
import pandas

## load data
housing_sklearn = datasets.load_boston()
housing = pandas.DataFrame(housing_sklearn.data, columns = housing_sklearn.feature_names)
housing['target'] = pandas.Series(housing_sklearn.target)

## conduct lasso regression
results = regressors.lasso_ncv_regressor(
    data = housing,
    y = 'target',
    loss = 'mse',
    k_inner = 3,
    k_outer = 3,
    n_evals = 300
)

## preview performance
results.error_mean()

## preview plots
results.plot_error_mean()
results.plot_lambda()
results.plot_regular()
results.plot_coef()

Examples

https://github.com/nickkunz/nestedhyperline/blob/master/examples/nestedhyperline_example_ridge.ipynb

License

Contributions

NestedHyperLine is open for improvements and maintenance. Your help is valued to make the package better for everyone.

References

Bergstra, J., Bardenet, R., Bengio, Y., Kegl, B. (2011). Algorithms for Hyper-Parameter Optimization. https://papers.nips.cc/paper/4443-algorithms-for-hyper-parameter-optimization.pdf.

Bergstra, J., Yamins, D., Cox, D. D. (2013). Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures. Proceedings of the 30th International Conference on International Conference on Machine Learning. 28:I115–I123. http://proceedings.mlr.press/v28/bergstra13.pdf.

Hoerl, Arthur E., Kennard, Robert W. (1970). Ridge Regression: Biased Estimation for Nonorthogonal Problems. American Statistical Association and American Society for Quality Stable. 12(1):55-67. https://doi.org/10.1080/00401706.1970.10488634.

Tibshirani, R. (1996). Regression Shrinkage and Selection Via the Lasso. Journal of the Royal Statistical Society: Series B (Methodological). 58(1):267-288. https://doi.org/10.1111/j.2517-6161.1996.tb02080.x.

Zou, H., Hastie, T. (2005). Regularization and Variable Selection via the Elastic Net. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 67: 301-320. https://doi.org/10.1111/j.1467-9868.2005.00503.x.

Name		Name	Last commit message	Last commit date
Latest commit History 151 Commits
examples		examples
media/logos		media/logos
nestedhyperline		nestedhyperline
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
UPDATES		UPDATES
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

media/logos

media/logos

nestedhyperline

nestedhyperline

.gitignore

.gitignore

.travis.yml

.travis.yml

LICENSE

LICENSE

MANIFEST.in

MANIFEST.in

README.md

README.md

UPDATES

UPDATES

setup.py

setup.py

Repository files navigation

Nested Cross-Validation for Bayesian Optimized Linear Regularization

Description

Features

Requirements

Installation

Usage

Examples

License

Contributions

References

About

Releases

Packages

Languages

License

nickkunz/nestedhyperline

Folders and files

Latest commit

History

Repository files navigation

Nested Cross-Validation for Bayesian Optimized Linear Regularization

Description

Features

Requirements

Installation

Usage

Examples

License

Contributions

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages