Skip to content

Releases: DistrictDataLabs/yellowbrick

Hotfix 0.3.1

13 Oct 13:02
Compare
Choose a tag to compare
Hotfix 0.3.1 Pre-release
Pre-release

Hotfix to solve pip install issues with Yellowbrick.

Deployed: Monday, October 10, 2016
Contributors: Benjamin Bengfort

Changes

  • Modified packaging and wheel for Python 2.7 and 3.5 compatibility
  • Modified deployment to PyPI and pip install ability
  • Fixed Travis-CI tests with the backend failures.

Version 0.3

13 Oct 12:59
Compare
Choose a tag to compare
Version 0.3 Pre-release
Pre-release

This release marks a major change from the previous MVP releases as Yellowbrick moves towards direct integration with Scikit-Learn for visual diagnostics and steering of machine learning and could therefore be considered the first alpha release of the library. To that end we have created a Visualizer model which extends sklearn.base.BaseEstimator and can be used directly in the ML Pipeline. There are a number of visualizers that can be used throughout the model selection process:

  • Feature Analysis
  • Model Selection
  • Hyperparameter Tuning

In this release specifically we focused on visualizers in the data space for feature analysis and visualizers in the model space for scoring and evaluating models. Future releases will extend these base classes and add more functionality.

Deployed: Sunday, October 9, 2016
Contributors: Benjamin Bengfort, Rebecca Bilbro, Marius van Niekerk

Enhancements

  • Created an API for visualization with machine learning: Visualizers that are BaseEstimators.
  • Created a class hierarchy for Visualizers throughout the ML process particularly feature analysis and model evaluation
  • Visualizer interface is draw method which can be called multiple times on data or model spaces and a poof method to finalize the figure and display or save to disk.
  • ScoreVisualizers wrap Scikit-Learn estimators and implement fit and predict (pass-throughs to the estimator) and also score which calls draw in order to visually score the estimator. If the estimator isn't appropriate for the scoring method an exception is raised.
  • ROCAUC is a ScoreVisualizer that plots the receiver operating characteristic curve and displays the area under the curve score.
  • ClassificationReport is a ScoreVisualizer that renders the confusion matrix of a classifier as a heatmap.
  • PredictionError is a ScoreVisualizer that plots the actual vs. predicted values and the 45 degree accuracy line for regressors.
  • ResidualPlot is a ScoreVisualizer that plots the residuals (y - yhat) across the actual values (y) with the zero accuracy line for both train and test sets.
  • ClassBalance is a ScoreVisualizer that displays the support for each class as a bar plot.
  • FeatureVisualizers are Scikit-Learn Transformers that implement fit and transform and operate on the data space, calling draw to display instances.
  • ParallelCoordinates plots instances with class across each feature dimension as line segments across a horizontal space.
  • RadViz plots instances with class in a circular space where each feature dimension is an arc around the circumference and points are plotted relative to the weight of the feature.
  • Rank2D plots pairwise scores of features as a heatmap in the space [-1, 1] to show relative importance of features. Currently implemented ranking functions are Pearson correlation and covariance.
  • Coordinated and added palettes in the bgrmyck space and implemented a version of the Seaborn set_palette and set_color_codes functions as well as the ColorPalette object and other matplotlib.rc modifications.
  • Inherited Seaborn's notebook context and whitegrid axes style but make them the default, don't allow user to modify (if they'd like to, they'll have to import Seaborn). This gives Yellowbrick a consistent look and feel without giving too much work to the user and prepares us for Matplotlib 2.0.
  • Jupyter Notebook with Examples of all Visualizers and usage.

Bug Fixes

  • Fixed Travis-CI test failures with matplotlib.use('Agg').
  • Fixed broken link to Quickstart on README
  • Refactor of the original API to the Scikit-Learn Visualizer API

Version 0.2

04 Sep 17:17
Compare
Choose a tag to compare
Version 0.2 Pre-release
Pre-release

Intermediate steps towards a complete API for visualization. Preparatory stages for Scikit-Learn visual pipelines.

Deployed: Sunday, September 4, 2016
Contributors: Benjamin Bengfort, Rebecca Bilbro, Patrick O'Melveny, Ellen Lowy, Laura Lorenz

Changes

  • Continued attempts to fix the Travis-CI Scipy install failure (broken tests)
  • Utility function: get the name of the model
  • Specified a class based API and the basic interface (render, draw, fit, predict, score)
  • Added more documentation, converted to Sphinx, autodoc, docstrings for viz methods, and a quickstart
  • How to contribute documentation, repo images etc.
  • Prediction error plot for regressors (mvp)
  • Residuals plot for regressors (mvp)
  • Basic style settings a la seaborn
  • ROC/AUC plot for classifiers (mvp)
  • Best fit functions for "select best", linear, quadratic
  • Several Jupyter notebooks for examples and demonstrations

Version 0.1

18 May 19:06
Compare
Choose a tag to compare
Version 0.1 Pre-release
Pre-release

Created the yellowbrick library MVP with two primary operations: a classification report heat map and a ROC/AUC curve model analysis for classifiers. This is the base package deployment for continuing yellowbrick development.

Deployed: Wednesday, May 18, 2016
Contributors: Benjamin Bengfort, Rebecca Bilbro

Changes

  • Created the anscombe quartet visualization example
  • Added DDL specific color maps and a stub for more style handling
  • Created crplot which visualizes the confusion matrix of a classifier
  • Created rocplot_compare which compares two classifiers using ROC/AUC metrics
  • Stub tests/stub documentation