How can I set class weights in a multiclass classification with imbalance dataset? #183

alegarbed · 2019-08-21T05:17:58Z

I had difficulties to implement different class weights in a multiclass classification. The proper way to set a class weight is in a dictionary but I can just use with parameters: Real, Integer and Categorical. Are there any solution? Can you provide a simply example?
Thank you in advance.

HunterMcGushion · 2019-08-22T02:38:54Z

Thanks for opening this, @alegarbed! Yes, you can optimize class_weight values! Here's a basic example with SKLearn's RandomForestClassifier and the Iris dataset.

from hyperparameter_hunter import Environment, CVExperiment
from hyperparameter_hunter import BayesianOptPro, Integer, Categorical
from hyperparameter_hunter.utils.learning_utils import get_iris_data
from sklearn.ensemble import RandomForestClassifier

env = Environment(
    train_dataset=get_iris_data(),
    results_path="HyperparameterHunterAssets",
    target_column="species",
    metrics=["hamming_loss"],
    cv_params=dict(n_splits=5, random_state=32),
)

# Just a reference for normal `class_weight` usage outside of optimization
exp = CVExperiment(
    RandomForestClassifier, {"n_estimators": 10, "class_weight": {0: 1, 1: 1, 2: 1}}
)

opt = BayesianOptPro(iterations=10, random_state=32)
opt.forge_experiment(
    model_initializer=RandomForestClassifier,
    model_init_params=dict(
        #################### LOOK DOWN ####################
        class_weight={
            0: Categorical([1, 3]),
            1: Categorical([1, 4]),
            2: Integer(1, 9),  # You can also use `Integer` for low/high ranges
        },
        #################### LOOK UP ####################
        criterion=Categorical(["gini", "entropy"]),
        n_estimators=Integer(5, 100),
    ),
)
opt.go()

This should definitely be included in one of our examples, or at least documented, so thanks for asking again!

Side note: I just noticed that the automatic Experiment matching during optimization isn't working for this, which is a bug, so I'll look into that and update you

HunterMcGushion added a commit that referenced this issue Aug 23, 2019

Add class_weight optimization example from #183

24f176c

HunterMcGushion added a commit that referenced this issue Aug 26, 2019

Add class_weight optimization example from #183

0e24dae

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I set class weights in a multiclass classification with imbalance dataset? #183

How can I set class weights in a multiclass classification with imbalance dataset? #183

alegarbed commented Aug 21, 2019

HunterMcGushion commented Aug 22, 2019

How can I set class weights in a multiclass classification with imbalance dataset? #183

How can I set class weights in a multiclass classification with imbalance dataset? #183

Comments

alegarbed commented Aug 21, 2019

HunterMcGushion commented Aug 22, 2019