Using different feature set for each model #7

RaduStoicescu · 2017-06-10T10:31:48Z

It is advised to use different feature sub-sets across the models for diversity.

Is it possible using heamy?

rushter · 2017-06-10T10:39:41Z

Yes, it's possible.
You can implement this logic inside your custom model or just add new datasets.

def xgboost_model(X_train, y_train, X_test, y_test=None, random_state=9999):
    params = {
        'objective': 'reg:linear',
        'learning_rate': 0.02,
        'max_depth': 20,
        'subsample': 0.8,
        'colsample_bytree': 0.8,
        'seed': random_state,
        'num_estimators': 100,
        'silent': 1,
        'tree_method': 'exact',

    }

    na_value = np.nan
    
    # Filter columns 
    subset_of_columns = ['a','b','c']
    X_train = X_train[subset_of_columns]
    X_test = X_test[subset_of_columns]

    X_train = xgb.DMatrix(X_train, label=y_train, missing=na_value)
    model = xgb.train(params, X_train, params['num_estimators'], maximize=True, )
    return model.predict(xgb.DMatrix(X_test, missing=na_value))

RaduStoicescu · 2017-06-10T17:28:36Z

Thanks!

Adding new datasets is a painfully obvious solution.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using different feature set for each model #7

Using different feature set for each model #7

RaduStoicescu commented Jun 10, 2017

rushter commented Jun 10, 2017 •

edited

RaduStoicescu commented Jun 10, 2017

Using different feature set for each model #7

Using different feature set for each model #7

Comments

RaduStoicescu commented Jun 10, 2017

rushter commented Jun 10, 2017 • edited

RaduStoicescu commented Jun 10, 2017

rushter commented Jun 10, 2017 •

edited