pipeline.get_feature_names() #16807

jackwellsxyz · 2020-03-30T16:36:05Z

Describe the workflow you want to enable

This is possibly a duplicate of #6424. I'd love to do a pipeline.fit(X) with a pandas dataframe with named columns, then a pipeline.get_feature_names() as input into an eli5 explainer, with scikit-learn being smart enough to call get_feature_names() for those transformers it makes sense to do so (OneHotEncoder, SelectFromModel, etc.)

Describe your proposed solution

I'm not sure what a good solution might be -- one start might be to implement get_feature_names() for all transformers and return just the input column names if it doesn't change them, as would be the case for a Binarizer, for example.

Describe alternatives you've considered, if relevant

Additional context

rth · 2020-03-30T17:31:38Z

Thanks! The end goal of SLEP14 and proposed implementation in #16772 is to allow this.

Closing this issue to avoid duplicates, please comment in one of those exiting ones.

jnothman · 2020-03-31T20:03:27Z

eli5 already handles this case. But you shouldn't be passing the output of pipeline.get_feature_names() as these are the names of features output by the Pipeline's transform method.

jackwellsxyz added the New Feature label Mar 30, 2020

rth closed this as completed Mar 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pipeline.get_feature_names() #16807

pipeline.get_feature_names() #16807

jackwellsxyz commented Mar 30, 2020

rth commented Mar 30, 2020

jnothman commented Mar 31, 2020 via email

pipeline.get_feature_names() #16807

pipeline.get_feature_names() #16807

Comments

jackwellsxyz commented Mar 30, 2020

Describe the workflow you want to enable

Describe your proposed solution

Describe alternatives you've considered, if relevant

Additional context

rth commented Mar 30, 2020

jnothman commented Mar 31, 2020 via email