You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe alternatives you've considered, if relevant
pdpipe provides an excellent way: features will be one-hot encoded and columns will be replaced with one-hot encoded features. Great and simple labeling for understanding, integers are easier to read than floats, small and easy-to-read syntax. Hope to see this in sklearn's OneHotEncoder directly.
Additional context
Tested with sklearn's newest version 0.23.2 and older
The text was updated successfully, but these errors were encountered:
one-hot encoded features are replacing input columns in original dataframe
Unlikely to happen: transform doesn't change the data inplace. We're thinking about outputting dataframe-like objects (see other SLEPs), but that's slightly different
that values of one-hot encoded features should always be 0 and 1 (easier to read; currently, they are floats 0.0 and 1.0)
You can select a dtype with the dtype parameter. However, if you use the OHE in a ColumnTransformer and the final array has a mix of real-valued data and integer data, it will be upcasted to floats.
Note that for now, numpy array are first class citizens of scikit-learn, not dataframe.
Hi,
I would like to ask if it would be possible to add a feature to sklearn's OneHotEncoder:
Describe the workflow you want to enable
I would like to have an more comfortable way to applying one-hot enconding as implemented in pdpipe (using sklearn OHE):
Describe your proposed solution
Output should look like this:
Describe alternatives you've considered, if relevant
pdpipe provides an excellent way: features will be one-hot encoded and columns will be replaced with one-hot encoded features. Great and simple labeling for understanding, integers are easier to read than floats, small and easy-to-read syntax. Hope to see this in sklearn's OneHotEncoder directly.
Additional context
Tested with sklearn's newest version 0.23.2 and older
The text was updated successfully, but these errors were encountered: