node_stats for nodes with features including string values. #276

parrt · 2023-03-02T17:16:00Z

When creating a data frame from a numpy, by default it converts all dtypes to object in case it contains different dtypes (float and str in our case)...
https://stackoverflow.com/questions/61346021/create-a-mixed-type-pandas-dataframe-using-an-numpy-array-of-type-object

Solution:

df = pd.DataFrame(self.shadow_tree.X_train, columns=self.shadow_tree.feature_names).convert_dtypes()
return df.iloc[node_samples[node_id]].describe(include='all')

The text was updated successfully, but these errors were encountered:

tlapusan · 2023-03-03T06:59:10Z

I thought that you already made the fix :D

parrt · 2023-03-03T17:12:51Z

hahah. nope was working on the tutorial and now they want a blog post haha

parrt added the bug Something isn't working label Mar 2, 2023

parrt assigned tlapusan Mar 2, 2023

tlapusan mentioned this issue Mar 6, 2023

Include string columns in node stats #277

Merged

parrt added this to the 2.2.1 milestone Mar 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

node_stats for nodes with features including string values. #276

node_stats for nodes with features including string values. #276

parrt commented Mar 2, 2023

tlapusan commented Mar 3, 2023

parrt commented Mar 3, 2023

node_stats for nodes with features including string values. #276

node_stats for nodes with features including string values. #276

Comments

parrt commented Mar 2, 2023

tlapusan commented Mar 3, 2023

parrt commented Mar 3, 2023