Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Need data-type for each column in create_report function. #953

Open
anthng opened this issue Dec 23, 2022 · 2 comments
Open

Need data-type for each column in create_report function. #953

anthng opened this issue Dec 23, 2022 · 2 comments
Assignees
Labels
type: enhancement New feature or request

Comments

@anthng
Copy link

anthng commented Dec 23, 2022

Hi all,
Currently, I need to add a data-type (type) param in creat_report() like as plot() function. This data type can help me generate report with numerical/categorical features without affecting "Distinct Count".

This image below was automatically generated by creat_report. However, my expected output is numerical stats and visualization.
image

My expected feature:

dttype = {c: "Continuous" for c in dataframe.columns}
creat_report(dataframe, dtype=dttype)

Any solution to my problem, please support me. Thanks

@anthng anthng added the type: enhancement New feature or request label Dec 23, 2022
@dovahcrow
Copy link
Member

I see. So it seems DataPrep automatically identified your columns as categorical. May I ask what is the output of dataframe.dtypes?

@anthng
Copy link
Author

anthng commented Jan 5, 2023

I see. So it seems DataPrep automatically identified your columns as categorical. May I ask what is the output of dataframe.dtypes?

I cast all dtype of dataframe.dtypes to float before creating report. In e.g above, I attempted to cast "Continous", but it does not work
I guess that DataPrep automatically identifies a feature is numerical or categorical based on "distinct count" and "data type". I am not sure about this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type: enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants