Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fetch metafeatures option in fetch_data #12

Open
lacava opened this issue Oct 12, 2017 · 5 comments
Open

fetch metafeatures option in fetch_data #12

lacava opened this issue Oct 12, 2017 · 5 comments

Comments

@lacava
Copy link
Collaborator

lacava commented Oct 12, 2017

It would be nice to add an option to fetch feature types from fetch_data.

@rhiever
Copy link
Contributor

rhiever commented Oct 12, 2017

I believe that's captured in #3.

@lacava
Copy link
Collaborator Author

lacava commented Oct 12, 2017

not as a criteria though... an actual list of the types of each feature.

@rhiever
Copy link
Contributor

rhiever commented Oct 12, 2017

Oh, I see. How would we accomplish that? Scrape from the README?

@rrunix
Copy link

rrunix commented Oct 25, 2021

Any progress on this? It shouldn't be much work using the metadata file of each dataset. I can create a draft pull request, something like:

dataset, metadata = fetch_data('adult', return_medadata=True)

However, I'm not sure what information should be included in the metadata... I can think of three possible options:

  • the whole metadata.yaml parsed into a dictionary
  • a dictionary feature -> feature_type (e.g., {"age": "continuous", "education_type": "categorical", ....})
  • a list of the feature types (e.g., ["continuous", "categorical", ....])

@trangdata
Copy link
Collaborator

Thanks for this note @rrunix. 🙏🏽 🙌🏽 @JDRomano2 would be the contact at this point, but if I may chime in: yes, a PR would be most welcome. My suggestion would be that the argument return_medadata could take 'all' (metadata.yaml parsed into a dictionary), 'features' (dictionary of features), or NA (no metadata).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants