Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Documentation of listing data.frames #304

Open
HeidiSeibold opened this issue Jan 27, 2017 · 6 comments
Open

Documentation of listing data.frames #304

HeidiSeibold opened this issue Jan 27, 2017 · 6 comments

Comments

@HeidiSeibold
Copy link
Member

E.g. for listOMLTasks the Value documentation is quite scarce

Value
[data.frame].

It would be nice to know what the collumns of the data.frame actually are or at least a link to where I can find this.

Since this is an issue that I just ran into myself, can anyone tell me what max.nominal.att.distinct.values means:question:

@berndbischl
Copy link
Contributor

@joaquinvanschoren
it seems a bit stupid to doc this in R, redundantly.

are the docs for the meta features online so we can link to them in the R docs?

@berndbischl
Copy link
Contributor

it is apparently under "measures" (what i dislike)

https://www.openml.org/search?type=measure

also for heidi's example it simply says:

MaxNominalAttDistinctValues
DataQuality extracted from Fantail Library

:(

@HeidiSeibold
It will be the max number of levels for a categorical input feature.

but in general it does not help us if i have to answer that manually. can we at least link to the fantail docs then?

@HeidiSeibold
Copy link
Member Author

Thanks @berndbischl

I agree, a link should be good, but then the documentation on the website needs to be informative.

@joaquinvanschoren
Copy link
Sponsor Contributor

joaquinvanschoren commented Jan 27, 2017 via email

@berndbischl
Copy link
Contributor

The data quality list is indeed under measures. I can add a shortcut link
if you want. Where do you want it?

i would like to have the performance metrics and the data qualities simply in 2 different sections, also in the navigation menu.

I could also split up the measures index into data qualities, evaluation
measures, and estimation procedures, but that would be at least a day of
work. If you think it really helps I can try to make time.

i do think that the current state is a bit confusing and such a split-up would help

There is no documentation on the Fantail meta-features, and thus nothing to
link to. Even Quan Sun's thesis only has a list of them without
description. They are quite straightforward, but someone has to go over
them and add a good description. Shall we open up a Google Doc for that?

without a description they borderline useless IMHO ....
shall someone work on this during the workshop?
and how do you do this without fantail docs....?
i mean, are you sure that you know all of the definitions PRECISELY?

@joaquinvanschoren
Copy link
Sponsor Contributor

joaquinvanschoren commented Jan 28, 2017 via email

@giuseppec giuseppec added this to the 1.4 milestone Apr 4, 2017
@giuseppec giuseppec removed this from the 1.4 milestone Feb 25, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants