Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow GP data to be held in a Pandas Data Frame #185

Open
edaub opened this issue Jul 23, 2021 · 0 comments
Open

Allow GP data to be held in a Pandas Data Frame #185

edaub opened this issue Jul 23, 2021 · 0 comments

Comments

@edaub
Copy link
Collaborator

edaub commented Jul 23, 2021

R users typically provide their data to a regression model in a Data Frame. Similarly, most statistical Python packages use data frames when building regression models. This has the advantage of avoiding need of an input dictionary when parsing from a formula.

This should probably be the preferred mechanism for feeding data into the GP class, though we may still want to support inputs/targets as before.

One challenge: multiple outputs from a single dataframe is probably more difficult. We would certainly need a way of pulling out all of the targets so they aren't used as the inputs. Mean functions would need to be a list of string formulae identifying which targets are to be fit. There are probably a few other things to worry about on this front.

I'm currently building a prototype to test several specific changes at once to be sure they work nicely in concert. See the associated project board, which collates these issues together.

@edaub edaub added this to Features for Testing in New GP Prototype Jul 23, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
New GP Prototype
Features for Testing
Development

No branches or pull requests

1 participant