Skip to content

GroupKFold #381

Closed Answered by thibaultcordier
lukegriffiths asked this question in Q&A
Dec 4, 2023 · 1 comments · 1 reply
Discussion options

You must be logged in to vote

Hello @lukegriffiths,

This is indeed a suggestion that was put forward in issue #202 but has not yet been addressed. I agree with you that a customised cross-validation scheme would be welcome. In practice, this requires that parameters to the split method (such as group=...) can be passed from the fit method.

When you say "I need to split up my data by group because data points within a group can be very similar", are you referring to a paper that uses this splitting technique? Because if you group similar points together, aren't you afraid of having models that overfit?

I'm curious to read your answer.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@lukegriffiths
Comment options

Answer selected by thibaultcordier
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants