Skip to content

Difficulty estimator with categorical dtype support #27

Discussion options

You must be logged in to vote

Hi,

This is a good question, as the construction of good difficulty estimators is a central challenge. In addition to what you propose, one possible (but not necessarily good) option is to employ one-hot encoding, which would allow you to use the k-NN approach for the difficulty estimator, which relies on the Euclidean distance. This may however not scale very well with the number of unique categorical values. Another option would be to compute the difficulty outside the DifficultyEstimator and implement a tailored distance function, e.g., which allows for combining categorical and numerical features, such as Gower's distance.

Best regards,
Henrik

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@FlyingCurryMonster
Comment options

Answer selected by henrikbostrom
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants