Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Models converging after few iterations #279

Open
juhopaak opened this issue Feb 17, 2023 · 0 comments
Open

Models converging after few iterations #279

juhopaak opened this issue Feb 17, 2023 · 0 comments

Comments

@juhopaak
Copy link

Hi, and thanks for an excellent package!

I'm trying to run the searchK function on a dataset of around 400k social media messages from various platforms (including short Twitter tweets but also longer discussion forum posts). I've tried to find the optimal model between the range k=10-300. However when k is close to or over 200, models begin converging after just a couple of iterations, which produces results that are suboptimal in comparison to models that run longer. I've tried using different random seeds for generating the heldout set, and this seems to influence the issue, i.e. under some random splits e.g. the k=200 model would converge in 3 iterations, whereas under others it would take over 200 iterations.

Would you have any idea what might be causing this issue, and whether this is appropriate model behavior? I'm trying to figure out how to assess the reliability of such results, possibly through e.g. doing a 10-fold validation with different random seeds.

Many thanks for help!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant