You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jun 10, 2021. It is now read-only.
it's really weird.
in the 4 files of the corpus, 1 seems to be an issue.
however if I trim that file removing some "very long words" (eg words > 40 characters) the file is fine.
BUT if I learn_bpe with the 4 files altogether I still have an error:
[01/03/18 11:03:04 INFO] Getting pair statistics from vocabulary
[01/03/18 11:07:08 INFO] Generating merge operations to output
PANIC: unprotected error in call to Lua API (not enough memory)
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Hi,
After this commit ccd7e03
I had no issue to learn a BPE model with many millions of sentences even on Luajit.
On master the memory issue is back:
I am trying to go back in time to see when it came back.
The text was updated successfully, but these errors were encountered: