You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
You can create the resources for a new language with https://github.com/kermitt2/grisp
The readme describes the process. It's an Hadoop process that is going to take a few hours.
Loading the markupFull is the DB that is time consuming, it stores all the article text content.
You don't need to create embeddings if I remember well, it should work without them. However it improves a bit the disambiguation. This is also quite time consuming (it should be half day for Arabic given the number of articles).
There are 1,080,907 articles in Arabic, so it's a pretty big number, it should be doable and provide decent results.
Hello,
Is there any document or guide on how to train on Arabic ?
Is this possible ? if yes what are the requirements ?
Thanks in advance,
The text was updated successfully, but these errors were encountered: