-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to add new category? #19
Comments
From what I understood, we need a new dataset in .jsonl with text and labels. |
Here you go: https://github.com/NyanNyanovich/nyan/releases/download/can_annot/cat_markup.tar.gz |
@NyanNyanovich Thanks, I have found train_clf.py already and tried to train it with a single category but then on send.sh classificator failed probably because of "not_news" missing.. I have taken a dataset for Ukrainian news website which tagged their news, grouped only related to corruption and gotten about 700 entries which I united with categories_train.jsonl. And after training I've became getting much worse results: many from war/politics became triggering corruption now and resulting as "unknown". So I have a few questions about the hints for a dataset for the new category:
|
What is the scenario for adding a new category ?
The text was updated successfully, but these errors were encountered: