Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to add new tags? #1087

Open
dlculver opened this issue Feb 26, 2024 · 1 comment
Open

How to add new tags? #1087

dlculver opened this issue Feb 26, 2024 · 1 comment

Comments

@dlculver
Copy link

Hello,

I am interested in training my own Grobid to work on documents in a different domain from scientific papers. At the moment, I want to train a header model to identify particular parties in my documents. I am a bit confused as to what this process is. As I understand it, I am supposed to take some pdfs, I use Grobid's batch mode to generate training and evaluating data, I then annotate this manually, and then train the model. However, I am very confused about how to add new tags to TEI schemas. Where, in particular, do I need to add new tags in order to train a header model.

Thanks!

@lfoppiano
Copy link
Collaborator

Dear @dlculver,
thanks for your interest in Grobid. Modifying the training data is a complex process at first.

Could you please explain a bit more in detail what you want to do?
With "add new tags" do you mean to extend the existing tagset? or to just use the existing tags for additional objects in the TEI?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants