Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GROBID not able to extract the header numbers when they're in Roman or in alphabets #1115

Open
alwaysaditi opened this issue May 13, 2024 · 0 comments

Comments

@alwaysaditi
Copy link

Hi I am new to Grobid and really need help

I am trying to extract the section headers and while they do appear normally in the tag, it does not give information about the section number in the tag, when the section IDs are Roman Numerals or alphabets

image

image

In contrast, when the numbers are in decimal notation/english integers, I am able to properly find the section number inside the tag

image

Please help me with a workaround so I am able to extract the section numbers inside the header tags.

Thanks a lot !

@lfoppiano lfoppiano changed the title Please help! : GROBID not able to extract the header numbers when they're in Roman or in alphabets GROBID not able to extract the header numbers when they're in Roman or in alphabets May 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant