Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to annotate our own custom dataset ? #7

Open
bvy007 opened this issue Jul 9, 2019 · 1 comment
Open

How to annotate our own custom dataset ? #7

bvy007 opened this issue Jul 9, 2019 · 1 comment

Comments

@bvy007
Copy link

bvy007 commented Jul 9, 2019

Hi,

Could you please suggest me some ideas of creating a Questioning and Answering Dataset just like SQuAD but with common set of questions for every paragraph and a specific answer from the paragraph?? Any leads would be appreciated.

@ayushjain1144
Copy link

ayushjain1144 commented Jul 9, 2019

You can checkout https://github.com/samdash/QuestionAnswers

It provides interface to enter question answers and converts into BERT format. There are some bugs in this, but served the purpose in my case. It does not athutomatically calculates the start index though. I wrote a small script to calculate and feed it. Also it won't be very difficult to just modify it to automatically take the start index

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants