Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Evidence sources for dataset? #1

Open
dwadden opened this issue Apr 5, 2021 · 1 comment
Open

Evidence sources for dataset? #1

dwadden opened this issue Apr 5, 2021 · 1 comment

Comments

@dwadden
Copy link

dwadden commented Apr 5, 2021

Hi, thanks for creating this dataset!

In sec. 4.1 of your paper, it looks like you use a pipeline where you select relevant sentences from an evidence document, and then use BERT to predict the relation between the selected sentences and the claim. Does the main_text field in the data you make available for download correspond to the input evidence document?

What exactly is the relationship between the main_text and the sources? Is the main_text just the concatenation of the text from all the sources - and if so, what's going on in the cases where there is no source listed?

Thanks for the clarification!

Dave

@yfqiu98
Copy link

yfqiu98 commented Nov 6, 2021

Any updates here? Seems even there are some cases the sentences of main_text are not from the sources, e.g., 100th cases in testing split,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants