Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

search: index additional_resources and data_abstract #557

Closed
GraemeWatt opened this issue Nov 8, 2022 · 0 comments · Fixed by #766
Closed

search: index additional_resources and data_abstract #557

GraemeWatt opened this issue Nov 8, 2022 · 0 comments · Fixed by #766
Assignees

Comments

@GraemeWatt
Copy link
Member

Feedback from Sabine Kraml (@sabinekraml):

One significant news is the ample additional material with non-tabular structure that's uploaded on HEPData.
Formerly these were often just SLHA files, but now datasets are increasingly supplemented with code snippets, histfactory models (aka json likelihoods), ML models (ONNX) and more. Some discussion on how this could be standardized, or at least supplemented with appropriate meta data such that these additional resources become searchable and findable in automatic queries, would in my opinion be quite helpful.

We should index the description of the additional_resources to enable free-text searches to return relevant results.

Also, it looks like we only index the publication abstract (from INSPIRE) and not the user-submitted data_abstract (i.e. the comment from the submission.yaml file). The latter might contain additional information specific to the HEPData record and therefore it should also be indexed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Done
Development

Successfully merging a pull request may close this issue.

2 participants