Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

user doesn't know which version/date of CORD19 a carrel was created from #122

Open
nkmeyers opened this issue Jun 28, 2020 · 3 comments
Open
Assignees
Labels
carrel Something to do with the generated carrel pages enhancement New feature or request index This issue is related to indexing

Comments

@nkmeyers
Copy link
Collaborator

Problem: user either a carrel creator or user reading a carrel doesn't know which version/date of CORD19 dataset a carrel was created from

Question(s):

  1. How do we know what the last known date was for when the cord19 dataset was updated and indexed in Distant Reader? @artunit @ralphlevan @ericleasemorgan ?
  2. @ericleasemorgan How can we best write that last cord19dataset-updatetoDR info, and last cord19dataset-indexed-byDRdate info into the visilble body content of /export/reader/www/internal/index.cgi?
  3. @ericleasemorgan How can we best write that last cord19dataset-updatetoDR info, and last cord19dataset-indexed-byDRdate info into the provenance.tsv file for a carrel?

Possible solution(s): Once we can populate that date info as a data element in the provenance file can we then Write the entire contents of provenance file into a new variable derived section in the MANIFEST.htm and index.htm files for ea carrel?

Anyone have other ideas for how to get this info in front of the user(s) at carrel creation time? And at carrel "reading" time?

@nkmeyers nkmeyers added enhancement New feature or request index This issue is related to indexing labels Jun 28, 2020
@nkmeyers nkmeyers added this to Triage in The Reader Meets COVID-19 via automation Jun 28, 2020
@ericleasemorgan ericleasemorgan moved this from to do to in progress in Project CORD #7 (done) Jul 9, 2020
@ralphlevan
Copy link
Collaborator

A little late, but I have a way to find out when my database was last updated. http://solr-01:8983/solr/cord/admin/file?wt=json&_=1594418120634&file=dataimport.properties&contentType=text%2Fplain%3Bcharset%3Dutf-8

That will return a file that include the date of the last indexing

@ralphlevan
Copy link
Collaborator

There's also this: http://solr-01:8983/solr/admin/metrics/history?action=status&name=solr.collection.cord

That will return a JSON file. status.lastModified is timestamp in seconds. Usual tools will convert that into date and time

@ericleasemorgan
Copy link
Owner

ericleasemorgan commented Jul 27, 2020 via email

@dbrower dbrower added the carrel Something to do with the generated carrel pages label Jul 8, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
carrel Something to do with the generated carrel pages enhancement New feature or request index This issue is related to indexing
Projects
No open projects
Development

No branches or pull requests

5 participants