Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Initial feedback user story 1 - datacenter-publication-citations #2

Open
mfenner opened this issue May 11, 2020 · 2 comments
Open

Initial feedback user story 1 - datacenter-publication-citations #2

mfenner opened this issue May 11, 2020 · 2 comments

Comments

@mfenner
Copy link

mfenner commented May 11, 2020

  • Pangaea is not yet submitting views and downloads to DataCite, so this information could be left out. dryad.dryad has usage stats, and gbif.gbif has lots of citations (the occurrences that you also have in your output). Maybe show two or three repositories to show different patterns? You could have them all in one GraphQL query, but need to alias them, e.g.
pangaea: repository(id: $repository) {
  id
  name
}
gbif: repository(id: $repository) {
  id
  name
}
  • fetch the aggregate citation count over the whole repository and/or a specific query. There are issues with generating the numbers, but that is up to me.
  • Generate a downloadable bibtex file (just contact the individual bibtex entries into a single file with extension .bib).
  • consider a query term with more hits, e.g. polarstern (a research vessel from the same institution that also hosts Pangaea). Or show aggregated numbers for the whole repository.
datasome pushed a commit that referenced this issue May 13, 2020
@datasome
Copy link

@mfenner, I have addressed all the issues - please see below:

Item Status Comment
show two or three repositories to show different patterns? You could have them all in one GraphQL query, but need to alias them Done Included all of pangaea, dryad.dryad and gbif.gbif in the query, and for all of the generated: 1. bar plots; 2. html tables of works by query and 3. Allow user to download BibTeX files (see below)
fetch the aggregate citation count over the whole repository and/or a specific query. There are issues with generating the numbers, but that is up to me. Done As you say, they are currently 0 for each of the three repositories
consider a query term with more hits, e.g. polarstern Done Now use 'polarstern' for pangaea, and other queries with lots of hits for dryad.dryad and gbif.gbif respectively.
Generate a downloadable bibtex file Done As suggested elswhere, the user can now download one BibTeX file of works per repository; and a BibTeX file of all citations of a given work. I have tested the format of the downloaded BibTeX files via bibtool, see below:
# Parses the file correctly and displays all formatted entries
bibtool dryad.dryad_butterfly.bib
# Does not display any rouge characters (and also checks for duplicates and returns none):
bibtool -- 'unique.field {doi}' -o '' dryad.dryad_butterfly.bib 

@datasome
Copy link

@mfenner, I have now documented in Markdown the notebook for user story 1. Please note that the top Markdown table appears borderless in Jupyter lab and mybinder.org, but somehow not in github.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants