Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How many times does the term "github.com" appear in the Springer Nature article database #2

Open
KirstieJane opened this issue Nov 29, 2017 · 3 comments
Assignees
Labels
Projects

Comments

@KirstieJane
Copy link
Member

KirstieJane commented Nov 29, 2017

This database is not openly available, but as we're at the hackday we have access to it!! Woooo! ✨

Answer is 13,811 of 10,817,565 records (0.12%).

@KirstieJane KirstieJane added this to Backlog in Code cite via automation Nov 29, 2017
@KirstieJane
Copy link
Member Author

KirstieJane commented Nov 29, 2017

Here's a paper that is a true positive: DOI: 10.1038/sdata.2016.44

This paper is not in the Springer full test database. A true positive that is present in the Springer database is https://doi.org/10.1186/2049-2618-2-6.

@KirstieJane KirstieJane moved this from Backlog to Doing in Code cite Nov 29, 2017
@martintoreilly
Copy link
Collaborator

The query below will search for github.com anywhere in the text.
http://api.springer.com/xmldata/app?q=github.com&api_key=<key>

Running this gives the 13,811 records (summary block from results below).

   <result>
      <total>13811</total>
      <start>1</start>
      <pageLength>10</pageLength>
      <recordsDisplayed>10</recordsDisplayed>
   </result>

@martintoreilly
Copy link
Collaborator

Running a query with no search terms gives 10,817,565 records, so it seems only 0.12% of publications in Springer's full text have github.com links.

Query: http://api.springer.com/xmldata/app?q=&api_key=
Result summary:

   <result>
      <total>10817565</total>
      <start>1</start>
      <pageLength>10</pageLength>
      <recordsDisplayed>10</recordsDisplayed>
   </result>

martintoreilly added a commit that referenced this issue Nov 29, 2017
…ext-database

Count records with search terms in Springer full text API (issue #2)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Code cite
  
Doing
Development

No branches or pull requests

2 participants