Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

use case: parse DOIs into their pieces #1

Open
sckott opened this issue Jan 5, 2018 · 3 comments
Open

use case: parse DOIs into their pieces #1

sckott opened this issue Jan 5, 2018 · 3 comments

Comments

@sckott
Copy link
Owner

sckott commented Jan 5, 2018

parse DOI into:

prefix/suffix == (prefix, suffix)

e.g., 10.1007/978-3-319-68041-5_5 == c("10.1007", "978-3-319-68041-5_5")

@noamross does this capture what you want?

@noamross
Copy link

noamross commented Jan 6, 2018

I think so. I actually don't know enough about DOI formats to know if the suffix can be broken up into further components. I guess that's specific to the publisher?

@sckott
Copy link
Owner Author

sckott commented Jan 7, 2018

Prefix can be further broken up in to 10 + XXXX. the 10 at the beginning is always 10 for DOIs.

For suffixes, yeah I think germane to the publisher so not sure there's one way to parse those, there could be publisher specific parsing, but that would be a huge can of worms

@sckott
Copy link
Owner Author

sckott commented Jan 18, 2018

now there's fxns for getting out prefixes and suffixes

still working on a fxn to split by separator / - easy in theory, but getting used to PEG stuff

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants