Skip to content
This repository has been archived by the owner on Mar 27, 2023. It is now read-only.

extract more structured data from ESAC registry #254

Open
6 tasks
maxheld83 opened this issue Jul 29, 2020 · 0 comments
Open
6 tasks

extract more structured data from ESAC registry #254

maxheld83 opened this issue Jul 29, 2020 · 0 comments
Assignees
Labels
esac our use of http://esac-initiative.org

Comments

@maxheld83
Copy link
Contributor

currently, a lot of the ESAC registry data appears to be somewhat unstructured.
To better make use of this data in #240 #243 et al, it would be good to clean this data and expose it as an R object in {hoad}.

In addition:

  • we should get the data programmatically from ESAC scrape esac data #244.

  • clean some fields (ie. yes, no should be logical, etc.).

  • publisher is an open text field, and would have to be checked against some definitive list

  • agreement_url is an open text field

  • consortia/institution is an open text field

  • access_costs should be ordinal

  • worfklow_assessment are 3 separate vars!

  • article_types can be parsed down to a few types (that need not be an open field)

  • ...

@maxheld83 maxheld83 added the esac our use of http://esac-initiative.org label Jul 29, 2020
@maxheld83 maxheld83 self-assigned this Jul 29, 2020
maxheld83 added a commit that referenced this issue Jul 29, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
esac our use of http://esac-initiative.org
Projects
None yet
Development

No branches or pull requests

1 participant