Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Write archiver to regularly run and archive updated CEMS crosswalk #102

Open
2 tasks
e-belfer opened this issue Apr 26, 2023 · 1 comment
Open
2 tasks

Comments

@e-belfer
Copy link
Member

e-belfer commented Apr 26, 2023

This is a continuation of issue #2505 in PUDL, which sets out to update the EPA-EIA crosswalk with 2021 data. The script written to do the updated archiving is written in PR #1 in the forked Catalyst repository.

While creating a static manually compiled output is a good start, it would probably be good to have a more reproducible programmatic process that will incorporate any data updates, and any updates to the crosswalk repo (this could be process changes or manual mapping additions), and that archives these outputs in a manner consistent to our other data sources.

  • Configure archiver to work with environment variable
  • Configure archiver to run render.r in the catalyst-cooperative/camd-eia-crosswalk-2021 repo and upload outputs to Zenodo, or configure the repo itself to run regularly (whichever is easier).
@zschira
Copy link
Member

zschira commented May 8, 2023

I've updated the archiver to create a 2021 archive that just archives the zip of our fork that basically looks exactly like the 2018 archive. I think the functionality to dynamically generate the crosswalk would still be valuable, so I'm going to leave this issue open, but it probably won't be high priority for awhile, so I'm moving to the icebox.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Icebox
Development

No branches or pull requests

2 participants