Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Validate primary fuel methodology that should be used for assigning a plant's fuel category #281

Open
grgmiller opened this issue Feb 15, 2023 · 1 comment
Labels
generation data accuracy/completeness of generation data methodology Improve methodology validation Validating data outputs

Comments

@grgmiller
Copy link
Collaborator

The OGE pipeline currently can use several methods to identify a plant's primary fuel: based on the fuel consumption, based on net generation, or based on nameplate capacity (see docs). The plant primary fuel identification is important because that is what is used to categorize each plant into the fuel categories that we use to match plant data up with the hourly EIA-930 data, and is also what we use when we calculate fleet-specific emission rates. Thus, ideally, we should be trying to categorize each plant into the fuel category that is used by EIA-930.

We currently default to identifying the primary fuel based on fuel consumption, but there is a chance that this approach, while technically accurate, may not match up with the fuel category used by EIA-930. For example, if there is a plant with a 100 MW coal generator that runs at 50% capacity and a 90 MW natural gas generator that runs at 90% capacity, our pipeline would likely identify this plant as a natural gas generator, because it is likely consuming more natural gas fuel (by mmbtu) than it is coal. However, just looking at the nameplate capacity of the plant, it has more coal generating capacity than gas capacity, so it might be categorized as a coal plant by the ISO and thus in EIA-930 (even if in a specific year, it did burn more gas than coal).

We have never validated whether one primary fuel identification method generally performs better than another. There are a few ways we could consider trying to validate which method we should use: We should try grouping plants into fuel categories based on different primary fuel identification methods, then:

  1. Compare the error between total reported net generation for that fuel category in EIA-930 and total net generation in that category from our data.
  2. Compare the error between total emissions calculated in OGE to total emissions calculated using the fleet emissions factors multiplied by EIA-930 net generation.

We should also generally look at how much each of these different primary fuel identification methods actually impacts the categorization of plants - how many plants does this affect? Are these plants large or small?

Also, we should start exporting the plant_primary_fuel table as part of our intermediate outputs.

@grgmiller grgmiller added methodology Improve methodology validation Validating data outputs generation data accuracy/completeness of generation data labels Feb 15, 2023
@grgmiller
Copy link
Collaborator Author

See #296 for some work that:

  • Exports the primary_fuel_table with all intermediate columns to outputs to help with validation.
  • Adds a new validation check to flag when the plant primary fuel assigned by the pipeline does not match the capacity-based primary fuel assignment.

We still need to work on comparing whether one primary fuel assignment method better aligns with total generation reported in EIA-930.

@grgmiller grgmiller mentioned this issue Dec 17, 2023
20 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
generation data accuracy/completeness of generation data methodology Improve methodology validation Validating data outputs
Projects
None yet
Development

No branches or pull requests

1 participant