Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

change default num_buildings_represented and downselect criteria #185

Merged
merged 13 commits into from Oct 13, 2020

Conversation

vtnate
Copy link
Contributor

@vtnate vtnate commented Sep 24, 2020

Pull Request Description

Based off Slack conversation this changes the default num_buildings_represented and the default downselect criteria in some example yml files. This way we capture the single family detached housing stock in a manner consistent with the rest of BuildStock.

Checklist

  • All other unit tests passing
  • Update existing documentation
  • Run a small batch run to make sure it all works (local is fine, unless an Eagle specific feature)

@vtnate vtnate requested a review from nmerket September 24, 2020 17:19
@vtnate vtnate self-assigned this Sep 24, 2020
Copy link
Member

@nmerket nmerket left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems reasonable. I often just want to run SFD when doing testing because they're faster. Do we want to be limiting the example files to SFD @afontani @ejhw?

@vtnate
Copy link
Contributor Author

vtnate commented Sep 25, 2020

We could instead just comment that line selecting SFD but still leave it in as an example of how to structure the filtering logic.

Changing from ~80 million to ~130 million buildings represented feels like a pretty big change (is that true?). I'm glad our example file now matches the BuildStock data.

@afontani
Copy link
Collaborator

afontani commented Sep 25, 2020

So the number of units represented is the correct number from ACS 5-yr 2016 (aggregates from census track dwelling unit counts). Since we now have multifamily and vacant units, I feel like the project_resstock_national.yml file should be renamed. The YAML file isn't really the full national project but occupied SFD only.

In general, I would support a more exhaustive set of example yaml instead of just commenting out rows.

@ejhw
Copy link
Contributor

ejhw commented Sep 25, 2020

Thanks @vtnate ! We've been talking about moving all ResStock yamls to the OS-BS repo anyway (they'd have to reference the compatible bsb version). I think @TobiAdekanye was going to work on this.

I'd prefer to just keep example yamls in this repo. Does that sound okay @nmerket ?

I agree with @afontani that project_resstock_national.yml should not be downselected and we could have a separate yaml pre-downselected to SFD if we want that.

@vtnate
Copy link
Contributor Author

vtnate commented Sep 29, 2020

Thanks guys. I removed the downselection to SFD in project_resstock_national.yml, though I kept it in aws_demo_project.yml for the slightly different use case @ejhw suggested.

@afontani Are you suggesting we host a bunch of yml files showing details of different operations, or a single yml with everything in it that can be edited by the user?

@afontani
Copy link
Collaborator

@vtnate: I am suggesting a set of YAML files showing different example use cases. However, where these YAML files live is still a question for the group.

n_datapoints: 4 # Comment this line out if using a custom buildstock csv file
n_buildings_represented: 80000000
n_buildings_represented: 133172057 # Total number of residential dwelling units in CONUS. Downselected later as desired.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add reference for this number here too please

n_datapoints: 4
n_buildings_represented: 80000000
n_datapoints: 4
n_buildings_represented: 133172057 # Total number of residential dwelling units in CONUS. Downselected later as desired.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add reference for this number here too please

# logic:
# - Geometry Building Type ACS|Single-Family Detached
downselect:
resample: false
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

does resample: false mean that downselect is disabled? If so, add comment "# downselect is disabled"

downselect:
resample: false
logic:
- Vacancy Status|Occupied
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this just here as an example? If so, add comment saying so.

I guess I'd prefer to keep "Geometry Building Type ACS|Single-Family Detached" as the example logic; seems like it would be more common than "Vacancy Status|Occupied".

baseline:
n_datapoints: 4 # Comment this line out if using a custom buildstock csv file
n_buildings_represented: 133172057 # Total number of residential dwelling units in CONUS. Downselected later as desired.
n_buildings_represented: 133172057 # Total number of residential dwelling units in contiguous United States as defined by
# 2016 American Community Survey by census region. Data available at data.census.gov.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it 5-yr ACS (2012-2016)? Does the count include all dwelling units including high-rise residential and vacant units? cc @afontani

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe both are true. @afontani can you confirm?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@vtnate, @ejhw: Correct.

project_resstock_national.yml Outdated Show resolved Hide resolved
project_resstock_national.yml Outdated Show resolved Hide resolved
@nmerket nmerket requested a review from ejhw October 9, 2020 16:27
Copy link
Member

@nmerket nmerket left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me. Let's wait until we hear from @ejhw before merging.

@ejhw
Copy link
Contributor

ejhw commented Oct 9, 2020

Looks good to me. Let's wait until we hear from @ejhw before merging.

Waiting for @afontani to confirm the exact definition of the dwelling unit count so we can document it here: https://github.com/NREL/buildstockbatch/pull/185/files#r498970703

  • ACS or PUMS? Which year(s)?
  • Includes all residential dwelling units, including high-rise multifamily and unoccupied units
  • Also would be nice to document where the number came from (e.g., what tsv maker script?)

@vtnate did you test running the ymls to make sure they run as is?

@afontani
Copy link
Collaborator

afontani commented Oct 9, 2020

Looks good to me. Let's wait until we hear from @ejhw before merging.

Waiting for @afontani to confirm the exact definition of the dwelling unit count so we can document it here: https://github.com/NREL/buildstockbatch/pull/185/files#r498970703

  • ACS or PUMS? Which year(s)?
  • Includes all residential dwelling units, including high-rise multifamily and unoccupied units
  • Also would be nice to document where the number came from (e.g., what tsv maker script?)

@vtnate did you test running the ymls to make sure they run as is?

@ejhw: The number comes from a census tract level query of ACS 5-yr 2016 (i.e. 2012-2016).

@vtnate
Copy link
Contributor Author

vtnate commented Oct 12, 2020

  • Also would be nice to document where the number came from (e.g., what tsv maker script?)

@afontani Can you share this path or filename?

@afontani
Copy link
Collaborator

  • Also would be nice to document where the number came from (e.g., what tsv maker script?)

@afontani Can you share this path or filename?

@vtnate: https://github.com/NREL/resstock-estimation/blob/master/sources/spatial/tsv_maker.py

Copy link
Contributor

@ejhw ejhw left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

@vtnate
Copy link
Contributor Author

vtnate commented Oct 13, 2020

@ejhw Yes, both eagle and aws have been tested successfully with these yml files as they are currently written. Note that (unrelated to this PR) sampling takes a loooooooong time when downselecting. @nmerket has been working on an improved sampling script that may be able to help reduce the sampling time.

@nmerket nmerket merged commit 6439a50 into develop Oct 13, 2020
@nmerket nmerket deleted the num-buildings branch October 13, 2020 17:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants