Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Onboard BLS - CPSAAT 2020 dataset #105

Merged
merged 7 commits into from Jun 24, 2021
Merged

feat: Onboard BLS - CPSAAT 2020 dataset #105

merged 7 commits into from Jun 24, 2021

Conversation

adlersantos
Copy link
Member

@adlersantos adlersantos commented Jun 24, 2021

Description

As requested by some teams to be able to join hiring data with labour statistics.

https://www.bls.gov/cps/cpsaat18.htm

Checklist

  • Please merge this PR for me once it is approved.
  • If this PR adds or edits a dataset or pipeline, it was reviewed and approved by the Google Cloud Public Datasets team beforehand.
  • If this PR adds or edits a dataset or pipeline, I put all my code inside datasets/<YOUR-DATASET> and nothing outside of that directory.
  • This PR is appropriately labeled.

@adlersantos adlersantos added the data onboarding Onboard a dataset or submit a pipeline label Jun 24, 2021
depends_on_past: False
start_date: '2021-06-23'
max_active_runs: 1
schedule_interval: "@once"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should be yearly

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done! f309da6

args:
task_id: "load_csv_to_bq"
bucket: "{{ var.json.bls.source_bucket }}"
source_objects: ["cpsaat18/2020.csv"]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

csv is 2020 so if we switch to yearly this doesn't work

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done! 3c83177

datasets/bls/cpsaat18/pipeline.yaml Show resolved Hide resolved
resources:
- type: bigquery_dataset
dataset_id: bls
description: "Overview: This dataset includes economic statistics on inflation, prices, unemployment, and pay & benefits provided by the Bureau of Labor Statistics (BLS).\n\nUpdate frequency: Monthly\n\nDataset source: U.S. Bureau of Labor Statistics\n\nTerms of use: This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source - http://www.data.gov/privacy-policy#data_policy - and is provided \"AS IS\" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.\n\nSee the GCP Marketplace listing for more details and sample queries: https://console.cloud.google.com/marketplace/details/bls-public-data/bureau-of-labor-statistics"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

now that #103 is in we can get rid of the newlines :)

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Here you go! Looks nicer now on the YAML config: e0b82a4

@leahecole leahecole merged commit 61f4394 into main Jun 24, 2021
@leahecole leahecole deleted the bls-dataset branch June 24, 2021 21:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data onboarding Onboard a dataset or submit a pipeline
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants