GitHub - clarkbk/birth-names: Load annual baby name records from the Social Security Administration (US) and Office for National Statistics (UK) into a PostgreSQL database

Instructions

1. Install

$ git clone <this repo>
$ cd birth-names
$ mkvirtualenv birth-names
$ pip install -r requirements.txt

2. Start a PostgreSQL server and save the access credentials

For local use, you can spin up a database in Postgres.app. Then, using the example provided in .env.sample as a template, save your access credentials in a new file named .env.

Afterward, don't forget to:

$ source .env

3. Download source data from web and save to local folder

$ python3 download.py

The result should be two folders, data/us and data/uk. Each should contain many files with annual birth records for the respective country plus one summary file of total births with a name like us_births_by_year.csv (or uk_…).

4. Create the database tables

$ python3 database.py

The result should be three new empty tables in your database: year, birth_record, and name.

5. Load the database tables

$ python3 process.py

Creates a data model for birth records and initializes a database schema using Peewee ORM. Loops through all U.S. and U.K. data files in /data/ and loads the records into a PostgreSQL database. This could take as long as a half hour to complete.

6. Run the charting interface

$ streamlit run analysis.py

Starts a local Streamlit server and opens a browser tab with an interactive time series chart rendered using Plotly Express. It should look like this:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

.env.sample

.env.sample

.gitignore

.gitignore

README.md

README.md

analysis.py

analysis.py

database.py

database.py

download.py

download.py

favorites.py.sample

favorites.py.sample

logger.py

logger.py

process.py

process.py

requirements.txt

requirements.txt

Repository files navigation

Instructions

1. Install

2. Start a PostgreSQL server and save the access credentials

3. Download source data from web and save to local folder

4. Create the database tables

5. Load the database tables

6. Run the charting interface

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data		data
.env.sample		.env.sample
.gitignore		.gitignore
README.md		README.md
analysis.py		analysis.py
database.py		database.py
download.py		download.py
favorites.py.sample		favorites.py.sample
logger.py		logger.py
process.py		process.py
requirements.txt		requirements.txt

clarkbk/birth-names

Folders and files

Latest commit

History

Repository files navigation

Instructions

1. Install

2. Start a PostgreSQL server and save the access credentials

3. Download source data from web and save to local folder

4. Create the database tables

5. Load the database tables

6. Run the charting interface

About

Resources

Stars

Watchers

Forks

Languages