Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UK census data - teaching file #139

Open
gmingas opened this issue Nov 25, 2020 · 0 comments
Open

UK census data - teaching file #139

gmingas opened this issue Nov 25, 2020 · 0 comments
Labels
dataset Particular datasets that we are using or investigating

Comments

@gmingas
Copy link
Contributor

gmingas commented Nov 25, 2020

This is a small non-disclosive sample of the census publicly available by ONS here. This was designed for teaching purposes. It contains 18 census characteristics like sex, age, region, ethinic group, religion, etc for 1% for the census population (~570,000 individuals). Personal identifiers (name, address, data of birth) have been removed. Potentially disclosive variables (e.g. geographic information) have been either completely removed or have been aggregated. They come under an Open Government License (OGL), requiring the inclusion of source accreditation when reproducing the data: link. This is now part of the QUIPP pipeline here. Note that a 5% sample of the census data is also available by the UK data service - see #57.

For microsimulation synthesis we can combine this with aggregated UK census data publicly available by the UK Data Service here. We are using a particular dataset containing numbers of males and females per region in England (only 9 rows). We might need to use other versions of this dataset including different variables (e.g. religion, age) in the same aggregated format. These come under an OGL and also with a EULA by UK Data Service which does not permit to attempt to identify individuals, households or organisations: link. This is also now part of the QUIPP pipeline here.

@gmingas gmingas added the dataset Particular datasets that we are using or investigating label Nov 25, 2020
@gmingas gmingas changed the title UK census data UK census data - teaching file Nov 25, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dataset Particular datasets that we are using or investigating
Projects
None yet
Development

No branches or pull requests

1 participant