Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Weekly Standup 2021/11/15 #3

Open
github-actions bot opened this issue Nov 15, 2021 · 3 comments
Open

Weekly Standup 2021/11/15 #3

github-actions bot opened this issue Nov 15, 2021 · 3 comments
Assignees

Comments

@github-actions
Copy link

Please post any useful updates from your project

@callummole
Copy link
Collaborator

callummole commented Nov 15, 2021

In the ONS project we are recreating datasets from publicly available tables to assess the privacy risk of releasing those tables.

More detail: we are producing synthetic data by reconstructing distributions based on a set of two-way marginal tables (snapshots of how many individuals in a dataset are present in each combined category of two variables e.g. for the two variables marital status and sex a single cell could be single + male). We use a method called iterative proportional fitting for this, which iteratively adjusts a distribution so that the marginal counts are correct (e.g. it matches how many single/married people there are and also matches how many male/females etc). We have begun analysing the extent that individuals present in the 1% census data teaching file are also present in the recreated dataset (based on two-way tables of the 1% census data), thereby assessing the privacy risk of releasing tables.

code

@triangle-man
Copy link
Member

I intend to start https://github.com/alan-turing-institute/Hut23/issues/1013 Synthetic Data, Federated Learning, and Privacy Trade-Offs by writing a backbrief. Will create a repo and link to it from here.

@crangelsmith
Copy link
Collaborator

I've been trying to understand the current state of QUIPP. From conversations with the QUIPP team to attempting to run the pipeline. For the next week or so I'm focusing on finishing the RDS course but once that is done I plan to write up a report that describes what QUIPP is and what it can do.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants