Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data examination results for pension contributions #82

Open
martinholmer opened this issue May 11, 2024 · 2 comments
Open

Data examination results for pension contributions #82

martinholmer opened this issue May 11, 2024 · 2 comments
Labels
data generation Tasks related to generating microdata targeting Tasks related to gathering targets

Comments

@martinholmer
Copy link
Collaborator

martinholmer commented May 11, 2024

The amount of defined-contribution (DC) pension contributions (pencon_p for the tax unit head and pencon_s for the tax unit spouse when married filing jointly) seems too low, primarily because very few people in our most recent dataset have a positive value for these two variables.

Here is a tabulation of the tmd.csv file (for 2021) that is being used to generate the most recent examination results:

% awk -F, 'NR==1{for(i=1;i<=NF;i++)print i,$i}' tmd.csv | grep -e pencon -e s006
11 s006
50 pencon_p
51 pencon_s

% awk -F, 'NR==1{next}{t++}$50>0{n++}END{print t,n,n/t}' tmd.csv 
233412   6789   0.0290859                 <--- UNWEIGHTED HEADS (#)

% awk -F, 'NR==1{next}{w=$11;t+=w}$50>0{n+=w}END{print t*1e-6,n*1e-6,n/t}' tmd.csv
219.594   15.171   0.0690865              <--- WEIGHTED HEADS (#M)

% awk -F, 'NR==1{next}{w=$11;t+=w}$50>0{n+=w}$51>0{n+=w}END{print t*1e-6,n*1e-6,n/t}' tmd.csv
219.594   16.0148   0.0729291             <--- WEIGHTED PEOPLE (#M)

% awk -F, 'NR==1{next}{w=$11;t+=w;c+=w*($50+$51)}END{print t*1e-6,c*1e-9}' tmd.csv
219.594   48.7986                         <--- WEIGHTED DOLLAR CONTRIBUTONS ($B)

The 16.0 million people with positive DC contributions compares with USDOL Form 5500 results for 2020 "active participants" of nearly 85.3 million:

Screenshot 2024-05-11 at 4 36 50 PM

.

And the $48.8 billion tabulation compares with the USDOL Form 5500 results for 2020 of almost $586 billion shown above. So, we have less than ten percent of DC pension contributions. Even if the DOL contribution total includes both employee and employer DC contributions, the employee DC contribution amounts in the tmd.csv file seem too low.

@martinholmer martinholmer added bug Something isn't working data generation Tasks related to generating microdata targeting Tasks related to gathering targets and removed bug Something isn't working labels May 11, 2024
@martinholmer
Copy link
Collaborator Author

Better targets for years up through 2018 are from IRS-SOI tabulations of W-2 forms.

The 2018 tabulations of taxpayers with an employee pension contributions are:

Number of taxpayers (#M)               60.353
Gross (Medicare) earnings ($B)       5062.371
Employee pension contributions ($B)   332.520

So, the 16.0 million taxpayers in 2021 tabulated using the tmd.csv file is clearly too low.
Also, the $48.8 billion in 2021 employee pension contributions tabulated using the tmd.csv file is way below the actual 2018 value of almost $333 billion.

It does appear that @donboyd5 was correct to highlight this issue in #8.

The 2011 IRS-SOI W-2 tabulations were used in the taxdata repository's inpute_pencon.py module to impute pencon_p and pencon_s values to 2011 PUF data. The 2015 IRS-SOI tabulations could be used to impute more accurate pension contributions to the 2015 PUF.

@donboyd5
Copy link
Collaborator

donboyd5 commented May 12, 2024 via email

@martinholmer martinholmer changed the title Pension contributions seem too low Data examination results for pension contributions May 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data generation Tasks related to generating microdata targeting Tasks related to gathering targets
Projects
None yet
Development

No branches or pull requests

2 participants