Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Updated v7 PUF--PolicyEngine comparisons to Tax-Calculator--taxdata default, and to IRS tables #46

Open
donboyd5 opened this issue Apr 22, 2024 · 1 comment

Comments

@donboyd5
Copy link
Collaborator

@nikhilwoodruff @martinholmer

I've updated the ad-hoc analyses. They can be found here. They mostly compare the v7 PUF-PE (pufpe) 2021 file to the Tax-Calculator--taxdata (tctd) default 2021 file. There are also some comparisons to IRS published aggregates, which are more labor-intensive because they require determining proper correspondence between PUF/TC variable names and IRS spreadsheet columns and concepts; so I'll do more of those in Phase 3.

The pufpe-tctd comparisons are in section 4 and the limited pufpe-tctd-irs comparisons are in section 6. For now, they focus on taxpayers, defined as max(c09200 - refund, 0) > 0. This is close to the IRS definition but imperfect; we'll perfect it in Phase 3. I do not believe the imperfection in defining taxpayers is a significant concern.

I do not pretend that the tctd file is the gold standard, but a lot of people have been looking at it for a lot of years, so significant differences require investigation.

In the interest of speed, I am highlighting here a few areas of concern rather than analyzing in detail. I focus on dollar amounts ($ billions) rather than percentage differences. I focus more on input variables than output variables right now.

  • pufpe AGI is much higher than tctd in the $15k-100k range; lower than tctd in the $100k-$1m range; and then higher than tctd at the high ends. I think the pufpe distribution is probably reasonable because it was targeted, but that does not mean the components of income are correct; you can see that by looking at section 6, which compares to IRS

  • pufpe AGI is $1.2 trillion higher than tctd AGI; again comparing to IRS (section 6, this seems right)

  • pufpe wages are $1.8 trillion higher than tctd, driven by the $15k-200k AGI ranges; that seems impossible to believe

  • pufpe taxable pensions and Social Security are near zero, which is not believable

  • pufpe QBID variable is zero -that must be obviously incorrect

  • pufpe medical is near zero; that can't be right

  • pufpe SALT is triple the tctd SALT

There are lots of other questions but I think for now we need to focus on the input variables.

@donboyd5
Copy link
Collaborator Author

donboyd5 commented Apr 23, 2024

@nikhilwoodruff @martinholmer

I moved updated tables to here. Section 4 has aggregates for most variables in 2021, compared to taxdata for 2021.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant