Tax function estimation #96

jdebacker · 2024-03-11T14:56:15Z

Beginning with PR #73, which updated the default calibration of OG-USA, we have observed some odd results related to the estimated tax functions. This issue will document what we've noticed in the hopes that we can address any issues with the tax function estimation routines or with the microsimulation model used to calibrate OG-USA (or both).

Things that haven't seemed quite right:

In PR Update calibration #73, I noted that using DEP tax functions estimated using the most recent Tax-Calculator at the time (v 3.4.1) resulted in tax function parameters that, when used in OG-USA, resulted in an inability for the model SS to solve.
Also, noted in PR Update calibration #73, when trying to estimate the mono and mono2D functional form for the tax functions, there were failures in the estimation (e.g., no minimum found) (again, using Tax-Calculator 3.4.1)
In OG-USA simulations since October 2023, we've used GS functional forms for the tax functions (with these, the model solve), but we've noticed significant garbage collection and reductions in computational performance when solving the model (noted in OG-USA Discussions Analysis of Dask distributed workloads #83). Times to solve the model SS have gone up from about 45 seconds to 15 minutes. Note that when using the tax functions parameters in ogusa_default_parameters.json, the warnings and performance reductions pretty much disappear.

The text was updated successfully, but these errors were encountered:

jdebacker · 2024-03-11T16:14:05Z

Some plots:

DEP functions estimated on Tax-Calculator 3.4.1 (each line is a different age- blues for younger, red for older):

GS functions estimated on Tax-Calculator 3.5.1:

jdebacker · 2024-03-11T16:16:23Z

Some key questions:

Are these odd functions and artifact of the microsimulation model output or txfunc.py (there have been changes to both)?
Are these functions "correct" (i.e., do they fit the data best)?
What is preventing estimation of the mono and mono2D functions?

jdebacker · 2024-03-11T16:20:18Z

Re (2) above, I don't see how these could be the best fit (albeit, the scatter plot dots do not reflect sampling weights):

ETRs for 40 year olds, DEP functions (tax year 2024):

MTR on labor income for 40 year olds, DEP functions (tax year 2024):

jdebacker · 2024-05-17T22:27:38Z

I've started looking into the estimation of the tax functions. Some questions I have:

Does the numerical optimization method in our minimization of the non-linear least squares estimator matter for our estimates? In particular, as seen above, it is not uncommon to see the fitted functions and see clear room for a different parameterization to fit better than what is returned from the optimizer.
Are we using the good starting values in our optimization? Do "better" starting value help reduce variation across age?
Is it particularly difficult to estimate MTRs since they display much more variation than ETRs? And if so, is it better to infer the MTRs from the ETRs? But then, how much of the variation in the data are we missing?

jdebacker · 2024-05-17T22:55:32Z

Re the method of numerical optimization, I'm seeing significant differences across the numerical algorithm used to minimize the nonlinear least squares function. Here are the tax functions for each age estimated using a few different algorithms:

DEP functional form:

L-BFGS-B method

CPS data

PUF data

SLSQP method

CPS data

PUF data

Nelder-Mead method

CPS data

PUF data

GS functional form:

L-BFGS-B method

CPS data

PUF data

SLSQP

CPS data

PUF data

Nelder-Mead

CPS data

PUF data

Summary:

Quite of bit of variation across datasets (CPS vs PUF) and age -- both suggest that the estimates are very sensitive to the underlying data because there's shouldn't be that much variation in tax rates across the two data files (but we can confirm this).
The variation across methods to minimize the statistical objective function also suggests parameter estimates that are very sensitive to initial values and algorithms and therefore probably not precisely estimated.

jdebacker · 2024-05-18T01:00:30Z

ETR function estimation

The above plots are of MTRs on labor income. ETRs seem to be more consistently estimated:

DEP

CPS

PUF

GS

CPS

PUF

GS, Nelder-Mead, PUF

jdebacker mentioned this issue May 16, 2024

Updates to example scripts #111

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tax function estimation #96

Tax function estimation #96

jdebacker commented Mar 11, 2024

jdebacker commented Mar 11, 2024

jdebacker commented Mar 11, 2024

jdebacker commented Mar 11, 2024 •

edited

Loading

jdebacker commented May 17, 2024

jdebacker commented May 17, 2024

jdebacker commented May 18, 2024

Tax function estimation #96

Tax function estimation #96

Comments

jdebacker commented Mar 11, 2024

jdebacker commented Mar 11, 2024

jdebacker commented Mar 11, 2024

jdebacker commented Mar 11, 2024 • edited Loading

jdebacker commented May 17, 2024

jdebacker commented May 17, 2024

DEP functional form:

L-BFGS-B method

CPS data

PUF data

SLSQP method

CPS data

PUF data

Nelder-Mead method

CPS data

PUF data

GS functional form:

L-BFGS-B method

CPS data

PUF data

SLSQP

CPS data

PUF data

Nelder-Mead

CPS data

PUF data

Summary:

jdebacker commented May 18, 2024

ETR function estimation

DEP

CPS

PUF

GS

CPS

PUF

GS, Nelder-Mead, PUF

jdebacker commented Mar 11, 2024 •

edited

Loading