[WIP] add test implementation of carrillo and rosenbaum #208

ljwolf · 2022-05-31T10:14:00Z

This is a test of the carrillo and rosenbaum (2016) counterfactual spatial distribution estimator. I'm not sure of the precise mathematical relationship between this and Cortes et al. (2021)'s cdf estimator...

since this is general-purpose resampling, should it live in esda? I suppose that the cdf counterfactualizer is similarly generic... @renanxcortes @knaaptime @sjsrey would you rather this kind of thing live in segregation, too?

…mator

ljwolf · 2022-05-31T10:19:31Z

esda/counterfactuals.py

+                )
+        # 5
+        self.actual_ = y
+        self.counterfactual_ = self.tau_ * self.actual_


I'm not yet certain that this does the "right" thing. C&R say you need to apply tau against P(y1,y2|tau), and the empirical distribution of that is actual_ (y). But, I'm not sure where the kernel density function re-weighting needs to come in? Need to continue working on it.

knaaptime · 2022-05-31T20:01:02Z

hm, maybe the opposite, actually. I think of esda as the second-most central layer in the pysal stack so it might be preferable to move some of the counterfactual generators from segregation over here instead? I rewrote them all for parallelization maybe a year ago, so they all live in one spot. That would make them easy to port into esda if they're useful elsewhere in the ecosystem?

renanxcortes · 2022-05-31T22:07:22Z

Hi @ljwolf and @knaaptime , long time no see! I remember this was one of the first thigs I developed when I arrived in the CGS, but developed an R framework with plotly implementation of carrillo and rosenbaum (2016). The main hint was that the binary dependent model for the propensity score matching should "separate" well the groups, that is why the logistic regression used has some non-linear terms (the authors explained this to me through e-mail). Will look here my historical data and share with you, ok?

Did you receive the e-mail with the files attached (matlab and R codes) @ljwolf and @knaaptime ?

renanxcortes · 2022-05-31T22:36:06Z

I'm not sure of the precise mathematical relationship between this and Cortes et al. (2021)'s cdf estimator...

So, technically, I believe they are quite different since their approach relies on matching using covariates, whereas our approach is not modeled with covariates...

ljwolf · 2022-06-01T09:12:47Z

long time no see!

Yes! good to see you (virtually) 😄

Did you recieve the e-mail?

Yes! Thank you very much, @renanxcortes!! That's super helpful.

binary dependent model for the psm should "separate" well the groups

Yes, definitely this makes sense, because the "power" of the method is based on that odds ratio weight, tau. In theory (and this implementation), you could use any estimator that provides predicted probabilities for observation i to be in time t, given its traits x_i (like, the method could use trees, XGBoost, anns, etc)

I believe they are quite different...

I know it looks this way on the surface. But, the C&R approach can just "ignore" X and still create tau and re-weight... so I wonder whether there may be a way to relate them formally using the "pooled" ecdf in the case of no exogenous information.

Regardless I'll adapt & extend the R script you sent along to correct this test implementation! And, I agree w/ @knaaptime that it makes sense to have them in the same place if there's more than one... and I agree esda makes sense, but I really don't mind wherever these get put!

renanxcortes · 2022-06-01T10:58:36Z

long time no see!

Yes! good to see you (virtually) 😄

Did you recieve the e-mail?

Yes! Thank you very much, @renanxcortes!! That's super helpful.

binary dependent model for the psm should "separate" well the groups

Yes, definitely this makes sense, because the "power" of the method is based on that odds ratio weight, tau. In theory (and this implementation), you could use any estimator that provides predicted probabilities for observation i to be in time t, given its traits x_i (like, the method could use trees, XGBoost, anns, etc)

I believe they are quite different...

I know it looks this way on the surface. But, the C&R approach can just "ignore" X and still create tau and re-weight... so I wonder whether there may be a way to relate them formally using the "pooled" ecdf in the case of no exogenous information.

Regardless I'll adapt & extend the R script you sent along to correct this test implementation! And, I agree w/ @knaaptime that it makes sense to have them in the same place if there's more than one... and I agree esda makes sense, but I really don't mind wherever these get put!

Awesome!

add test implementation of carrillo and rosenbaum counterfactual esti…

e1291cd

…mator

ljwolf changed the title ~~add test implementation of carrillo and rosenbaum counterfactual esti…~~ [WIP] add test implementation of carrillo and rosenbaum counterfactual esti… May 31, 2022

ljwolf changed the title ~~[WIP] add test implementation of carrillo and rosenbaum counterfactual esti…~~ [WIP] add test implementation of carrillo and rosenbaum May 31, 2022

ljwolf commented May 31, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] add test implementation of carrillo and rosenbaum #208

[WIP] add test implementation of carrillo and rosenbaum #208

ljwolf commented May 31, 2022 •

edited

ljwolf May 31, 2022

knaaptime commented May 31, 2022

renanxcortes commented May 31, 2022 •

edited

renanxcortes commented May 31, 2022 •

edited

ljwolf commented Jun 1, 2022

renanxcortes commented Jun 1, 2022

[WIP] add test implementation of carrillo and rosenbaum #208

Are you sure you want to change the base?

[WIP] add test implementation of carrillo and rosenbaum #208

Conversation

ljwolf commented May 31, 2022 • edited

ljwolf May 31, 2022

Choose a reason for hiding this comment

knaaptime commented May 31, 2022

renanxcortes commented May 31, 2022 • edited

renanxcortes commented May 31, 2022 • edited

ljwolf commented Jun 1, 2022

renanxcortes commented Jun 1, 2022

ljwolf commented May 31, 2022 •

edited

renanxcortes commented May 31, 2022 •

edited

renanxcortes commented May 31, 2022 •

edited