States division #24

gnopik · 2024-02-16T08:42:45Z

The default procedure does not return states with an equal amount of observations. The screenshot (tested in the dashboard) and the data are attached.

case1_data.csv

tupui · 2024-03-12T16:40:44Z

I actually know what is happening: NaN...

If I load the dataset, then do the decomposition and on the bins fill NaN, then I get an equal count for all scenarios.

I need to dig more to understand why we have NaNs. I don't remember the details there.

I have the feeling binned_statistic_dd is not doing exactly what I think it is🤔 I know for a SciPy maintainer... 😅

Maybe I need to calculate the bins for each axis before instead. This way I am sure that the binning is done on the number of sample and not the values. Need to check that hypothesis 😮‍💨

gnopik · 2024-03-13T10:13:35Z

NaNs in bins - what do you mean, like this?

This is the way to communicate that we want particular boundaries between states (==bins), and this case, just for the second & third input variables out of four.
If the whole thing is not supplied, (at least in the matlab package), the state boundaries are defined automatically:

either by categories if 5 or less unique values, or
equal amount of observations (highlighted)

tupui · 2024-03-13T11:33:27Z

Yep we can provide bounds for the bins. I just thought that was the normal behavior. I have to check that in SciPy's code and do some poking around.

So worst case I can do as you do and construct my own bounds it's not hard 👍

tupui · 2024-03-13T11:39:26Z

For the NaNs I don't remember why we have them, need to check as well.

tupui · 2024-05-18T14:57:46Z

Should be fixed in a81bf18

gnopik · 2024-05-20T08:57:40Z

For the NaNs I don't remember why we have them, need to check as well.

Easier to discuss over a call.

tupui added the bug Something isn't working label Feb 16, 2024

tupui closed this as completed May 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

States division #24

States division #24

gnopik commented Feb 16, 2024

tupui commented Mar 12, 2024

gnopik commented Mar 13, 2024

tupui commented Mar 13, 2024

tupui commented Mar 13, 2024

tupui commented May 18, 2024

gnopik commented May 20, 2024

States division #24

States division #24

Comments

gnopik commented Feb 16, 2024

tupui commented Mar 12, 2024

gnopik commented Mar 13, 2024

tupui commented Mar 13, 2024

tupui commented Mar 13, 2024

tupui commented May 18, 2024

gnopik commented May 20, 2024