How to map correlations to Stokes parameters without looping the correlations? #213

miguelcarcamov · 2022-04-30T11:34:42Z

dask-ms version: 0.2.6
Python version: 3.9.7
Operating System: Manjaro

Hello, I am trying to calculate the psf/dirtybeam analytically with dask-ms and also gridding the data. However, to calculate the psf analytically for each stokes parameter, or the dirty image for each stokes parameter it has been inevitable to loop the correlations to match the data of the correlations to each Stokes. This additional loop inside the loop of list of subms makes the code considerably slow. Has anyone found a way to map the correlations to each stokes without looping the correlations? Is there a way to do this with dask?

If what I just wrote above does not make sense to you, please ask :).

JSKenyon · 2022-05-03T09:44:41Z

Hi @miguelcarcamov! Apologies for the delay - I was on vacation. I am not entirely sure what you are trying to accomplish. Could you possibly provide more details/a code snippet?

Are you trying to map [XX, XY, YX, YY] to [I, Q, U V] on the xarray datasets?

miguelcarcamov · 2022-05-05T16:49:19Z

Hi @JSKenyon well it depends of the feed really. If you see this code that tries to do a dirty map from data using dask, you can see that in line 120 I loop the correlations in order to map them to I,Q,U,V depending on the feed. In the code, gridded_data and gridded_weights have a shape of (m,n,ncorrs), and to sum them to I,Q,U,V uv-grids depending on the feed it costs me a loop through all correlations for each one of the subms. I want to get rid of that for loop, but I'm not so sure of how to do it yet. It might be difficult to follow this, but let me know if you have questions :)

JSKenyon · 2022-05-09T08:13:07Z

If you are doing all your operations on dask arrays, I am not sure why the loop itself would be slow (unless you have a huge number of datasets). You can likely simplify the code by just having a mapping stored somewhere so that you don't have to check so many conditions.

If your code is pure dask, that loop over correlations isn't doing any real work - it is just setting up a graph. If, however, your arrays have already been reified to numpy at that point, I can imagine that that is slow.

Would you be willing to run line_profiler on the function? That may make it a bit clearer to me.

sjperkins · 2022-07-04T10:47:43Z

You may also want to consider: https://codex-africanus.readthedocs.io/en/latest/model-api.html#africanus.model.coherency.dask.convert

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to map correlations to Stokes parameters without looping the correlations? #213

How to map correlations to Stokes parameters without looping the correlations? #213

miguelcarcamov commented Apr 30, 2022

JSKenyon commented May 3, 2022

miguelcarcamov commented May 5, 2022

JSKenyon commented May 9, 2022

sjperkins commented Jul 4, 2022

How to map correlations to Stokes parameters without looping the correlations? #213

How to map correlations to Stokes parameters without looping the correlations? #213

Comments

miguelcarcamov commented Apr 30, 2022

JSKenyon commented May 3, 2022

miguelcarcamov commented May 5, 2022

JSKenyon commented May 9, 2022

sjperkins commented Jul 4, 2022