High Content Screening #630

s-n-i · 2022-08-19T19:08:04Z

Background

I am opening a draft PR to see if there is interest in having viv support loading multiple datasets for high content screening. This is accomplished by passing in an array of loaders and information on how to position the images next to each other into the PictureInPictureViewer. This change is backwards-compatible, so Avivator works as before. I marked the PR as draft because it is not yet ready to be merged, and I am looking for feedback. It would be great to have this functionality merged into viv, rather than maintaining a fork of viv.

Change List

Support for multiple loaders and position data

Checklist

[✓] Update JSdoc types if there is any API change.
[✓] Make sure Avivator works as expected with your change.

…aders if no loader has been passed in

ilan-gold · 2022-08-23T15:54:42Z

@s-n-i As a start, I would want a layer that does this, not a viewer. @manzt has expressed opposition to having this in the core of viv, and I completely understand his point, so I to would lean towards "no" (see #287) EDIT: maybe he hasn't? I seem to remember agreeing that the core of Viv was not meant for this but maybe not. ~~Laying out MultiscaleImageLayers how one would like probably should not constitute a core contribution.~~

That being said, how long does the above screenshot take to load? If you had some contribution that allowed super fast loading or interaction (both of which have been challenges in the past from what I remember), I think then we might be more interested in that since others (i.e vizarr) would benefit from this too and so having this committed would be broadly helpful.

s-n-i · 2022-08-24T02:53:09Z

@ilan-gold I tested how long loading 16 plates takes. The application starts with only one plate in the viewport. It takes about 3 seconds to load it. It takes 2 seconds to zoom out and load the remaining 15 plates. Only plates that are within the viewport get loaded. I am not serving the datasets locally, so network speed has an effect.

With this implementation, when loading 16 plates, I am not noticing any rendering performance problems. When displaying around 100 plates, the performance is noticeably slower, but still usable.

Our approach to fast loading and interaction with hundreds of plates involves generating small top levels for these pyramids, with the highest level being 1 pixel in size, discussed in #620.

ilan-gold · 2022-08-30T16:34:17Z

Relatedly @s-n-i - why are you maintaining a fork? Just curious, are you using Avivator as your viewer? @manzt maybe it would be worth releasing Avivator as a package too i.e a giant React component? And allow people to pass in a high level prop for layers or something? Maybe we need more use-cases for this...

manzt · 2022-08-30T17:34:57Z

maybe it would be worth releasing Avivator as a package too i.e a giant React component?

I think this is beyond the scope of Viv. The React components are already intended to be extensible, and this would open up lots of additional development burden with having to maintain and document an additional API.

ilan-gold · 2022-08-31T17:38:00Z

So @s-n-i I am not sure how @manzt feels, but a very generic GridLayer could be a nice addition. I think generally, the goal would be to pare this down to the minimum of what is needed to get your feature working. So there's a few things:

How exactly are you providing the loader URL? Is the OME-Zarr HCS spec?
How are you using/deploying your version of Avivator?
Is it possible to do what you are asking by just creating a layer that orchestrates all of this? One issue is that standardizing the relationship between loaders, layers, and layout maybe be too complex for Viv. That being said, we do have this wonderful monorepo now, so maybe we could make a experimental-layers package @manzt? Something where community members could iterate on things until they become stable, so perhaps moving vizarr's implementation of a GridLayer there too?
Relatedly, @s-n-i, why not use vizarr for showing HCS?

Thanks @s-n-i !

s-n-i · 2022-09-06T17:44:12Z

I am maintaining a viv fork because it appeared to be the most efficient way to implement the functionality to load multiple plates. Please let me know, if an alternative approach could be better.

I am using a modified version of Avivator as the viewer, so I would have a simpler code base if Avivator were to be added to the viv library on npm. Actually, it does get packaged into the viv library locally if I run pnpm run build && pnpm pack.

I have an array of URLs and I call Avivator's createLoader function for each of them. This creates an array of loaders, which I pass into the modified PictureInPictureViewer.
I have a custom modification of Avivator consume our fork of the viv library.
Yes, creating a new layer would work as well. This might even improve performance, if the layer has logic to only load datasets that are within the viewport.
I am not very familiar withvizarr. My understanding is that it runs inside of a Jupyter Notebook and we would like to have a web application.

ilan-gold · 2022-09-06T18:04:33Z

@s-n-i Let's see what @manzt has to say about a layer. I'm not opposed since it seems like a common use-case, even outside of HCS. There are also loads of nice performance improvements, I imagine, to be made that everyone would benefit from.

That being said, here is vizarr in a web app. This previous example is just one image but here are some HCS examples

If vizarr does not work for you, I'm not sure what we can do. I don't think we're in a position right now to maintain an Avivator API one could customize (nor am I sure this is really something we want to do given the complexity of it being a full page web application).

What parts of Avivator have you changed? If it is just a few lines, perhaps exporting just the controller would be a nice middle ground (if that is not changing for you), although this too is a bit fraught because of its shared state with the actual viewer.

s-n-i · 2022-09-06T18:46:06Z

Thank you for sharing the vizarr web app, I can see it potentially being suitable for our goals. We would just need to evaluate it in more detail.

The motivation behind our approach is allowing the user to perform high-content screening by visualizing a grid of multiple datasets, which are not in the HCS format. This simplifies the data pipeline because data scientists would not need to convert existing datasets into HCS format.

In Avivator I have changed Controller.jsx, hooks.js, and Viewer.jsx. My approach does not require any changes to Avivator. Exporting it "as is" from the viv library would be sufficient.

Also, I am not 100% sure about this, but it looks like creating a custom layer would also require modifying the PictureInPictureViewer, as well as other viewers, so that they can load this new layer.

ilan-gold · 2022-09-10T11:38:08Z

@s-n-i If you have altered those files, what would an Avivator component API look like should we release it? I guess one route I could see here that might make everyone happy:

Implement a general purpose grid layer and update upstream API's to allow for it
Allow users to pass in comma-separated lists of URLS to Avivator
Optionally: If the images have different channel lists, make the Avivator controller flexible enough to handle this.

The main thing I am worried about is how the controller would work - if all these images have different channels for example, you would need different controllers for each? How does this work? All that being said, I would feel comfortable with the above three changes. It will be a bit of work and we would want to get the API just right here, but this seems feasible.

s-n-i · 2022-09-10T20:48:54Z

Here is how I have it currently implemented.

<PictureInPictureViewer
      gridLoaders={{
        loaders,
        spacingX: 12345,
        spacingY: 12345,
        numberOfColumns: 12
      }}

the loaders are set like this:

Promise.all(urls.map((url) => createLoader(url))).then((values) => setLoaders(values.map((value) => value.data)));

For the channels we have a few options:

Only allow datasets with the same channels and display an error when loading datasets with different channels.
Show a slider for each unique channel. Moving this slider affects all datasets that have this channel.
Show sliders for for the currently selected dataset.

manzt · 2022-09-12T19:32:35Z

Catching up on this thread.

I think I would be supportive of iterating on a generalized gridlayer. We could use the vizarr implementation as a reference point as well as what you have been working on @s-n-i.

I am not very familiar with vizarr. My understanding is that it runs inside of a Jupyter Notebook and we would like to have a web application.

To clarify, vizarr is a general web-based image viewer for zarr-based images. It is intended to be used as a standalone web-app like Avivator (see embedded use in the OME Blog), and additionally has optional features for running within Jupyter Notebooks (and loading multiple image layers). Vizarr's key feature is support for OME-NGFF metadata. For example, rather than defining non-standard patterns for loading multiple images (i.e., comma separated URLs), plate layouts are expressed within the metadata. This allows Vizarr to be compatible with other NGFF-compatible viewers.

Allow users to pass in comma-separated lists of URLS to Avivator

hmm, with the multi-tiff loader comma-separated lists of URLS already have "special" meaning in Avivator. Why not have a new route for Avivator (and actually make use of the BrowserRouter) (i.e., https://avivator.gehlenborglab.org/grid?image_urls=)

s-n-i · 2022-09-12T19:45:59Z

I looked up the length limits on URLs in different browsers:

https://www.geeksforgeeks.org/maximum-length-of-a-url-in-different-browsers/

Looks like Microsoft Edge might not be able to fit URLs for hundreds of different datasets in the address bar.

ilan-gold · 2022-09-13T07:44:26Z

hmm, with the multi-tiff loader comma-separated lists of URLS already have "special" meaning in Avivator. Why not have a new route for Avivator (and actually make use of the BrowserRouter) (i.e., https://avivator.gehlenborglab.org/grid?image_urls=)

This is what I meant, not a(nother) file (format).

As for the "hundreds of data sets" issue, I think we'll need to think about this a bit...one option if you all have the capacity would be a URL shortener.

ilan-gold · 2022-09-13T07:48:09Z

I also think I may have misunderstood - you are considering non-HCS datasets that you wish to compare that were acquired separately and have some value when looked at side-by-side? Or you have non-HCS format HCS-acquired datasets?

manzt · 2022-09-13T14:02:34Z

As for the "hundreds of data sets" issue, I think we'll need to think about this a bit...one option if you all have the capacity would be a URL shortener.

At some point, pointing to many images URLs in the query parameters is just a poor choice. URL shortening means that the URLs are human readable, and there are likely better alternatives to expressing this information in a structured manner. Hundreds of URLs is unwieldy and the "comma separated list" is essentially a new format of its own. Also there is so much implicit information with a comma separated list of URLs.

How many rows? how many columns? are the images all the same size? In this case some type of manifest JSON file is probably most appropriate which contains all this metadata as well as links to the individual images, but this is essentially OME-NGFF plate specification and I'd rather not create any sort of manifest that is Avivator-specific.

Something like kerchunk could be used to create a "virtual" OME-NGFF plate from many TIFFs. Vizarr supports this kerchunk-reference based stores. Here is an example of reading and OME-TIFF as Zarr with a chunk reference: https://observablehq.com/@manzt/ome-tiff-as-filesystemreference

In vizarr https://hms-dbmi.github.io/vizarr/?source=https://gist.githubusercontent.com/manzt/436fc2966c484205a2c60824f659b412/raw/cdc69f2ce645d953185f10d7552501bfd459dd12/Vanderbilt-Spraggins-Kidney-MxIF.ome.tif.json&channel_axis=0

s-n-i · 2022-09-13T20:28:43Z

you are considering non-HCS datasets that you wish to compare that were acquired separately and have some value when looked at side-by-side?

Yes.

I don't fully understand the distinction between datasets "acquired separately" and "HCS-acquired".

s-n-i · 2022-09-14T17:35:00Z

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

High Content Screening #630

High Content Screening #630

s-n-i commented Aug 19, 2022 •

edited

ilan-gold commented Aug 23, 2022 •

edited

s-n-i commented Aug 24, 2022 •

edited

ilan-gold commented Aug 30, 2022 •

edited

manzt commented Aug 30, 2022 •

edited

ilan-gold commented Aug 31, 2022 •

edited

s-n-i commented Sep 6, 2022 •

edited

ilan-gold commented Sep 6, 2022

s-n-i commented Sep 6, 2022 •

edited

ilan-gold commented Sep 10, 2022 •

edited

s-n-i commented Sep 10, 2022 •

edited

manzt commented Sep 12, 2022 •

edited

s-n-i commented Sep 12, 2022

ilan-gold commented Sep 13, 2022 •

edited

ilan-gold commented Sep 13, 2022

manzt commented Sep 13, 2022 •

edited

s-n-i commented Sep 13, 2022 •

edited

s-n-i commented Sep 14, 2022 •

edited

ilan-gold commented Sep 19, 2022

s-n-i commented Sep 19, 2022 •

edited

High Content Screening #630

Are you sure you want to change the base?

High Content Screening #630

Conversation

s-n-i commented Aug 19, 2022 • edited

Background

Change List

Checklist

ilan-gold commented Aug 23, 2022 • edited

s-n-i commented Aug 24, 2022 • edited

ilan-gold commented Aug 30, 2022 • edited

manzt commented Aug 30, 2022 • edited

ilan-gold commented Aug 31, 2022 • edited

s-n-i commented Sep 6, 2022 • edited

ilan-gold commented Sep 6, 2022

s-n-i commented Sep 6, 2022 • edited

ilan-gold commented Sep 10, 2022 • edited

s-n-i commented Sep 10, 2022 • edited

manzt commented Sep 12, 2022 • edited

s-n-i commented Sep 12, 2022

ilan-gold commented Sep 13, 2022 • edited

ilan-gold commented Sep 13, 2022

manzt commented Sep 13, 2022 • edited

s-n-i commented Sep 13, 2022 • edited

s-n-i commented Sep 14, 2022 • edited

ilan-gold commented Sep 19, 2022

s-n-i commented Sep 19, 2022 • edited

s-n-i commented Aug 19, 2022 •

edited

ilan-gold commented Aug 23, 2022 •

edited

s-n-i commented Aug 24, 2022 •

edited

ilan-gold commented Aug 30, 2022 •

edited

manzt commented Aug 30, 2022 •

edited

ilan-gold commented Aug 31, 2022 •

edited

s-n-i commented Sep 6, 2022 •

edited

s-n-i commented Sep 6, 2022 •

edited

ilan-gold commented Sep 10, 2022 •

edited

s-n-i commented Sep 10, 2022 •

edited

manzt commented Sep 12, 2022 •

edited

ilan-gold commented Sep 13, 2022 •

edited

manzt commented Sep 13, 2022 •

edited

s-n-i commented Sep 13, 2022 •

edited

s-n-i commented Sep 14, 2022 •

edited

s-n-i commented Sep 19, 2022 •

edited