Real or technical artifacts due to errors detected in UMAP following run.flowsom()? #161

denvercal1234GitHub · 2023-04-27T11:12:28Z

Hi there,

Thanks for the package.

When I used run.flowsom even with meta.k=20 (automatic meta.k resolved 11 clusters), the cluster UMAP doesn't look "right" -- they clumped together on 1 side of the UMAP, and there two tiny clusters that are so separated from everyone.

Do you think these 2 clusters (6 and 20) are simply artifacts? If so, would you mind suggesting some ways to assess that?

Thank you for your help!

Related to #154.

The text was updated successfully, but these errors were encountered:

tomashhurst · 2023-06-05T09:39:35Z

Hi @denvercal1234GitHub hmm I get this on occasion -- the issue is to do with the UMAP calculations, which should be independent from FlowSOM clustering (i.e., even if you change the FlowSOM clustering, you will still get the same UMAP arrangement).

You might find that those cells on the far right are some kind of cells that are stacked on the maximum value across multiple channels, perhaps some kind of antibody aggregate bound to some cells or something similar. It would be worth doing some plotting to find out. If it is something technical like this then you could filter out those cells prior to analysis?

Tom

denvercal1234GitHub · 2023-06-05T09:42:32Z

thanks @tomashhurst ! This is very useful. Would you mind elaborating a bit more on which plotting would you be doing to investigate whether these are cells of technical artifact? These 2 populations are actually what we expect to detect in our data based on RNAseq of the same samples.

tomashhurst · 2024-05-24T15:21:27Z

@denvercal1234GitHub sorry for the delayed reply here -- essentially an nxn plot -- so CD3 vs CD4, CD3 vs CD19, CD3 vs N etc. In each, plotting the metacluster. You could also create a heatmap to look at what markers are expressed most highly using make.pheatmap.

There will likely be some group of cells that may have ultra-high expression on a set of markers, possibly as these might be some kind of aggregate. You could also plot them on tSNE of FItSNE which should squish them into the 2D plot a bit better which might allow you to see how they look compared to the remaining cells in the dataset.

I'll close this for now, but let us know how you get on.

denvercal1234GitHub mentioned this issue May 18, 2023

Advice on how to sub-cluster? #164

Open

tomashhurst closed this as completed May 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Real or technical artifacts due to errors detected in UMAP following run.flowsom()? #161

Real or technical artifacts due to errors detected in UMAP following run.flowsom()? #161

denvercal1234GitHub commented Apr 27, 2023 •

edited

tomashhurst commented Jun 5, 2023

denvercal1234GitHub commented Jun 5, 2023

tomashhurst commented May 24, 2024

Real or technical artifacts due to errors detected in UMAP following run.flowsom()? #161

Real or technical artifacts due to errors detected in UMAP following run.flowsom()? #161

Comments

denvercal1234GitHub commented Apr 27, 2023 • edited

tomashhurst commented Jun 5, 2023

denvercal1234GitHub commented Jun 5, 2023

tomashhurst commented May 24, 2024

denvercal1234GitHub commented Apr 27, 2023 •

edited