New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Liana on pseduo bulk-sc data data #96
Comments
Hi @Marwansha, I assume you are referring to the DE analysis vignette. From my knowledge, it is the current best practice to perform differential testing between single-cell samples at the pseudobulk level. A couple of reference on the topic: https://www.nature.com/articles/s41576-023-00586-w I hope this helps. |
Hi @Marwansha, Sorry but not sure I exactly follow. Can you elaborate in what sense I run liana on pseudobulks? One can also say that average expression per cluster is a "pseudobulk" (which is how the vast majority of CCC methods approach it). In the DE tutorial, you can think of the |
sorry if i wasn't clear again, my question is about generation the ligand-receptor interactions df my question is if i run the run aggreagate on the pseudobulk anndata object rather than the single cell object?
i ran liana on the pseudobulk aggregated anndata object and the results make sense more for my data by comparing with the results from the single cell object as its much less noisy but i was not sure if this was tested before or which one is the best practice Thanks |
Hi @Marwansha, Sorry for the delay, I was on away. Hmm. This is a really interesting approach, though not standard. It only changes a bit the interpretation, since instead of comparing means across cells, you are comparing means across sample pseudobulks. Just to share my intuition with this, think of CellPhoneDB. You get a mean between the averaged ligand and receptor expression per cluster ( In short, at a glance, I like it as an idea, and it can make sense depending on your data. You are also avoiding over-inflated permuted p-values due to pseudoreplication. :) |
PS. A major motivation of mine when writing liana-py was to make it flexible, so I'm glad to see when it's used in ways beyond the tutorials. |
Thank you very much for your response. In fact, I am trying to benchmark and compare the different results that come from computing the CCC (cell-cell communication) on single-cell or pseudobulk objects. From my perspective, and from a ground truth point of view (considering some ligand-receptor interactions that exist in one group and not in the other, which I know from literature and previous work), it seems that using the pseudobulk data makes it cleaner and easier to discern. Would you be interested in having a short meeting? Maybe I can show you my data (I can share it too), so I can get some insights from your point of view on which approach makes more sense. Thanks |
Hey,
I want to know if it's best practice to always use single cell data on liana to compute the l-r ccc results, as I saw in the tutorial of differential expression, after computing differential expression on pseduobulk data, Liana was run on the adata object, and I want to ask if it's best practise and if it's ok to run it on the pseduobulk data?
Thanks
Marwan
The text was updated successfully, but these errors were encountered: