[Feature request] one vs. all others #168

acoteataltius · 2023-08-23T15:48:27Z

I'd like to be able to input a contrast design (or otherwise choose design factors), to do a one vs all comparison within in a particular "condition" that has more than two levels. If my column "condition" has levels A, B, C, and D, do a comparison of A vs B, C, D.

Something like these options in R deseq2:
design <- ~0 + condition
contrast = c(1, -1/3, -1/3, -1/3)
contrast=list(c("conditionA"),
c("conditionB","conditionC","conditionD"))

Would it be possible to do something where if you leave the second option blank in contrast, like:
contrast = ['condition', 'A', '']
it compares A with all other samples?

BorisMuzellec · 2023-09-01T10:04:45Z

Hi @acoteataltius, that would be a convenient feature to have indeed.

It's not available in pydeseq2 yet, but I'm adding it to our feature wishlist. I'll give it a go when I have time, but I'm also happy to help anyone opening a PR. Not sure what would be the best way to implement it from a user perspective (maybe a one_vs_all boolean argument?).

In the meantime it seems that it would be possible to obtain the same results by manually setting the contrast_vector attribute after initializing the DeseqStats object, but I'm not 100% sure about this either.

GalaMichal · 2023-12-14T11:57:58Z

Hi @BorisMuzellec I'd like to ask it is even possible to compare all vs all? Basically, treating each level of the condition factor as a separate group and not setting any of them as a reference (e.g. healthy).

Something like in R Deseq2:
design <- ~0 + condition

BorisMuzellec · 2023-12-15T08:53:28Z

@GalaMichal there is unfortunately no direct way to do this as of yet. This relates to #213.

However I think it is possible to obtain the same design matrix using pydeseq2.utils.build_design_matrix with no intercept but an expanded design, and use it in your DeseqDataSet like this:

dds = ds.DeseqDataSet(counts=counts, metadata=metadata, design_factors="condition")

# This is where you replace the design matrix
dds.obsm["design_matrix"] =  build_design_matrix(
            metadata=dds.obs,
            design_factors=dds.design_factors,
            expanded=True,
            intercept=False,
        )

# And then you should be able to carry on as usual

dds.deseq2() # etc.

Let me know if this works!

GalaMichal · 2023-12-19T12:35:13Z

@BorisMuzellec thank you for quick response. Unfortunately, it doesn't work.
dds.deseq2() is calculated but stat_res = DeseqStats(dds) shows: KeyError: 'Condition_1_vs_Condition_1

The same situation occurs when I try, for example, stat_res = DeseqStats(dds, contrast =("Conditions", "Condition_1", "Condition_2"))

'

acoteataltius changed the title ~~one vs. all others~~ [Feature request] one vs. all others Aug 23, 2023

BorisMuzellec added the enhancement New feature or request label Aug 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature request] one vs. all others #168

[Feature request] one vs. all others #168

acoteataltius commented Aug 23, 2023

BorisMuzellec commented Sep 1, 2023

GalaMichal commented Dec 14, 2023

BorisMuzellec commented Dec 15, 2023 •

edited

GalaMichal commented Dec 19, 2023

[Feature request] one vs. all others #168

[Feature request] one vs. all others #168

Comments

acoteataltius commented Aug 23, 2023

BorisMuzellec commented Sep 1, 2023

GalaMichal commented Dec 14, 2023

BorisMuzellec commented Dec 15, 2023 • edited

GalaMichal commented Dec 19, 2023

BorisMuzellec commented Dec 15, 2023 •

edited