DE section issues #212

le-ander · 2023-06-13T14:51:46Z

I'm just looking at the DE section of the best practice book and noticed two potential issues with the aggregate_and_filter() function:

The text says "For each patient we create 1 pseudobulk sample per cell type by aggregating the cell from each subpopulation and taking the mean gene expression within that subpopulation." but in the function, counts are summed rather than mean aggregated. Additionally, it might be helpful to add any required columns (eg. "donor_key", "condition_key" etc) to the 'obs_to_keep' list to prevent an error where the donor_key is not found in the donor_df

alitinet · 2023-06-14T08:11:15Z

Hi, should be fixed after we move to decoupler (#141).

le-ander added the bug Something isn't working label Jun 13, 2023

Zethson assigned alitinet Jun 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DE section issues #212

DE section issues #212

le-ander commented Jun 13, 2023

alitinet commented Jun 14, 2023

DE section issues #212

DE section issues #212

Comments

le-ander commented Jun 13, 2023

alitinet commented Jun 14, 2023