You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm just looking at the DE section of the best practice book and noticed two potential issues with the aggregate_and_filter() function:
The text says "For each patient we create 1 pseudobulk sample per cell type by aggregating the cell from each subpopulation and taking the mean gene expression within that subpopulation." but in the function, counts are summed rather than mean aggregated. Additionally, it might be helpful to add any required columns (eg. "donor_key", "condition_key" etc) to the 'obs_to_keep' list to prevent an error where the donor_key is not found in the donor_df
The text was updated successfully, but these errors were encountered:
I'm just looking at the DE section of the best practice book and noticed two potential issues with the aggregate_and_filter() function:
The text says "For each patient we create 1 pseudobulk sample per cell type by aggregating the cell from each subpopulation and taking the mean gene expression within that subpopulation." but in the function, counts are summed rather than mean aggregated. Additionally, it might be helpful to add any required columns (eg. "donor_key", "condition_key" etc) to the 'obs_to_keep' list to prevent an error where the donor_key is not found in the donor_df
The text was updated successfully, but these errors were encountered: