SuperCellCyto-analysis

This repository contains the code to reproduce all the analysis done for our paper introducing the SuperCellCyto R package: https://github.com/phipsonlab/SuperCellCyto.

SuperCellCyto is an adaptation of the SuperCell R package. Initially developed for scRNAseq data, the SuperCell package aggregates cells with similar transcriptomic profiles into "supercells" (also known as “metacells” in the scRNAseq literature).

The preprint of the paper is available on bioRxiv:

Putri, G. H., Howitt, G., Marsh-Wakefield, F., Ashhurst, T. M., & Phipson, B. (2023). SuperCellCyto: enabling efficient analysis of large scale cytometry datasets. bioRxiv; DOI: https://doi.org/10.1101/2023.08.14.553168

explore_supercell_purity_clustering for Supercells Preserve Biological Heterogeneity and Facilitate Efficient Cell Type Identification
b_cells_identification for Identifying Rare B Cells Subsets by Clustering Supercells
batch_correction for Mitigating Batch Effects in the Integration of Multi-Batch Cytometry Data at the Supercell Level
de_test for Recovery of Differentially Expressed Cell State Markers Across Stimulated and Unstimulated Human Peripheral Blood Cells
da_test for Identification of Differentially Abundant Rare Monocyte Subsets in Melanoma Patients
label_transfer for Efficient Cell Type Label Transfer Between CITEseq and Cytometry Data
run_time for measuring the run time of SuperCellCyto and clustering process applicable for the first 3 items above.

The code folder contains the scripts used to generate the results that are processed in the Rmd files in the analysis folder. Please note that running some of these scripts will take a long time. That's the reason why they are in separate R scripts. Otherwise, each rebuilding of the workflowr website will take hours.

The data and output folders are meant for storing raw data and processed data generated by the scripts in the code folder respectively. The content of these folders are purposely not committed into the repository as they are enormous (over 40GB in total). If you would like to reproduce our analysis, please download the content for the data and output folder from Zenodo: .

Instruction after downloading the files:

Uncompress data_20232308.tar.gz (using tar -zxvf <filename>.tar.gz). You should get one data folder. This is the data folder for the workflowr website.
Uncompress each of the tar.gz files starting with the word output. Each file should uncompress into one folder.
Create a new folder call output and place all the folders uncompressed in step 3 into it.
Run wflow_build().

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
analysis		analysis
code		code
data		data
docs		docs
output		output
.Rprofile		.Rprofile
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
SuperCellCyto-analysis.Rproj		SuperCellCyto-analysis.Rproj
_workflowr.yml		_workflowr.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

analysis

analysis

code

code

data

data

docs

docs

output

output

.Rprofile

.Rprofile

.gitattributes

.gitattributes

.gitignore

.gitignore

README.md

README.md

SuperCellCyto-analysis.Rproj

SuperCellCyto-analysis.Rproj

_workflowr.yml

_workflowr.yml

Repository files navigation

SuperCellCyto-analysis

Contents

About

Releases

Packages

Languages

phipsonlab/SuperCellCyto-analysis

Folders and files

Latest commit

History

Repository files navigation

SuperCellCyto-analysis

Contents

About

Resources

Stars

Watchers

Forks

Languages