Skip to content

satijalab/PASTA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

68 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

image

PASTA (PolyA Site analysis using relative Transcript Abundance)

Overview

PASTA is a toolkit for analysis, visualization, and exploration of cellular heterogeneity in alternative polyadenylation (APA) from scRNA-seq datasets. While scRNA-seq has been widely applied to characterize cellular heterogeneity in gene expression levels, these datasets can also be leveraged to explore variation in transcript structure as well. For example, reads from 3’ scRNA-seq datasets can be leveraged to quantify the relative usage of multiple polyadenylation (polyA) sites within the same gene.

In Kowalski*, Wessels*, Linder* et al, we introduce a statistical framework, based on the Dirichlet multinomial distribution, to characterize heterogeneity in the relative use of polyA sites at single-cell resolution. This framework enables upervised differential analysis (differential polyadenylation between groups of cells), but also unsupervised analysis and visualization.

We have implemented these methods in an R package PASTA (PolyA Site analysis using relative Transcript Abundance) that interfaces directly with Seurat. This repository represents an initial release of PASTA, we will be adding significant functionality and documentation in future releases.

Installation

To install PASTA, please run:

#install remotes and BiocManager if necessary 
if (!requireNamespace("remotes", quietly = TRUE))
    install.packages("remotes")
if (!requireNamespace("BiocManager", quietly = TRUE))
    install.packages("BiocManager")

#additionally install several Bioconductor dependencies
BiocManager::install(c("BSgenome.Hsapiens.UCSC.hg38", "EnsDb.Hsapiens.v86", 
                       "rtracklayer","GenomicFeatures","plyranges"))

remotes::install_github("satijalab/PASTA")

Vignette

Check out the PASTA vignette to perform supervised and unsupervised analysis on an scRNA-seq dataset of human PBMC

Links

Seurat: R package for the analysis, integration, and exploration of scRNA-seq data

CPA-Perturb-seq: Preprint describing analytical methods for characterizing APA in single-cell datasets

About

No description, website, or topics provided.

Resources

License

Unknown, MIT licenses found

Licenses found

Unknown
LICENSE
MIT
LICENSE.md

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages