Skip to content

tallulandrews/scRNASeqPipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This is a collection of scripts I use (or have used in the past) to process scRNASeq data. They are free to use by anyone else for any purpose, but come with no assurances or guarantees of correctness or functionality. The general workflow is as follows:

0 : Create the appropriate genome for the dataset, and obtain the read files & initial QC
	- Building mapping indexes generally requires ~30Gb of memory for a mouse-sized genome
1 : Split the files by well (cell), Trim reads as appropriate based on QC
2 : Map the reads to the genome
3 : Clean up mapping output & remove duplicates
4 : Mapping QC
5 : Quantify expression
6 : Assemble expression matrix

Finished Pipelines:
00_Kallisto_For_SmartSeq.readme = Smartseq2 + Kallisto (no UMIs) 


Brief Descriptions of Useful files:
0_Extract_barcodes_from_BAM.sh : open the first line of each BAM file and find the barcode (tagged with BC:) - for matching up metadata.

Releases

No releases published

Packages

No packages published