Skip to content

Tool for fast counting of reads from BAM file into genomic ranges (GTF, BED, VCF)

License

Notifications You must be signed in to change notification settings

DeplanckeLab/FastReadCounter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

FastReadCounter

Tool for fast counting of reads from BAM file into different formatted genomic ranges (GTF, BED, VCF)

Download software

FastReadCounter (FRC) command-line tools are provided as a single executable jar file. The .jar file contains all required materials and can be run on any terminal.

Dependencies

Java version

For the tools to run properly, you must have Java >=1.8 installed.

To check your java version, open your terminal application and run the following command:

java -version

If the output looks something like java version "1.8.x" or above, you are good to go. If not, you may need to update your version; see the Oracle Java website to download the latest JRE (for users) or JDK (for developers).

htsjdk Java library

The software relies on the htsjdk library for reading SAM/BAM files, but the JAR is already embedded in the released JAR, so no need to install it yourself.

How to run

To check that FastReadCounter is working properly, run the following command:

java -jar FastReadCounter.jar

As shown in the output of this command, FastReadCounter has the following options

Options:
--bam %s         [Required] Path of BAM file [do not need to be sorted or indexed].
--gtf %s         [Required (or -bed or -vcf)]Path of GTF file.
--bed %s         [Required (or -gtf or -vcf)]Path of BED file.
--vcf %s         [Required (or -gtf or -bed)]Path of VCF file.
-s %s           Do you want to count only reads falling on same strand than feature? [no, yes, reverse] [default = no].
-q %i           Minimum quality required for a read to be counted [default = 10].
-o %s           Output folder [default = folder of BAM file]

Comparison with existing tools

FastReadCounter can be compared with a variety of tools that all perform the same task on only one file format (bed or gtf, or vcf) such as:

  • htseq-count for counting reads in GTF file. FastReadCounter has stricly identical output than the union parameter in htseq-count. It runs ~20-100 times faster.
  • featureCounts for counting reads in GTF file. FastReadCounter has identical output and runs only slightly slower. However FastReadCounter can read more file formats (BED, VCF) and can read multiplexed BAM files (single-cell RNA-seq or multiplexed bulk RNA-seq experiments).
  • bedtools for counting reads in BED files (multicov tool). FastReadCounter has very similar output than HOMER. It does not require to transform the bam into a tag folder.
  • homer for counting reads in BED files (annotatePeaks.pl tool). FastReadCounter has very similar output than HOMER. It does not require to transform the bam into a tag folder.
  • freebayes for counting reads in VCF files with the --variant-input option. However freebayes current version fails with this option, and only calculate chr1.

Directory content

  • releases: jar releases of FastReadCounter
  • src and lib folders will come soon

Author

Vincent Gardeux - vincent.gardeux@epfl.ch

About

Tool for fast counting of reads from BAM file into genomic ranges (GTF, BED, VCF)

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages