Skip to content

Releases: pinellolab/CRISPResso2

Basepairing Bern

22 Oct 01:25
Compare
Choose a tag to compare

Various improvements and bug fixes:

  • Add param --plot_center to allow custom plots centered at a given point in plotCustomAllelePlot script (ecf23ef)
  • All sub-CRISPResso runs are run with 1 process in Batch, WGS, Pooled, etc. Because we added multiprocessing capabilities to CRISPResso (the plotting part) we thought it would be slick for CRISPRessoPooled to run sqrt(n_processes) CRISPResso processes with sqrt(n_processes) processes each. Unfortunately, sqrt(n) isn't a really useful number for the number of processes people usually run (e.g. 2 or 3 or even 8), and a lot of the CRISPResso processing isn't enabled to take advantage of multiprocessing (e.g. the alignment step), so processes were being wasted. So we reverted back to having n_processes CRISPResso processes, each with 1 process. (a923a7c)
  • Remove version checks for numpy and seaborn (90b43ea)
  • Fix allele plotting error for plot 5 referring to uninitialized y_max variable (53197e6)
  • Fix unicode errors for bam read/write (8196b6a)
  • Convert columns in nucleotide count and modification tables to numeric for PE analysis (cabebbe)
  • Make loggers module-specific so matplotlib debug doesn't get spewn out in the CRISPResso log (c2bdd96)
  • Add unit tests (3e6c281)

Full Changelog: v2.2.5...v2.2.6

Immunity from Bonneville

22 Sep 06:10
Compare
Choose a tag to compare

Fixes bug when sequencing reads are much longer than the given reference sequence.

Mutant Maricopa

10 Sep 04:43
Compare
Choose a tag to compare

This release adds an additional parameter --assign_ambiguous_alignements_to_first_allele. For ambiguous alignments, setting this flag will force them to be assigned to the first (as provided by the references -a first and then -e second) amplicon. Thus, no reads will be discarded as 'ambiguous' and all reads will be counted once in the analysis.

In addition, batch summaries are produced for amplicons present in only one sample.

Collateral Cardston

30 Aug 18:46
Compare
Choose a tag to compare

Fixes database_id bug

Large Honolulu

21 Aug 04:19
Compare
Choose a tag to compare

For some reason some of the previous commits resolving problems with filterFastqs weren't picked up in v2.2.1. So I'm hoping they'll be included here.

Sequence Length Salt Lake

20 Aug 17:37
Compare
Choose a tag to compare
  • More unicode bug fixes for filtering fastqs
  • CRISPRessBatch now outputs summary of splicing/frameshift mutation status

Matches Sanpete

13 Aug 17:46
Compare
Choose a tag to compare

Python 3 release

  • Incorporates updates and changes from python2 up to this point
  • Adds multiprocessing to CRISPRessoBase to parallelize image generation to speed up results
    • CRISPRessoBatch, CRISPRessoPooled, and CRISPRessoWGS allocate processes to sub-CRISPResso commands so that sqrt(n_processes) sub-commands are run, each with sqrt(n_processes) unless plotting is turned off (via --suppress_report or --suppress_plots in which case n_processes are run, each with 1 process.
  • The crispresso_info dictionary containing run information is saved as json so it can be read across versions of python and by other programs (e.g. R). The dict structure of the object has also been changed to be more navigable and hierarchical.

Because of changes to crispresso_info, this version of CRISPResso will not be able to finish incomplete runs (e.g. checkpointing of CRISPRessoPooled) that were started by previous python2 versions of CRISPResso, or aggregate or access information from previous runs (e.g. using CRISPRessoAggregate, CRISPRessoCompare, or the custom python plotting scripts in scripts). However, even if we were to have stuck with pickle format for crispresso_info, the python2 and python3 versions were incompatible anyway. So we figured this was a good time to move toward a better format.

Lentiviral Cache

29 Jun 18:35
Compare
Choose a tag to compare

Fixes bug for CRISPRessoPooled analyzes of many amplicons where samtools sort writes status updates that can't be parsed as reads.

Single Guide Washington

24 Jun 03:03
Compare
Choose a tag to compare

This release includes:

  • Updates to CRISPRessoPooledWGSCompare, used for comparing multiple amplicons in CRISPRessoWGS or CRISPRessoPooled experiments 48d6c87
    • CRISPRessoPooledWGSCompare now produces html report linking to sub-CRISPRessoCompare reports
  • Addition of a script for custom allele plotting 947fbab

Nicking Hancock

23 May 04:58
Compare
Choose a tag to compare

This release incorporates changes to make bowtie2 alignment in CRISPRessoPooled more permissive (44dc9e7), and remove duplicate rows in the Alleles_frequency_table.txt due to reads being in the forward or reverse direction (0e08cd0).

When given a genome file, CRISPRessoPooled aligns reads to the genome using the Bowtie2 aligner. The legacy parameters were somewhat strict. The new parameters reflect the 'default_min_aln_score' parameter in allowing for substantially more indels and mismatches than previous.
The parameter --use_legacy_bowtie2_options_string has been added to use the legacy settings. Otherwise, the bowtie2 alignment settings will be calculated as follows:
--end-to-end - no clipping, match bonus -ma is set to 0
-N 0 number of mismatches allowed in seed alignment
--np 0 where read (or ref have ambiguous character (N)) penalty is 0
--mp 3,2 mismatch penalty - set max mismatch to -3 to coincide with the gap extension penalty (2 is the default min mismatch penalty)
--score-min L,-5,-3*(1-H) For a given homology score, we allow up to (1-H) mismatches (-3) or gap extensions (-3) and one gap open (-5). This score translates to -5 + -3(1-H)L where L is the sequence length