Skip to content
This repository has been archived by the owner on Mar 16, 2022. It is now read-only.
Jason Chin edited this page Jul 9, 2015 · 45 revisions

Falcon v0.3 Release Note

  • Integrate latest (Apr. 2015) Gene Myers' daligner code
  • Supporting read length up to 100kb from current 64kb
  • New consensus code which process reads from diploid genomes better
  • Logging for tracking job submission

Important changes

  • it uses up-to-date Gene’s daligner code. It helps to support read > 64kb and helps for tracking read identifiers through the pipeline (which is important for diploid assembly in the future). The cost is that the new code use twice more memory if the daligner parameters is the same before. One needs to use the “-M” option to set limit on the memory usage. For human assembly, we tested with -M 32 for using 32G RAM for each daligner. Other possibilities are under investigation.

  • You will need a new option --output_dformat for the configuration variable falcon_sense_option. It is used in LA4Falcon and fc_consensus.py to generate p-read fasta files that are compatible to DAZZ_DB's fasta2DB for creating p-read database.

Getting started

Falcon is not meant to be a "push-button" assembler (yet) except for very small genomes. To get best results, one might need some knowledge about various distributed computation system and some basic genome assembly theory. I will collect some question and answer in the FAQ page which might help you.