Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
fix: added missing dependencies and improved docs (#450)
* Added mising dependencies and improved docs * Reformat * Small tweaks * Improved description
- Loading branch information
Showing
3 changed files
with
32 additions
and
4 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -4,3 +4,5 @@ channels: | |
- defaults | ||
dependencies: | ||
- mapdamage2 =2.2 | ||
- python >=3.9 | ||
- pysam >=0.17 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,30 @@ | ||
name: mapDamage2 | ||
description: tracking and quantifying damage patterns in ancient DNA sequences. For more information about MapDamage2 see `MapDamage2 documentation <https://ginolhac.github.io/mapDamage/>`_. | ||
description: mapDamage2 is a computational framework written in Python and R, which tracks and quantifies DNA damage patterns among ancient DNA sequencing reads generated by Next-Generation Sequencing platforms. | ||
authors: | ||
- Filipe G. Vieira | ||
input: | ||
- reference genome | ||
- SAM/BAM/CRAM alignemnt | ||
output: | ||
- Runtime_log.txt: log file with a summary of command lines used and timestamps. | ||
# If plotting | ||
- Fragmisincorporation_plot.pdf, a pdf file that displays both fragmentation and misincorporation patterns. | ||
- Length_plot.pdf, a pdf file that displays length distribution of singleton reads per strand and cumulative frequencies of C->T at 5'-end and G->A at 3'-end are also displayed per strand. | ||
- misincorporation.txt, contains a table with occurrences for each type of mutations and relative positions from the reads ends. | ||
- 5pCtoT_freq.txt, contains frequencies of Cytosine to Thymine mutations per position from the 5'-ends. | ||
- 3pGtoA_freq.txt, contains frequencies of Guanine to Adenine mutations per position from the 3'-ends. | ||
- dnacomp.txt, contains a table of the reference genome base composition per position, inside reads and adjacent regions. | ||
- lgdistribution.txt, contains a table with read length distributions per strand. | ||
# If stats output | ||
- Stats_out_MCMC_hist.pdf, MCMC histogram for the damage parameters and log likelihood. | ||
- Stats_out_MCMC_iter.csv, values for the damage parameters and log likelihood in each MCMC iteration. | ||
- Stats_out_MCMC_trace.pdf, a MCMC trace plot for the damage parameters and log likelihood. | ||
- Stats_out_MCMC_iter_summ_stat.csv, summary statistics for the damage parameters estimated posterior distributions. | ||
- Stats_out_post_pred.pdf, empirical misincorporation frequency and posterior predictive intervals from the fitted model. | ||
- Stats_out_MCMC_correct_prob.csv, position specific probability of a C->T and G->A misincorporation is due to damage. | ||
- dnacomp_genome.txt, contains the global reference genome base composition (computed by seqtk). | ||
# If rescaled BAM output | ||
- Rescaled BAM file, where likely post-mortem damaged bases have downscaled quality scores. | ||
notes: | | ||
* The `extra` param allows for additional program arguments. | ||
* For more information see, https://ginolhac.github.io/mapDamage/ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters