Skip to content

Analysis of Caudoviricetes phages with genome terminal repeats in fecal metagenomes.

Notifications You must be signed in to change notification settings

aag1/NL_vir_analysis

Repository files navigation

Analysis of Caudoviricetes phages with genome terminal repeats in four Dutch cohorts

Folders

  • test_cenote, test_code_prediction, benchmark_markers - test procedures

  • run_cenote - run Cenote-Taker 2 for the 4 cohorts

  • rrna_NL_vir - detect rRNA genes in the virus-like contigs from the 4 cohorts

  • predict_proteomes - predict proteomes for the virus-like contigs from the 4 cohorts

  • taxo_NL_vir - assign taxonomy to the virus-like contigs from the 4 cohorts

  • get_databases_caudo - detect Caudoviricetes genomes with terminal repeats in 6 databases

  • cluster_genomes - dereplicate virus-like contigs from the 4 cohorts and genomes from the databases together (build DB0, DB1)

  • map_reads - map reads from the 4 cohorts to DB1

  • DEVoC_reads - map reads from the 254 Danish fecal viromes to DB1

  • DB2_info, DB3_DB4_info - build DB2, DB3, DB4

  • DB3_vConTACT2 - DB3 gene-sharing with TerL-encoding viruses

  • DB3_TerL_tree - build & plot phylogenetic tree

  • DB3_abundance_stability - % sample reads mapped to DB3 & Bray-Curtis dissimilarity

  • DB3_tRNA_scan - tRNA detection in DB3 vOTU representatives

  • host_prediction - prophage-, CRISPR- and co-abundance-based host prediction

  • DB4_in_GenBank, DB4_in_BanfieldLab - look for phages similar to DB4 vOTU representatives

  • DB4_genome_info - nt content & GC skew for extended DB4

  • DB4_proteome_annot - functional annotation of proteins encoded by DB4 vOTU representatives

  • DB4_genome_maps - plot genome maps of DB4 vOTU representatives

  • DB4_pheno_assoc - associations between DB4 vOTUs prevalence and human phenotypes

  • workflow_diagram - plot analysis workflow

  • blastn_crAss_sensu_stricto - compare MGV-GENOME-0359371 to the first discovered crAssphage

  • DEVoC_LoVEphage - LoVEphage in the current study?

Designations

  • DB0 - 100,060 genomes & contigs pooled together

  • DB1 - 30,461 vOTU representatives, result of DB0 dereplication

  • DB2 - 15,196 detected vOTU representatives, result of read mapping to DB1

  • DB3 - 1,899 DB2 vOTUs represented by Caudoviricetes genomes with terminal repeats

  • DB4 - 54 DB4 vOTUs detected in > 5% samples in a Dutch cohort

About

Analysis of Caudoviricetes phages with genome terminal repeats in fecal metagenomes.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published