T2T-MFA8, is pronounced T2T-macfallite.
We generated a parthenogenesis cell line, MFA582-1, from a crab-eating macaque, from which we produced 53× PacBio HiFi, 77× ONT, 32× Illumina WGS, and 156× Hi-C data. Then, we constructed a telomere-to-telomere assembly of the crab-eating macaque (T2T-MFA8). All resources are available on this site.
- T2T-MFA8v1.0: GCA_037993035.1
All tracks are available on the UCSC Genome Browser.
- T2T-MFA8v1.0: unmasked, 20 autosomes + chrX + chrMT
- T2T-MFA8v1.0: soft-masked (segmental duplications, tandem repeats and RepeatMasker), 20 autosomes + chrX + chrMT
- PacBio HiFi: SRX22719293
- Oxford Nanopore Technology: SRX22719294
- Illumina WGS: SRX22702646
- Hi-C: SRX22702647
- Curated gene annotation: LiftOff result from Mmul _10 RefSeq and lso-Seq transcripts annotated by GeneMarkS-T, with manual curation
- GeneMarkS-T: annotated Iso-Seq FLNC transcripts
- LiftOff from Mmul_10 RefSeq
- LiftOff from MFA1912RKSv2 RefSeq
- 24-mer with no error: produced by GenMap
- 36-mer with no error: produced by GenMap
- Centromere
- CenSat: the 500-kbp extended regions of centromeres
- Segmental duplications: native bed format or merged bed format
- Tandem repeats
- RepeatMasker: native out format or bed format
- WindowMasker (with SDust)
- Centromere suprachromosomal family annotation
- rDNA models
- CpG islands
- CpG methylation from ONT: identified by Nanopolish v0.14.0
- Crab-eating macaque assemblies
- Rhesus macaque assemblies
We would appreciate if you would acknowledge and cite our paper:
Zhang, S. et al. Comparative genomics of macaques and integrated insights into genetic variation and population history. bioRxiv (2024).
All data is released to the public domain (CC0).