Daniel Standage, 2018
https://osf.io/anr56/
Noble is a collection of data sets for evaluating and benchmarking de novo variant discovery methods. You can freely and anonymously download files containing the associated simulated genetic data under the terms of the CC BY 4.0 license: see DATALICENSE.txt for details. You are also free to use, copy, modify, and distribute source code from this project with attribution under the terms of the MIT license: see CODELICENSE.txt for details.
Scientific ethics dictate that you credit this resource if you use it in any research publication: see CITATION.md for more details.
I want to use the data to benchmark my new method.
All of the data files are stored on Amazon S3 and can be downloaded from your web browser or (preferably) using shell tools like wget
or curl
.
See DOWNLOADS.md for more details.
I want to recreate the data sets from scratch.
The build/ directory contains information on the workflow used to create the data files, instructions on how to invoke the workflow, and a description of the software prerequisites and configuration.
I want to reproduce a published analysis.
The eval/ directory contains details on our accuracy assessment of kevlar on Noble.
I want borrow code from this project and create my own.
You are free to create a fork of this project and adapt it for your own needs, as long as you attribute the original work.
I'm having trouble with Noble.
Please use the GitHub tracker to report bugs or submit support requests.