ADAM

Version 0.22.0

Closed issues:

Realign all reads at target site, not just reads with no mismatches #1469
Parallel file merger fails if the output file is smaller than the HDFS block size #1467
Add new realigner arguments to docs #1465
Recalibrate method misspelled as recalibateBaseQualities #1463
FASTQ may try to split GZIPed files #1459
Update to Hadoop-BAM 7.8.0 #1455
Publish Markdown and Scaladoc to the interwebs #1453
Make VariantContextConverter public #1451
Apply method in FragmentRDD is package private #1445
Thread pool will block inside of pipe command for streams too large to buffer #1442
FeatureRDD.apply() does not allow addition of other parameters with defaults in the case class #1439
Question : Why the number of paired sequence in adam-0.21.0 less than adam-0.19.0? #1424
loadCoverage missing from Java API #1420
Estimate contig lengths in SequenceDictionary for BED, GFF3, GTF, and NarrowPeak feature formats #1410
loadIntervalList FeatureRDD has empty SequenceDictionary #1409
problem using transform command #1406
Add coveralls #1403
INDEL realigner binary search conditional is flipped #1402
Delete adam-scripts/R #1398
Data missing when transfroming FASTQ to Adam #1393
java.io.FileNotFoundException when file exists #1385
Off-by-1 error in FASTQ InputFormat start positioning code #1383
Set the wrong value for end for symbolic alts #1381
RecordGroupDictionary should support isEmpty #1380
Add pipe API in and out formatters for Features #1374
Increase visibility for SupportedHeaderLines.allHeaderLines #1372
Bits of VariantContextConverter don't get ValidationStringencied #1371
Add Markdown docs for Pipe API #1368
Array[Consensus] not registered #1367
ValidationStringency in MDTagging should apply to reads on unknown references #1365
When doing a release, the SNAPSHOT should bump by 0.1.0, not 0.0.1 #1364
FromKnowns consensus generator fails if no reads overlap a consensus #1362
Performance tune-up in BQSR #1358
Increase visibility for ADAMContext.sc and/or getFs... methods #1356
Pipe API formatters need to be public #1354
Version 0.21.0: VariantContextConverter fails for 1000G VCF data #1353
ConsensusModel's can't really be instantiated #1352
Runtime conflicts in transitive versions of Guava dependency #1350
Transcript Effects ignored if more than 1 #1347
Remove "fork" tag from releases #1344
Refactor isSorted boolean parameters to sorted #1341
Loading GZipped VCF returns an empty RDD #1333
Follow up on error messages in build scripts #1331
Bump Spark 2 build to Spark 2.1.0 #1330
FeatureRDD instantiation tries to cache the RDD #1321
Load queryname sorted BAMs as Fragments #1303
Run Duplicate Marking on Fragments #1302
GenomicRDD.pipe may hang on failure error codes #1282
IllegalArgumentException Wrong FS for vcf_head files on HDFS #1272
java.io.NotSerializableException: org.bdgenomics.formats.avro.AlignmentRecord #1240
Investigate sorted join in dataset api #1223
Support looser validation stringency for loading some VCF Integer fields #1213
Add new feature-overlap command to demonstrate new region joins #1194
What should our API at the command line look like? #1178
Split apart partition and join in ShuffleRegionJoin #1175
Merging files should be multithreaded #1164
File _rgdict.avro does not exist #1150
how to collect the .adam files from Spark cluster multiple nodes and some questions about avocado #1140
JFYI: tiny forked adam-core "0.20.0" release #1139
Samtools (htslib) integration testing #1120
AlignmentRecordRDD does not extend GenomicRDD per javac #1092
Release ADAM version 0.21.0 #1088
Difference running markdups with and without projection #1014
ADAM to BAM conversion fails using relative path #1012
Refactor SequenceDictionary to use Contig instead of SequenceRecord #997
Customize adam-main cli from configuration file #918
genotypeType for genotypes with multiple OtherAlt alleles? #897
How to convert genotype DataFrame to VariantContext DataFrame / RDD #886
Ensure Java API is up-to-date with Scala API #855
Improve parallelism during FASTA output #842
Explicitly validate user args passed to transform enhancement #841
BroadcastRegionJoin fails with unmapped reads #821
Resolve Fragment vs. SingleReadBucket #789
Add profile for skipping test compilation/resolution #713
Next on empty iterator in BroadcastRegionJoin #661
Cleanup code smell in sort work balancing code #635
Remove reliance on MD tags #622
Provide low-impact alternative to transform -repartition for reducing partition size #594
Clean up Rich records #577
Create standardized, interpretable exceptions for error reporting #420
Create ADAM Benchmarking suite #120

Merged and closed pull requests:

[ADAM-1469] Don't filter on whether reads have mismatches during realignment #1470 (fnothaft)
[ADAM-1467] Skip concat call if there is only one shard. #1468 (fnothaft)
[ADAM-1465] Updating realigner CLI docs. #1466 (fnothaft)
[ADAM-1463] Rename recalibateBaseQualities method as recalibrateBaseQualities #1464 (heuermh)
[ADAM-1453] Add hooks to publish ADAM docs from CI flow. #1461 (fnothaft)
[ADAM-1459] Don't split FASTQ when compressed. #1459 (fnothaft)
[ADAM-1451] Make VariantContextConverter class and convert methods public #1452 (fnothaft)
Moving API overview from building apps doc to new source file. #1450 (heuermh)
[ADAM-1424] Adding test for reads dropped in 0.21.0. #1448 (heuermh)
[ADAM-1439] Add inferSequenceDictionary ctr to FeatureRDD. #1447 (heuermh)
[ADAM-1445] Make apply method for FragmentRDD public. #1446 (fnothaft)
[ADAM-1442] Fix thread pool deadlock in GenomicRDD.pipe #1443 (fnothaft)
[ADAM-1164] Add parallel file merger. #1441 (fnothaft)
Dependency version bump + BroadcastRegionJoin fix #1440 (fnothaft)
added JavaApi for loadCoverage #1437 (akmorrow13)
Update versions, etc. in build docs #1435 (heuermh)
Add test sample(verify number of reads in loadAlignments function) and ADAM SNAPSHOT document #1433 (xubo245)
Add cache argument to loadFeatures, additional Feature timers #1427 (heuermh)
feat: speed up 2bit file extract #1426 (Blaok)
BQSR refactor for perf improvements #1423 (fnothaft)
Add ADAMContext/GenomicRDD/pipe docs #1422 (fnothaft)
INDEL realigner cleanup #1412 (fnothaft)
Estimate contig lengths in SequenceDictionary for BED, GFF3, GTF, and NarrowPeak feature formats #1411 (heuermh)
Add coveralls badge to README.md. #1408 (fnothaft)
[ADAM-1403] Push coverage reports to Coveralls. #1404 (fnothaft)
Added instrumentation timers around joins. #1401 (fnothaft)
Add Apache Spark version to --version text #1400 (heuermh)
[ADAM-1398] Delete adam-scripts/R. #1399 (fnothaft)
[ADAM-1383] Use gt instead of gteq in FASTQ input format line size checks #1396 (fnothaft)
Maint spark2 2.11 0.21.0 #1395 (A-Tsai)
[ADAM-1393] fix missing reads when transforming fastq to adam #1394 (A-Tsai)
[ADAM-1380] Adds isEmpty method to RecordGroupDictionary. #1392 (fnothaft)
[ADAM-1381] Fix Variant end position. #1389 (fnothaft)
Make javac see that AlignmentRecordRDD extends GenomicRDD #1386 (fnothaft)
Added ShuffleRegionJoin usage docs #1384 (devin-petersohn)
Misc. INDEL realigner bugfixes #1382 (fnothaft)
Add pipe API in and out formatters for Features #1378 (heuermh)
[ADAM-1356] Make ADAMContext.getFsAndFiles and related protected visibility #1376 (heuermh)
[ADAM-1372] Increase visibility for DefaultHeaderLines.allHeaderLines #1375 (heuermh)
[ADAM-1371] Wrap ADAM->htsjdk VariantContext conversion with validation stringency. #1373 (fnothaft)
[ADAM-1367] Register Consensus array for serialization. #1369 (fnothaft)
[ADAM-1365] Apply validation stringency to reads on missing contigs when MD tagging #1366 (fnothaft)
[ADAM-1362] Fixing issue where FromKnowns consensus model fails if no reads hit a target. #1363 (fnothaft)
[ADAM-1352] Clean up consensus model usage. #1357 (fnothaft)
Increase visibility for InFormatter case classes from package private to public #1355 (heuermh)
Use htsjdk getAttributeAsList for VCF INFO ANN key #1348 (heuermh)
Fixes parsing variant annotations for multi-allelic rows #1346 (majkiw)
Sort pull requests by id #1345 (heuermh)
HBase genotypes backend -revised #1335 (jpdna)
[ADAM-1330] Move to Spark 2.1.0. #1332 (fnothaft)
Support deduping fragments #1309 (fnothaft)
[ADAM-1280] Silence CRAM logging in tests. #1294 (fnothaft)
Added test to try and repro #1282. #1292 (fnothaft)

Version 0.21.0

Closed issues:

Update Markdown docs with ValidationStringency in VCF<->ADAM CLI #1342
Variant VCFHeaderLine metadata does not handle wildcards properly #1339
Close called multiple times on VCF header stream #1337
BroadcastRegionJoin has serialization failures #1334
adam-cli uses git-commit-id-plugin which breaks release? #1322
move_to_xyz scripts should have interlocks... #1317
Lineage for partitionAndJoin in ShuffleRegionJoin causes StackOverflow Errors #1308
Add move_to_spark_1.sh script and update README to mention #1307
adam-submit transform fails with Exception in thread "main" java.lang.IncompatibleClassChangeError: Implementing class #1306
private ADAMContext constructor? #1296
AlignmentRecord.mateAlignmentEnd never set #1290
how to submit my own driver class via adam-submit? #1289
ReferenceRegion on Genotype seems busted? #1286
Clarify strandedness in ReferenceRegion apply methods #1285
Parquet and CRAM debug logging during unit tests #1280
Add more ANN field parsing unit tests #1273
loadVariantAnnotations returns empty RDD #1271
Implement joinVariantAnnotations with region join #1259
Count how many chromosome in the range of the kmer #1249
ADAM minor release to support htsjdk 2.7.0? #1248
how to config kryo.registrator programmatically #1245
Does the nested record Flattener drop Maps/Arrays? #1244
Dead-ish code cleanup in org.bdgenomics.adam.utils #1242
java.io.FileNotFoundException for old adam file after upgrade to adam0.20 #1240
please add maven-source-plugin into the pom file #1239
Assembly jar doesn't get rebuilt on CLI changes #1238
how to compare with the last the column for the same chromosome name? #1237
Need a way for users to add VCF header lines #1233
Enhancements to VCF save #1232
Must we split multi-allelic sites in our Genotype model? #1231
Can't override default -collapse in reads2coverage #1228
Reads2coverage NPEs on unmapped reads #1227
Strand bias doesn't get exported #1226
Move ADAMFunSuite helper functions upstream to SparkFunSuite #1225
broadcast join using interval tree #1224
Instrumentation is lost in ShuffleRegionJoin #1222
Bump Spark, Scala, Hadoop dependency versions #1221
GenomicRDD shuffle region join passes partition count to partition size #1220
Scala compile errors downstream of Spark 2 Scala 2.11 artifacts #1218
Javac error: incompatible types: SparkContext cannot be converted to ADAMContext #1217
Release 0.20.0 artifacts failed Sonatype Nexus validation #1212
Release script failed for 0.20.0 release #1211
gVCF - can't load multi-allelic sites #1202
Allow open-ended intervals in loadIndexedBam #1196
Interval tree join in ADAM #1171
spark-submit throw exception in spark-standalone using .adam which transformed from .vcf #1121
BroadcastRegionJoin is not a broadcast join #1110
Improve test coverage of VariantContextConverter #1107
Variant dbsnp rs id tracking in vcf2adam and ADAM2Vcf #1103
Document core ADAM transform methods #1085
Document deploying ADAM on Toil #1084
Clean up packages #1083
VariantCallingAnnotations is getting populated with INFO fields #1063
How to load DatabaseVariantAnnotation information ? #1049
Release ADAM version 0.20.0 #1048
Support VCF annotation ANN field in vcf2adam and adam2vcf #1044
How to create a rich(er) VariantContext RDD? Reconstruct VCF INFO fields. #878
Add biologist targeted section to the README #497
Update usage docs running for EC2 and CDH #493
Add docs about building downstream apps on top of ADAM #291
Variant filter representation #194

Merged and closed pull requests:

[ADAM-1342] Update CLI docs after #1288 merged. #1343 (fnothaft)
[ADAM-1339] Use glob-safe method to load VCF header metadata for Parquet #1340 (fnothaft)
[ADAM-1337] Remove os.{flush,close} calls after writing VCF header. #1338 (fnothaft)
[ADAM-1334] Clean up serialization issues in Broadcast region join. #1336 (fnothaft)
[ADAM-1307] move_to_spark_2 fails after moving to scala 2.11. #1329 (fnothaft)
unroll/optimize some JavaConversions #1326 (ryan-williams)
clean up *Join type-params/scaldocs #1325 (ryan-williams)
[ADAM-1322] Skip git commit plugin if .git is missing. #1323 (fnothaft)
Supports access to indexed fa and fasta files #1320 (akmorrow13)
Add interlocks for move_to_xyz scripts. #1319 (fnothaft)
[ADAM-1307] Add script for moving to Spark 1. #1318 (fnothaft)
Update move_to_spark_2.sh #1316 (creggian)
[ADAM-1308] Fix stack overflow in join with custom iterator impl. #1315 (fnothaft)
Why Adam? section added to README.md #1310 (tverbeiren)
Add docs about using ADAM's Kryo registrator from another Kryo registrator. #1305 (fnothaft)
Add docs about building downstream applications #1304 (heuermh)
[ADAM-493] Add ADAM-on-Spark-on-YARN docs. #1301 (fnothaft)
Code style fixes #1299 (heuermh)
Make ADAMContext and JavaADAMContext constructors public #1298 (heuermh)
Remove back reference between VariantAnnotation and Variant #1297 (fnothaft)
[ADAM-1280] Silence CRAM logging in tests. #1294 (fnothaft)
HBase as a separate repo #1293 (jpdna)
Reference region cleanup #1291 (fnothaft)
Clean rewrite of VariantContextConverter #1288 (fnothaft)
add function:filterByOverlappingRegions #1287 (liamlee)
Populate fields on VariantAnnotation #1283 (heuermh)
Add VCF headers for fields in Variant and VariantAnnotation records #1281 (heuermh)
CGCloud deploy docs #1279 (jpdna)
some style nits #1278 (ryan-williams)
use ParsedLoci in loadIndexedBam #1277 (ryan-williams)
Increasing unit test coverage for VariantContextConverter #1276 (heuermh)
Expose FeatureRDD to public #1275 (Georgehe4)
Clean up CLI operation categories and names, and add documentation for CLI #1274 (fnothaft)
Rename org.bdgenomics.adam.rdd.variation package to o.b.a.rdd.variant #1270 (heuermh)
use testFile in some tests #1268 (ryan-williams)
[ADAM-1083] Cleaning up org.bdgenomics.adam.models. #1267 (fnothaft)
make py file py3-forward-compatible #1266 (ryan-williams)
rm accidentally-added file #1265 (fnothaft)
Finishing up the cleanup on org.bdgenomics.adam.rdd. #1264 (fnothaft)
Clean up org.bdgenomics.adam.rich package. #1263 (fnothaft)
Add docs for transform pipeline, ADAM-on-Toil #1262 (fnothaft)
updates for bdg utils 0.2.9-SNAPSHOT #1261 (akmorrow13)
[ADAM-1233] Expose header lines in Variant-related GenomicRDDs #1260 (fnothaft)
[ADAM-1221] Bump Spark/Hadoop versions. #1258 (fnothaft)
Rename org.bdgenomics.adam.rdd.features package to o.b.a.rdd.feature #1256 (heuermh)
Clean up documentation in org.bdgenomics.adam.projection. #1255 (fnothaft)
[ADAM-1221] Bump Spark/Hadoop versions. #1254 (fnothaft)
Misc shuffle join fixes. #1253 (fnothaft)
[ADAM-1196] Add support for open ReferenceRegions. #1252 (fnothaft)
[ADAM-1225] Move helper functions from ADAMFunSuite to SparkFunSuite. #1251 (fnothaft)
Merge VariantAnnotation and DatabaseVariantAnnotation records #1250 (heuermh)
Miscellaneous VCF fixes #1247 (fnothaft)
HBase backend for Genotypes #1246 (jpdna)
[ADAM-1242] Clean up dead code in org.bdgenomics.adam.util. #1243 (fnothaft)
Small cleanup of "replacing uses of deprecated class SAMFileReader" #1236 (fnothaft)
replacing uses of deprecated class SAMFileReader #1235 (lbergelson)
[ADAM-1224] Replace BroadcastRegionJoin with tree based algo. #1234 (fnothaft)
Fix reads2coverage issues #1230 (fnothaft)
[ADAM-1212] Add empty assembly object, allows Maven build to create sources and javadoc artifacts #1215 (heuermh)
[ADAM-1211] Fix call to move_to_scala_2.sh, reorder Spark 2.x Scala 2.10 and 2.10 sections #1214 (heuermh)
demonstrate multi-allelic gVCF failure - test added #1205 (jpdna)
Merge VariantAnnotation and DatabaseVariantAnnotation records #1144 (heuermh)
Upgrade to bdg-formats-0.10.0 #1135 (fnothaft)

Version 0.20.0

Closed issues:

Sorting by reference index seems doesn't work or sorted by DESC order? #1204
master won't compile #1200
VCF format tag SB field parse error in loading #1199
Publish sources JAR with snapshots #1195
Type SparkFunSuite in package org.bdgenomics.utils.misc is not available #1193
MDTagging fails on GRCh38 #1192
Fix stack overflow in IndelRealigner serialization #1190
Delete ./scripts/commit-pr.sh #1188
Hadoop globStatus returns null if no glob matches #1186
Swapping out IntervalRDD under GenomicRDDs #1184
How to get "SO coordinate" instead of "SO unsorted"? #1182
How to read glob of multiple parquet Genotype #1179
Update command line doc and examples in README.md #1176
FastqRecordConverter needs cleanup and tests #1172
TransformFormats write to .gff3 file path incorrectly writes as parquet #1168
Should be able to merge shards across two different file systems #1165
RG ID gets written as the index, not the record group name #1162
Users should be able to save files as -single without merging them #1161
Users should be able to set size of buffer used for merging files #1160
Bump Hadoop-BAM to 7.7.0 #1158
adam-shell prints command trace to stdout #1154
Map IntervalList format column four to feature name or attributes? #1152
Parquet storage of VariantContext #1151
vcf2adam unparsable vcf record #1149
Reorder kryo.register statements in ADAMKryoRegistrator #1146
Make region joins public again #1143
Support CRAM input/output #1141
Transform should run with spark.kryo.requireRegistration=true #1136
adam-shell not handling bash args correctly #1132
Remove Gene and related models and parsing code #1129
Generate Scoverage reports when running CI #1124
Remove PairingRDD #1122
SAMRecordConverter.convert takes unused arguments #1113
Add Pipe API #1112
Improve coverage in Feature unit tests #1106
K-mer.scala code #1105
add -single file output option to ADAM2Vcf #1102
adam2vcf Fails with Sample not serializable #1100
ReferenceRegion.apply(AlignmentRecord) should not NPE on unmapped reads #1099
Add outer region join implementations #1098
VariantContextConverter never returns DatabaseVariantAnnotation #1097
loadvcf: conflicting require statement #1094
ADAM version 0.19.0 will not run on Spark version 2.0.0 #1093
Be more rigorous with FileSystem.get #1087
Remove network-connected and default test-related Maven profiles #1073
Releases should get pushed to Spark Packages #1067
Invalid POM for cli on 0.19.0 #1066
scala.MatchError RegExp does not catch colons in value part properly #1061
Support writing IntervalList header for features #1059
Add -single support when writing features in native formats #1058
Remove workaround for gzip/BGZF compressed VCF headers #1057
Clean up if clauses in Transform #1053
Adam-0.18.2 can not load Adam-0.14.0 adamSave function data (sam) #1050
filterByOverlappingRegion Incorrect for Genotypes #1042
Move Interval trait to utils, added in #75 #1041
Remove implicit GenomicRDD to RDD conversion #1040
VCF sample metadata - proposal for a GenotypedSampleMetadata object #1039
[build system] ADAM test builds pollute /tmp, leaving lots of cruft... #1038
adamMarkDuplicates function in AlignmentRecordRDDFunctions class can not mark the same read? #1037
test MarkDuplicatesSuite with two similar read in ref and start position and different avgPhredScore, error! #1035
Explore protocol buffers vs Avro #1031
Increase Avro dependency version to 1.8.0 #1029
ADAM specific logging #1024
Reenable Travis CI for pull request builds #1023
Bump Apache Spark version to 1.6.1 in Jenkins #1022
ADAM compatibility with Spark 2.0 #1021
ADAM to BAM conversion failing on 1000G file #1013
Factor out *RDDFunctions classes #1011
Port single file BAM and header code to VCF #1009
Roll Jenkins JDK 8 changes into ./scripts/jenkins-test #1008
Support GFF3 format #1007
Separate fat jar build from adam-cli to new maven module #1006
adam-cli POM invalid: maven.build.timestamp #1004
Sub-partitioning of Parquet file for ADAM #1003
Flattening the Genotype schema #1002
install adam 0.19 error! #1001
How to solve it please? #1000
Has the project realized alignment reads to reference genome algorithm? #996
All file-based input methods should support running on directories, compressed files, and wildcards #993
Contig to ContigName Change not reflected in AlignmentRecordField #991
Add homebrew guidelines to release checklist or automate PR generation #987
fix deprecation warnings #985
rename fragments package #984
Explore if SeqDict data can be factored out more aggressively #983
Make "Adam" all caps in filename Adam2Fastq.scala #981
Adam2Fastq should output reverse complement when 0x10 flag is set for read #980
Allow lowercase letters in jar/version names #974
Add stringency parameter to flagstat #973
Arg-array parsing problem in adam-submit #971
Pass recordGroup parameter to loadPairedFastq #969
Send a number of partitions to sc.textFile calls #968
adamGetReferenceString doesn't reduce pairs correctly #967
Update ADAM formula in homebrew-science to version 0.19.0 #963
BAM output in ADAM appears to be corrupt #962
Remove code workarounds necessary for Spark 1.2.1/Hadoop 1.0.x support #959
Issue with version 18.0.2 #957
Expose sorting by reference index #952
.rgdict and .seqdict files are not placed in the adam directory #945
Why does count_kmers not return k-mers that are split between two records? #930
Load legacy file formats to Spark SQL Dataframes #912
Clean up RDD method names #910
Load/store sequence dictionaries alongside Genotype RDDs #909
vcf2adam -print_metrics throws IllegalStateException on Spark 1.5.2 or later #902
error: no reads in first split: bad BAM file or tiny split size? #896
FastaConverter.FastaDescriptionLine not kryo-registered #893
Work With ADAM fasta2adam in a distributed mode #881
vcf2adam -> Exception in thread "main" java.lang.NoSuchMethodError: scala.Predef$.$conforms()Lscala/Predef$$less$colon$less; #871
Code coverage profile is broken #849
Building Adam on OS X 10.10.5 with Java 1.8 #835
Normalize AlignmentRecord.recordGroup* fields onto a separate record type #828
Gracefully handle missing Spark- and Hadoop-versions in jenkins-test; document how to set them. #827
Use Adam File with Hive #820
How do we handle reads that don't have original quality scores when converting to FASTQ with original qualities? #818
SAMFileHeader "sort order" attribute being un-set during file-save job #800
Use same sort order as Samtools #796
RNAME and RNEXT fields jumbled on transform BAM->ADAM->BAM #795
Support loading multiple indexed read files #787
Duplicate OUTPUT command line argument metaVar in adam2fastq #776
Allow Variant to ReferenceRegion conversion #768
Spark Errors References Deprecated SPARK_CLASSPATH #767
Spark Errors References Deprecated SPARK_CLASSPATH #766
adam2vcf fails with -coalesce #735
Writing to a BAM file with adamSAMSave consistently fails #721
BQSR on C835.HCC1143_BL.4 uses excessive amount of driver memory #714
Support writing RDD[Feature] to various file formats #710
adamParquetSave has a menacing false error message about *.adam extension #681
BAMHeader not set when running on a cluster #676
spark 1.3.1 upgarde to hortonworks HDP 2.2.4.2-2? #675
Symbol case class is nucleotide-centric #672
xAssembler cannot be build using mvn #658
adam-submit VerifyError #642
vcf2adam : Unsupported type ENUM #638
Update CDH documentation #615
Remove and generalize plugin code #602
Fix record oriented shuffle #599
Migrate preprocessing stages out of ADAM #598
Publish/socialize a roadmap #591
Eliminate format detection and extension checks for loading data #587
Improve error message when we can't find a ReferenceRegion for a contig #582
Do reference partitioners restrict a partition to contain keys from a single contig? #573
Connection refused errors when transforming BAM file with BQSR #516
ReferenceRegion shouldn't extend Ordered #511
Documentation for common usecases #491
Improve handling of "*" sequences during BQSR #484
Original qualities are parsed out, but left in attribute fields #483
Need a FileLocator that mirrors the use of Path in HDFS #477
FileLocator should support finding "child" locators. #476
Add S3 based Parquet directory loader #463
Should FASTQ output use reads' "original qualities"? #436
VcfStringUtils unused? #428
We should be able to filter genotypes that overlap a region #422
Create a simplified vocabulary for naming projections. #419
Update documentation #406
Bake off different region join implementations #395
Handle no-ops more intelligently when creating MD tags #392
Remove all the commands in the "CONVERSION OPERATIONS" CommandGroup #373
Fail to Write RDD into HDFS with Parquet Format #344
Refactor ReferencePositionWithOrientation #317
Add docs about SPARK_LOCAL_IP #305
PartitionAndJoin should throw an exception if it sees an unmapped read #297
Add insert size calculation #296
Newbie questions - learning resources? Reading a range of records from Adam? #281
Add variant effect ontology #261
Don't flatten optional SAM tags into a string #240
Characterize impact of partition size on pileup creation #163
Need to support BCF output format #153
Allow list of commands to be injected into adam-cli AdamMain #132
Parse out common annotations stored in VCF format #118
Update normalization code to enable normalization of sequences with more than two indels #64
Add clipping heuristic to indel realigner #63
BQSR should support recalibration across multiple ADAM files #58

Merged and closed pull requests:

fix SB tag parsing #1209 (fnothaft)
Fastq record converter #1208 (fnothaft)
Doc suggested partitionSize in ShuffleRegionJoin #1207 (jpdna)
Test demonstrating region join failure #1206 (jpdna)
fix SB tag parsing #1203 (jpdna)
fix build #1201 (ryan-williams)
[ADAM-1192] Correctly handle other whitespace in FASTA description. #1198 (fnothaft)
[ADAM-1190] Manually (un)pack IndelRealignmentTarget set. #1191 (fnothaft)
[ADAM-1188] Delete scripts/commit-pr.sh #1189 (fnothaft)
[ADAM-1186] Mask null from fs.globStatus. #1187 (fnothaft)
Fastq record converter #1185 (zyxue)
[ADAM-1182] isSorted=true should write SO:coordinate in SAM/BAM/CRAM header. #1183 (fnothaft)
Add scoverage aggregator and fail on low coverage. #1181 (fnothaft)
[ADAM-1179] Improve error message when globbing a parquet file fails. #1180 (fnothaft)
[ADAM-1176] Update command line doc and examples in README.md #1177 (heuermh)
Refactor CLIs for merging sharded files #1167 (fnothaft)
Update Hadoop-BAM to version 7.7.0 #1166 (heuermh)
[ADAM-1162] Write record group string name. #1163 (fnothaft)
Map IntervalList format column four to feature name #1159 (heuermh)
Make AlignmentRecordConverter public so that it can be used from other projects #1157 (tomwhite)
added predicate option to loadCoverage #1156 (akmorrow13)
[ADAM-1154] Change set -x to set -e in ./bin/adam-shell. #1155 (fnothaft)
Remove Gene and related models and parsing code #1153 (heuermh)
Reorder kryo.register statements in ADAMKryoRegistrator #1148 (heuermh)
Updated GenomicPartitioners to accept additional key. #1147 (akmorrow13)
[ADAM-1141] Add support for saving/loading AlignmentRecords to/from CRAM. #1145 (fnothaft)
misc pom/test/resource improvements #1142 (ryan-williams)
[ADAM-1136] Transform runs successfully with kryo registration required #1138 (fnothaft)
[ADAM-1132] Fix improper quoting of bash args in adam-shell. #1133 (fnothaft)
Remove StructuralVariant and StructuralVariantType, add names field to Variant #1131 (heuermh)
Remove StructuralVariant and StructuralVariantType, add names field to Variant #1130 (heuermh)
PR #1108 with issue #1122 #1128 (fnothaft)
[ADAM-1038] Eliminate writing to /tmp during CI builds. #1127 (fnothaft)
Update for bdg-formats code style changes #1126 (heuermh)
[ADAM-1124] Add Scoverage and generate coverage reports in Jenkins. #1125 (fnothaft)
[ADAM-1093] Move to support Spark 2.0.0. #1123 (fnothaft)
remove duplicated dependency #1119 (ryan-williams)
Clean up ADAMContext #1118 (fnothaft)
[ADAM-993] Support loading files using globs and from directory paths. #1117 (fnothaft)
[ADAM-1087] Migrate away from FileSystem.get #1116 (fnothaft)
[ADAM-1099] Make reference region not throw NPE. #1115 (fnothaft)
Add pipes API #1114 (fnothaft)
[ADAM-1105] Use assembly jar in adam-shell. #1111 (fnothaft)
Add outer joins #1109 (fnothaft)
Modified CalculateDepth to calcuate coverage from alignment files #1108 (akmorrow13)
Resolves various single file save/header issues #1104 (fnothaft)
[ADAM-1100] Resolve Sample Not Serializable exception #1101 (fnothaft)
added loadIndexedVcf and loadIndexedBam for multiple ReferenceRegions #1096 (akmorrow13)
Added support for Indexed VCF files #1095 (akmorrow13)
[ADAM-582] Eliminate .get on option in FragmentCoverter. #1091 (fnothaft)
[ADAM-776] Rename duplicate OUTPUT metaVar in ADAM2Fastq. #1090 (fnothaft)
refactored ReferenceFile to require SequenceDictionary #1086 (akmorrow13)
[ADAM-1073] Remove network-connected and default test-related Maven profiles #1082 (heuermh)
[ADAM-1053] Clean up Transform #1081 (fnothaft)
[ADAM-1061] Clean up attributes regex and denormalized fields #1080 (fnothaft)
Extended TwoBitFile and NucleotideContigFragmentRDDFunctions to behave more similar #1079 (akmorrow13)
Refactor variant and genotype annotations #1078 (heuermh)
[ADAM-1039] Add basic support for Sample record. #1077 (fnothaft)
Remove code workarounds necessary for Spark 1.2.1/Hadoop 1.0.x support #1076 (heuermh)
[ADAM-194] Use separate filtersFailed and filtersPassed arrays for variant quality filters #1075 (heuermh)
Whitespace code style fixes #1074 (heuermh)
[ADAM-1006] Split überjar out to adam-assembly submodule. #1072 (fnothaft)
Remove code coverage profile #1071 (heuermh)
[ADAM-768] ReferenceRegion from variant/genotypes #1070 (fnothaft)
[ADAM-1044] Support VCF annotation ANN field #1069 (heuermh)
[ADAM-1067] Add release documentation and scripting for Spark Packages. #1068 (fnothaft)
[ADAM-602] Remove plugin code. #1065 (fnothaft)
Refactoring org.bdgenomics.adam.io package. #1064 (fnothaft)
Cleanup in org.bdgenomics.adam.converters package. #1062 (fnothaft)
[ADAM-1057] Remove workaround for gzip/BGZF compressed VCF headers #1057 (heuermh)
Cleanup on org.bdgenomics.adam.algorithms.smithwaterman package. #1056 (fnothaft)
Documentation cleanup and minor refactor on the consensus package. #1055 (fnothaft)
Add KEYS with public code signing keys #1054 (heuermh)
Adding GA4GH 0.5.1 converter for reads. #1052 (fnothaft)
[ADAM-1011] Refactor to add GenomicRDDs for all Avro types #1051 (fnothaft)
removed interval trait and redirected to interval in utils-intervalrdd #1046 (akmorrow13)
[ADAM-952] Expose sorting by reference index. #1045 (fnothaft)
overlap query reflects new formats #1043 (erictu)
Changed loadIndexedBam to use hadoop-bam InputFormat #1036 (fnothaft)
Increase Avro dependency version to 1.8.0 #1034 (heuermh)
Improved README fix using feedback from other approach review. #1034 (InvisibleTech)
Error in the README.md for kmer.scala example, need to get rdd first. #1032 (InvisibleTech)
Add fragmentEndPosition to NucleotideContigFragment #1030 (heuermh)
Logging to be done by ADAM utils code rather than Spark #1028 (jpdna)
add maxScore #1027 (xubo245)
[ADAM-1008] Modify jenkins-test script to support Java 8 build. #1026 (fnothaft)
whitespace change, do not merge #1025 (shaneknapp)
require kryo registration in tests #1020 (ryan-williams)
print full stack traces on test failures #1019 (ryan-williams)
bump commons-io version #1017 (ryan-williams)
exclude javadoc jar in adam-shell #1016 (ryan-williams)
[ADAM-909] Refactoring variation RDDs. #1015 (fnothaft)
Modified CalculateDepth to get coverage on whole alignment adam files #1010 (akmorrow13)
[ADAM-1004] Remove recursive maven.build.timestamp declaration #1005 (heuermh)
Maint 2.11 0.19.0 #999 (tushu1232)
[ADAM-710] Add saveAs methods for feature formats GTF, BED, IntervalList, and NarrowPeak #998 (heuermh)
Moving Adam2Fastq to ADAM2Fastq #995 (heuermh)
Update release doc for CHANGES.md and homebrew #994 (heuermh)
Update to AlignmentRecordField and its usages as contig changed to co… #992 (jpdna)
[ADAM-974] Short term fix for multiple ADAM cli assembly jars check #990 (heuermh)
Update hadoop-bam dependency version to 7.5.0 #989 (heuermh)
Replaced Contig with ContigName in AlignmentRecord and related changes #988 (jpdna)
fix some deprecation/style things and rename a pkg #986 (ryan-williams)
Fix Adam2fastq in case of read with both reverse and unmapped flags #982 (jpdna)
[ADAM-510] Refactoring RDD function names #979 (heuermh)
Use .adam/_{seq,rg}dict.avro paths for Avro-formatted dictionaries #978 (heuermh)
Remove unused file VcfHeaderUtils.scala #977 (heuermh)
add validation stringency to bam parsing, flagstat #976 (ryan-williams)
more permissible jar regex in adam-submit #975 (ryan-williams)
fix bash arg array processing in adam-submit #972 (ryan-williams)
adamGetReferenceString reduces pairs correctly, fixes #967 #970 (erictu)
A few improvements #966 (ryan-williams)
improve SW performance by replacing functional reductions with imperative ones #965 (noamBarkai)
[ADAM-962] Fix corrupt single-file BAM output. #964 (fnothaft)
[ADAM-960] Updating bdg-utils dependency version to 0.2.4 #961 (heuermh)
[ADAM-946] Fixes to FlagStat for Samtools concordance issue #954 (jpdna)
Use hadoop-bam BAMInputFormat to do loadIndexedBam #953 (andrewmchen)
Add -print_metrics option to Jenkins build #947 (heuermh)
adam2vcf doesn't have info fields #939 (andrewmchen)
[ADAM-893] Register missing serializers. #933 (fnothaft)

Version 0.19.0

Closed issues:

Update bdg-utils dependency version to 0.2.4 #960
Drop support for Spark version 1.2.1, Hadoop version 1.0.x #958
Exception occurs when running tests on master #956
Flagstat results still don't match samtools flagstat #946
readInFragment value is not properly read from parquet file into RDD[AlignmentRecord] #942
adam2vcf -sort_on_save flag broken #940
Transform -limit_projection requires .sam.seqdict file #937
MarkDuplicates fails if library name is not set #934
fastqtobam or sam #928
Vcf2Adam uses SB field instead of FS field for fisher exact test for strand bias #923
Add back limit_projection on Transform #920
BAM header is not getting set on partition 0 with headerless BAM output format #916
Add numParts apply method to GenomicRegionPartitioner #914
Add Spark version 1.6.x to Jenkins build matrix #913
Target Spark 1.5.2 as default Spark version #911
Move to bdg-formats 0.7.0 #905
secondOfPair and firstOfPair flag is missing in the newest 0.18 adam transformed results from BAM #903
Future pull request #900
error in vcf2adam #899
Importing directory of VCFs seems to fail #898
How to filter genotypeRDD on sample names? org.apache.spark.SparkException: Task not serializable? #891
Add Spark version 1.5.x to Jenkins build matrix #889
Transform DAG causes stages to recompute #883
adam-submit buildinfo is confused #880
move_to_scala_2.11 and maven-javadoc-plugin #863
NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable #837
Fix record oriented shuffle #599
Avro.GenericData error with ADAM 0.12.0 on reading from ADAM file #290

Merged and closed pull requests:

[ADAM-960] Updating bdg-utils dependency version to 0.2.4 #961 (heuermh)
[ADAM-946] Fixes to FlagStat for Samtools concordance issue #954 (jpdna)
Fix for travis build, replace reads2ref with reads2fragments #950 (heuermh)
[ADAM-940] Fix adam2vcf -sort_on_save flag #949 (massie)
Remove BuildInformation and extraneous git-commit-id-plugin configuration #948 (heuermh)
Update readme for spark 1.5.2 and hadoop 2.6.0 #944 (heuermh)
[ADAM-942] Replace first/secondInRead with readInFragment #943 (heuermh)
[ADAM-937] Adding check for aligned read predicate or limit projection flags and non-parquet input path #938 (heuermh)
[ADAM-934] Properly handle unset library name during duplicate marking #935 (fnothaft)
[ADAM-911] Move to Spark 1.5.2 and Hadoop 2.6.0 as default versions. #932 (fnothaft)
added start and end values to Interval Trait. Used for IntervalRDD #931 (akmorrow13)
Removing buildinfo command #929 (heuermh)
Removing symbolic test resource links, read from test classpath instead #927 (heuermh)
Changed fisher strand bias field for VCF2Adam from SB to FS #924 (andrewmchen)
[ADAM-920] Limit tag/orig qual flags in Transform. #921 (fnothaft)
Change the README to use adam-shell -i instead of pasting #919 (andrewmchen)
[ADAM-916] New strategy for writing header. #917 (fnothaft)
[ADAM-914] Create a GenomicRegionPartitioner given a partition count. #915 (fnothaft)
Squashed #907 and ran format-sources #908 (fnothaft)
Various small fixes #907 (huitseeker)
ADAM-599, 905: Move to bdg-formats:0.7.0 and migrate metadata #906 (fnothaft)
Rewrote the getType method to handle all ploidy levels #904 (NeillGibson)
Single file save from #733, rebased #901 (fnothaft)
Added is* genotype methods from HTS-JDK Genotype to RichGenotype #895 (NeillGibson)
[ADAM-891] Mark SparkContext as @transient. #894 (fnothaft)
Update README URLs based on HTTP redirects #892 (ReadmeCritic)
adding --version command line option #888 (heuermh)
Add exception in move_to_scala_2.11.sh for maven-javadoc-plugin #887 (heuermh)
Fix tightlist bug in Pandoc #885 (massie)
[ADAM-883] Add caching to Transform pipeline. #884 (fnothaft)

Version 0.18.2

ISSUE 877: Minor fix to commit script to support https.
ISSUE 876: Separate command line argument words by underscores
ISSUE 875: P Operator parsing for MDTag
ISSUE 873: [ADAM-872] Modify regex to capture release and SNAPSHOT jars but not javadoc or sources jars
ISSUE 866: [ADAM-864] Don't force shuffle if reducing partition count.
ISSUE 856: export valid fastq
ISSUE 847: Updating build dependency versions to latest minor versions

Version 0.18.1

ISSUE 870: [ADAM-867] add pull requests missing from 0.18.0 release to CHANGES.md
ISSUE 869: [ADAM-868] make release branch and tag names consistent
ISSUE 862: [ADAM-861] use -d to check for repo assembly dir

Version 0.18.0

ISSUE 860: New release and pr-commit scripts
ISSUE 859: [ADAM-857] Corrected handling of env vars in bin scripts
ISSUE 854: [ADAM-853] allow main class in adam-submit to be specified
ISSUE 852: [ADAM-851] Slienced Parquet logging.
ISSUE 850: [ADAM-848] TwoBitFile now support nBlocks and maskBlocks
ISSUE 846: Updating maven build plugin dependency versions
ISSUE 845: [ADAM-780] Make DecadentRead package private.
ISSUE 844: [ADAM-843] Aggressively project out metadata fields.
ISSUE 840: fix flagstat output file encoding
ISSUE 839: let flagstat write to file
ISSUE 831: Support loading paired fastqs
ISSUE 830: better validation when saving paired fastqs
ISSUE 829: fix Long != null warnings
ISSUE 819: Implement custom ReferenceRegion hashcode
ISSUE 816: [ADAM-793] adding command to convert ADAM nucleotide contig fragments to FASTA files
ISSUE 815: Upgrade to bdg-formats:0.6.0, add Fragment datatype converters
ISSUE 814: [ADAM-812] fix for javadoc errors on JDK8
ISSUE 813: [ADAM-808] build an assembly cli jar with maven shade plugin
ISSUE 810: [ADAM-807] workaround for git-commit-id/git-commit-id-maven-plugin#61
ISSUE 809: [ADAM-785] Add support for all numeric array (TYPE=B) tags
ISSUE 806: [ADAM-755] updating utils dependency version to 0.2.3
ISSUE 805: Better transform error when file doesn't exist
ISSUE 803: fix unmapped-read sorting
ISSUE 802: stop writing contig names as md5 sums
ISSUE 798: fix SAM-attr conversion bug; int[]'s not byte[]'s
ISSUE 790: optionally add MDTags to reads with transform
ISSUE 782: Fix SAM Attribute parser for numeric array tags
ISSUE 773: [ADAM-772] fix some bash var quoting
ISSUE 765: [ADAM-752] Build for many combos of Spark/Hadoop versions.
ISSUE 764: More involved README restructuring
ISSUE 762: [ADAM-132] allowing list of commands to be injected into adam-cli ADAMMain

Version 0.17.1

ISSUE 784: [ADAM-783] Write @SQ header lines in sorted order.
ISSUE 792: [ADAM-791] Add repartition parameter to Fasta2ADAM.
ISSUE 781: [ADAM-777] Add validation stringency flag for BQSR.
ISSUE 757: We should print a warning message if the user has ADAM_OPTS set.
ISSUE 770: [ADAM-769] Fix serialization issue in known indel consensus model.
ISSUE 763: Clean up README links, other nits
ISSUE 749: Remove adam-cli jar from classpath during adam-submit
ISSUE 754: Bump ADAM to Spark 1.4
ISSUE 753: Bump Spark to 1.4
ISSUE 748: Fix for mdtag issues with insertions
ISSUE 746: Upgrade to Parquet 1.8.1.
ISSUE 744: [ADAM-743] exclude conflicting jackson dependencies
ISSUE 737: Reverse complement negative strand reads in fastq output
ISSUE 731: Fixed bug preventing use of TLEN attribute
ISSUE 730: [ADAM-729] Stuff TLEN into attributes.
ISSUE 728: [ADAM-709] Remove FeatureHierarchy and FeatureHierarchySuite
ISSUE 719: [ADAM-718] Use filesystem path to get underlying file system.
ISSUE 712: unify header-setting between BAM/SAM and VCF
ISSUE 696: include SequenceRecords from second-in-pair reads
ISSUE 698: class-ify ShuffleRegionJoin, force setting seqdict
ISSUE 706: restore clause guarding pruneCache check
ISSUE 705: GeneFeatureRDDFunctions → FeatureRDDFunctions

Version 0.17.0

ISSUE 691: fix BAM/SAM header setting when writing on cluster
ISSUE 688: make adamLoad public
ISSUE 694: Fix parent reference in distribution module
ISSUE 684: a few region-join nits
ISSUE 682: [ADAM-681] Remove menacing error message about reqd .adam extension
ISSUE 680: [ADAM-674] Delete Bam2ADAM.
ISSUE 678: upgrade to bdg utils 0.2.1
ISSUE 668: [ADAM-597] Move correction out of ADAM and into a downstream project.
ISSUE 671: Bug fix in ReferenceUtils.unionReferenceSet
ISSUE 667: [ADAM-666] Clean up key not found error in partitioner code.
ISSUE 656: Update Vcf2ADAM.scala
ISSUE 652: added filterByOverlappingRegion in GeneFeatureRDDFunctions
ISSUE 650: [ADAM-649] Support transform of all BAM/SAM files in a directory.
ISSUE 647: [ADAM-646] Special case reads with '*' quality during BQSR.
ISSUE 645: [ADAM-634] Create a local ParquetLister for testing purposes.
ISSUE 633: [Adam] Tests for SAMRecordConverter.scala
ISSUE 641: [ADAM-640] Fix incorrect exclusion for org.seqdoop.htsjdk.
ISSUE 632: [ADAM-631] Allow VCF conversion to sort on output after coalescing.
ISSUE 628: [ADAM-627] Makes ReferenceFile trait extend Serializable.
ISSUE 637: check for mac brew alternate spark install structure
ISSUE 624: Conceptual fix for duplicate marking and sorting stragglers
ISSUE 629: [ADAM-604] Remove normalization code.
ISSUE 630: Add flatten command.
ISSUE 619: [ADAM-540] Move to new HTSJDK release; should support Java 8.
ISSUE 626: [ADAM-625] Enable globbing for BAM.
ISSUE 621: Removes the predicates package.
ISSUE 620: [ADAM-600] Adding RegionJoin trait.
ISSUE 616: [ADAM-565] Upgrade to Parquet filter2 API.
ISSUE 613: [ADAM-612] Point to proper k-mer counters.
ISSUE 588: [ADAM-587] Clean up loading checks.
ISSUE 592: [ADAM-513] Remove ReferenceMappable trait.
ISSUE 606: [ADAM-605] Remove visualization code.
ISSUE 596: [ADAM-595] Delete the 'comparisons' code.
ISSUE 590: [ADAM-589] Removed pileup code.
ISSUE 586: [ADAM-452] Fixes SM attribute on ADAM to BAM conversion.
ISSUE 584: [ADAM-583] Add k-mer counting functionality for nucleotide contig fragments

Version 0.16.0

ISSUE 570: A few small conversion fixes
ISSUE 579: [ADAM-578] Update end of read when trimming.
ISSUE 564: [ADAM-563] Add warning message when saving Parquet files with incorrect extension
ISSUE 576: Changed hashCode implementations to improve performance of BQSR
ISSUE 569: Typo in the narrowPeak parser
ISSUE 568: Moved the Timers object from bdg-utils back to ADAM
ISSUE 478: Move non-genomics code
ISSUE 550: [ADAM-549] Added documentation for testing and CI for ADAM.
ISSUE 555: Makes maybeLoadVCF private.
ISSUE 558: Makes Features2ADAMSuite use SparkFunSuite
ISSUE 557: Randomize ports and turn off Spark UI to reduce bind exceptions in tests
ISSUE 552: Create test suite for FlagStat
ISSUE 554: privatize ADAMContext.maybeLoad{Bam,Fastq}
ISSUE 551: [ADAM-386] Multiline FASTQ input
ISSUE 542: Variants Visualization
ISSUE 545: [ADAM-543][ADAM-544] Fix issues with ADAM scripts and classpath
ISSUE 535: [ADAM-441] put a check in for Nothing. Throws an IAE if no return type is provided
ISSUE 546: [ADAM-532] Fix wigFix intermittent test failure
ISSUE 534: [ADAM-528][ADAM-533] Adds new RegionJoin impl that is shuffle-based
ISSUE 531: [ADAM-529] Attaching scaladoc to released distribution.
ISSUE 413: [ADAM-409][ADAM-520] Added local wigfix2bed tool
ISSUE 527: [ADAM-526] VcfAnnotation2ADAM only counts once
ISSUE 523: don't open non-.adam-extension files as ADAM files
ISSUE 521: quieting wget output
ISSUE 482: [ADAM-462] Coverage region calculation
ISSUE 515: [ADAM-510] fix for bash syntax error; add ADDL_JARS check to adam-submit

Version 0.15.0

ISSUE 509: Add a 'distribution' module to create assemblies
ISSUE 508: Upgrade from Parquet 1.4.3 to 1.6.0rc4
ISSUE 498: [ADAM-496] Changes VCF to flat ADAM command name and usage
ISSUE 500: [ADAM-495] Require SPARK_HOME for adam-submit
ISSUE 501: [ADAM-499] Add -onlyvariants option to vcf2adam
ISSUE 507: [ADAM-505] Removed adam-local from docs
ISSUE 504: [ADAM-502] Add missing Long implicit to ColumnReaderInput
ISSUE 503: [ADAM-473] Make RecordCondition and FieldCondition public
ISSUE 494: Fix foreach block for vcf ingest
ISSUE 492: Documentation cleanup and style improvements
ISSUE 481: [ADAM-480] Switch assembly to single goal.
ISSUE 487: [ADAM-486] Add port option to viz command.
ISSUE 469: [ADAM-461] Fix ReferenceRegion and ReferencePosition impl
ISSUE 440: [ADAM-439] Fix ADAM to account for BDG-FORMATS-35: Avro uses Strings
ISSUE 470: added ReferenceMapping for Genotype, filterByOverlappingRegion for GenotypeRDDFunctions
ISSUE 468: refactor RDD loading; explicitly load alignments
ISSUE 474: Consolidate documentation into a single location in source.
ISSUE 471: Fixed typo on MAVEN_OPTS quotation mark
ISSUE 467: [ADAM-436] Optionally output original qualities to fastq
ISSUE 451: add adam view command, analogous to samtools view
ISSUE 466: working examples on .sam included in repo
ISSUE 458: Remove unused val from Reads2Ref
ISSUE 438: Add ability to save paired-FASTQ files
ISSUE 457: A few random Predicate-related cleanups
ISSUE 459: a few tweaks to scripts/jenkins-test
ISSUE 460: Project only the sequence when kmer/qmer counting
ISSUE 450: Refactor some file writing and reading logic
ISSUE 455: [ADAM-454] Add serializers for Avro objects which don't have serializers
ISSUE 447: Update the contribution guidelines
ISSUE 453: Better null handling for isSameContig utility
ISSUE 417: Stores original position and original cigar during realignment.
ISSUE 449: read “OQ” attr from structured SAMRecord field
ISSUE 446: Revert "[ADAM-237] Migrate to Chill serialization libraries."
ISSUE 437: random nits
ISSUE 434: Few transform tweaks
ISSUE 435: [ADAM-403] Remove seqDict from RegionJoin
ISSUE 431: A few tweaks, typo corrections, and random cleanups
ISSUE 430: [ADAM-429] adam-submit now handles args correctly.
ISSUE 427: Fixes for indel realigner issues
ISSUE 418: [ADAM-416] Removing 'ADAM' prefix
ISSUE 404: [ADAM-327] Adding gene, transcript, and exon models.
ISSUE 414: Fix error in adam-local alias
ISSUE 415: Update README.md to reflect Spark 1.1
ISSUE 412: [ADAM-411] Updated usage aliases in README. Fixes #411.
ISSUE 408: [ADAM-405] Add FASTQ output.
ISSUE 385: [ADAM-384] Adds import from FASTQ.
ISSUE 400: [ADAM-399] Fix link to schemas.
ISSUE 396: [ADAM-388] Sets Kryo serialization with --conf args
ISSUE 394: [ADAM-393] Adds knobs to SparkContext creation in SparkFunSuite
ISSUE 391: [ADAM-237] Migrate to Chill serialization libraries.
ISSUE 380: Rewrite of MarkDuplicates which seems to improve performance
ISSUE 387: fix some deprecation warnings

Version 0.14.0

ISSUE 376: [ADAM-375] Upgrade to Hadoop-BAM 7.0.0.
ISSUE 378: [ADAM-360] Upgrade to Spark 1.1.0.
ISSUE 379: Fix the position of the jar path in the submit.
ISSUE 383: Make Mdtags handle '=' and 'X' cigar operators
ISSUE 369: [ADAM-369] Improve debug output for indel realigner
ISSUE 377: [ADAM-377] Update to Jenkins scripts and README.
ISSUE 374: [ADAM-372][ADAM-371][ADAM-365] Refactoring CLI to simplify and integrate with Spark model better
ISSUE 370: [ADAM-367] Updated alias in README.md
ISSUE 368: erasure, nonexhaustive-match, deprecation warnings
ISSUE 354: [ADAM-353] Fixing issue with SAM/BAM/VCF header attachment when running distributed
ISSUE 357: [ADAM-357] Added Java Plugin hook for ADAM.
ISSUE 352: Fix failing MD tag
ISSUE 363: Adding maven assembly plugin configuration to create tarballs
ISSUE 364: [ADAM-364] Fixing remaining cs.berkeley.edu URLs.
ISSUE 362: Remove mention of uberjar from README

Version 0.13.0

ISSUE 343: Allow retrying on failure for HTTPRangedByteAccess
ISSUE 349: Fix for a NullPointerException when hostname is null in Task Metrics
ISSUE 347: Bug fix for genome browser
ISSUE 346: Genome visualization
ISSUE 342: [ADAM-309] Update to bdg-formats 0.2.0
ISSUE 333: [ADAM-332] Upgrades ADAM to Spark 1.0.1.
ISSUE 341: [ADAM-340] Adding the TrackedLayout trait and implementation.
ISSUE 337: [ADAM-335] Updated README.md to reflect migration to appassembler.
ISSUE 311: Adding several simple normalizations.
ISSUE 330: Make mismatch and deletes positions accessible
ISSUE 334: Moving code coverage into a profile
ISSUE 329: Add count of mismatches to mdtag
ISSUE 328: [ADAM-326] Adding a 5-second retry on the HttpRangedByteAccess test.
ISSUE 325: Adding documentation for commit/issue nomenclature and rebasing

Version 0.12.1

ISSUE 308: Fixing the 'index 0' bug in features2adam
ISSUE 306: Adding code for lifting over between sequences and the reference genome.
ISSUE 320: Remove extraneous implicit methods in ReferenceMappingContext
ISSUE 314: Updates to indel realigner to improve performance and accuracy.
ISSUE 319: Adding scripts for publishing scaladoc.
ISSUE 315: Added table of (wall-clock) stage durations when print_metrics is used
ISSUE 312: Fixing sources jar
ISSUE 313: Making the CredentialsProperties file optional
ISSUE 267: Parquet and indexed Parquet RDD implementations, and indices.
ISSUE 301: Add Beacon's AlleleCount
ISSUE 293: Add aggregation and display of metrics obtained from Spark
ISSUE 295: Fix broken link to ADAM specification for storing reads.
ISSUE 292: Cleaning up scaladoc generation warnings.
ISSUE 289: Modifying interleaved fastq format to be hadoop version independent.
ISSUE 288: Add ADAMFeature to Kryo registrator
ISSUE 286: Removing some debug printout that was left in.
ISSUE 287: Cleaning hadoop dependencies
ISSUE 285: Refactoring read groups to increase the amount of data stored.
ISSUE 284: Cleaning up build warnings.
ISSUE 280: Move to bdg-formats
ISSUE 283: Fix reference name comment
ISSUE 282: Minor cleanup on interleaved FASTQ input format.
ISSUE 277: Implemented HTTPRangedByteAccess.
ISSUE 274: Added clarifying note to ADAMVariantContext
ISSUE 279: Simplify format-source
ISSUE 278: Use maven license plugin to ensure source has correct license
ISSUE 268: Adding fixed depth prefix trie implementation
ISSUE 273: Fixes issue in reference models where strings are not sanitized on collection from avro.
ISSUE 272: Created command categories
ISSUE 269: Adding k-mer and q-mer counting.
ISSUE 271: Consolidate Parquet logging configuration

Version 0.12.0

ISSUE 264: Parquet-related Utility Classes
ISSUE 259: ADAMFlatGenotype is a smaller, flat version of a genotype schema
ISSUE 266: Removed extra command 'BuildInformation'
ISSUE 263: Added AdamContext.referenceLengthFromCigar
ISSUE 260: Modifying conversion code to resolve #112.
ISSUE 258: Adding an 'args' parameter to the plugin framework.
ISSUE 262: Adding reference assembly name to ADAMContig.
ISSUE 256: Upgrading to Spark 1.0
ISSUE 257: Adds toString method for sequence dictionary.
ISSUE 255: Add equals, canEqual, and hashCode methods to MdTag class

Version 0.11.0

ISSUE 254: Cleanup import statements
ISSUE 250: Adding ADAM to SAM conversion.
ISSUE 248: Adding utilities for read trimming.
ISSUE 252: Added a note about rebasing-off-master to CONTRIBUTING.md
ISSUE 249: Cosmetic changes to FastaConverter and FastaConverterSuite.
ISSUE 251: CHANGES.md is updated at release instead of per pull request
ISSUE 247: For #244, Fragments were incorrect order and incomplete
ISSUE 246: Making sample ID field in genotype nullable.
ISSUE 245: Adding ADAMContig back to ADAMVariant.
ISSUE 243: Rebase PR#238 onto master

Version 0.10.0

ISSUE 242: Upgrade to Parquet 1.4.3
ISSUE 241: Fixes to FASTA code to properly handle indices.
ISSUE 239: Make ADAMVCFOutputFormat public
ISSUE 233: Build up reference information during cigar processing
ISSUE 234: Predicate to filter conversion
ISSUE 235: Remove unused contiglength field
ISSUE 232: Add -pretty and -o to the print command
ISSUE 230: Remove duplicate mdtag field
ISSUE 231: Helper scripts to run an ADAM Console.
ISSUE 226: Fix ReferenceRegion from ADAMRecord
ISSUE 225: Change Some to Option to check for unmapped reads
ISSUE 223: Use SparkConf object to configure SparkContext
ISSUE 217: Stop using reference IDs and use reference names instead
ISSUE 220: Update SAM to ADAM conversion
ISSUE 213: BQSR updates

Version 0.9.0

ISSUE 214: Upgrade to Spark 0.9.1
ISSUE 211: FastaConverter Refactor
ISSUE 212: Cleanup build warnings
ISSUE 210: Remove Scalariform from process-sources phase
ISSUE 209: Fix Scalariform issues and Maven warnings
ISSUE 207: Change from deprecated manifest erasure to runtimeClass
ISSUE 206: Add Scalariform settings to pom
ISSUE 204: Update Avro code gen to not mark fields as deprecated.

Version 0.8.0

ISSUE 203: Move package from edu.berkeley.cs.amplab to org.bdgenomics
ISSUE 199: Updating pileup conversion code to convert sequences that use the X and = (EQ) CIGAR operators
ISSUE 191: Add repartition parameter
ISSUE 183: Fixing Job.getInstance call that breaks hadoop 1 compatibility.
ISSUE 192: Add docs and scripts for creating a release
ISSUE 193: Issue #137, clarify role of CHANGES.{md,txt}

Version 0.7.2

ISSUE 187: Add summarize_genotypes command
ISSUE 178: Upgraded to Hadoop-BAM 0.6.2/Picard 1.107.
ISSUE 173: Parse annotations out of vcf files
ISSUE 162: Refactored SequenceDictionary
ISSUE 180: BQSR using vcf loader
ISSUE 179: Update maven-surefire-plugin dependency version to 2.17, also create an ...
ISSUE 175: VariantContext converter refactor
ISSUE 169: Cleaning up mpileup command
ISSUE 170: Adding variant field enumerations

Version 0.7.1

Version 0.7.3

Version 0.7.2

ISSUE 166: Pair-wise genotype concordance of genotype RDDs, with CLI tool

Version 0.7.0

ISSUE 171: Add back in allele dosage for genotypes.

Version 0.7.0

ISSUE 167: Fix for Hadoop 1.0.x support
ISSUE 165: call PluginExecutor in apply method, fixes issue 164
ISSUE 160: Refactoring FASTA work to break contig sizes.
ISSUE 78: Upgrade to Spark 0.9 and Scala 2.10
ISSUE 138: Display Git commit info on command line
ISSUE 161: Added switches to spark context creation code
ISSUE 117: Add a "range join" method.
ISSUE 151: Vcf work concordance and genotype
ISSUE 150: Remaining variant changes for adam2vcf, unit tests, and CLI modifications
ISSUE 147: Resurrect VCF conversion code
ISSUE 148: Moving createSparkContext into core
ISSUE 142: Enforce Maven and Java versions
ISSUE 144: Merge of last few days of work on master into this branch
ISSUE 124: Vcf work rdd master merge
ISSUE 143: Changing package declaration to match test file location and removing un...
ISSUE 140: Update README.md
ISSUE 139: Update README.md
ISSUE 129: Modified pileup transforms to improve performance + to add options
ISSUE 116: add fastq interleaver script
ISSUE 125: Add design doc to CONTRIBUTING document
ISSUE 114: Changes to RDD utility files for new variant schema
ISSUE 122: Add IRC Channel to readme
ISSUE 100: CLI component changes for new variant schema
ISSUE 108: Adding new PluginExecutor command
ISSUE 98: Vcf work remove old variant
ISSUE 104: Added the port erasure to SparkFunSuite's cleanup.
ISSUE 107: Cleaning up change documentation.
ISSUE 99: Encoding tag types in the ADAMRecord attributes, adding the 'tags' command
ISSUE 105: Add initial documentation on contributing
ISSUE 97: New schema, variant context converter changes, and removal of old genoty...
ISSUE 79: Adding ability to convert reference FASTA files for nucleotide sequences
ISSUE 91: Minor change, increase adam-cli usage width to 150 characters
ISSUE 86: Fixes to pileup code
ISSUE 88: Added function for building variant context from genotypes.
ISSUE 81: Update README and cleanup top-level cli help text
ISSUE 76: Changing hadoop fs call to be compatible with Hadoop 1.
ISSUE 74: Updated CHANGES.txt to include note about the recursive-load branch.
ISSUE 73: Support for loading/combining multiple ADAM files into a single RDD.
ISSUE 72: Added ability to create regions from reads, and to merge adjacent regions
ISSUE 71: Change RecalTable to use optimized phred calculations
ISSUE 68: sonatype-nexus-snapshots repository is already in parent oss-parent-7 pom
ISSUE 67: fix for wildcard exclusion maven warnings
ISSUE 65: Create a cache for phred -> double values instead of recalculating
ISSUE 60: Bugfix for BQSR: Offset into qualityScore list was wrong
ISSUE 66: add pluginDependency section and remove versions in plugin sections
ISSUE 61: Filter utility for inverse of Projection
ISSUE 48: Fix read groups mapping and add Y as base type
ISSUE 36: Adding reads to rods transformation.
ISSUE 56: Adding Yy as base in MdTag

Version 0.6.0

ISSUE 53: Fix Hadoop 2.2.0 support, upgrade to Spark 0.8.1
ISSUE 52: Attributes: Use 't' instead of ',', as , is a valid character
ISSUE 47: Adding containsRefName to SequenceDictionary
ISSUE 46: Reduce logging for the actual adamSave job
ISSUE 45: Make MdTag immutable
ISSUE 38: Small bugfixes and cleanups to BQSR
ISSUE 40: Fixing reference position from offset implementation
ISSUE 31: Fixing a few issues in the ADAM2VCF2ADAM pipeline.
ISSUE 30: Suppress parquet logging in FieldEnumerationSuite
ISSUE 28: Fix build warnings
ISSUE 24: Add unit tests for marking duplicates
ISSUE 26: Fix unmapped reads in sequence dictionary
ISSUE 23: Generalizing the Projection class
ISSUE 25: Adding support for before, after clauses to SparkFunSuite.
ISSUE 22: Add a unit test for sorting reads
ISSUE 21: Adding rod functionality: a specialized grouping of pileup data.
ISSUE 13: Cleaning up VCF<->ADAM pipeline
ISSUE 20: Added Apache License 2.0 boilerplate to tops of all the GB-(c) files
ISSUE 19: Allow the Hadoop version to be specified
ISSUE 17: Fix transform -sort_reads partitioning. Add -coalesce option to transform.
ISSUE 16: Fixing an issue in pileup generation and in the MdTag util.
ISSUE 15: Tweaks 1
ISSUE 12: Subclass testing bug in AdamContext.adamLoad
ISSUE 11: Missing brackets in VcfConverter.getType
ISSUE 10: Moved record field name enum over to the projections package.
ISSUE 8: Fixes to sorting in ReferencePosition
ISSUE 4: New SparkFunSuite test support class, logging util and new BQSR test.
ISSUE 1: Fix scalatest configuration and fix unit tests
ISSUE 14: Converting some of the Option() calls to Some()
ISSUE 13: Cleaning up VCF<->ADAM pipeline
ISSUE 9: Adding support for a Sequence Dictionary from BAM files
ISSUE 8: Fixes to sorting in ReferencePosition
ISSUE 7: ADAM variant and genotype formats; and a VCF->ADAM converter
ISSUE 4: New SparkFunSuite test support class, logging util and new BQSR test.
ISSUE 3: Adding in implicit conversion functions for going between Java and Scala...
ISSUE 2: Update from Spark 0.7.3 to 0.8.0-incubating
ISSUE 1: Fix scalatest configuration and fix unit tests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CHANGES.md

CHANGES.md

ADAM

Version 0.22.0

Version 0.21.0

Version 0.20.0

Version 0.19.0

Version 0.18.2

Version 0.18.1

Version 0.18.0

Version 0.17.1

Version 0.17.0

Version 0.16.0

Version 0.15.0

Version 0.14.0

Version 0.13.0

Version 0.12.1

Version 0.12.0

Version 0.11.0

Version 0.10.0

Version 0.9.0

Version 0.8.0

Version 0.7.2

Version 0.7.1

Version 0.7.3

Version 0.7.2

Version 0.7.0

Version 0.7.0

Version 0.6.0

Files

CHANGES.md

Latest commit

History

CHANGES.md

File metadata and controls

ADAM

Version 0.22.0

Version 0.21.0

Version 0.20.0

Version 0.19.0

Version 0.18.2

Version 0.18.1

Version 0.18.0

Version 0.17.1

Version 0.17.0

Version 0.16.0

Version 0.15.0

Version 0.14.0

Version 0.13.0

Version 0.12.1

Version 0.12.0

Version 0.11.0

Version 0.10.0

Version 0.9.0

Version 0.8.0

Version 0.7.2

Version 0.7.1

Version 0.7.3

Version 0.7.2

Version 0.7.0

Version 0.7.0

Version 0.6.0