- 2020-06-01 (available until commit
f1d6464
)- Remove
DeduplicateByDocumentId
transform, which was intended for use with the backfill fromheka
data, but did not perform well and was never used in production. - Remove
PublishBundleMetrics
which was temporarily useful for investigation into potential batch refactoring, but we were able to use that data to determine that average bundle size is so small that it would not be reasonable to publish individual GCS objects per bundle; see #501.
- Remove
- 2020-02-26 (available until commit
6551bc7
)- Remove support for schema aliases, including the
--schemaAliasesLocation
parameter defined inSinkOptions
.
- Remove support for schema aliases, including the
- 2020-02-04 (available until commit
7e60dfc
)- Remove patched
WritePartition.java
that limits the maximum number of bytes in a single BigQuery load job; Beam 2.19 exposes a configuration parameter we now use for the same effect.
- Remove patched
- 2020-01-24 (available until commit
16d7702
)HekaReader
and theheka
input type were removed- The
sanitizedLandfill
input format was removed along with AWS dependencies