Skip to content

M3: Improving the analysis pipeline

No due date 16% complete

The BigQuery datasets are reorganized to support partitioning and clustering.
The summary datasets are generated from the HAR files in Dataflow.
Content pre-processing (Rework CSS parsing) is done by Dataflow and written to BigQuery.
2021 Web Almanac queries run monthly and results stored. (Cloud SQL + BQ?)
Web Almanac metrics are well-documented and easi…

The BigQuery datasets are reorganized to support partitioning and clustering.
The summary datasets are generated from the HAR files in Dataflow.
Content pre-processing (Rework CSS parsing) is done by Dataflow and written to BigQuery.
2021 Web Almanac queries run monthly and results stored. (Cloud SQL + BQ?)
Web Almanac metrics are well-documented and easily extensible.