Bug 1685769 - Add cloud monitoring export script and jobs #1679

BenWu · 2021-01-15T17:51:26Z

https://bugzilla.mozilla.org/show_bug.cgi?id=1685769

This is blocked by grpc size limit in the monitoring library (googleapis/python-monitoring#62). Some intervals for some metrics can't be exported with the current google-cloud-monitoring==2.0.0 and I don't think it's possible to pip install from git with pip-compile --generate-hashes.

@jklukas does this structure of a query.py in the project/dataset/table directory that uses a file in the bigquery-etl module fit into the bigquery-etl pattern?

jklukas · 2021-01-15T17:56:04Z

@jklukas does this structure of a query.py in the project/dataset/table directory that uses a file in the bigquery-etl module fit into the bigquery-etl pattern?

Yes, that's an existing pattern and this looks like a reasonable approach.

jklukas · 2021-01-15T17:57:44Z

I won't have time for a full review today, so I can take a look early next week. Otherwise, feel free to ask someone else for a full review.

BenWu · 2021-01-15T18:04:33Z

Ok no rush on this since it's still blocked by the grpc issue and also permissions. Thanks

jklukas · 2021-01-19T18:20:09Z

dags.yaml

@@ -331,3 +331,14 @@ bqetl_desktop_platform:
      ]
    retries: 2
    retry_delay: 30m
+
+bqetl_cloud_monitoring_export:
+    schedule_interval: 0 * * * *


I believe this is equivalent to @daily, and we should prefer that shorthand if it's what we use consistently elsewhere in this repo.

This is hourly but yes @hourly would work

jklukas · 2021-01-19T18:27:29Z

bigquery_etl/monitoring/export_metrics.py

+        # get existing timestamps in destination table to avoid overlap
+        if not overwrite and len(time_series_data) > 0:
+            time_series_data = filter_existing_data(
+                time_series_data,
+                bq_client,
+                target_table,
+                start_time,
+                end_time,
+            )


My inclination would be to create these monitoring tables with hourly partitioning, and then have these scripts atomically overwrite the target partition (specifying the destination table as mytable$2021011201, etc.), avoiding the need for filtering logic like this.

We could achieve some simplification by having all this machinery assume that it's operating on one whole hour at a time. I may well be missing some nuance, though, so definitely open to pushback.

BenWu · 2021-01-19T20:59:38Z

Going to discuss this with SRE before continuing

BenWu · 2021-01-29T21:22:38Z

Going with influxdb instead. See https://bugzilla.mozilla.org/show_bug.cgi?id=1619406

BenWu added 5 commits January 15, 2021 12:23

Add cloud monitoring export jobs

4889df2

Make export job idempotent

19a2e46

Fix idempotency

5f7c14d

test

cb417da

Add unit tests

2d4782f

BenWu requested a review from jklukas January 15, 2021 17:51

BenWu changed the title ~~Add cloud monitoring export script and jobs~~ [Bug 1685769] Add cloud monitoring export script and jobs Jan 18, 2021

BenWu changed the title ~~[Bug 1685769] Add cloud monitoring export script and jobs~~ Bug 1685769 - Add cloud monitoring export script and jobs Jan 18, 2021

jklukas reviewed Jan 19, 2021

View reviewed changes

BenWu closed this Jan 29, 2021

BenWu deleted the benwu/monitoring_export branch March 19, 2021 20:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug 1685769 - Add cloud monitoring export script and jobs #1679

Bug 1685769 - Add cloud monitoring export script and jobs #1679

BenWu commented Jan 15, 2021 •

edited

jklukas commented Jan 15, 2021

jklukas commented Jan 15, 2021

BenWu commented Jan 15, 2021

jklukas Jan 19, 2021

BenWu Jan 19, 2021

jklukas Jan 19, 2021

BenWu commented Jan 19, 2021

BenWu commented Jan 29, 2021

Bug 1685769 - Add cloud monitoring export script and jobs #1679

Bug 1685769 - Add cloud monitoring export script and jobs #1679

Conversation

BenWu commented Jan 15, 2021 • edited

jklukas commented Jan 15, 2021

jklukas commented Jan 15, 2021

BenWu commented Jan 15, 2021

jklukas Jan 19, 2021

Choose a reason for hiding this comment

BenWu Jan 19, 2021

Choose a reason for hiding this comment

jklukas Jan 19, 2021

Choose a reason for hiding this comment

BenWu commented Jan 19, 2021

BenWu commented Jan 29, 2021

BenWu commented Jan 15, 2021 •

edited