Store metrics using explicit schema #165

ants · 2023-04-05T16:28:30Z

In pgw2 data and tags are stored as jsonb. This duplicates schema information in every row, giving bad storage efficiency for uncompressed data. It also defeats timeseries compression mechanisms for compressed data. Additionally, any access to data needs to decompress the whole document instead of accessing just the necessary column, which is very important for columnar storage.

Storing series metadata (e.g. full query text) out-of-line in a separate table might also be a good idea.

TBD: measurements on real world data

ants · 2023-06-14T08:33:21Z

Measurement results on 45M rows of real world stat_statements data (6db * 3mo) + a query to get month worth of data for a single queryid, 4 data cols.

Schema	Size uncompressed	Size compressed	Compression ratio	Query time	Buffers accessed
jsonb	70 GB	8050 MB	8.9x	1600ms	143596
relational	11 GB	763 MB	14.5x	4ms	522

kmoppel · 2023-08-30T12:41:58Z

My .02$ - in practice I think that extra data proliferation only becomes visible only with this stat_statements metric - and using TimescaleDB reduced old data heavily still (aroun ~10x) - so indeed, would be nice, but not sure if its worth the extra complexity as users on very high instance counts would probably prefer Prometheus anyways.

pashagolub self-assigned this Apr 5, 2023

pashagolub added enhancement New feature or request metrics Metrics related issues refactoring Something done as it should've been done from the start labels Apr 5, 2023

pashagolub added this to the Metrics format milestone Apr 5, 2023

pashagolub added sinks Where and how to store monitored data and removed metrics Metrics related issues labels Jan 11, 2024

cybertec-postgresql locked and limited conversation to collaborators May 15, 2024

pashagolub converted this issue into discussion #447 May 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

Store metrics using explicit schema #165

Store metrics using explicit schema #165

ants commented Apr 5, 2023

ants commented Jun 14, 2023

kmoppel commented Aug 30, 2023

This issue was moved to a discussion.

This issue was moved to a discussion.

Store metrics using explicit schema #165

Store metrics using explicit schema #165

Comments

ants commented Apr 5, 2023

ants commented Jun 14, 2023

kmoppel commented Aug 30, 2023

This issue was moved to a discussion.