how to use carbon clickhouse with distributed tables ? #119

mcarbonneaux · 2022-11-18T13:34:02Z

how to configure carbon-clikhouse with clickhouse distributed tables ?

Felixoid · 2022-11-18T13:51:07Z

I am not sure what you mean. I've used distributed tables for inserts there

mcarbonneaux · 2022-11-19T20:03:34Z

you use partitionned tables, not distributed table in the readme.

clickhouse distributed table:
https://clickhouse.com/docs/en/sql-reference/distributed-ddl

mcarbonneaux · 2022-11-19T20:11:23Z

with distributed table you can distribute data again clickhouse node shard.... and scale linearie with the number of node (depend on the eficiency of the distribution key)....

Felixoid · 2022-11-19T20:12:45Z

You should rather use a single table, not "on cluster" clause.

See https://clickhouse.com/docs/en/engines/table-engines/special/distributed/

mcarbonneaux · 2022-11-19T22:07:57Z

is the documentation what i'm searching !

the idea is to store not in single node but in cluster with multiple shard.... to scale...

the readme create table instruction, are for single node... or i've missed somephing ...

can be possible to have an example of create table in distributed mode in the readme ?

sheyt0 · 2022-12-12T12:54:32Z

You should create regular tables on each node in cluster.
After that you can write into any of nodes (I use L7 LB).

For reading from all nodes in one request, use Distributed table.

When creating Distributed, you can set sharding_key. That allows you to write "to distribution table" -- this means that all incoming data will be routed by sharding_key.

Note here:
When you use rollup-conf = "auto" in graphite-clickhouse, you should set rollup-auto-table = "" pointed to regular table.

Here is examples of configs from my prod:

Tables:

CREATE TABLE IF NOT EXISTS graphite_repl ON CLUSTER datalayer (
    `Path`      String  CODEC(ZSTD(3)),
    `Value`     Float64 CODEC(Gorilla, LZ4),
    `Time`      UInt32  CODEC(DoubleDelta, LZ4),
    `Date`      Date    CODEC(DoubleDelta, LZ4),
    `Timestamp` UInt32  CODEC(DoubleDelta, LZ4)
)
ENGINE = ReplicatedGraphiteMergeTree('/clickhouse/tables/{shard}/graphite_repl', '{replica}', 'graphite_rollup')
PARTITION BY toYYYYMMDD(Date)
ORDER BY (Path, Time)
TTL
    Date + INTERVAL 1 WEEK TO VOLUME 'cold_volume',
    Date + INTERVAL 4 MONTH DELETE
SETTINGS
    index_granularity = 512;

CREATE TABLE IF NOT EXISTS graphite_dist ON CLUSTER datalayer AS graphite_repl
ENGINE = Distributed(datalayer, ..., graphite_repl);

carbon-clickhouse:

...
[upload.graphite]
type = "points"
table = "graphite_repl"
...

graphite-clickhouse:

...
[[data-table]]
 table = "graphite_dist"
 rollup-conf = "auto"
 rollup-auto-table = "graphite_repl"
...

mcarbonneaux · 2022-12-17T12:51:23Z

i while go to test that !!

mcarbonneaux · 2022-12-17T12:52:17Z

can be usefull to use chproxy in front to cache request (https://www.chproxy.org/) ?

Civil · 2022-12-17T13:31:16Z

If you use carbonapi - it also can cache requests. So that depends on what is your use case.

I would overall suggest to start with simple setup and then add extra pieces once you encounter a bottleneck

msaf1980 · 2022-12-17T19:12:14Z

can be usefull to use chproxy in front to cache request (https://www.chproxy.org/) ?

No. chproxy can't cache requests with external data (used in points table queries).

Graphite-clickhouse can cache finder queries (in render requests).
Carbonapi can cache other on the front of API requests (render, find, tags autocomplete).

So, no reason use chproxy for caching. But usefull as bouncer/connection pool limiter.

mcarbonneaux changed the title ~~distributed tables ?~~ how to use distributed tables ? Nov 18, 2022

mcarbonneaux changed the title ~~how to use distributed tables ?~~ how to use clickhouse distributed tables ? Nov 18, 2022

mcarbonneaux changed the title ~~how to use clickhouse distributed tables ?~~ how to use carbon clickhouse with distributed tables ? Nov 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to use carbon clickhouse with distributed tables ? #119

how to use carbon clickhouse with distributed tables ? #119

mcarbonneaux commented Nov 18, 2022 •

edited

Felixoid commented Nov 18, 2022

mcarbonneaux commented Nov 19, 2022

mcarbonneaux commented Nov 19, 2022

Felixoid commented Nov 19, 2022 •

edited

mcarbonneaux commented Nov 19, 2022 •

edited

sheyt0 commented Dec 12, 2022 •

edited

mcarbonneaux commented Dec 17, 2022

mcarbonneaux commented Dec 17, 2022

Civil commented Dec 17, 2022

msaf1980 commented Dec 17, 2022 •

edited

how to use carbon clickhouse with distributed tables ? #119

how to use carbon clickhouse with distributed tables ? #119

Comments

mcarbonneaux commented Nov 18, 2022 • edited

Felixoid commented Nov 18, 2022

mcarbonneaux commented Nov 19, 2022

mcarbonneaux commented Nov 19, 2022

Felixoid commented Nov 19, 2022 • edited

mcarbonneaux commented Nov 19, 2022 • edited

sheyt0 commented Dec 12, 2022 • edited

mcarbonneaux commented Dec 17, 2022

mcarbonneaux commented Dec 17, 2022

Civil commented Dec 17, 2022

msaf1980 commented Dec 17, 2022 • edited

mcarbonneaux commented Nov 18, 2022 •

edited

Felixoid commented Nov 19, 2022 •

edited

mcarbonneaux commented Nov 19, 2022 •

edited

sheyt0 commented Dec 12, 2022 •

edited

msaf1980 commented Dec 17, 2022 •

edited