help in understanding output file format #72

asafalina · 2022-10-16T20:26:20Z

Hi

I was running cleora using the command below:

cleora-v1.2.3-x86_64-apple-darwin --columns transient::cluster_id StarNode --dimension 1024 -n 5 --input fb_cleora_input_star.txt -o output

I got something similar to the following output:
(I added some spacing just for better readability)

39361 1024
1        1    0.029419877 ..... -0.0073362226
16260    7    0.033474464 ..... -0.00906976
.
.
.
22459    1    0.010709517 ..... 0.026430061

I cant figure out what does the 1st (1, 16260, ..., 22459) and the 2nd (1, 7, ..., 1) columns represent?

Thanks

The text was updated successfully, but these errors were encountered:

piobab · 2022-10-21T15:39:35Z

Hi @asafalina !

First column - entity. In your case it should be cluster_id.
Second column - occurrence, how many times entity occurs in the data.

https://github.com/Synerise/cleora/blob/master/src/persistence.rs#L44

Hope it helps.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

help in understanding output file format #72

help in understanding output file format #72

asafalina commented Oct 16, 2022 •

edited

piobab commented Oct 21, 2022

help in understanding output file format #72

help in understanding output file format #72

Comments

asafalina commented Oct 16, 2022 • edited

piobab commented Oct 21, 2022

asafalina commented Oct 16, 2022 •

edited