[Lake] Parallelize fetching data across >>1 coins #932

trentmc · 2024-04-22T10:43:25Z

Background / motivation

We currently fetch data from Binance one token at at time.

So if there are 10 coins (BTC, ETH, DOT, ..) then it takes 10x longer.

TODOs / DoD

Parallelize fetching data, in lake/ohclv_data_factory.py. It's likely a straightforward conversion of a for loop

Related github issues

#804 "[Sim, make $] Make benchmarking parallel, via threading"

Note: we could also parallelize grabbing data within a feed. However this isn't as easy, because the algorithm needs to check what's there already. So consider this for later. But also maybe not needed because at some point we'll have historical data repo / bundle.

The text was updated successfully, but these errors were encountered:

trentmc · 2024-04-22T10:43:35Z

cc @calina-c @idiom-bytes

calina-c · 2024-04-23T07:20:38Z

I don't recommend doing this now, since the structure is changing for lake either way. It will only result in either difficult conflicts or entirely lost while fixing said conflicts. I agree it is a good thing, but we should wait until the lake/ETL part is done.

trentmc · 2024-04-23T09:54:40Z

I don't recommend doing this now, since the structure is changing for lake either way. It will only result in either difficult conflicts or entirely lost while fixing said conflicts. I agree it is a good thing, but we should wait until the lake/ETL part is done.

OK. Makes sense.

trentmc added the Type: Enhancement New feature or request label Apr 22, 2024

trentmc changed the title ~~[Lake] Parallelize fetching data across >>1 tokens~~ [Lake] Parallelize fetching data across >>1 coins Apr 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Lake] Parallelize fetching data across >>1 coins #932

[Lake] Parallelize fetching data across >>1 coins #932

trentmc commented Apr 22, 2024 •

edited

trentmc commented Apr 22, 2024

calina-c commented Apr 23, 2024 •

edited

trentmc commented Apr 23, 2024

[Lake] Parallelize fetching data across >>1 coins #932

[Lake] Parallelize fetching data across >>1 coins #932

Comments

trentmc commented Apr 22, 2024 • edited

Background / motivation

TODOs / DoD

Related github issues

trentmc commented Apr 22, 2024

calina-c commented Apr 23, 2024 • edited

trentmc commented Apr 23, 2024

trentmc commented Apr 22, 2024 •

edited

calina-c commented Apr 23, 2024 •

edited