Native Support for Deltalake #4463
Replies: 8 comments 6 replies
-
My personal take on this is to let Iceberg and DeltaLake fight to the death and then support the eventual winner. There is no recommended way currently unless @Alex-Monahan has something on this |
Beta Was this translation helpful? Give feedback.
-
PyArrow is the recommended method! You don't have to load it all into memory as a table, you can load it as a dataset as well (which DuckDB can read!). https://github.com/delta-io/delta-rs/blob/main/python/tests/test_table_read.py One of Arrow's goals is to make this kind if integration easier. All you need is format to Arrow (Delta to Arrow), then Arrow to engine (Arrow to DuckDB). So it's a really good option here. |
Beta Was this translation helpful? Give feedback.
-
looks like databricks just added a uniform layer to Delta Lake 3.0 to be compatible with Hudi and Iceberg. Delta Lake support would be great. |
Beta Was this translation helpful? Give feedback.
-
Wonder if this could be used as a basis for Delta? The APIs are likely very different, but some aspects of the implementation would be similar: https://github.com/duckdblabs/duckdb_iceberg |
Beta Was this translation helpful? Give feedback.
-
Polars bears can quack and have integration with delta: https://www.edgarbahilo.com/poors-man-data-lake-with-polars-deltalake/ Also polars is going to support merge operations over delta in next release so potentially:
|
Beta Was this translation helpful? Give feedback.
-
Another related demo: https://github.com/milicevica23/dbt-duckdb-delta-plugin-demo |
Beta Was this translation helpful? Give feedback.
-
@hannes We need it not only for direct use by DuckDB itself natively, but mostly to benefit from reading and writing Delta tables using third parties to which we do not have access under the hood, like Rill Data. This includes Metabase DuckDB connector visualizations, which utilize DuckDB. Our use isn't limited to scenarios where DuckDB supports it anyhow living off other tools; we need a native support. |
Beta Was this translation helpful? Give feedback.
-
it is done https://github.com/duckdb/duckdb_delta |
Beta Was this translation helpful? Give feedback.
-
What is the recomended way to read DeltaLake tables?
Can we have a way to read DeltaLake tables? something like
read_deltalake('....')
natively part of the SQL syntax?Beta Was this translation helpful? Give feedback.
All reactions