Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DataFusion write to delta #2422

Open
dhruvils414 opened this issue Apr 15, 2024 · 5 comments
Open

DataFusion write to delta #2422

dhruvils414 opened this issue Apr 15, 2024 · 5 comments
Labels
enhancement New feature or request

Comments

@dhruvils414
Copy link

Hello

I would like to write into Delta lake via DataFusion just like spark.

Future support

Append, overwrite, Merge into

@dhruvils414 dhruvils414 added the enhancement New feature or request label Apr 15, 2024
@MrPowers
Copy link
Collaborator

Let's make sure to get the docs page updated too when/if this gets completed: https://delta-io.github.io/delta-rs/integrations/delta-lake-datafusion/

@ion-elgreco
Copy link
Collaborator

We already use datafusion in delta-rs, so not sure what you mean?

@MrPowers
Copy link
Collaborator

@ion-elgreco - any chance we can expose the syntax to write to a Delta tables with DataFusion in the docs, so it's easy for me to learn how to do it?

@ion-elgreco
Copy link
Collaborator

@MrPowers what docs are we talking about here? Because we use datafusion on the rust side, but this is not documented well. All of the writing is just dispatched to rust from pyuthon

@dhruvils414
Copy link
Author

I think it didn't support directly. Based Article 1, We need convert to pyarrow before we read from datafusion.

from datafusion import SessionContext
from deltalake import DeltaTable

ctx = SessionContext()
table = DeltaTable("G1_1e9_1e2_0_0")
ctx.register_dataset("my_delta_table", table.to_pyarrow_dataset())

Article 1 https://delta-io.github.io/delta-rs/integrations/delta-lake-datafusion/#delta-lake-performance-benefits-for-datafusion-users

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants