Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable consistentDataPush for Spark execution framework #12941

Open
lrao-stripe opened this issue Apr 16, 2024 · 1 comment
Open

Enable consistentDataPush for Spark execution framework #12941

lrao-stripe opened this issue Apr 16, 2024 · 1 comment

Comments

@lrao-stripe
Copy link

#9295 enabled consistent data push for standalone execution framework. This would be a great feature to extend to Spark based ingestion as well.

This will be useful for scenarios for our users where every run of a batch job may produce a different number of partition files and an atomic replace of one set of segments with another will help mitigate the issue of serving duplicate data.

@swaminathanmanish
Copy link
Contributor

@Jackie-Jiang - Please assign this to me.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants