Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data quality framework #802

Open
explicite opened this issue Jun 5, 2023 · 1 comment
Open

Data quality framework #802

explicite opened this issue Jun 5, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@explicite
Copy link

explicite commented Jun 5, 2023

Is your feature request related to a problem or challenge? Please describe what you are trying to do.

(This section helps Arrow developers understand the context and why for this feature, in addition to the what)

Describe the solution you'd like
When building DAG of transformations, I want to be able define tests which can prove data correctness. On the end of the DAG I should be able to o review data quality and provide context to end user if required.

Like in Deequ I can check if all id's are unique or in some column I can find data in correct format. Other approaches Apache Glue, dbt test or Great Expectation

Describe alternatives you've considered
Instead of building framework it's maybe possible to extend Great Expectation

Additional context
Add any other context or screenshots about the feature request here.

@explicite explicite added the enhancement New feature or request label Jun 5, 2023
@YuriyGavrilov
Copy link

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants