Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Lake][Data Quality] Add pydantic to validate schema at the lake-level #973

Closed
2 tasks
idiom-bytes opened this issue Apr 30, 2024 · 1 comment
Closed
2 tasks
Assignees
Labels
Type: Enhancement New feature or request

Comments

@idiom-bytes
Copy link
Member

Background / motivation

The current table schemas use types but are poorly validated. We also can't leverage validation wrappers because it's just using a generic type.

Rather: let's implement a validation tool like pydantic to validate data through our system, and in our lake

DoD

  • Update table schema definitions to use pydantic
  • Update lake validate cli to validate lake schema vs. expected schema
@idiom-bytes
Copy link
Member Author

We're not going to do the pydantic working any longer, since we updated our schema and tables recently

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Type: Enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants