- Completeness - data is complete not partially available
- Accuracy - accurate phone number, birth date etc - needs reference to source of truth
- Consistency - same dimensions used in multiple instances refer to the same thing
- Validity - valid phone number, valid date of birth
- Uniqueness - Ensure no duplicates
- Integrity - Relationships different and related data is maintained through out the data journey. Should be traceble throught the org. Customer address -> Customer Profile
- Accessibility - is data accessible, searchable etc
- Timeliness - freshness - is data available when you need it?
- Relevance - What data is used to support business initiatives?
Data consumers must define what’s most important and creators must focus on delivering that most important data.
https://www.collibra.com/blog/the-6-dimensions-of-data-quality
Name | Source based | Regex based | Destination based | Time based |
---|---|---|---|---|
Completeness | yes | yes | ||
Accuracy | no | yes | ||
Consistency | yes | yes | yes | yes |
Validity | no | yes | ||
Uniqueness | yes | no | ||
Integrity | yse | no | yes | |
Accessibility | ? | ? | ? | ? |
Timeliness | yes | |||
Relevance |