Skip to content
This repository has been archived by the owner on Nov 10, 2018. It is now read-only.

Wrangler - Normalizing Cols with Multiple Vals per Record #216

Open
JVonderwell opened this issue Apr 20, 2018 · 0 comments
Open

Wrangler - Normalizing Cols with Multiple Vals per Record #216

JVonderwell opened this issue Apr 20, 2018 · 0 comments
Labels

Comments

@JVonderwell
Copy link
Contributor

Attributes will sometimes consist of records that are a list of values e.g. PHARM_CLASSES for a record might contain: Radioactive Diagnostic Agent [EPC], Antiarrhythmic [EPC], ...

The wrangler normalizes them collectively as though they were one string. We should add an option to split independent values and normalize them separately.

Benefit: More effective normalization for certain types of attributes.

Cost: Ballparking at 3 hours for development and testing.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

1 participant