Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Seperate out clean so it doesn't rely on pandas/dask #969

Open
meyerovb opened this issue Jul 14, 2023 · 0 comments
Open

Seperate out clean so it doesn't rely on pandas/dask #969

meyerovb opened this issue Jul 14, 2023 · 0 comments
Assignees
Labels
type: enhancement New feature or request

Comments

@meyerovb
Copy link

Banging my head for hours cause AWS Lambda has a 250mb code limit and all I wanted to do was clean email addresses. I'd have to build a damn docker image to run 10 lines of code. So instead I'm ripping out _format_email from clean_emails.py to just run it against my emails. It would be great if the actual CLEANING code was separated out into different files that don't need to import all the other nonsense that has nothing to do with CLEANING DATA.

@meyerovb meyerovb added the type: enhancement New feature or request label Jul 14, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type: enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants