Skip to content

dataprep.eda TypeError: Please provide npartitions as an int, or possibly as None if you specify chunksize #901

Answered by rfinatan
rfinatan asked this question in Q&A
Discussion options

You must be logged in to vote

Hi all,

User SultanOrazbayev was able to provide a workaround for those replicating this issue.

Please see below for his response:

_

There is a small bug in dataprep package, you can track it in this PR.

In the meantime, one option to avoid the bug is to explicitly convert data to a dask dataframe and pass that into the function:

_

import numpy as np
import pandas as pd
from dask.dataframe import from_pandas
from dataprep.clean import clean_phone

df = pd.DataFrame(
    {
        "phone": [
            "555-234-5678",
            "(555) 234-5678",
            "555.234.5678",
            "555/234/5678",
            15551234567,
            "(1) 555-234-5678",
            "+1 (234) 567-89…

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by rfinatan
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants