Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: add "no inference mode" to add from URL #120

Open
aborruso opened this issue May 5, 2019 · 7 comments
Open

Feature request: add "no inference mode" to add from URL #120

aborruso opened this issue May 5, 2019 · 7 comments

Comments

@aborruso
Copy link

aborruso commented May 5, 2019

Hi,
when I add from URL a file, workbenchdata does inferencing to map the field types. It's a great feature but sometimes gives wrong results.

In example here (https://app.workbenchdata.com/workflows/17120) I import an XLS file and it maps the field "CODISTAT" as number and it's a problem, because in the source xls file it's a text field. And then in workbenchdata the value "001801" becomes "1801" and it's not so good.

It would be great to have an option in the module to have "no inference", and have all fields as text field.

Thank you

@aborruso
Copy link
Author

aborruso commented May 5, 2019

Moreover if (it's in tab2 ) I apply CODISTAT.rjust(6,"0") python function, I have wrong result: once again "1801" and not "001801", because the output field type is a number.

@pierreconti
Copy link
Collaborator

pierreconti commented May 6, 2019 via email

@aborruso
Copy link
Author

aborruso commented May 6, 2019

Hi @pierreconti I think it could be more useful to write here about feature requests and bugs. I think this avoids duplication.
Then I wait with great interest, because in some cases it becomes uncomfortable.

Thank you

@adamhooper
Copy link
Contributor

@aborruso I've seen something similar before. My workaround, using the Python module:

def process(table):
    table['Zip'] = table['Zip'].astype(str).str.zfill(5)
    return table

@aborruso
Copy link
Author

@adamhooper I will use it waiting for an official "solution".

Thank you

@adamhooper
Copy link
Contributor

I deployed new fetch logic that stores raw files. And our new CSV parser backend has this option ... but we don't expose it to users.

Now, the missing pieces are:

@aborruso
Copy link
Author

@adamhooper thank you, it's a good thing!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants