You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, thanks for your report. This is expected because we are now utilising an option that converts all string-like columns to PyArrow backed strings. You can disable that behaviour with
Hopefully not too many that are actually user-facing. We now optimise queries before we submit them to the scheduler, but we aimed for as much compatibility from an end-user perspective as possible. Please let us know if you encounter something that breaks
Just checking here, I hope that we're considering this to be a bug. We may not have a good way to fix the bug short term, but certainly converting a range object to a string is unexpected and suboptimal behavior.
I think when we convert to dataframe we're looking at a few values, right? We can probably do some simple Python logic there to see if strings or objects are appropriate, right? (This may not be right, but I'm curious why not if not)
Describe the issue:
Since the last update dask bag's to_dataframe generates data frames with a string dtype by default rather than object
Minimal Complete Verifiable Example:
Anything else we need to know?:
Environment:
The text was updated successfully, but these errors were encountered: