Support for PySpark #1055

gracemiguel · 2023-10-26T17:46:23Z

Is your feature request related to a problem? Please describe.

Hello, I see that this package supports Pandas, but does it support pyspark? I'd like to use this on large datasets and pandas is insufficient for my use case.

Describe the outcome you'd like:
I'd like to be able to run this on large datasets over 10k+ rows. Do you think this would be possible?

taylorfturner · 2023-10-26T17:49:26Z

Depends on how many columns you are also dealing with, but my first though is you should be fine at that data size with pandas, @gracemiguel. Thanks!

taylorfturner · 2023-11-20T14:57:26Z

@gracemiguel any additional questions on this? Any luck using? Thanks!

gracemiguel added the New Feature A feature addition not currently in the library label Oct 26, 2023

gracemiguel assigned ksneab7, micdavis, taylorfturner and tyfarnan Oct 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for PySpark #1055

Support for PySpark #1055

gracemiguel commented Oct 26, 2023

taylorfturner commented Oct 26, 2023 •

edited

taylorfturner commented Nov 20, 2023

Support for PySpark #1055

Support for PySpark #1055

Comments

gracemiguel commented Oct 26, 2023

taylorfturner commented Oct 26, 2023 • edited

taylorfturner commented Nov 20, 2023

taylorfturner commented Oct 26, 2023 •

edited