Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ENH: Explore SIMDJSON for JSON I/O #58278

Open
1 of 3 tasks
WillAyd opened this issue Apr 16, 2024 · 0 comments
Open
1 of 3 tasks

ENH: Explore SIMDJSON for JSON I/O #58278

WillAyd opened this issue Apr 16, 2024 · 0 comments
Labels
Enhancement IO JSON read_json, to_json, json_normalize

Comments

@WillAyd
Copy link
Member

WillAyd commented Apr 16, 2024

Feature Type

  • Adding new functionality to pandas

  • Changing existing functionality in pandas

  • Removing existing functionality in pandas

Problem Description

We have used a vendored port of ultrajson for quite some time, but I think it is starting to show its age and could use a fresh look. The SIMDJSON project looks promising for our needs and claims really faster performance:

https://github.com/simdjson/simdjson?tab=readme-ov-file

Feature Description

Leverage SIMDJSON. There is a Python package that is worth exploring, although I think to get the most out of it we are looking at a C++ extension with nanoarrow

Luckily SIMDJSON is supported in the Meson Wrap DB, and nanoarrow should be with 0.5.0. So vendoring would be less of a thing - just would integrate with Meson

Alternative Solutions

status quo

Additional Context

No response

@WillAyd WillAyd added Enhancement IO JSON read_json, to_json, json_normalize labels Apr 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Enhancement IO JSON read_json, to_json, json_normalize
Projects
None yet
Development

No branches or pull requests

1 participant