Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Port to UDF/SQLAlchemy #30

Open
jayceslesar opened this issue Mar 19, 2023 · 5 comments
Open

Port to UDF/SQLAlchemy #30

jayceslesar opened this issue Mar 19, 2023 · 5 comments

Comments

@jayceslesar
Copy link
Contributor

Definitely a non-trivial task but there is a growing library Ibis which aims to compile pandas-like syntax python to many different SQL backends. UDF's are supported for many SQL backends here so would be super interesting to be able to somehow port the functionality of this library to SQLAlchemy to be used with any database. There are some resources on how this was done in here but moving to something like Ibis/SQLAlchemy would be awesome as right now this is only really available in TimescaleDB

@jvdd
Copy link
Member

jvdd commented Mar 19, 2023

(quote from #29)
Would also be interesting to integrate this into polars using the UDf/Pipe methodology but might want a separation of concerns there as this library does what it needs to really well and polars does what it needs to really well and ideally all the magic happens on the rust side of things anyways

I am considering updating this library to make it more flexible. The argminmax Rust crate is the beating heart of this library, and I recently updated it to support slices, vec, and comply with Apache Arrow while also adding nan-handling capabilities. I plan to propagate these changes as soon as I find the time. Unfortunately, my spare time is limited for the next 1.5 months due to some paper deadlines (where we - including @jonasvdd - will present some exciting new findings on time series downsampling 😉).

I am not acquianted with the UDF/pipe mehthodology, but I'll certainly look into it! Thank you for bringing this to my attention 🤝

P.S.: the polars author also expressed his interest in integrating the argminmax project into polars: jvdd/argminmax#22

@jayceslesar
Copy link
Contributor Author

I am considering updating this library to make it more flexible. The argminmax Rust crate is the beating heart of this library, and I recently updated it to support slices, vec, and comply with Apache Arrow while also adding nan-handling capabilities. I plan to propagate these changes as soon as I find the time. Unfortunately, my spare time is limited for the next 1.5 months due to some paper deadlines (where we - including @jonasvdd - will present some exciting new findings on time series downsampling 😉).

I am in a similar boat where I need to also crank out a few papers for my coursework haha! Need to stop procrastinating but hoping to use this library as an example for one of the papers :D

@jvdd
Copy link
Member

jvdd commented Apr 5, 2023

When working on predict-idlab/plotly-resampler#154 - I realized that instead of pandas, we were now coupling our downsampling approach to numpy... Implementing to smth like IBIS would serve great flexibility & allow out-of-core support! I am just not 100% sure if this can achieve the same runtime performance as the current numpy implementation 🤔

This is definitely on my radar to investigate in the near future :)

@jvdd
Copy link
Member

jvdd commented Apr 5, 2023

On another note, if you have finished any visualization-related papers that you are able to share, @jonasvdd & I would love to read & learn from it :)

P.S.: our 1st paper is just submitted 🚀
-> preprint: https://arxiv.org/abs/2304.00900

@jvdd
Copy link
Member

jvdd commented Apr 17, 2023

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants