You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
hello - I have a general question about whether Dataframes.jl uses all of the physical cores available on the machine when executing code (the way polars does - https://www.pola.rs/) - I'd greatly appreciate it if someone could share any resources on tips to improve the performance of Dataframes.jl.
it'd also be super helpful to get some feedback on whether there is any way to improve the performance of Dataframes.jl in the recently updated db-benchmark:
it'd also be super helpful to get some feedback on whether there is any way to improve the performance of Dataframes.jl
Yes. However, it currently was not considered as top priority. Having said that:
If someone is willing to work on this I can give information what needs to be done and in what parts of code.
If you have some specific operation that you believe is slow for you we can work on it specifically to improve things - can you please indicate the case where you have a performance problem?
Also note that in the benchmarks you reference DuckDB not Polars is generally the fastest solution and we treat it as a reference benchmark.
thank you for the detailed response @bkamins - my question was based on a discussion with a colleague about the db-benchmark - I will look into the multi-threading support and get back
I'm not sure about my bandwidth or capability to help with the source code to improve on the db-benchmark, but it's a very popular benchmark that does influence the usage of libraries, so it'd be great to see the Julia performance improve - thank you again!
Help with the code is always welcome. However, as I have commented, even sharing real-life examples that are slow in practice would help.
The point is that this benchmark is run on a large multi-core server, while probably typically people run their code on laptops /smaller servers that have a different performance characteristic (and this is the target we want to optimize for in the first place).
hello - I have a general question about whether Dataframes.jl uses all of the physical cores available on the machine when executing code (the way polars does - https://www.pola.rs/) - I'd greatly appreciate it if someone could share any resources on tips to improve the performance of Dataframes.jl.
it'd also be super helpful to get some feedback on whether there is any way to improve the performance of Dataframes.jl in the recently updated db-benchmark:
https://duckdblabs.github.io/db-benchmark/
thank you
The text was updated successfully, but these errors were encountered: