Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

formula for ideal executor plot #32

Open
chen116 opened this issue Mar 9, 2019 · 1 comment
Open

formula for ideal executor plot #32

chen116 opened this issue Mar 9, 2019 · 1 comment

Comments

@chen116
Copy link

chen116 commented Mar 9, 2019

Hi,

First of all, amazing project!

From the report generated from http://sparklens.qubole.com/ , I see the ideal executor plot where it plot "the minimal number of executors (ideal) which could have finished the same work in same amount of wall clock time"

I am curious what are the formulas, equations for such plot. If you can give me some explanation on how you guys approach it, that would be great. Thanks!

@iamrohit
Copy link
Contributor

Hi @chen116,

Thanks for the wonderful words. The calculation you are referring to comes from simulation. It works just like the completion time vs efficiency graph at different executor counts. Instead of simulating the complete application, here we simulate each spark job individually. To find the minimal number of executors required for a particular job, we do a binary search between 1 and total number of executors. This graph is basically trying to show if autoscaling would be useful and second to judge how a given autoscaling policy is doing compared to "ideal" autoscaling.

We will be talking more about it here: https://conferences.oreilly.com/strata/strata-eu/public/schedule/detail/74183

thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants