New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Plotting issues with hierarchical roofline #90
Comments
ert_results.json can be generated from |
Requires changes in #89, specifically: timemory/timemory/roofline/roofline.py Line 180 in 88d7b91
if self.units is not None:
for i in range(len(self.data)):
self.data[i] /= self.units |
OK, actually that's the issue I was referring to: the L1 and L2 data may be there in the |
Would it help to have the L1, L2, and (if exists) L3 data cache sizes in the JSON so you can extract the ERT tests around those values? |
That will definitely help the L2 and L3 detection, the major issue is that ERT can never reach the L1 bandwidth on either Skylake or P100/V100. I need to try some new kernels (micro-benchmarks) on ERT. |
Using output
ert_results.json
, only one memory level is plotted:The text was updated successfully, but these errors were encountered: