Skip to content
This repository was archived by the owner on May 17, 2025. It is now read-only.

Commit a5009aa

Browse files
committed
updated README
1 parent cdbbfa6 commit a5009aa

File tree

1 file changed

+7
-13
lines changed

1 file changed

+7
-13
lines changed

README

Lines changed: 7 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ hopscotch/
2727
| |---4_cache/ Benchmark for evaluating caching.
2828
| |---common/ Source code common to all benchmarks.
2929
| |---include/ Common header files.
30-
| |---kernels/ Common kernels. May get used by different benchmarks.
30+
| |---kernels/ Common kernels. Used by different benchmarks.
3131
|
3232
|---gpu/
3333
| |
@@ -59,12 +59,12 @@ IV. CPU Benchmarks
5959

6060
1_roofline
6161
==========
62-
Measures the maximum attainable performance with varying arithmetic intensity.
62+
Measures the maximum attainable performance with varying arithmetic intensity and the machine balance.
6363

64-
To run: ./roofline_cpu.py
64+
To run: ./roofline.py
6565

6666
The python script will generate a pdf for the roofline plot. Available options can be found
67-
using ./roofline_cpu.py --help
67+
using ./roofline.py --help
6868

6969

7070
2_bandwidth
@@ -74,10 +74,9 @@ IV. CPU Benchmarks
7474
To run: a) make
7575
b) ./bandwidth
7676

77-
Working set size can be changed by defining HS_ARRAY_ELEM, which denotes number of elements in the working set array.
78-
HS_ARRAY_ELEM can be defined directy if manually compiling, or can be passes with USER_DEF.
77+
Working set size can be changed by defining WSS_EXP. Number of elements in the working set is (2 ^ WSS_EXP). WSS_EXP can be defined directy if manually compiling, or can be passes with USER_DEFS.
7978

80-
Example: a) make USER_DEF="-DHS_ARRAY_ELEM=1000000"
79+
Example: a) make USER_DEFS="-DWSS_EXP=32"
8180
b) ./bandwidth
8281

8382

@@ -98,18 +97,13 @@ IV. CPU Benchmarks
9897
To run: a) make
9998
b) ./cache
10099

101-
Working set size can be changed by defining HS_ARRAY_ELEM, which denotes number of elements in the working set array.
102-
HS_ARRAY_ELEM can be defined directy if manually compiling, or can be passes with USER_DEF.
103100

104-
Example: a) make USER_DEF="-DHS_ARRAY_ELEM=1000000"
105-
b) ./cache
106-
107101

108102
IV. GPU Benchmarks
109103

110104
1_roofline
111105
==========
112-
Measures the maximum attainable performance with varying arithmetic intensity.
106+
Measures the maximum attainable performance with varying arithmetic intensity and the machine balance.
113107
Supports single and double precision floating point operations.
114108

115109
To run: ./roofline.py

0 commit comments

Comments
 (0)