Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add more kernels to the benchmark and AD benchmarks #434

Open
wants to merge 13 commits into
base: master
Choose a base branch
from

Conversation

theogf
Copy link
Member

@theogf theogf commented Feb 1, 2022

Adds a pair more kernels and benchmarks for AD
Right now the issue is with ForwardDiff where I am not sure how to turn the parameters into a single Array, if we want to use ParameterHandling we first need JuliaGaussianProcesses/ParameterHandling.jl#39 to be merged.

@theogf theogf added the performance critical Triggers benchmarking CI label Feb 1, 2022
benchmark/benchmarks.jl Outdated Show resolved Hide resolved
benchmark/benchmarks.jl Outdated Show resolved Hide resolved
benchmark/benchmarks.jl Outdated Show resolved Hide resolved
benchmark/benchmarks.jl Outdated Show resolved Hide resolved
benchmark/benchmarks.jl Outdated Show resolved Hide resolved
benchmark/benchmarks.jl Outdated Show resolved Hide resolved
theogf and others added 4 commits February 1, 2022 15:30
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
@willtebbutt
Copy link
Member

Shall we just stick with Zygote for now? I'm not going to have a chance to look at the above ParameterHandling PR for a while, and it needs some proper thought to ensure that it doesn't mess up anything else.

The performance of ForwardDiff is, in general in my (limited) experience, more robust and closely linked to primal performance than Zygote anyway, so I'm less concerned about it (I might be horribly wrong though).

@codecov
Copy link

codecov bot commented Feb 1, 2022

Codecov Report

Merging #434 (cfaf096) into master (09e76cb) will not change coverage.
The diff coverage is n/a.

Impacted file tree graph

@@           Coverage Diff           @@
##           master     #434   +/-   ##
=======================================
  Coverage   92.99%   92.99%           
=======================================
  Files          52       52           
  Lines        1214     1214           
=======================================
  Hits         1129     1129           
  Misses         85       85           

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 09e76cb...cfaf096. Read the comment docs.

benchmark/benchmarks.jl Outdated Show resolved Hide resolved
theogf and others added 2 commits February 1, 2022 16:23
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
@github-actions
Copy link
Contributor

github-actions bot commented Feb 1, 2022

Benchmark result

Judge result

Benchmark Report for /home/runner/work/KernelFunctions.jl/KernelFunctions.jl

Job Properties

  • Time of benchmarks:
    • Target: 1 Feb 2022 - 15:57
    • Baseline: 1 Feb 2022 - 15:58
  • Package commits:
    • Target: 54d89c
    • Baseline: 1a3fa3
  • Julia commits:
    • Target: 905826
    • Baseline: 905826
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: None
    • Baseline: None

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

Julia versioninfo

Target

Julia Version 1.6.5
Commit 9058264a69 (2021-12-19 12:30 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 20.04.3 LTS
  uname: Linux 5.11.0-1027-azure #30~20.04.1-Ubuntu SMP Wed Jan 12 20:56:50 UTC 2022 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz        754 s          1 s        195 s      18964 s          0 s
       #2  2294 MHz      18953 s          1 s        598 s        459 s          0 s
       
  Memory: 6.788978576660156 GB (2990.96875 MB free)
  Uptime: 2006.58 sec
  Load Avg:  1.08  1.04  0.96
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-11.0.1 (ORCJIT, broadwell)

Baseline

Julia Version 1.6.5
Commit 9058264a69 (2021-12-19 12:30 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 20.04.3 LTS
  uname: Linux 5.11.0-1027-azure #30~20.04.1-Ubuntu SMP Wed Jan 12 20:56:50 UTC 2022 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz        879 s          1 s        203 s      19449 s          0 s
       #2  2294 MHz      19427 s          1 s        617 s        586 s          0 s
       
  Memory: 6.788978576660156 GB (3073.578125 MB free)
  Uptime: 2068.69 sec
  Load Avg:  1.06  1.04  0.98
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-11.0.1 (ORCJIT, broadwell)

Target result

Benchmark Report for /home/runner/work/KernelFunctions.jl/KernelFunctions.jl

Job Properties

  • Time of benchmark: 1 Feb 2022 - 15:57
  • Package commit: 54d89c
  • Julia commit: 905826
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Allocated Functions", "Constant", "ColVecs", "kernelmatrixX"] 725.490 ns (5%) 3.44 KiB (1%) 8
["Allocated Functions", "Constant", "ColVecs", "kernelmatrixXY"] 712.409 ns (5%) 3.45 KiB (1%) 9
["Allocated Functions", "Constant", "ColVecs", "kernelmatrix_diagX"] 434.848 ns (5%) 544 bytes (1%) 9
["Allocated Functions", "Constant", "ColVecs", "kernelmatrix_diagXY"] 287.414 ns (5%) 560 bytes (1%) 10
["Allocated Functions", "Constant", "RowVecs", "kernelmatrixX"] 207.038 ns (5%) 1.06 KiB (1%) 8
["Allocated Functions", "Constant", "RowVecs", "kernelmatrixXY"] 204.874 ns (5%) 1.08 KiB (1%) 9
["Allocated Functions", "Constant", "RowVecs", "kernelmatrix_diagX"] 443.814 ns (5%) 448 bytes (1%) 9
["Allocated Functions", "Constant", "RowVecs", "kernelmatrix_diagXY"] 222.353 ns (5%) 464 bytes (1%) 10
["Allocated Functions", "Constant", "Vecs", "kernelmatrixX"] 744.375 ns (5%) 3.39 KiB (1%) 5
["Allocated Functions", "Constant", "Vecs", "kernelmatrixXY"] 687.195 ns (5%) 3.38 KiB (1%) 4
["Allocated Functions", "Constant", "Vecs", "kernelmatrix_diagX"] 518.947 ns (5%) 496 bytes (1%) 6
["Allocated Functions", "Constant", "Vecs", "kernelmatrix_diagXY"] 400.005 ns (5%) 496 bytes (1%) 6
["Allocated Functions", "Cosine", "ColVecs", "kernelmatrixX"] 8.600 μs (5%) 6.73 KiB (1%) 3
["Allocated Functions", "Cosine", "ColVecs", "kernelmatrixXY"] 8.500 μs (5%) 6.97 KiB (1%) 4
["Allocated Functions", "Cosine", "ColVecs", "kernelmatrix_diagX"] 288.957 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "Cosine", "ColVecs", "kernelmatrix_diagXY"] 428.643 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "Cosine", "RowVecs", "kernelmatrixX"] 2.750 μs (5%) 3.67 KiB (1%) 4
["Allocated Functions", "Cosine", "RowVecs", "kernelmatrixXY"] 3.138 μs (5%) 5.59 KiB (1%) 6
["Allocated Functions", "Cosine", "RowVecs", "kernelmatrix_diagX"] 166.532 ns (5%) 320 bytes (1%) 2
["Allocated Functions", "Cosine", "RowVecs", "kernelmatrix_diagXY"] 345.146 ns (5%) 320 bytes (1%) 2
["Allocated Functions", "Cosine", "Vecs", "kernelmatrixX"] 9.200 μs (5%) 6.59 KiB (1%) 4
["Allocated Functions", "Cosine", "Vecs", "kernelmatrixXY"] 9.200 μs (5%) 6.59 KiB (1%) 4
["Allocated Functions", "Cosine", "Vecs", "kernelmatrix_diagX"] 289.877 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "Cosine", "Vecs", "kernelmatrix_diagXY"] 603.454 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "Exponential", "ColVecs", "kernelmatrixX"] 6.975 μs (5%) 6.73 KiB (1%) 3
["Allocated Functions", "Exponential", "ColVecs", "kernelmatrixXY"] 6.625 μs (5%) 6.97 KiB (1%) 4
["Allocated Functions", "Exponential", "ColVecs", "kernelmatrix_diagX"] 203.177 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "Exponential", "ColVecs", "kernelmatrix_diagXY"] 354.206 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "Exponential", "RowVecs", "kernelmatrixX"] 2.333 μs (5%) 3.67 KiB (1%) 4
["Allocated Functions", "Exponential", "RowVecs", "kernelmatrixXY"] 2.533 μs (5%) 5.59 KiB (1%) 6
["Allocated Functions", "Exponential", "RowVecs", "kernelmatrix_diagX"] 134.697 ns (5%) 320 bytes (1%) 2
["Allocated Functions", "Exponential", "RowVecs", "kernelmatrix_diagXY"] 309.091 ns (5%) 320 bytes (1%) 2
["Allocated Functions", "Exponential", "Vecs", "kernelmatrixX"] 8.300 μs (5%) 6.59 KiB (1%) 4
["Allocated Functions", "Exponential", "Vecs", "kernelmatrixXY"] 7.700 μs (5%) 6.59 KiB (1%) 4
["Allocated Functions", "Exponential", "Vecs", "kernelmatrix_diagX"] 209.500 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "Exponential", "Vecs", "kernelmatrix_diagXY"] 537.297 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "Exponentiated", "ColVecs", "kernelmatrixX"] 10.500 μs (5%) 6.50 KiB (1%) 2
["Allocated Functions", "Exponentiated", "ColVecs", "kernelmatrixXY"] 11.200 μs (5%) 6.50 KiB (1%) 2
["Allocated Functions", "Exponentiated", "ColVecs", "kernelmatrix_diagX"] 570.621 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "Exponentiated", "ColVecs", "kernelmatrix_diagXY"] 568.681 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "Exponentiated", "RowVecs", "kernelmatrixX"] 3.943 μs (5%) 3.52 KiB (1%) 3
["Allocated Functions", "Exponentiated", "RowVecs", "kernelmatrixXY"] 4.086 μs (5%) 5.28 KiB (1%) 4
["Allocated Functions", "Exponentiated", "RowVecs", "kernelmatrix_diagX"] 328.821 ns (5%) 320 bytes (1%) 2
["Allocated Functions", "Exponentiated", "RowVecs", "kernelmatrix_diagXY"] 325.214 ns (5%) 320 bytes (1%) 2
["Allocated Functions", "Exponentiated", "Vecs", "kernelmatrixX"] 10.500 μs (5%) 6.59 KiB (1%) 4
["Allocated Functions", "Exponentiated", "Vecs", "kernelmatrixXY"] 9.900 μs (5%) 6.59 KiB (1%) 4
["Allocated Functions", "Exponentiated", "Vecs", "kernelmatrix_diagX"] 660.256 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "Exponentiated", "Vecs", "kernelmatrix_diagXY"] 668.131 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "GammaExponential", "ColVecs", "kernelmatrixX"] 48.100 μs (5%) 6.83 KiB (1%) 4
["Allocated Functions", "GammaExponential", "ColVecs", "kernelmatrixXY"] 50.000 μs (5%) 7.06 KiB (1%) 5
["Allocated Functions", "GammaExponential", "ColVecs", "kernelmatrix_diagX"] 441.667 ns (5%) 576 bytes (1%) 3
["Allocated Functions", "GammaExponential", "ColVecs", "kernelmatrix_diagXY"] 2.222 μs (5%) 576 bytes (1%) 3
["Allocated Functions", "GammaExponential", "RowVecs", "kernelmatrixX"] 12.600 μs (5%) 3.77 KiB (1%) 5
["Allocated Functions", "GammaExponential", "RowVecs", "kernelmatrixXY"] 15.000 μs (5%) 5.69 KiB (1%) 7
["Allocated Functions", "GammaExponential", "RowVecs", "kernelmatrix_diagX"] 287.586 ns (5%) 416 bytes (1%) 3
["Allocated Functions", "GammaExponential", "RowVecs", "kernelmatrix_diagXY"] 1.220 μs (5%) 416 bytes (1%) 3
["Allocated Functions", "GammaExponential", "Vecs", "kernelmatrixX"] 39.900 μs (5%) 6.69 KiB (1%) 5
["Allocated Functions", "GammaExponential", "Vecs", "kernelmatrixXY"] 41.300 μs (5%) 6.75 KiB (1%) 8
["Allocated Functions", "GammaExponential", "Vecs", "kernelmatrix_diagX"] 473.846 ns (5%) 576 bytes (1%) 3
["Allocated Functions", "GammaExponential", "Vecs", "kernelmatrix_diagXY"] 2.367 μs (5%) 576 bytes (1%) 3
["Allocated Functions", "SqExponential", "ColVecs", "kernelmatrixX"] 6.600 μs (5%) 6.73 KiB (1%) 3
["Allocated Functions", "SqExponential", "ColVecs", "kernelmatrixXY"] 6.620 μs (5%) 6.97 KiB (1%) 4
["Allocated Functions", "SqExponential", "ColVecs", "kernelmatrix_diagX"] 195.688 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "SqExponential", "ColVecs", "kernelmatrix_diagXY"] 325.230 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "SqExponential", "RowVecs", "kernelmatrixX"] 2.311 μs (5%) 3.67 KiB (1%) 4
["Allocated Functions", "SqExponential", "RowVecs", "kernelmatrixXY"] 2.633 μs (5%) 5.59 KiB (1%) 6
["Allocated Functions", "SqExponential", "RowVecs", "kernelmatrix_diagX"] 141.809 ns (5%) 320 bytes (1%) 2
["Allocated Functions", "SqExponential", "RowVecs", "kernelmatrix_diagXY"] 287.456 ns (5%) 320 bytes (1%) 2
["Allocated Functions", "SqExponential", "Vecs", "kernelmatrixX"] 7.000 μs (5%) 6.59 KiB (1%) 4
["Allocated Functions", "SqExponential", "Vecs", "kernelmatrixXY"] 6.900 μs (5%) 6.59 KiB (1%) 4
["Allocated Functions", "SqExponential", "Vecs", "kernelmatrix_diagX"] 207.317 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "SqExponential", "Vecs", "kernelmatrix_diagXY"] 473.469 ns (5%) 480 bytes (1%) 2
["Allocated Functions", "White", "ColVecs", "kernelmatrixX"] 1.110 μs (5%) 992 bytes (1%) 2
["Allocated Functions", "White", "ColVecs", "kernelmatrixXY"] 1.020 μs (5%) 992 bytes (1%) 2
["Allocated Functions", "White", "ColVecs", "kernelmatrix_diagX"] 280.342 ns (5%) 224 bytes (1%) 2
["Allocated Functions", "White", "ColVecs", "kernelmatrix_diagXY"] 116.988 ns (5%) 224 bytes (1%) 2
["Allocated Functions", "White", "RowVecs", "kernelmatrixX"] 727.152 ns (5%) 2.14 KiB (1%) 3
["Allocated Functions", "White", "RowVecs", "kernelmatrixXY"] 913.695 ns (5%) 3.91 KiB (1%) 4
["Allocated Functions", "White", "RowVecs", "kernelmatrix_diagX"] 302.521 ns (5%) 192 bytes (1%) 2
["Allocated Functions", "White", "RowVecs", "kernelmatrix_diagXY"] 82.030 ns (5%) 192 bytes (1%) 2
["Allocated Functions", "White", "Vecs", "kernelmatrixX"] 3.857 μs (5%) 4.66 KiB (1%) 7
["Allocated Functions", "White", "Vecs", "kernelmatrixXY"] 3.513 μs (5%) 4.66 KiB (1%) 7
["Allocated Functions", "White", "Vecs", "kernelmatrix_diagX"] 379.602 ns (5%) 224 bytes (1%) 2
["Allocated Functions", "White", "Vecs", "kernelmatrix_diagXY"] 248.983 ns (5%) 224 bytes (1%) 2
["Zygote", "Constant", "ColVecs", "kernelmatrixXY_forward"] 6.575 μs (5%) 8.12 KiB (1%) 95
["Zygote", "Constant", "ColVecs", "kernelmatrixXY_reverse"] 52.000 μs (5%) 4.00 KiB (1%) 143
["Zygote", "Constant", "ColVecs", "kernelmatrixX_forward"] 6.325 μs (5%) 8.12 KiB (1%) 95
["Zygote", "Constant", "ColVecs", "kernelmatrixX_reverse"] 51.401 μs (5%) 4.00 KiB (1%) 143
["Zygote", "Constant", "ColVecs", "kernelmatrix_diagXY_forward"] 106.400 μs (5%) 67.25 KiB (1%) 1294
["Zygote", "Constant", "ColVecs", "kernelmatrix_diagXY_reverse"] 534.201 μs (5%) 73.34 KiB (1%) 2389
["Zygote", "Constant", "ColVecs", "kernelmatrix_diagX_forward"] 99.901 μs (5%) 67.25 KiB (1%) 1294
["Zygote", "Constant", "ColVecs", "kernelmatrix_diagX_reverse"] 588.201 μs (5%) 73.38 KiB (1%) 2391
["Zygote", "Constant", "RowVecs", "kernelmatrixXY_forward"] 6.000 μs (5%) 5.75 KiB (1%) 95
["Zygote", "Constant", "RowVecs", "kernelmatrixXY_reverse"] 51.701 μs (5%) 4.00 KiB (1%) 143
["Zygote", "Constant", "RowVecs", "kernelmatrixX_forward"] 6.480 μs (5%) 5.75 KiB (1%) 95
["Zygote", "Constant", "RowVecs", "kernelmatrixX_reverse"] 52.700 μs (5%) 4.00 KiB (1%) 143
["Zygote", "Constant", "RowVecs", "kernelmatrix_diagXY_forward"] 50.700 μs (5%) 34.55 KiB (1%) 664
["Zygote", "Constant", "RowVecs", "kernelmatrix_diagXY_reverse"] 281.501 μs (5%) 37.92 KiB (1%) 1239
["Zygote", "Constant", "RowVecs", "kernelmatrix_diagX_forward"] 48.500 μs (5%) 34.55 KiB (1%) 664
["Zygote", "Constant", "RowVecs", "kernelmatrix_diagX_reverse"] 283.700 μs (5%) 37.95 KiB (1%) 1241
["Zygote", "Constant", "Vecs", "kernelmatrixXY_forward"] 6.775 μs (5%) 8.12 KiB (1%) 95
["Zygote", "Constant", "Vecs", "kernelmatrixXY_reverse"] 56.300 μs (5%) 4.00 KiB (1%) 143
["Zygote", "Constant", "Vecs", "kernelmatrixX_forward"] 6.675 μs (5%) 8.12 KiB (1%) 95
["Zygote", "Constant", "Vecs", "kernelmatrixX_reverse"] 52.300 μs (5%) 4.00 KiB (1%) 143
["Zygote", "Constant", "Vecs", "kernelmatrix_diagXY_forward"] 107.800 μs (5%) 67.36 KiB (1%) 1295
["Zygote", "Constant", "Vecs", "kernelmatrix_diagXY_reverse"] 567.100 μs (5%) 73.75 KiB (1%) 2396
["Zygote", "Constant", "Vecs", "kernelmatrix_diagX_forward"] 109.200 μs (5%) 67.36 KiB (1%) 1295
["Zygote", "Constant", "Vecs", "kernelmatrix_diagX_reverse"] 592.000 μs (5%) 73.78 KiB (1%) 2398
["Zygote", "Cosine", "ColVecs", "kernelmatrixXY_forward"] 41.300 μs (5%) 51.86 KiB (1%) 85
["Zygote", "Cosine", "ColVecs", "kernelmatrixXY_reverse"] 34.800 μs (5%) 30.25 KiB (1%) 103
["Zygote", "Cosine", "ColVecs", "kernelmatrixX_forward"] 28.800 μs (5%) 25.42 KiB (1%) 43
["Zygote", "Cosine", "ColVecs", "kernelmatrixX_reverse"] 14.200 μs (5%) 16.70 KiB (1%) 53
["Zygote", "Cosine", "ColVecs", "kernelmatrix_diagXY_forward"] 1.019 μs (5%) 1.36 KiB (1%) 4
["Zygote", "Cosine", "ColVecs", "kernelmatrix_diagXY_reverse"] 10.702 μs (5%) 7.78 KiB (1%) 75
["Zygote", "Cosine", "ColVecs", "kernelmatrix_diagX_forward"] 12.600 μs (5%) 9.00 KiB (1%) 78
["Zygote", "Cosine", "ColVecs", "kernelmatrix_diagX_reverse"] 8.633 μs (5%) 2.14 KiB (1%) 14
["Zygote", "Cosine", "RowVecs", "kernelmatrixXY_forward"] 21.001 μs (5%) 22.44 KiB (1%) 87
["Zygote", "Cosine", "RowVecs", "kernelmatrixXY_reverse"] 27.102 μs (5%) 18.12 KiB (1%) 107
["Zygote", "Cosine", "RowVecs", "kernelmatrixX_forward"] 12.000 μs (5%) 10.67 KiB (1%) 44
["Zygote", "Cosine", "RowVecs", "kernelmatrixX_reverse"] 7.776 μs (5%) 7.00 KiB (1%) 55
["Zygote", "Cosine", "RowVecs", "kernelmatrix_diagXY_forward"] 653.804 ns (5%) 832 bytes (1%) 4
["Zygote", "Cosine", "RowVecs", "kernelmatrix_diagXY_reverse"] 7.820 μs (5%) 6.91 KiB (1%) 79
["Zygote", "Cosine", "RowVecs", "kernelmatrix_diagX_forward"] 12.901 μs (5%) 8.45 KiB (1%) 78
["Zygote", "Cosine", "RowVecs", "kernelmatrix_diagX_reverse"] 8.401 μs (5%) 1.44 KiB (1%) 14
["Zygote", "Cosine", "Vecs", "kernelmatrixXY_forward"] 95.901 μs (5%) 151.53 KiB (1%) 2426
["Zygote", "Cosine", "Vecs", "kernelmatrixXY_reverse"] 180.900 μs (5%) 400.44 KiB (1%) 2522
["Zygote", "Cosine", "Vecs", "kernelmatrixX_forward"] 95.900 μs (5%) 151.02 KiB (1%) 2420
["Zygote", "Cosine", "Vecs", "kernelmatrixX_reverse"] 184.501 μs (5%) 403.56 KiB (1%) 2538
["Zygote", "Cosine", "Vecs", "kernelmatrix_diagXY_forward"] 4.457 μs (5%) 7.86 KiB (1%) 125
["Zygote", "Cosine", "Vecs", "kernelmatrix_diagXY_reverse"] 3.433 μs (5%) 10.30 KiB (1%) 54
["Zygote", "Cosine", "Vecs", "kernelmatrix_diagX_forward"] 14.400 μs (5%) 8.69 KiB (1%) 88
["Zygote", "Cosine", "Vecs", "kernelmatrix_diagX_reverse"] 12.000 μs (5%) 2.27 KiB (1%) 18
["Zygote", "Exponential", "ColVecs", "kernelmatrixXY_forward"] 28.300 μs (5%) 54.94 KiB (1%) 86
["Zygote", "Exponential", "ColVecs", "kernelmatrixXY_reverse"] 31.700 μs (5%) 30.25 KiB (1%) 103
["Zygote", "Exponential", "ColVecs", "kernelmatrixX_forward"] 17.300 μs (5%) 28.50 KiB (1%) 44
["Zygote", "Exponential", "ColVecs", "kernelmatrixX_reverse"] 13.500 μs (5%) 16.70 KiB (1%) 53
["Zygote", "Exponential", "ColVecs", "kernelmatrix_diagXY_forward"] 608.982 ns (5%) 1.55 KiB (1%) 4
["Zygote", "Exponential", "ColVecs", "kernelmatrix_diagXY_reverse"] 9.600 μs (5%) 7.97 KiB (1%) 75
["Zygote", "Exponential", "ColVecs", "kernelmatrix_diagX_forward"] 12.100 μs (5%) 9.19 KiB (1%) 78
["Zygote", "Exponential", "ColVecs", "kernelmatrix_diagX_reverse"] 8.000 μs (5%) 2.33 KiB (1%) 14
["Zygote", "Exponential", "RowVecs", "kernelmatrixXY_forward"] 17.200 μs (5%) 23.19 KiB (1%) 87
["Zygote", "Exponential", "RowVecs", "kernelmatrixXY_reverse"] 25.400 μs (5%) 18.12 KiB (1%) 107
["Zygote", "Exponential", "RowVecs", "kernelmatrixX_forward"] 8.600 μs (5%) 11.42 KiB (1%) 44
["Zygote", "Exponential", "RowVecs", "kernelmatrixX_reverse"] 8.500 μs (5%) 7.00 KiB (1%) 55
["Zygote", "Exponential", "RowVecs", "kernelmatrix_diagXY_forward"] 467.725 ns (5%) 912 bytes (1%) 4
["Zygote", "Exponential", "RowVecs", "kernelmatrix_diagXY_reverse"] 7.250 μs (5%) 6.98 KiB (1%) 79
["Zygote", "Exponential", "RowVecs", "kernelmatrix_diagX_forward"] 12.800 μs (5%) 8.53 KiB (1%) 78
["Zygote", "Exponential", "RowVecs", "kernelmatrix_diagX_reverse"] 8.600 μs (5%) 1.52 KiB (1%) 14
["Zygote", "Exponential", "Vecs", "kernelmatrixXY_forward"] 90.300 μs (5%) 154.61 KiB (1%) 2427
["Zygote", "Exponential", "Vecs", "kernelmatrixXY_reverse"] 163.000 μs (5%) 400.44 KiB (1%) 2522
["Zygote", "Exponential", "Vecs", "kernelmatrixX_forward"] 93.100 μs (5%) 154.09 KiB (1%) 2421
["Zygote", "Exponential", "Vecs", "kernelmatrixX_reverse"] 159.000 μs (5%) 403.56 KiB (1%) 2538
["Zygote", "Exponential", "Vecs", "kernelmatrix_diagXY_forward"] 4.314 μs (5%) 8.05 KiB (1%) 125
["Zygote", "Exponential", "Vecs", "kernelmatrix_diagXY_reverse"] 2.911 μs (5%) 10.48 KiB (1%) 54
["Zygote", "Exponential", "Vecs", "kernelmatrix_diagX_forward"] 12.700 μs (5%) 8.88 KiB (1%) 88
["Zygote", "Exponential", "Vecs", "kernelmatrix_diagX_reverse"] 11.900 μs (5%) 2.45 KiB (1%) 18
["Zygote", "Exponentiated", "ColVecs", "kernelmatrixXY_forward"] 14.600 μs (5%) 16.53 KiB (1%) 10
["Zygote", "Exponentiated", "ColVecs", "kernelmatrixXY_reverse"] 22.200 μs (5%) 9.22 KiB (1%) 80
["Zygote", "Exponentiated", "ColVecs", "kernelmatrixX_forward"] 14.400 μs (5%) 16.02 KiB (1%) 4
["Zygote", "Exponentiated", "ColVecs", "kernelmatrixX_reverse"] 8.275 μs (5%) 7.84 KiB (1%) 37
["Zygote", "Exponentiated", "ColVecs", "kernelmatrix_diagXY_forward"] 826.897 ns (5%) 1.05 KiB (1%) 4
["Zygote", "Exponentiated", "ColVecs", "kernelmatrix_diagXY_reverse"] 7.640 μs (5%) 5.89 KiB (1%) 69
["Zygote", "Exponentiated", "ColVecs", "kernelmatrix_diagX_forward"] 847.559 ns (5%) 1.05 KiB (1%) 4
["Zygote", "Exponentiated", "ColVecs", "kernelmatrix_diagX_reverse"] 7.940 μs (5%) 7.62 KiB (1%) 70
["Zygote", "Exponentiated", "RowVecs", "kernelmatrixXY_forward"] 6.160 μs (5%) 8.31 KiB (1%) 12
["Zygote", "Exponentiated", "RowVecs", "kernelmatrixXY_reverse"] 19.500 μs (5%) 6.69 KiB (1%) 80
["Zygote", "Exponentiated", "RowVecs", "kernelmatrixX_forward"] 5.617 μs (5%) 6.03 KiB (1%) 5
["Zygote", "Exponentiated", "RowVecs", "kernelmatrixX_reverse"] 6.380 μs (5%) 4.50 KiB (1%) 37
["Zygote", "Exponentiated", "RowVecs", "kernelmatrix_diagXY_forward"] 506.000 ns (5%) 672 bytes (1%) 4
["Zygote", "Exponentiated", "RowVecs", "kernelmatrix_diagXY_reverse"] 7.050 μs (5%) 5.72 KiB (1%) 73
["Zygote", "Exponentiated", "RowVecs", "kernelmatrix_diagX_forward"] 488.832 ns (5%) 672 bytes (1%) 4
["Zygote", "Exponentiated", "RowVecs", "kernelmatrix_diagX_reverse"] 8.320 μs (5%) 7.45 KiB (1%) 74
["Zygote", "Exponentiated", "Vecs", "kernelmatrixXY_forward"] 20.500 μs (5%) 26.44 KiB (1%) 25
["Zygote", "Exponentiated", "Vecs", "kernelmatrixXY_reverse"] 180.300 μs (5%) 397.08 KiB (1%) 2519
["Zygote", "Exponentiated", "Vecs", "kernelmatrixX_forward"] 18.100 μs (5%) 25.92 KiB (1%) 19
["Zygote", "Exponentiated", "Vecs", "kernelmatrixX_reverse"] 188.400 μs (5%) 400.20 KiB (1%) 2535
["Zygote", "Exponentiated", "Vecs", "kernelmatrix_diagXY_forward"] 1.022 μs (5%) 1.61 KiB (1%) 5
["Zygote", "Exponentiated", "Vecs", "kernelmatrix_diagXY_reverse"] 2.320 μs (5%) 7.38 KiB (1%) 45
["Zygote", "Exponentiated", "Vecs", "kernelmatrix_diagX_forward"] 982.143 ns (5%) 1.61 KiB (1%) 5
["Zygote", "Exponentiated", "Vecs", "kernelmatrix_diagX_reverse"] 3.433 μs (5%) 10.75 KiB (1%) 67
["Zygote", "GammaExponential", "ColVecs", "kernelmatrixXY_forward"] 2.296 ms (5%) 1.33 MiB (1%) 25335
["Zygote", "GammaExponential", "ColVecs", "kernelmatrixXY_reverse"] 11.815 ms (5%) 1.55 MiB (1%) 49417
["Zygote", "GammaExponential", "ColVecs", "kernelmatrixX_forward"] 2.278 ms (5%) 1.30 MiB (1%) 25293
["Zygote", "GammaExponential", "ColVecs", "kernelmatrixX_reverse"] 11.478 ms (5%) 1.53 MiB (1%) 49366
["Zygote", "GammaExponential", "ColVecs", "kernelmatrix_diagXY_forward"] 103.000 μs (5%) 69.67 KiB (1%) 1312
["Zygote", "GammaExponential", "ColVecs", "kernelmatrix_diagXY_reverse"] 713.501 μs (5%) 90.58 KiB (1%) 2652
["Zygote", "GammaExponential", "ColVecs", "kernelmatrix_diagX_forward"] 119.300 μs (5%) 77.84 KiB (1%) 1387
["Zygote", "GammaExponential", "ColVecs", "kernelmatrix_diagX_reverse"] 574.100 μs (5%) 84.89 KiB (1%) 2585
["Zygote", "GammaExponential", "RowVecs", "kernelmatrixXY_forward"] 652.200 μs (5%) 351.75 KiB (1%) 6436
["Zygote", "GammaExponential", "RowVecs", "kernelmatrixXY_reverse"] 2.957 ms (5%) 408.83 KiB (1%) 12521
["Zygote", "GammaExponential", "RowVecs", "kernelmatrixX_forward"] 639.701 μs (5%) 340.08 KiB (1%) 6393
["Zygote", "GammaExponential", "RowVecs", "kernelmatrixX_reverse"] 2.999 ms (5%) 397.58 KiB (1%) 12468
["Zygote", "GammaExponential", "RowVecs", "kernelmatrix_diagXY_forward"] 52.600 μs (5%) 36.50 KiB (1%) 682
["Zygote", "GammaExponential", "RowVecs", "kernelmatrix_diagXY_reverse"] 320.001 μs (5%) 49.78 KiB (1%) 1426
["Zygote", "GammaExponential", "RowVecs", "kernelmatrix_diagX_forward"] 68.500 μs (5%) 44.67 KiB (1%) 757
["Zygote", "GammaExponential", "RowVecs", "kernelmatrix_diagX_reverse"] 311.701 μs (5%) 44.27 KiB (1%) 1355
["Zygote", "GammaExponential", "Vecs", "kernelmatrixXY_forward"] 2.183 ms (5%) 1.42 MiB (1%) 27675
["Zygote", "GammaExponential", "Vecs", "kernelmatrixXY_reverse"] 11.916 ms (5%) 1.91 MiB (1%) 51829
["Zygote", "GammaExponential", "Vecs", "kernelmatrixX_forward"] 2.069 ms (5%) 1.42 MiB (1%) 27669
["Zygote", "GammaExponential", "Vecs", "kernelmatrixX_reverse"] 11.781 ms (5%) 1.91 MiB (1%) 51844
["Zygote", "GammaExponential", "Vecs", "kernelmatrix_diagXY_forward"] 108.600 μs (5%) 76.17 KiB (1%) 1433
["Zygote", "GammaExponential", "Vecs", "kernelmatrix_diagXY_reverse"] 584.101 μs (5%) 93.00 KiB (1%) 2626
["Zygote", "GammaExponential", "Vecs", "kernelmatrix_diagX_forward"] 120.900 μs (5%) 77.45 KiB (1%) 1397
["Zygote", "GammaExponential", "Vecs", "kernelmatrix_diagX_reverse"] 582.200 μs (5%) 84.97 KiB (1%) 2589
["Zygote", "SqExponential", "ColVecs", "kernelmatrixXY_forward"] 14.500 μs (5%) 32.58 KiB (1%) 13
["Zygote", "SqExponential", "ColVecs", "kernelmatrixXY_reverse"] 27.600 μs (5%) 20.50 KiB (1%) 100
["Zygote", "SqExponential", "ColVecs", "kernelmatrixX_forward"] 14.500 μs (5%) 31.83 KiB (1%) 6
["Zygote", "SqExponential", "ColVecs", "kernelmatrixX_reverse"] 13.400 μs (5%) 13.38 KiB (1%) 50
["Zygote", "SqExponential", "ColVecs", "kernelmatrix_diagXY_forward"] 692.656 ns (5%) 1.81 KiB (1%) 4
["Zygote", "SqExponential", "ColVecs", "kernelmatrix_diagXY_reverse"] 10.500 μs (5%) 8.00 KiB (1%) 74
["Zygote", "SqExponential", "ColVecs", "kernelmatrix_diagX_forward"] 13.700 μs (5%) 8.88 KiB (1%) 77
["Zygote", "SqExponential", "ColVecs", "kernelmatrix_diagX_reverse"] 9.100 μs (5%) 2.59 KiB (1%) 14
["Zygote", "SqExponential", "RowVecs", "kernelmatrixXY_forward"] 5.343 μs (5%) 12.50 KiB (1%) 14
["Zygote", "SqExponential", "RowVecs", "kernelmatrixXY_reverse"] 25.200 μs (5%) 15.50 KiB (1%) 104
["Zygote", "SqExponential", "RowVecs", "kernelmatrixX_forward"] 4.662 μs (5%) 10.06 KiB (1%) 6
["Zygote", "SqExponential", "RowVecs", "kernelmatrixX_reverse"] 8.400 μs (5%) 6.05 KiB (1%) 52
["Zygote", "SqExponential", "RowVecs", "kernelmatrix_diagXY_forward"] 503.030 ns (5%) 1.05 KiB (1%) 4
["Zygote", "SqExponential", "RowVecs", "kernelmatrix_diagXY_reverse"] 7.800 μs (5%) 6.98 KiB (1%) 78
["Zygote", "SqExponential", "RowVecs", "kernelmatrix_diagX_forward"] 13.700 μs (5%) 8.11 KiB (1%) 77
["Zygote", "SqExponential", "RowVecs", "kernelmatrix_diagX_reverse"] 8.850 μs (5%) 1.67 KiB (1%) 14
["Zygote", "SqExponential", "Vecs", "kernelmatrixXY_forward"] 99.501 μs (5%) 157.73 KiB (1%) 2427
["Zygote", "SqExponential", "Vecs", "kernelmatrixXY_reverse"] 189.100 μs (5%) 400.41 KiB (1%) 2521
["Zygote", "SqExponential", "Vecs", "kernelmatrixX_forward"] 92.300 μs (5%) 157.22 KiB (1%) 2421
["Zygote", "SqExponential", "Vecs", "kernelmatrixX_reverse"] 172.799 μs (5%) 403.53 KiB (1%) 2537
["Zygote", "SqExponential", "Vecs", "kernelmatrix_diagXY_forward"] 4.829 μs (5%) 8.14 KiB (1%) 125
["Zygote", "SqExponential", "Vecs", "kernelmatrix_diagXY_reverse"] 3.344 μs (5%) 10.58 KiB (1%) 54
["Zygote", "SqExponential", "Vecs", "kernelmatrix_diagX_forward"] 14.300 μs (5%) 8.56 KiB (1%) 87
["Zygote", "SqExponential", "Vecs", "kernelmatrix_diagX_reverse"] 12.000 μs (5%) 2.72 KiB (1%) 18
["Zygote", "White", "ColVecs", "kernelmatrixXY_forward"] 2.925 μs (5%) 7.88 KiB (1%) 10
["Zygote", "White", "ColVecs", "kernelmatrixXY_reverse"] 4.967 μs (5%) 752 bytes (1%) 9
["Zygote", "White", "ColVecs", "kernelmatrixX_forward"] 2.600 μs (5%) 7.36 KiB (1%) 4
["Zygote", "White", "ColVecs", "kernelmatrixX_reverse"] 411.224 ns (5%) 496 bytes (1%) 1
["Zygote", "White", "ColVecs", "kernelmatrix_diagXY_forward"] 216.631 ns (5%) 640 bytes (1%) 4
["Zygote", "White", "ColVecs", "kernelmatrix_diagXY_reverse"] 56.110 ns (5%) 112 bytes (1%) 1
["Zygote", "White", "ColVecs", "kernelmatrix_diagX_forward"] 402.500 ns (5%) 640 bytes (1%) 4
["Zygote", "White", "ColVecs", "kernelmatrix_diagX_reverse"] 56.486 ns (5%) 112 bytes (1%) 1
["Zygote", "White", "RowVecs", "kernelmatrixXY_forward"] 1.390 μs (5%) 6.20 KiB (1%) 12
["Zygote", "White", "RowVecs", "kernelmatrixXY_reverse"] 4.750 μs (5%) 448 bytes (1%) 9
["Zygote", "White", "RowVecs", "kernelmatrixX_forward"] 1.160 μs (5%) 3.92 KiB (1%) 5
["Zygote", "White", "RowVecs", "kernelmatrixX_reverse"] 156.923 ns (5%) 192 bytes (1%) 1
["Zygote", "White", "RowVecs", "kernelmatrix_diagXY_forward"] 157.280 ns (5%) 448 bytes (1%) 4
["Zygote", "White", "RowVecs", "kernelmatrix_diagXY_reverse"] 45.390 ns (5%) 96 bytes (1%) 1
["Zygote", "White", "RowVecs", "kernelmatrix_diagX_forward"] 386.070 ns (5%) 448 bytes (1%) 4
["Zygote", "White", "RowVecs", "kernelmatrix_diagX_reverse"] 47.465 ns (5%) 96 bytes (1%) 1
["Zygote", "White", "Vecs", "kernelmatrixXY_forward"] 5.320 μs (5%) 12.00 KiB (1%) 20
["Zygote", "White", "Vecs", "kernelmatrixXY_reverse"] 5.033 μs (5%) 752 bytes (1%) 9
["Zygote", "White", "Vecs", "kernelmatrixX_forward"] 5.880 μs (5%) 11.48 KiB (1%) 14
["Zygote", "White", "Vecs", "kernelmatrixX_reverse"] 412.690 ns (5%) 496 bytes (1%) 1
["Zygote", "White", "Vecs", "kernelmatrix_diagXY_forward"] 406.566 ns (5%) 752 bytes (1%) 5
["Zygote", "White", "Vecs", "kernelmatrix_diagXY_reverse"] 161.134 ns (5%) 384 bytes (1%) 5
["Zygote", "White", "Vecs", "kernelmatrix_diagX_forward"] 535.393 ns (5%) 752 bytes (1%) 5
["Zygote", "White", "Vecs", "kernelmatrix_diagX_reverse"] 247.484 ns (5%) 448 bytes (1%) 6

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Allocated Functions", "Constant", "ColVecs"]
  • ["Allocated Functions", "Constant", "RowVecs"]
  • ["Allocated Functions", "Constant", "Vecs"]
  • ["Allocated Functions", "Cosine", "ColVecs"]
  • ["Allocated Functions", "Cosine", "RowVecs"]
  • ["Allocated Functions", "Cosine", "Vecs"]
  • ["Allocated Functions", "Exponential", "ColVecs"]
  • ["Allocated Functions", "Exponential", "RowVecs"]
  • ["Allocated Functions", "Exponential", "Vecs"]
  • ["Allocated Functions", "Exponentiated", "ColVecs"]
  • ["Allocated Functions", "Exponentiated", "RowVecs"]
  • ["Allocated Functions", "Exponentiated", "Vecs"]
  • ["Allocated Functions", "GammaExponential", "ColVecs"]
  • ["Allocated Functions", "GammaExponential", "RowVecs"]
  • ["Allocated Functions", "GammaExponential", "Vecs"]
  • ["Allocated Functions", "SqExponential", "ColVecs"]
  • ["Allocated Functions", "SqExponential", "RowVecs"]
  • ["Allocated Functions", "SqExponential", "Vecs"]
  • ["Allocated Functions", "White", "ColVecs"]
  • ["Allocated Functions", "White", "RowVecs"]
  • ["Allocated Functions", "White", "Vecs"]
  • ["Zygote", "Constant", "ColVecs"]
  • ["Zygote", "Constant", "RowVecs"]
  • ["Zygote", "Constant", "Vecs"]
  • ["Zygote", "Cosine", "ColVecs"]
  • ["Zygote", "Cosine", "RowVecs"]
  • ["Zygote", "Cosine", "Vecs"]
  • ["Zygote", "Exponential", "ColVecs"]
  • ["Zygote", "Exponential", "RowVecs"]
  • ["Zygote", "Exponential", "Vecs"]
  • ["Zygote", "Exponentiated", "ColVecs"]
  • ["Zygote", "Exponentiated", "RowVecs"]
  • ["Zygote", "Exponentiated", "Vecs"]
  • ["Zygote", "GammaExponential", "ColVecs"]
  • ["Zygote", "GammaExponential", "RowVecs"]
  • ["Zygote", "GammaExponential", "Vecs"]
  • ["Zygote", "SqExponential", "ColVecs"]
  • ["Zygote", "SqExponential", "RowVecs"]
  • ["Zygote", "SqExponential", "Vecs"]
  • ["Zygote", "White", "ColVecs"]
  • ["Zygote", "White", "RowVecs"]
  • ["Zygote", "White", "Vecs"]

Julia versioninfo

Julia Version 1.6.5
Commit 9058264a69 (2021-12-19 12:30 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 20.04.3 LTS
  uname: Linux 5.11.0-1027-azure #30~20.04.1-Ubuntu SMP Wed Jan 12 20:56:50 UTC 2022 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz        754 s          1 s        195 s      18964 s          0 s
       #2  2294 MHz      18953 s          1 s        598 s        459 s          0 s
       
  Memory: 6.788978576660156 GB (2990.96875 MB free)
  Uptime: 2006.58 sec
  Load Avg:  1.08  1.04  0.96
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-11.0.1 (ORCJIT, broadwell)

Baseline result

Benchmark Report for /home/runner/work/KernelFunctions.jl/KernelFunctions.jl

Job Properties

  • Time of benchmark: 1 Feb 2022 - 15:58
  • Package commit: 1a3fa3
  • Julia commit: 905826
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Exponential", "ColVecs", "kernelmatrixX"] 6.200 μs (5%) 6.73 KiB (1%) 3
["Exponential", "ColVecs", "kernelmatrixXY"] 6.200 μs (5%) 6.97 KiB (1%) 4
["Exponential", "ColVecs", "kernelmatrix_diagX"] 200.000 ns (5%) 480 bytes (1%) 2
["Exponential", "ColVecs", "kernelmatrix_diagXY"] 300.000 ns (5%) 480 bytes (1%) 2
["Exponential", "RowVecs", "kernelmatrixX"] 2.300 μs (5%) 3.67 KiB (1%) 4
["Exponential", "RowVecs", "kernelmatrixXY"] 2.500 μs (5%) 5.59 KiB (1%) 6
["Exponential", "RowVecs", "kernelmatrix_diagX"] 100.000 ns (5%) 320 bytes (1%) 2
["Exponential", "RowVecs", "kernelmatrix_diagXY"] 300.000 ns (5%) 320 bytes (1%) 2
["Exponential", "Vecs", "kernelmatrixX"] 7.400 μs (5%) 6.59 KiB (1%) 4
["Exponential", "Vecs", "kernelmatrixXY"] 7.500 μs (5%) 6.59 KiB (1%) 4
["Exponential", "Vecs", "kernelmatrix_diagX"] 200.000 ns (5%) 480 bytes (1%) 2
["Exponential", "Vecs", "kernelmatrix_diagXY"] 500.000 ns (5%) 480 bytes (1%) 2
["SqExponential", "ColVecs", "kernelmatrixX"] 6.700 μs (5%) 6.73 KiB (1%) 3
["SqExponential", "ColVecs", "kernelmatrixXY"] 6.400 μs (5%) 6.97 KiB (1%) 4
["SqExponential", "ColVecs", "kernelmatrix_diagX"] 200.000 ns (5%) 480 bytes (1%) 2
["SqExponential", "ColVecs", "kernelmatrix_diagXY"] 300.000 ns (5%) 480 bytes (1%) 2
["SqExponential", "RowVecs", "kernelmatrixX"] 2.400 μs (5%) 3.67 KiB (1%) 4
["SqExponential", "RowVecs", "kernelmatrixXY"] 2.500 μs (5%) 5.59 KiB (1%) 6
["SqExponential", "RowVecs", "kernelmatrix_diagX"] 100.000 ns (5%) 320 bytes (1%) 2
["SqExponential", "RowVecs", "kernelmatrix_diagXY"] 300.000 ns (5%) 320 bytes (1%) 2
["SqExponential", "Vecs", "kernelmatrixX"] 6.900 μs (5%) 6.59 KiB (1%) 4
["SqExponential", "Vecs", "kernelmatrixXY"] 6.900 μs (5%) 6.59 KiB (1%) 4
["SqExponential", "Vecs", "kernelmatrix_diagX"] 200.000 ns (5%) 480 bytes (1%) 2
["SqExponential", "Vecs", "kernelmatrix_diagXY"] 400.000 ns (5%) 480 bytes (1%) 2

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Exponential", "ColVecs"]
  • ["Exponential", "RowVecs"]
  • ["Exponential", "Vecs"]
  • ["SqExponential", "ColVecs"]
  • ["SqExponential", "RowVecs"]
  • ["SqExponential", "Vecs"]

Julia versioninfo

Julia Version 1.6.5
Commit 9058264a69 (2021-12-19 12:30 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 20.04.3 LTS
  uname: Linux 5.11.0-1027-azure #30~20.04.1-Ubuntu SMP Wed Jan 12 20:56:50 UTC 2022 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz        879 s          1 s        203 s      19449 s          0 s
       #2  2294 MHz      19427 s          1 s        617 s        586 s          0 s
       
  Memory: 6.788978576660156 GB (3073.578125 MB free)
  Uptime: 2068.69 sec
  Load Avg:  1.06  1.04  0.98
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-11.0.1 (ORCJIT, broadwell)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:                    x86_64
CPU op-mode(s):                  32-bit, 64-bit
Byte Order:                      Little Endian
Address sizes:                   46 bits physical, 48 bits virtual
CPU(s):                          2
On-line CPU(s) list:             0,1
Thread(s) per core:              1
Core(s) per socket:              2
Socket(s):                       1
NUMA node(s):                    1
Vendor ID:                       GenuineIntel
CPU family:                      6
Model:                           79
Model name:                      Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Stepping:                        1
CPU MHz:                         2294.685
BogoMIPS:                        4589.37
Hypervisor vendor:               Microsoft
Virtualization type:             full
L1d cache:                       64 KiB
L1i cache:                       64 KiB
L2 cache:                        512 KiB
L3 cache:                        50 MiB
NUMA node0 CPU(s):               0,1
Vulnerability Itlb multihit:     KVM: Mitigation: VMX unsupported
Vulnerability L1tf:              Mitigation; PTE Inversion
Vulnerability Mds:               Mitigation; Clear CPU buffers; SMT Host state unknown
Vulnerability Meltdown:          Mitigation; PTI
Vulnerability Spec store bypass: Vulnerable
Vulnerability Spectre v1:        Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Vulnerability Spectre v2:        Mitigation; Full generic retpoline, STIBP disabled, RSB filling
Vulnerability Srbds:             Not affected
Vulnerability Tsx async abort:   Mitigation; Clear CPU buffers; SMT Host state unknown
Flags:                           fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm rdseed adx smap xsaveopt md_clear
Cpu Property Value
Brand Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Vendor :Intel
Architecture :Broadwell
Model Family: 0x06, Model: 0x4f, Stepping: 0x01, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading hardware capability detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 256, 51200) kbytes
64 byte cache line size
Address Size 48 bits virtual, 46 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@theogf theogf added performance critical Triggers benchmarking CI and removed performance critical Triggers benchmarking CI labels Feb 14, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
performance critical Triggers benchmarking CI
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants