Optimizing algorithms code, refactoring acceleration directions #55

lostella · 2021-11-23T10:00:50Z

This PR:

Optimizes the algorithms implementations by means of various fixes (fixing Further allocation savings in FBS, PANOC #21 to some extent)
Refactors acceleration direction for algorithms that support them (line search ones)

Because of 2, this PR is slightly breaking.

Optimizations

Changes include:

Avoiding temporary allocations in favor of pre-allocated buffers (arrays) in the state
Remove occurrences of variables changing type within their scope
Making sure type parameters are all concrete (mainly avoiding unions)
Removing type inference issues (and adding checks for it in tests)

Such changes result in a significant reduction in the allocations, as one can see from the benchmark, and consequently runtime. This is more apparent in some algorithm than others, and definitely more apparent on small problems than large ones (where overhead is less significant).

Refactoring

Acceleration strategies now have different "styles". Each style says something about how the underlying object should be used. For example:

NesterovStyle denotes sequences of extrapolation coefficients
QuasiNewtonStyle denotes linear operators that can be updated

Wherever needed, algorithms can dispatch on the style, and use the underlying object the way they need to compute acceleration directions the way they need.

A good example to understand how this is used is in DRLS, which supports both the above styles.

A (good) side effect of this story is that one does not need to provide the size of the decision varable at "configuration time": that means one specifies L-BFGS with memory 5 as LBFGS(5) instead of LBFGS(x, 5) (where x was usually the initial iterate, only used here to allocate buffers of the appropriate size).

Other changes

The default max number of iteration for SFISTA was increased to 10 thousands, like other proximal gradient implementations.

codecov · 2021-11-23T10:09:21Z

Codecov Report

Merging #55 (e3181da) into master (2a04316) will increase coverage by 0.33%.
The diff coverage is 96.00%.

❗ Current head e3181da differs from pull request most recent head ba692d1. Consider uploading reports for the commit ba692d1 to get more accurate results

@@            Coverage Diff             @@
##           master      #55      +/-   ##
==========================================
+ Coverage   88.65%   88.99%   +0.33%     
==========================================
  Files          20       21       +1     
  Lines         802      854      +52     
==========================================
+ Hits          711      760      +49     
- Misses         91       94       +3

Impacted Files	Coverage Δ
src/ProximalAlgorithms.jl	`100.00% <ø> (ø)`
src/algorithms/douglas_rachford.jl	`95.65% <ø> (ø)`
src/algorithms/primal_dual.jl	`63.55% <ø> (ø)`
src/accel/traits.jl	`75.00% <75.00%> (ø)`
src/algorithms/zerofpr.jl	`92.00% <86.36%> (-3.72%)`	⬇️
src/accel/nesterov.jl	`90.90% <90.90%> (+2.02%)`	⬆️
src/algorithms/panoc.jl	`95.65% <97.05%> (-0.86%)`	⬇️
src/algorithms/drls.jl	`95.00% <97.29%> (+0.17%)`	⬆️
src/accel/anderson.jl	`100.00% <100.00%> (+6.66%)`	⬆️
src/accel/broyden.jl	`100.00% <100.00%> (ø)`
... and 7 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2a04316...ba692d1. Read the comment docs.

lostella · 2021-11-23T10:12:23Z

There are some type inference errors on Julia 1.5, I'll look into those

lostella · 2021-11-23T10:13:55Z

Benchmark results: https://github.com/JuliaFirstOrder/ProximalAlgorithms.jl/runs/4305621580?check_suite_focus=true#step:7:509

src/accel/nesterov.jl

nantonel · 2021-11-24T11:27:59Z

src/algorithms/douglas_rachford.jl

+    y::Tx = similar(x)
+    r::Tx = similar(x)
+    z::Tx = similar(x)
+    res::Tx = similar(x)


Is this completely safe? Sometimes similar returns NaNs

Should be safe, as long as the array is initialized before it's used, otherwise there's a problem in Julia Base!

The thing is that similar is faster than zero or copy (I can get numbers about how faster). Actually, I should check if there's other places where similar could be used!

check this out:

julia> x = zeros(10); any([any(isnan.(similar(x))) for i = 1:1000]) true

I just hate bugs involving 'NaN's. 😆 Is there really a significant speed up?

I’ll check, I’m just obsessing a bit over getting rid of any superfluous operation 😅

For n = 1000 and using Float64, similar is about 4x faster than zero (I'm looking at the median). The time ratio gets larger as n increases (which makes sense)

julia> n = 1000; x = randn(n); julia> @benchmark zero($x) BenchmarkTools.Trial: 10000 samples with 198 evaluations. Range (min … max): 424.631 ns … 42.357 μs ┊ GC (min … max): 0.00% … 97.30% Time (median): 1.062 μs ┊ GC (median): 0.00% Time (mean ± σ): 2.132 μs ± 5.725 μs ┊ GC (mean ± σ): 50.82% ± 17.80% ▅█ ▁ ██▇▅▅▇▃▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▄▃▃▄▄▅▅▅▅▄▅▅▅▆▆▇█▇▇▆▇▇ █ 425 ns Histogram: log(frequency) by time 33.9 μs < Memory estimate: 7.94 KiB, allocs estimate: 1. julia> @benchmark similar($x) BenchmarkTools.Trial: 5065 samples with 883 evaluations. Range (min … max): 155.805 ns … 10.300 μs ┊ GC (min … max): 0.00% … 96.60% Time (median): 283.980 ns ┊ GC (median): 0.00% Time (mean ± σ): 1.112 μs ± 1.958 μs ┊ GC (mean ± σ): 74.49% ± 34.68% █▇▁ ▂▂▁▁▁ ▁ ███▄▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▃▁▆▆▆▄▄████████▇▇▇█▇█▇██▇▇█▇▇▇▇▇▆ █ 156 ns Histogram: log(frequency) by time 7.48 μs < Memory estimate: 7.94 KiB, allocs estimate: 1.

nantonel · 2021-11-24T11:31:03Z

src/algorithms/drls.jl

-    x_d::Tx = zero(x)
+    xbar_prev::Tx = similar(x)
+    d::Tx = similar(x)
+    x_d::Tx = similar(x)


same as before

nantonel

Just some very minor comments, feel free to discard them!

Terminology could be kept consistent using *Acceleration on the acceleration types
I prefer using zero instead of similar. similar sometimes gives NaNs. I'm sure this doesn't happen as you write in-place those vector afterwards. However sometimes you change the code assume for example the vector is full of zeros and get some annoying bugs.

lostella · 2021-11-24T21:32:45Z

@nantonel I decided to double down on turning zero into similar: while I understand that it may be the source of some headache, it’s probably the tests job to surface any issue with non-initialized arrays. And if an array doesn’t get properly initialized, it might even be better to have some NaN or Inf or whatever spread around and fail a test, than silently have an algorithm apparently “work” (but likely not work as intended).

One more approval and I’ll merge this! Thanks for the review!

nantonel

Sure! As I told you mine were only very minor suggestions 😉

lostella added 3 commits November 19, 2021 23:14

refactor and optimize algorithms

533a6c8

fix typing issues

131832f

further fixes (wip)

4edd510

lostella requested a review from nantonel November 23, 2021 10:11

lostella added 4 commits November 23, 2021 22:22

fix typing

ec974db

more cleanup

84342ab

more typing improvements

9f01493

increase coverage

5a34596

nantonel reviewed Nov 24, 2021

View reviewed changes

src/accel/nesterov.jl Show resolved Hide resolved

nantonel reviewed Nov 24, 2021

View reviewed changes

src/algorithms/drls.jl

x_d::Tx = zero(x)

xbar_prev::Tx = similar(x)

d::Tx = similar(x)

x_d::Tx = similar(x)

Copy link

Collaborator

nantonel Nov 24, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same as before

nantonel reviewed Nov 24, 2021

View reviewed changes

nantonel previously approved these changes Nov 24, 2021

View reviewed changes

more zero -> similar

ba692d1

lostella dismissed nantonel’s stale review via ba692d1 November 24, 2021 20:49

nantonel approved these changes Nov 25, 2021

View reviewed changes

lostella merged commit 60d4958 into JuliaFirstOrder:master Nov 25, 2021

lostella deleted the algo-improvements-20211108 branch November 25, 2021 16:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimizing algorithms code, refactoring acceleration directions #55

Optimizing algorithms code, refactoring acceleration directions #55

lostella commented Nov 23, 2021 •

edited

codecov bot commented Nov 23, 2021 •

edited

lostella commented Nov 23, 2021

lostella commented Nov 23, 2021 •

edited

nantonel Nov 24, 2021

lostella Nov 24, 2021

nantonel Nov 24, 2021

nantonel Nov 24, 2021

lostella Nov 24, 2021

lostella Nov 24, 2021 •

edited

nantonel Nov 24, 2021

nantonel left a comment

lostella commented Nov 24, 2021

nantonel left a comment

Optimizing algorithms code, refactoring acceleration directions #55

Optimizing algorithms code, refactoring acceleration directions #55

Conversation

lostella commented Nov 23, 2021 • edited

Optimizations

Refactoring

Other changes

codecov bot commented Nov 23, 2021 • edited

Codecov Report

lostella commented Nov 23, 2021

lostella commented Nov 23, 2021 • edited

nantonel Nov 24, 2021

Choose a reason for hiding this comment

lostella Nov 24, 2021

Choose a reason for hiding this comment

nantonel Nov 24, 2021

Choose a reason for hiding this comment

nantonel Nov 24, 2021

Choose a reason for hiding this comment

lostella Nov 24, 2021

Choose a reason for hiding this comment

lostella Nov 24, 2021 • edited

Choose a reason for hiding this comment

nantonel Nov 24, 2021

Choose a reason for hiding this comment

nantonel left a comment

Choose a reason for hiding this comment

lostella commented Nov 24, 2021

nantonel left a comment

Choose a reason for hiding this comment

lostella commented Nov 23, 2021 •

edited

codecov bot commented Nov 23, 2021 •

edited

lostella commented Nov 23, 2021 •

edited

lostella Nov 24, 2021 •

edited