Updating gradients in-place #61

willtebbutt · 2017-10-17T11:24:09Z

Add support for providing pre-allocated elements of the gradient tape. Currently viable as a hack of the following form:

using Nabla, NNlib

t = Tape()
W = Leaf(t, rand(5,5))
b = Leaf(t, rand(5))
x = rand(5)

y = σ.(W*x .+ b)

# Previously accumulated gradients.
∇W, ∇b = randn(5, 5), randn(5)

# Create empty tape.
rvs = Nabla.reverse_tape(y, randn(5))

# Point entries of tape towards previously allocated arrays.
rvs.tape[W.pos], rvs.tape[b.pos] = ∇W, ∇b

# Run the reverse pass.
Nabla.propagate(y.tape, rvs)

but this is not at all robust and will silently compute gradients incorrectly if one isn't careful.

The text was updated successfully, but these errors were encountered:

tmigot · 2023-04-13T11:08:54Z

I am wondering if there was any update on this topic?

willtebbutt · 2023-04-13T11:19:49Z

I'm afraid not, and there is not going to be any progress any time soon. (I've just updated the readme to reflect the fact that no one is maintaining this repo anymore)

tmigot · 2023-04-13T11:32:44Z

Oh, sorry to hear that. Thanks for your answer.

willtebbutt added the enhancement label Oct 17, 2017

willtebbutt self-assigned this Oct 17, 2017

ararslan mentioned this issue Apr 12, 2019

Add an internal helper function to do more in-place updating #145

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updating gradients in-place #61

Updating gradients in-place #61

willtebbutt commented Oct 17, 2017

tmigot commented Apr 13, 2023

willtebbutt commented Apr 13, 2023

tmigot commented Apr 13, 2023

Updating gradients in-place #61

Updating gradients in-place #61

Comments

willtebbutt commented Oct 17, 2017

tmigot commented Apr 13, 2023

willtebbutt commented Apr 13, 2023

tmigot commented Apr 13, 2023