RMAD

Reverse-mode auto-differentiation for Nim. (a.k.a - autograd)

Status

STALLED

Nim-rmad works great for scalars. I don't think there is a need for a pure scalar non-vector/non-matrix autograd however. Feel free to prove me wrong by either asking features, doing PR or adopting/forking this package.

Autograd development on matrices/vectors/n-dimensional tensors is done in Arraymancer/autograd.

Goal

Neural networks pass an input through a series of transformations (layers) to get an output value called forward pass.

During the learning phase, neural networks needs to know the contribution of each transformation steps with regards to each input values so its weights can be adjusted. This is the backward propagation pass, a.k.a. backprop, which retrieves the gradients of each transformations with regards to each input values. Those will be used to adjust the weights in the proper direction and a reasonable amplitude.

RMAD, also called autograd is an efficient way to automatically differentiate any transformation applied during the forward pass.

Examples

Examples are available in the examples folder. You will find examples from differentiating addition and substraction to building a Support Vector machine and a neural network with 2-hidden layers.

Here is a port of Andrej Karpathy's Javascript old Neural Network tutorial.

let ctx = newContext[float32]()

let
    a = ctx.variable(1)
    b = ctx.variable(2)
    c = ctx.variable(-3)
    x = ctx.variable(-1)
    y = ctx.variable(3)

proc forwardNeuron[T](a,b,c,x,y: T): T =
    let
        ax = a * x
        by = b * y
        axpby = ax + by
        axpbypc = axpby + c
        s = axpbypc.sigmoid()
    return s

var s = forwardNeuron(a,b,c,x,y)

echo s.value() # 0.8807970285415649

let gradient = s.grad()
echo gradient.wrt(a) # -0.1049936264753342
echo gradient.wrt(b) # 0.3149808645248413
echo gradient.wrt(c) # 0.1049936264753342
echo gradient.wrt(x) # 0.1049936264753342
echo gradient.wrt(y) # 0.2099872529506683

Strengths

Everything can be differentiated as long as it can be computed, even a for-loop. This is a key advantage for Recurrent Neural Networks. Plus, if you implement your own layer you don't need to derive the function yourself.

Weaknesses

In place operation of variables (including changing its value) is not supported. As Facebook says, it is a hard matter.

Todo

Support matrices and vectors
Setting up continuous integration
GPU support
Support gradient freezing (for transfer learning)

Low priority

Detect and prevent operation with variables from different contexts: #1
Tests & profiling: memory usage, speed - overhead, bottlenecks, inline procs/template
Benchmarks: comparison with PyTorch, TensorFlow, Theano, HIPS/autograd
Support 2nd order and n-th order differentiation
Computation graph pretty-printing
Canonical form and rewrite rules: 2 / (1 + e^-x) => 2 * x.sigmoid()

License

The Apache License version 2

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
examples		examples
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TODO_Pending_Nim_features.md		TODO_Pending_Nim_features.md
rmad.nim		rmad.nim

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

src

src

tests

tests

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

TODO_Pending_Nim_features.md

TODO_Pending_Nim_features.md

rmad.nim

rmad.nim

Repository files navigation

RMAD

Status

Goal

Examples

Strengths

Weaknesses

Todo

License

About

Releases

Packages

Languages

License

mratsim/nim-rmad

Folders and files

Latest commit

History

Repository files navigation

RMAD

Status

Goal

Examples

Strengths

Weaknesses

Todo

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages