gradient_estimator

This algorithm are focused on how to efficiently estimate an gradient.

The gradient estimator is important in application as in many cases, we try to optimize (min/max) our objective function use gradient descent method.

However, we might encounter some challenges (1) estimator is biased. (2) High Variances. (3) Discrete random variable (can't use reparameterization trick directly) (4) Expected Function un-differentiable

In this project, we aim to solve all of above troubles by introduce RELAX estimator.

Please see PPT for details of the algorithm

Reference: "Backpropagation through the Void: Optimizing control variates for black-box gradient estimation"

Reference: "REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models"

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
grad_est.pdf		grad_est.pdf
my_reLAX.py		my_reLAX.py
reLax_tf.py		reLax_tf.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

grad_est.pdf

grad_est.pdf

my_reLAX.py

my_reLAX.py

reLax_tf.py

reLax_tf.py

Repository files navigation

gradient_estimator

About

Releases

Packages

Languages

ElleryL/gradient_estimator

Folders and files

Latest commit

History

Repository files navigation

gradient_estimator

About

Resources

Stars

Watchers

Forks

Languages