Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
-
Updated
Apr 1, 2021 - Jupyter Notebook
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
PyDiffGame is a Python implementation of a Nash Equilibrium solution to Differential Games, based on a reduction of Game Hamilton-Bellman-Jacobi (GHJB) equations to Game Algebraic and Differential Riccati equations, associated with Multi-Objective Dynamical Control Systems
Solving high dimensional HJB equation using tensor decomposition
Infinite horizon policy optimization for drone navigation. Graded project for the ETH course "Dynamic Programming and Optimal Control".
CSCI-561 AI Assignments.
Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.
Q-Learning from scratch in Python
calculate the optimum route in a warehouse using the Q-Learning algorithm (Bellman equation)
Implementation of several algorithms in RL based on Prof. sutton's book
Reinforcement learning
This project aims to explore the basic concepts of Reinforcement Learning using the FrozenLake environment from the OpenAI Gym library.
Q-Value (Reinforcement Learning) on Grid World
Dynamic Optimization project working on an economic model
A visualization tool for policy iteration and value iteration
Iterative Policy Evaluation for the world of linear-equation-solving proofs. Given a policy for how to solve a linear equation, we find the corresponding value function--that is, the function that assigns values to each state.
Find the shortest route using A* algorithm and graphs (Route Planner application)
Evolutionary algorithm to make better trade decisions based on Bellman equation. (Experimental)
A GPU-accelerated toolbox for hyperbolic PDEs in a weaker (viscosity) sense. It leverages the integral to the solution of the conservation of momentum problem (being equivalent to the derivative of Hamilton-Jacobi equations) in one spatial dimension. We resolve such hyperbolic differential equations using wave-front propagating schemes on a spat…
Design and Implementation of Pac-Man Strategies with Embedded Markov Decision Process in a Dynamic, Non-Deterministic, Fully Observable Environment
Implementation of Policy Iteration and Value Iteration Agents for Taxi game of OpenAI gym
Add a description, image, and links to the bellman-equation topic page so that developers can more easily learn about it.
To associate your repository with the bellman-equation topic, visit your repo's landing page and select "manage topics."