Inverse Reinforcement Learning Experiments

This repository implements and showcases experiments based on the paper "Algorithms for Inverse Reinforcement Learning" by Ng & Russell (2000).

Experiment Descriptions

1. 5 x 5 Grid World

In the initial experiment, a 5 x 5 grid world is used. The agent starts from the lower-left grid square and navigates to the absorbing upper-right grid square. The actions correspond to the four compass directions, but with a 30% chance of moving in a random direction instead. The objective is to recover the reward structure given the policy and problem dynamics.

Results:

Obtained a reward function by observing the policy of a trained agent which closely approximated the true reward.
Also derived a reward funtion from a given policy.

2. Mountain Car Task

The second experiment involves the "mountain-car" task, where the goal is to reach the top of the hill. The true, undiscounted, reward is -1 per step until reaching the goal. The state is the car's position and velocity, and the state space is continuous.

Results:

Using a reward function based on the car's position and 26 Gaussian-shaped basis functions, the algorithm produces a reward function that captures the structure of the true reward.

3. Continuous Grid World

The final experiment applies the sample-based algorithm to a continuous version of the 5 x 5 grid world. The state space is [0, 1] × [0, 1], and actions move the agent 0.2 in the intended direction with added noise. The true reward is 1 in a non-absorbing square [0.8, 1] × [0.8, 1], and 0 everywhere else.

Results:

The algorithm, using linear combinations of two-dimensional Gaussian basis functions and produces reasonable solutions.

References

Ng, A., & Russell, S. (2000). Algorithms for Inverse Reinforcement Learning.
ShivinDass. (n.d.). GitHub - ShivinDass/inverse_rl. GitHub.
Neka-Nat. (n.d.). Neka-nat/inv_rl: Inverse reinforcement learning argorithms. GitHub.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
Docs		Docs
Exp 1. Discrete Grid World		Exp 1. Discrete Grid World
Exp 2. Mountain Car		Exp 2. Mountain Car
Exp 3. Continuous Grid World		Exp 3. Continuous Grid World
Results		Results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Docs

Docs

Exp 1. Discrete Grid World

Exp 1. Discrete Grid World

Exp 2. Mountain Car

Exp 2. Mountain Car

Exp 3. Continuous Grid World

Exp 3. Continuous Grid World

Results

Results

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Inverse Reinforcement Learning Experiments

Experiment Descriptions

1. 5 x 5 Grid World

Results:

2. Mountain Car Task

Results:

3. Continuous Grid World

Results:

Further Reading

References

About

Languages

License

Shaz-5/IRL

Folders and files

Latest commit

History

Repository files navigation

Inverse Reinforcement Learning Experiments

Experiment Descriptions

1. 5 x 5 Grid World

Results:

2. Mountain Car Task

Results:

3. Continuous Grid World

Results:

Further Reading

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages