reinforcement-learning-project

Reinforcement Learning - AI Projects #7

This repository includes a drone environment that does not include any obstacles, and the aim of the drone is to fly to an arbitrary target and hover. It contains the Soft Actor Critic and Hindsight Experience Replay algorithms to train the drone.

To run the code, first download the base drone environment, which can be found here. Then, to run the training code modify the main.py file that includes all hyperparameters and the main training loop.

To start training run

$ python main.py

For GPU add --cuda attribute.

This repository supports tensorboard for logging the values.

The eval folder contains code for evaluating the trained agent visually and extract a dataset for offline learning/imitation learning for RL. The previously created dataset can also downloaded from here. It includes a checkpoint from the previous trainings to run the model.

To start the PyBullet simulation window

$ python eval.py

References

Haarnoja, T., Zhou, A., Abbeel, P. & Levine, S.. (2018). Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. Proceedings of the 35th International Conference on Machine Learning, in Proceedings of Machine Learning Research 80:1861-1870 Available from https://proceedings.mlr.press/v80/haarnoja18b.html.
Andrychowicz, M., Crow, D., Ray, A., Schneider, J., Fong, R., Welinder, P., McGrew, B., Tobin, J., Abbeel, P., & Zaremba, W. (2017). Hindsight Experience Replay. NIPS.
Panerati, J., Zheng, H., Zhou, S., Xu, J., Prorok, A., Studies, A.P., Intelligence, V.I., & Cambridge, U.O. (2021). Learning to Fly—a Gym Environment with PyBullet Physics for Reinforcement Learning of Multi-agent Quadcopter Control. 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 7512-7519.
Chen, L., Lu, K., Rajeswaran, A., Lee, K., Grover, A., Laskin, M., Abbeel, P., Srinivas, A., & Mordatch, I. (2021). Decision Transformer: Reinforcement Learning via Sequence Modeling. ArXiv, abs/2106.01345.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
decision_transformer		decision_transformer
eval		eval
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
env.py		env.py
her.py		her.py
main.py		main.py
model.py		model.py
replay_memory.py		replay_memory.py
sac.py		sac.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

decision_transformer

decision_transformer

eval

eval

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

env.py

env.py

her.py

her.py

main.py

main.py

model.py

model.py

replay_memory.py

replay_memory.py

sac.py

sac.py

utils.py

utils.py

Repository files navigation

reinforcement-learning-project

References

About

Releases

Packages

Contributors 2

Languages

License

inzva/reinforcement-learning-project

Folders and files

Latest commit

History

Repository files navigation

reinforcement-learning-project

References

About

Resources

License

Stars

Watchers

Forks

Languages