Learning to Race using RL PPO and MPC

Project Overview

F1Tenth Reinforcement Learning vs Model Predictive Control This project compares the performance of a Reinforcement Learning (RL) agent to a Model Predictive Control (MPC) approach for autonomous racing on the F1Tenth simulated environment. Overview The goal was to develop an autonomous racing agent that can complete laps around the F1Tenth example track as quickly as possible. Two main approaches were explored in parallel:

Reinforcement Learning (RL): An agent was trained using stable_baselines3's Proximal Policy Optimization (PPO) on the F1Tenth gym environment. The final agent achieved a lap time of 16.58 seconds on the example track.

Model Predictive Control (MPC): Two different vehicle dynamics models were implemented - a unicycle model and a kinematic bicycle model. The MPC solutions tracked the center line waypoints while optimizing for minimum lap time under constraints like steering limits. The unicycle model achieved 8.7 seconds and the kinematic bicycle model 10.81 seconds on the example track.

The project involved setting up the simulation environments, defining reward functions, tuning hyperparameters, and addressing challenges related to dynamics modeling and constraint handling. This project was a collaboration between myself, @EdwardShiBerkely, @FahimChoudhury007, and @yashopadmin,

How to Run (Reinforcement Learning)

First clone the repo. In a terminal window, create a new conda envirnoment, activate it, and install the required packages.

conda create -n <name_of_env> python=3.8
conda activate <name_of_env>
cd ME292B_FinalProject
pip install -r requirements.txt

Training (Train using PPO)

python rl_train_test.py --run train

Testing (Test best saved model from training)

python rl_train_test.py --run test --model_path <path/to/model>

Quick Run Best Model (Best results)

python rl_best_model.py

How to Run (Model Predictive Control)

MPC_Laptime_Racing.ipynb

Run all the cells in order:
-For the 4th cell ensure that the file paths are correct to not have any errors
-For the 6th cell choose the track and dynamics model
-The 7th cell has the results of the MPC plotted out

Results

The results from RL and MPC are shown below. The RL agent finishes the track in 16.58 seconds, around the example track. The bicycle kinematic dynamic based MPC finished the example track in 10.81 seconds.

RL Result

RL Training Result on Example Track (250,000 training steps)

RL Result on Example Track (5,000,000 training steps, following raceline)

MPC Result

Results for MPC in three tracks: Example Track, Brands Hatch and IMS

Acknowledgements

F1Tenth Gym Environment and documentation F1Tenth
Stable Baselines3 documentation and examples of PPO RL method usage.

** All other citations are in the project report under "References".

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
f1tenth_gym		f1tenth_gym
f1tenth_racetracks		f1tenth_racetracks
logs		logs
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
ME292B Final Project Presentation.pdf		ME292B Final Project Presentation.pdf
ME292B_FinalProject-main.zip		ME292B_FinalProject-main.zip
MPC_Laptime_Racing.ipynb		MPC_Laptime_Racing.ipynb
MPC_result_IMS_1.png		MPC_result_IMS_1.png
README.md		README.md
env_wrappers.py		env_wrappers.py
mpc_result_BrandsHatch.png		mpc_result_BrandsHatch.png
mpc_result_IMS.png		mpc_result_IMS.png
mpc_result_exampleTrack.png		mpc_result_exampleTrack.png
planner.py		planner.py
requirements.txt		requirements.txt
rl_best_model.py		rl_best_model.py
rl_result.gif		rl_result.gif
rl_train_test.py		rl_train_test.py
rl_training_clip.gif		rl_training_clip.gif

tganamur/RL-vs-MPC-Racing

Folders and files

Latest commit

History

Repository files navigation

Learning to Race using RL PPO and MPC

Project Overview

How to Run (Reinforcement Learning)

Training (Train using PPO)

Testing (Test best saved model from training)

Quick Run Best Model (Best results)

How to Run (Model Predictive Control)

Results

RL Result

MPC Result

Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Languages