Skip to content

Patrickjliu/CSE-19-Reports

Repository files navigation

Project Proposal: Applying Reinforcement Learning Algorithms in Simulated Car Lane Following

Project Description

The goal of this project is to apply reinforcement learning (RL) algorithms to enable simulated cars to autonomously follow lanes. We will focus primarily on policy gradient (PG) methods, which have shown promise in autonomous driving scenarios. The project aims to explore the use of the proximal policy optimization (PPO) algorithm as a potentially improved PG algorithm.

Objectives

  • Explore the application of reinforcement learning algorithms in the context of simulated car lane following.
  • Implement and evaluate different policy gradient algorithms, including REINFORCE, Advantage Actor-Critic (A2C), and Proximal Policy Optimization (PPO).
  • Assess the performance, stability, and convergence properties of the implemented algorithms.
  • Compare and analyze the effectiveness of PPO against simpler PG algorithms in simulated car lane following tasks.
  • Provide recommendations and insights for future improvements and research in the field of autonomous driving using reinforcement learning.

Project Tasks

  1. Environment Selection: Each intern will choose a gym environment, such as Pac-Man, spider walking, and car racing. They will thoroughly explore their selected environment, studying its characteristics, reward structures, and state representations.

  2. Algorithm Implementation: Interns will implement the selected RL algorithms (REINFORCE, A2C, PPO) in Python. They will design neural network models and train them using the algorithms.

  3. Evaluation and Analysis: The implemented algorithms will be evaluated based on their performance in the chosen gym environments. Interns will analyze key metrics such as convergence rate, stability, and overall task completion to assess the effectiveness of each algorithm.

  4. Comparison and Discussion: Interns will compare the performance of PPO against the simpler PG algorithms (REINFORCE, A2C) in terms of training efficiency, stability, and generalization capabilities. They will discuss the advantages and limitations of each algorithm and provide insights into the suitability of PPO for simulated car lane following.

Conclusion

This project offers an exciting opportunity to explore and apply reinforcement learning algorithms in the context of simulated car lane following. By implementing and evaluating various policy gradient methods, we aim to gain insights into the strengths and limitations of these algorithms in autonomous driving scenarios. The project outcomes will contribute to the growing field of reinforcement learning and pave the way for further research and improvements in the domain of autonomous vehicles.

If you have any questions or suggestions, please feel free to reach out.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published