Skip to content

Course Project - Advanced Topics in Machine Learning - Autumn Semester 2023 - Indian Institute of Technology Bombay

Notifications You must be signed in to change notification settings

rohankalbag/deep-recurrent-q-learning-for-pomdps

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Deep Recurrent Q-Learning for Partially Observable Markov Decision Processes

EE782 : Advanced Topics in Machine Learning

Abstract

This project presents our unique implementation of Deep Recurrent Q-Learning (DRQL) that incorporates Transfer Learning for feature extraction, a customized LSTM for temporal recurrence, and a domain-informed reward function. This tailored approach aims to expedite convergence compared to the vanilla implementation outlined in the original paper. The performance evaluation focuses on two adaptive Atari 2600 games: Assault-v5 and Bowling, where game difficulty scales with player proficiency. Comparative analysis between the convergence of our optimized reward function and the vanilla version is conducted using StepLR and CosineAnnealingLR learning rate schedulers, complemented by theoretical explanations. Additionally, an efficient windowed episodic memory implementation employing bootstrapped sequential updates is proposed to optimize GPU memory utilization

Watch our agent play in action

Assault-v5 Bowling

Environment Setup

python3 -m venv mlproj
source mlproj/bin/activate
pip install -r requirements.txt

Link to Jupyter Notebook Detailed Report with Code, Experimentation and Results

Collaborators:

  • Rohan Kalbag
  • Vansh Kapoor
  • Sankalp Bhamare

About

Course Project - Advanced Topics in Machine Learning - Autumn Semester 2023 - Indian Institute of Technology Bombay

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published