Generalized Population-Based Training With Pairwise Learning (GPBT-PL) arXiv

Code for the Generalized Population-Based Training With Pairwise Learning (GPBT-PL) algorithm, from the paper Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning.

The GPBT framework is implemented based on ray. Heavily inspired by ray tune PBT example, GPBT-PL is included in the ray.tune library, which is the official supported implementation.

Running the code

To run the PPO experiment, use command:

python run_ppo.py

To run the IMPALA experiment, use command:

python run_impala.py

Citing GPBT-PL

@article{bai2024generalized,
  title={Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning}, 
  author={Hui Bai and Ran Cheng},
  journal={IEEE Transactions on Emerging Topics in Computational Intelligence},
  publisher = {IEEE},
  year={2024},
  doi={10.1109/TETCI.2024.3389777}
  }

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
LICENSE		LICENSE
README.md		README.md
gpbt_pl.py		gpbt_pl.py
run_impala.py		run_impala.py
run_ppo.py		run_ppo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

gpbt_pl.py

gpbt_pl.py

run_impala.py

run_impala.py

run_ppo.py

run_ppo.py

Repository files navigation

Generalized Population-Based Training With Pairwise Learning (GPBT-PL) arXiv

Running the code

Citing GPBT-PL

About

Releases

Packages

Contributors 2

Languages

License

EMI-Group/gpbt-pl

Folders and files

Latest commit

History

Repository files navigation

Generalized Population-Based Training With Pairwise Learning (GPBT-PL) arXiv

Running the code

Citing GPBT-PL

About

Resources

License

Stars

Watchers

Forks

Languages