OSRS PvP Reinforcement Learning

This project leverages deep reinforcement learning, specifically PPO, and self-play techniques to develop an AI agent capable of mastering 'no honor' player versus player fights in Old School RuneScape (OSRS) (such as in Last Man Standing).

Project Components:

ML Training System
Game Simulation

The training system launches and manages the simulation when training, though the simulation can be run on its own too. The training system also contains an API to serve trained models.

Note: A third component for evaluating the models in the live game as a third-party client plugin was developed but is not publicly available to prevent affecting the real game.

Showcase

The most recent model version shows incredibly promising results. FineTunedNh maintained a near perfect win rate on the real game in LMS, primarily losing only due to the inherent randomness of the game.

Youtube Video

https://youtu.be/jArLZ8nC5Nw

Evaluation session stats:

Evaluation footage:

View Additional Screenshots

Win rate against scripted baseline throughout training
LMS! (note: this account was used for testing/data collection too, the final trained model has ~99% win rate)
First to 10k+ PvP Arena rank, and reached #1
In-game PvP Arena rank (the game really says this)

How To Use

Clone the repository: git clone https://github.com/Naton1/osrs-pvp-reinforcement-learning.
Follow the README in each of the two subprojects to set up each project individually.

By following these steps, you'll be able to set up the environment and start training your own AI agent. You'll also be able to serve the pre-trained models via an API.

PvP ML

This Python-based component encompasses the core machine learning system. It's the primary interface for training, orchestrating the entire training workflow, and facilitating connections to the simulation RSPS for training sessions. The system also includes a socket-based API for real-time predictions.

Simulation RSPS

The Simulation RSPS project provides a platform for conducting simulated PvP fights, enhancing the training efficiency of the agent. It's built on top of a pre-existing private server: Elvarg RSPS.

Environment Contracts

Environment contracts are defined in contracts/environments, using JSON files that describe the actions and observations within each environment. These contracts are used by the two components listed above.

Existing Environment Contracts

NH Environment: The primary environment for this project
Dharok Edge-Style Environment: A test environment, partially implemented.

Train Custom Model

You can train a customized model on the simulation for any defined environment. To train on a new environment, you can add one (see below).

Experiment parameters are defined in a YAML file. There are several provided configurations. These configurations can be customized to produce train new and unique models.

See start a training job for more detailed information.

Add New Environment

You can define a new environment in addition to the pre-existing environments. Use the already-implemented environments as a reference.

Implement the environment contract at contracts/environments.
Follow the simulation steps to add a new environment.
Create or update a training configuration to use the new environment by setting env-name.

Use Existing Models

To use an existing model via API:

Set up the pvp-ml project.
Run the pvp-ml API.

Evaluate Trained Models

To play against the trained models on the simulation:

Set up the pvp-ml project.
Set up the simulation-rsps project and start the server.
Start up a RSPS client and connect to the server.
Start an agent evaluation session for the model.

Name		Name	Last commit message	Last commit date
Latest commit History 572 Commits
.github/workflows		.github/workflows
assets		assets
contracts/environments		contracts/environments
pvp-ml		pvp-ml
simulation-rsps		simulation-rsps
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OSRS PvP Reinforcement Learning

Showcase

Youtube Video

How To Use

PvP ML

Simulation RSPS

Environment Contracts

Existing Environment Contracts

Train Custom Model

Add New Environment

Use Existing Models

Evaluate Trained Models

About

Uh oh!

Uh oh!

Languages

Naton1/osrs-pvp-reinforcement-learning

Folders and files

Latest commit

History

Repository files navigation

OSRS PvP Reinforcement Learning

Showcase

Youtube Video

How To Use

PvP ML

Simulation RSPS

Environment Contracts

Existing Environment Contracts

Train Custom Model

Add New Environment

Use Existing Models

Evaluate Trained Models

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages