Skip to content

v1.4

Latest
Compare
Choose a tag to compare
@Kaixhin Kaixhin released this 18 Jun 12:47
· 28 commits to master since this release

Pretrained models for data-efficient Rainbow. Reported scores matched for most games (sometimes models are a bit worse, sometimes a bit better).

Alien

Reward Q-values
newplot newplot (1)

Amidar

Reward Q-values
newplot (2) newplot (3)

Assault

Reward Q-values
newplot (4) newplot (5)

Asterix

Reward Q-values
newplot (6) newplot (7)

Bank Heist

Reward Q-values
newplot (8) newplot (9)

Battlezone

Reward Q-values
newplot (10) newplot (11)

Boxing

Reward Q-values
newplot (12) newplot (13)

Breakout

Reward Q-values
newplot (14) newplot (15)

Chopper Command

Reward Q-values
newplot (16) newplot (17)

Crazy Climber

Reward Q-values
newplot (18) newplot (19)

Demon Attack

Reward Q-values
newplot (20) newplot (21)

Freeway

Reward Q-values
newplot (22) newplot (23)

Frostbite

Reward Q-values
newplot (24) newplot (25)

Gopher

Reward Q-values
newplot (26) newplot (27)

H.E.R.O.

Reward Q-values
newplot (28) newplot (29)

James Bond 007

Reward Q-values
newplot (30) newplot (31)

Kangaroo

Reward Q-values
newplot (32) newplot (33)

Krull

Reward Q-values
newplot (34) newplot (35)

Kung-Fu Master

Reward Q-values
newplot (36) newplot (37)

Ms. Pac-Man

Reward Q-values
newplot (38) newplot (39)

Pong

Reward Q-values
newplot (40) newplot (41)

Private Eye

Reward Q-values
newplot (42) newplot (43)

Q*bert

Reward Q-values
newplot (44) newplot (45)

Road Runner

Reward Q-values
newplot (46) newplot (47)

Seaquest

Reward Q-values
newplot (48) newplot (49)

Up'n Down

Reward Q-values
newplot (50) newplot (51)