fix: term vs trunc #983

sash-a · 2024-01-09T15:04:12Z

Note

This is quite confusing so please check this carefully and make sure I haven't mixed up where to use each one!

What?

Mostly explained in #951. We are mixing up termination and truncation. This is especially prevalent in recurrent systems, but all systems have this issue.

Now very explicit about termination vs truncation, removed all mentions of done.
Done attribute remove from RnnLearnerState and replaced with truncated in the PpoTransition.

How?

Explicitly name variables termination and trunctation instead of done

OmaymaMahjoub

Thanks @sash-a! I will continue the review once we verify the recurrent systems performance :)

OmaymaMahjoub · 2024-01-16T08:46:30Z

mava/advanced_usage/ff_ippo_store_experience.py

+            trunc = jnp.repeat(timestep.last(), config.system.num_agents)
+            trunc = trunc.reshape(config.arch.num_envs, -1)
+            term = 1 - timestep.discount


If you can add documentation to timestep.last() and timestep.discount

OmaymaMahjoub · 2024-01-16T08:50:41Z

mava/systems/ff_ippo.py

            info = {
                "episode_return": env_state.episode_return_info,
                "episode_length": env_state.episode_length_info,
            }

            transition = PPOTransition(
-                done, action, value, timestep.reward, log_prob, last_timestep.observation, info
+                terminal=term,
+                truncated=trunc,


Random question, why do we include truncation if we don't use it 😅

OmaymaMahjoub · 2024-01-16T09:04:34Z

mava/systems/rec_ippo.py

I really like the changes of removing last_done from the learner_state

OmaymaMahjoub · 2024-01-16T09:06:24Z

mava/systems/rec_ippo.py

                hstates,
            ) = learner_state

            rng, policy_rng = jax.random.split(rng)

+            last_trunc = jnp.repeat(last_timestep.last(), n_agents).reshape(n_envs, -1)


if you can add documentation # Add last_trunc to the input to the network.

sash-a added 3 commits January 9, 2024 15:54

feat: ff systems now use trunc and term instead of done

c03e838

feat: truncate/terminate bugfix for rec ippo

4514895

fix: truncated added to PPOTransition

88b1f4b

sash-a self-assigned this Jan 9, 2024

pull-request-size bot added the size/L label Jan 9, 2024

sash-a added the bug Something isn't working label Jan 9, 2024

sash-a added 7 commits January 9, 2024 17:09

Merge branch 'develop' into fix/term-vs-trunc

f293e5d

Merge branch 'develop' into fix/term-vs-trunc

68db7cd

feat: rec mappo term/trunc fix

0c1396d

feat: remove truncated from RnnLearnerState for rec mappo

2fdd0f5

feat: remove truncated from RnnLearnerState for ff mappo

ad9b2fc

feat: term/trunc distinction for vault recording

8e79be5

chore: merge develop

fb74b0a

sash-a marked this pull request as ready for review January 10, 2024 13:37

sash-a requested review from arnupretorius, DriesSmit, RuanJohn, jcformanek, siddarthsingh1, OmaymaMahjoub, ulricharmel, callumtilbury and WiemKhlifi as code owners January 10, 2024 13:37

Merge branch 'develop' into fix/term-vs-trunc

0ab4bea

OmaymaMahjoub added the benchmark required Docker images get pushed if PR has this label. label Jan 15, 2024

sash-a added the priority/high label Jan 15, 2024

OmaymaMahjoub reviewed Jan 16, 2024

View reviewed changes

sash-a marked this pull request as draft February 16, 2024 10:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: term vs trunc #983

fix: term vs trunc #983

sash-a commented Jan 9, 2024 •

edited

OmaymaMahjoub left a comment

OmaymaMahjoub Jan 16, 2024

OmaymaMahjoub Jan 16, 2024

OmaymaMahjoub Jan 16, 2024

OmaymaMahjoub Jan 16, 2024

fix: term vs trunc #983

Are you sure you want to change the base?

fix: term vs trunc #983

Conversation

sash-a commented Jan 9, 2024 • edited

Note

What?

How?

OmaymaMahjoub left a comment

Choose a reason for hiding this comment

OmaymaMahjoub Jan 16, 2024

Choose a reason for hiding this comment

OmaymaMahjoub Jan 16, 2024

Choose a reason for hiding this comment

OmaymaMahjoub Jan 16, 2024

Choose a reason for hiding this comment

OmaymaMahjoub Jan 16, 2024

Choose a reason for hiding this comment

sash-a commented Jan 9, 2024 •

edited