Fully deterministic runs #43

jadkins99 · 2023-04-15T21:52:08Z

Awesome repo. quick question,

I ran the DMC WalkerWalk experiment 3 different times with the same seeds and got 3 different learning curves. How can I get reproducible experiments?Awesome repo. quick question,

I ran the DMC WalkerWalk experiment 3 different times with the same seeds and got 3 different learning curves. How can I get reproducible experiments?

danijar · 2023-04-16T19:37:06Z

Hi, are you asking for fully deterministic runs? I haven't paid much attention to this but I think the agent is already fully deterministic, so you'd probably just have to set the environment seed (make sure if you use more than 1 environment instance, that the environments have different seeds so they produce different data).

jadkins99 · 2023-04-18T19:09:44Z

Okay I will try that. Thank you for the quick response! What exactly is an "environment instance"? I couldn't find a clear definition in the paper.

jadkins99 · 2023-04-18T19:27:35Z

Also, how many seeds were the non-Minecraft experiments run for?

subho406 · 2023-04-18T19:33:03Z

+1 on the question above. Maybe it's not that apparent in the paper, could you also provide some clarification on what the confidence intervals denote in the non-minecraft experiments (DMLab, DMC Proprio, Crafter, etc)? Is it std-error across multiple seeds, or std-error across a window of timesteps with a single seed, or something else?

danijar · 2023-04-19T20:36:16Z

It's mean/std across seeds and at least 3 seeds per task, often more.

jadkins99 · 2023-04-25T03:21:55Z

Update: I seeded dmc_control here. And still got non-deterministic runs. Are there other non-environment sources of randomness not seeded?

jadkins99 · 2023-04-26T02:27:16Z

I found some non-seeded randomness in the repo. Namely here and here. Wouldn't these affect the agent?

danijar · 2023-04-27T23:02:22Z

I don't think those two methods are run ever. Could you check e.g. by adding asdf to the two methods to see if it errors?

swannercjj · 2023-08-17T21:46:15Z

Seeding this it removes randomness from the first 1000 steps, but runs are non-deterministic afterwards.

danijar changed the title ~~Reprocibility~~ Fully deterministic runs Apr 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fully deterministic runs #43

Fully deterministic runs #43

jadkins99 commented Apr 15, 2023 •

edited

danijar commented Apr 16, 2023

jadkins99 commented Apr 18, 2023

jadkins99 commented Apr 18, 2023

subho406 commented Apr 18, 2023 •

edited

danijar commented Apr 19, 2023

jadkins99 commented Apr 25, 2023 •

edited

jadkins99 commented Apr 26, 2023 •

edited

danijar commented Apr 27, 2023

swannercjj commented Aug 17, 2023

Fully deterministic runs #43

Fully deterministic runs #43

Comments

jadkins99 commented Apr 15, 2023 • edited

danijar commented Apr 16, 2023

jadkins99 commented Apr 18, 2023

jadkins99 commented Apr 18, 2023

subho406 commented Apr 18, 2023 • edited

danijar commented Apr 19, 2023

jadkins99 commented Apr 25, 2023 • edited

jadkins99 commented Apr 26, 2023 • edited

danijar commented Apr 27, 2023

swannercjj commented Aug 17, 2023

jadkins99 commented Apr 15, 2023 •

edited

subho406 commented Apr 18, 2023 •

edited

jadkins99 commented Apr 25, 2023 •

edited

jadkins99 commented Apr 26, 2023 •

edited