Error in value_iteration function of DP example #3

olmerg · 2018-07-16T15:06:33Z

the code of the example of DP (chapter 3 ), do not run the part of value iteraion because in line 209 you pass the action_values but is necesary to pass V.

action_values = one_step_lookahead(environment, state, V, discount_factor)

The text was updated successfully, but these errors were encountered:

OnurcanKoken · 2022-05-11T16:37:23Z

for the code of (chapter 3), frozen_lake.py, a small change:
print(''.join([action_mappings[action] for action in np.argmax(policy, axis=1)]))

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error in value_iteration function of DP example #3

Error in value_iteration function of DP example #3

olmerg commented Jul 16, 2018

OnurcanKoken commented May 11, 2022

Error in value_iteration function of DP example #3

Error in value_iteration function of DP example #3

Comments

olmerg commented Jul 16, 2018

OnurcanKoken commented May 11, 2022