How does Temperature fallback with beam search work? #549
-
Lines 102 to 128 in 02aa851 If we need_fallback, then we try again with the next temperature value
But the temprature values only seems to affect the Does the fallback just do exactly the same thing again when decoding with beam search? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
So the fallback must always go to greedy decoding. |
Beta Was this translation helpful? Give feedback.
-
Both beam search and greedy decoding are deterministic algorithms and make sense only with temperature 0. With nonzero temperature, the implementation becomes nondeterministic and uses the |
Beta Was this translation helpful? Give feedback.
Both beam search and greedy decoding are deterministic algorithms and make sense only with temperature 0. With nonzero temperature, the implementation becomes nondeterministic and uses the
best_of
parameter, which defaults to 5 in the CLI, where it makesbest_of
independent samples and select the one with the highest log probability.