Skip to content

How does Temperature fallback with beam search work? #549

Answered by jongwook
tom-huntington asked this question in Q&A
Discussion options

You must be logged in to vote

Both beam search and greedy decoding are deterministic algorithms and make sense only with temperature 0. With nonzero temperature, the implementation becomes nondeterministic and uses the best_of parameter, which defaults to 5 in the CLI, where it makes best_of independent samples and select the one with the highest log probability.

Replies: 2 comments 4 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
4 replies
@tom-huntington
Comment options

@jongwook
Comment options

@tom-huntington
Comment options

@HAJEKEL
Comment options

Answer selected by jongwook
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants