I want use the function prefix_allowed_tokens_fn, where of basaran's source code shall I modify? #220

zoubaihan · 2023-06-29T08:07:58Z

Hello, we all know that in huggingface transformers' origin model.generate() method, we can set the function paremeterprefix_allowed_tokens_fn to restrict the generate rule. I want to use this function in basaran just like I used in origin model.generate(), could you please tell me where of the source code shall I modify to make the model generation obey my custom prefix_allowed_tokens_fn?

The text was updated successfully, but these errors were encountered:

peakji · 2023-06-29T14:43:22Z

Generation related features can be implemented by modifying StreamModel.generate().

However, the original implementation from HF Transformers may require significant modifications to support streaming. This is also the main obstacle that prevents us from achieving feature parity...

peakji added the question Further information is requested label Jun 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I want use the function prefix_allowed_tokens_fn, where of basaran's source code shall I modify? #220

I want use the function prefix_allowed_tokens_fn, where of basaran's source code shall I modify? #220

zoubaihan commented Jun 29, 2023

peakji commented Jun 29, 2023

I want use the function prefix_allowed_tokens_fn, where of basaran's source code shall I modify? #220

I want use the function prefix_allowed_tokens_fn, where of basaran's source code shall I modify? #220

Comments

zoubaihan commented Jun 29, 2023

peakji commented Jun 29, 2023