Llama3-8b-Instruct won't stop generating #442

ekim322 · 2024-04-27T23:56:28Z

System Info

lorax-client==0.5.0

Information

Docker
The CLI directly

Tasks

An officially supported command
My own modifications

Reproduction

.

Expected behavior

I use below code to get LLM response.

pb = Predibase(api_token=os.environ.get("PREDIBASE_API_TOKEN"))
lorax_client = pb.deployments.client("llama-3-8b-instruct")
lorax_client.generate(
    prompt,
    adapter_id="ekim322/cpAdapter",
    adapter_source="hub",
    api_token=os.environ.get("HF_W_TOKEN"),
    max_new_tokens=512,
).generated_text

Llama3 keeps generating tokens until max_new_tokens. It looks like the eos_token_id is never registered.
I had similar issue running locally, and updating transformers to >4.40 solved the issue. Issue related to llama3b and llama3b-instruct having different eos_tokens.

I tried setting stop_sequence

lorax_client.generate(
    prompt,
    adapter_id="ekim322/cpAdapter",
    adapter_source="hub",
    api_token=os.environ.get("HF_W_TOKEN"),
    max_new_tokens=512,
    stop_sequences=['<|end_of_text|>', '<|eot_id|>']
).generated_text

but this returns empty string response. What is the proper way to set stopping tokens?

Am I setting up Predibase correctly?

The text was updated successfully, but these errors were encountered:

tgaddair · 2024-05-23T19:47:41Z

Hey @ekim322, we recently made some changes to fix this in #456. Can you try with the latest LoRAX version to see if the error persists?

tgaddair added the bug Something isn't working label May 23, 2024

tgaddair self-assigned this May 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama3-8b-Instruct won't stop generating #442

Llama3-8b-Instruct won't stop generating #442

ekim322 commented Apr 27, 2024

tgaddair commented May 23, 2024

Llama3-8b-Instruct won't stop generating #442

Llama3-8b-Instruct won't stop generating #442

Comments

ekim322 commented Apr 27, 2024

System Info

Information

Tasks

Reproduction

Expected behavior

tgaddair commented May 23, 2024