How to write the prompt to tell the model that it should only generate the answers based on the context? #4

gabriead · 2024-02-21T14:14:58Z

Hi guys,
we have tried your model for a bigger number of documents (then used in the example code) and found that the model does not suse the provided documents at all. How can I tell the model to only use the provied documents?. A general question: if the model has to use content from different documents how does it choose which ones to use (we couldn't find an internal similarity metric in your code that would do that)?

So our small example looks like this:

`
instruction = """ You are the Kitchen Owner's Manual. Based ONLY on the documents answer the Question."""

queries = ["How do I attach the kitchen cabinet?"]

with open("some_json.json", "r") as file:
    documents = json.load(file)

def gritlm_instruction(instruction):
    return "<|user|>\n" + instruction + "\n<|embed|>\n" if instruction else "<|embed|>\n"

model = GritLM("GritLM/GritLM-7B", torch_dtype="auto")

model.encode(documents, instruction=gritlm_instruction(""))
model.encode(queries, instruction=gritlm_instruction(instruction))

prompt_instr = "Question: How do I attach the kitchen cabinet?"

messages = [ {"role": "user", "content": prompt_instr}, "content": prompt_instr]
encoded = model.tokenizer.apply_chat_template(messages, add_generation_prompt=True, return_tensors="pt")
encoded = encoded.to(model.device)
gen = model.generate(encoded, max_new_tokens=256, do_sample=False)

`

The answer is super generic and you can tell instantly that the model didn't use any of the provided information from the documents. So how can one fix that?

The text was updated successfully, but these errors were encountered:

Muennighoff · 2024-02-21T18:58:34Z

Hey sorry that demo did not do any RAG but just showed how you can use the model to embed / generate i.e. when you do generate it is not using any of the stuff you encoded previously but just the text provided

I've added a new demo here to show how you can do RAG with the different caching alternatives
https://github.com/ContextualAI/gritlm/tree/main?tab=readme-ov-file#caching
You can also remove the caching if you don't need it

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to write the prompt to tell the model that it should only generate the answers based on the context? #4

How to write the prompt to tell the model that it should only generate the answers based on the context? #4

gabriead commented Feb 21, 2024 •

edited

Muennighoff commented Feb 21, 2024

How to write the prompt to tell the model that it should only generate the answers based on the context? #4

How to write the prompt to tell the model that it should only generate the answers based on the context? #4

Comments

gabriead commented Feb 21, 2024 • edited

Muennighoff commented Feb 21, 2024

gabriead commented Feb 21, 2024 •

edited