Prompts refactoring #517

quovadim · 2024-04-30T09:24:51Z

closes #505
Prompts are moved into subclass, better prompts for text generation added, chain-of-thought capabilities added for prompt

Main new features are:

Added native and automatic support for JSON generation using openai format structure
Added prompt-level validation capabilities for generated answer
Added classes prompt with automatic validation of prompts and parameters
Added support for non-strict queries, meaning that now you can specify if particular prompt response may fail and it will be still ok
Added support for chain-of-thought reasoning (take a look at examples at QNA generation prompts
Moved prompts into separate txt files
Now q&a generation can fail generation of some questions without consequences as long as at least one question is generated
Changed python version to python 3.11
Some prompts that were previously returning text now return JSONs
Some prompts that were previously returning JSONs now return text
Signature of generate_response now depends on prompt input parameters
Refactored all (except ragas) prompts, added examples.
some try/except blocks were removed due to new system of handling responses for non-strict queries. Meaning that all exceptions thrown from response_generator are critical. If NonStrict tag is added, in case of failed execution result will be None, unless exception is critical (e.g. non-related to LLM generation, but problem in the code)

Metrics and comparison

Comparison on data generated using this branch

Metric	New Data, New Prompts	New Data, Old Prompts
fuzzy	75.88	77.05
bert_all_MiniLM_L6_v2	80.58	79.60
cosine	75.21	68.88
bert_distilbert_base_nli_stsb_mean_tokens	76.90	76.14
llm_answer_relevance	72.59	67.60
llm_context_precision	91.03	84.62

Comparison on data generated using development branch

Metric	Old Data, New Prompts	Old Data, Old Prompts
fuzzy	81.14	80.56
bert_all_MiniLM_L6_v2	69.71	66.58
cosine	66.78	57.51
bert_distilbert_base_nli_stsb_mean_tokens	70.49	67.78
llm_answer_relevance	62.18	57.63
llm_context_precision	84.62	78.85

…dded, chain-of-thought capabilities added for prompt

rag_experiment_accelerator/llm/prompt/instructurion_prompts.py

rag_experiment_accelerator/config/config.py

rag_experiment_accelerator/evaluation/eval.py

rag_experiment_accelerator/llm/prompts_text/generate_qna_long_multi_context.txt

martinpeck · 2024-04-30T15:27:29Z

You've removed quite a few try/except blocks.
This might be fine, but it's not mentioned in the PR notes, and I wasn't sure if this change was desired/expected.
Probably best to mention how the error handling has changed in the PR notes.

ritesh-modi

LGTM

rag_experiment_accelerator/config/config.py

Vadim Kirilin and others added 10 commits April 30, 2024 10:23

Prompts are moved into subclass, better prompts for text generation a…

34061ec

…dded, chain-of-thought capabilities added for prompt

fixed system prompt update

21bf63a

path fix

49eabe2

removed usage of CoT by default due to stupidity of GPT-3.5

adfc3c3

Merge branch 'development' into quovadim/qna-generation-improvement

a31bdd8

cot prompts were fixed

aa7dc25

blacked

2d74967

blacked

75c98a0

flake8

362828f

debug commit

581c62f

martinpeck reviewed Apr 30, 2024

View reviewed changes

rag_experiment_accelerator/llm/prompt/instructurion_prompts.py Outdated Show resolved Hide resolved

martinpeck reviewed Apr 30, 2024

View reviewed changes

rag_experiment_accelerator/config/config.py Outdated Show resolved Hide resolved

martinpeck reviewed Apr 30, 2024

View reviewed changes

rag_experiment_accelerator/evaluation/eval.py Show resolved Hide resolved

Vadim Kirilin and others added 2 commits April 30, 2024 16:24

debug commit

31d9966

Merge branch 'development' into quovadim/qna-generation-improvement

39fac25

martinpeck reviewed Apr 30, 2024

View reviewed changes

rag_experiment_accelerator/llm/prompts_text/generate_qna_long_multi_context.txt Outdated Show resolved Hide resolved

Vadim Kirilin added 4 commits April 30, 2024 17:07

addressing comments

ca87be3

addressing comments

1218fc2

removed debug prints

b366419

temperature dropped back to 0

cc52374

ritesh-modi self-requested a review May 1, 2024 11:00

Vadim Kirilin added 8 commits May 1, 2024 12:16

added json enforcement

96892fe

prompt reverted

0d09cb7

changed formatting mechanism and reverted q&a generation prompt

cfd77f5

fixed bug

e07b2f4

added old prompt

fb08f3b

reverted to old prompt

8b96322

printing deployment info

00ef4d7

removed debug print

463e928

quovadim requested a review from martinpeck May 3, 2024 07:43

quovadim self-assigned this May 3, 2024

Vadim Kirilin added 4 commits May 3, 2024 09:20

outdated prompt reverted

6974e4c

outdated prompt reverted

56d447c

outdated reverted

1ed220a

meaningless change

6993ea8

martinpeck removed their request for review May 7, 2024 10:01

Merge branch 'development' into quovadim/qna-generation-improvement

9961ea7

quovadim requested a review from martinpeck May 9, 2024 07:58

Vadim Kirilin and others added 7 commits May 9, 2024 09:13

removed outdated prompt

c254d3b

Merge branch 'development' into quovadim/qna-generation-improvement

8ebc42d

fixed merged bug

6317906

temp changes

c32d50b

reverting debug print

1a09bab

Merge branch 'development' into quovadim/qna-generation-improvement

c39a42c

Merge branch 'development' into quovadim/qna-generation-improvement

a8cfbe4

ritesh-modi approved these changes May 18, 2024

View reviewed changes

rag_experiment_accelerator/config/config.py Show resolved Hide resolved

quovadim changed the base branch from development to prerelease May 20, 2024 09:37

quovadim and others added 2 commits May 20, 2024 10:42

Merge branch 'prerelease' into quovadim/qna-generation-improvement

1f78775

fixing merge errors

fa1e49a

quovadim merged commit 0341e88 into prerelease May 20, 2024
3 checks passed

quovadim deleted the quovadim/qna-generation-improvement branch May 20, 2024 11:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prompts refactoring #517

Prompts refactoring #517

quovadim commented Apr 30, 2024 •

edited

martinpeck commented Apr 30, 2024

ritesh-modi left a comment

Prompts refactoring #517

Prompts refactoring #517

Conversation

quovadim commented Apr 30, 2024 • edited

martinpeck commented Apr 30, 2024

ritesh-modi left a comment

Choose a reason for hiding this comment

quovadim commented Apr 30, 2024 •

edited