data split #4

xiaoyuxin1002 · 2023-07-24T09:51:00Z

Hi, after reading your code, can i check with you the following regarding how you split your dataset?

5 samples are generated from prompt_gen_data to induce the instruction from the open-source LLM
20 samples are generated from eval_data to evaluate the quality of the induced instruction during BO iterations
100 samples are generated from test_data to evaluate the quality of the proposed instruction after BO iterations
Thanks!

xiaoyuxin1002 · 2023-07-25T20:57:17Z

Hi, it also seems that your data/instruction_induction_raw/induce folder contains 33 datasets while your data/instruction_induction_raw/execute folder contains only 31 datasets.
And Table 2 in your paper didn't include the tuned prompts for word_in_context, but had 2 rows both showing the prompts for negation.
Could you please fix them? Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data split #4

data split #4

xiaoyuxin1002 commented Jul 24, 2023

xiaoyuxin1002 commented Jul 25, 2023

data split #4

data split #4

Comments

xiaoyuxin1002 commented Jul 24, 2023

xiaoyuxin1002 commented Jul 25, 2023