9 forgetting pipeline #18

jack89roberts · 2024-04-25T08:45:12Z

Adds a Forgetter class for Gradient Ascent, Gradient Difference, KL, and "I don't know forgetters", which all inherit from the HuggingFace Trainer, only modifying compute_loss.

TODOs

Configs/Job Submission

All needs testing:

New top-level config structure (see here) + associated changes for converting it to individual configs etc.
The top level to individual full/retain/forget configs generation code in arcsf.config.experiment has been adapted for the new config/job submission structure. It generates separate full, retain, and forget train scripts and configs.
train.py has been modified to accept the new config structure and to work for both training and forgetting jobs, but not checked/tested at all yet.

Forgetting Evaluation

Currently it's not possible to do forgetting with an eval_dataset / any kind of evaluation during training as the evaluate function in the trainer class doesn't know what to when given 2 data inputs (forget / retain). The evaluate function in arcsf.forget.base need to be implemented to use Jack D's evaluation code (and the trainer should be initialised with eval_dataset set to whatever our appropriate eval dataset instance is).

TOFU says it didn't work well

Steals a lot from fine-tuning branch - some placeholder forget configs and config classes - adds a data collator compatible with QAForgetDataset - adapts trainer loading etc. to have option of loading a forgetter instead BUG - Evaluation does not work, likely also needs adapting for expected data input format.

jack89roberts added this to the Milestone 1: Working pipeline on small novel usecase milestone Apr 25, 2024

jack89roberts force-pushed the 9-forgetting-pipeline branch 2 times, most recently from 7b122f7 to 2e186da Compare May 3, 2024 17:13

jack89roberts force-pushed the 9-forgetting-pipeline branch from b8740ff to 392151e Compare May 28, 2024 08:39

jack89roberts added 26 commits May 28, 2024 16:13

⬆️ add/update some basic dependencies

c711c7c

🚧 start adapting tofu forgetting code

947370c

⬆️ optional notebook dependencies

53bfdf7

🚧 rename tokenizer in dataset

2edfee6

🔥 delete placeholder data class

ff1116c

🔥 remove dpo for now

573c699

TOFU says it didn't work well

✨ add lookup dict for forget classes

8387fca

wip start working on forget tests

1db3222

wip tests

d205428

✅ Add passing test for each forgetter

c1a47c3

➕ re-add scikit-learn dependency

c97e119

💡 remove commented code

a91d087

📝 Docstrings

9ae742a

wip forget script

2cd653e

✅ fix broken test due to changed forget/retain order

0168a47

⬆️ update pyproject.toml to match fine-tuning branch

f5dac6f

🔨 sample forget submit script

8e8fa21

🚨 lint

ba32ad2

print diffs in actions

60f5e4f

fix bugs in running kl and idk

25019ff

reduce walltime

65f3fb4

auto device map for oracle model, unique output dir

b3eabc1

🐛 job type not set

70fe2a4

add not implemented evaluate function

23cb6ad

data collator return docstring

b95f876

jack89roberts added 5 commits May 28, 2024 16:14

📌 re-generate lock file after rebase

c952f70

🚨 remove unused import

619eaf2

🔥 remove actions file from template

3782749

wip refactoring forget for new config structure

13eb867

📝 fix all -> full in docstrings

ad3522c

jack89roberts force-pushed the 9-forgetting-pipeline branch from a877f2c to ad3522c Compare May 28, 2024 15:14

jack89roberts added 3 commits May 31, 2024 12:11

wip forget config refactoring

e2b8f87

untested general full/retain/forget train script

6551f1f

debug tests

8bc3a6b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

9 forgetting pipeline #18

9 forgetting pipeline #18

jack89roberts commented Apr 25, 2024 •

edited

9 forgetting pipeline #18

Are you sure you want to change the base?

9 forgetting pipeline #18

Conversation

jack89roberts commented Apr 25, 2024 • edited

TODOs

Configs/Job Submission

Forgetting Evaluation

jack89roberts commented Apr 25, 2024 •

edited