8 fine tuning #14

philswatton · 2024-04-18T16:46:48Z

UPDATED: A PR to resolve #8, now open for review.

Big picture:

Implements model fine-tuning
Implements a partial config structure for the project, with a view towards minimal changes being required to cover the full project pipeline
Implements Baskerville array jobs\
Implements wandb usage

The code here:

Adds functions for loading model, tokenizer, and trainer
Adds a base config class, along with experiment, data, and model config classes
The idea is to use a modular setup for configs, with one config per experiment component (data, model, forgetting techniques) to allow us to easily change different components of the code without needing to change all components
- I anticipate being able to add forget-specific fine-tuning details to model hyperparam configs
A function that given a fairly simple top level experiment config, spawns several single experiment configs (and, optionally, an associated baskerville array script)
Example experiment (both normal and top-level), data, model, and hyperparam configs
scripts/README containing instructions for fine-tuning
Some additional dependencies
Unit tests

jack89roberts

Few initial thoughts/comments

src/arcsf/config/config_class.py

src/arcsf/models/config_class.py

src/arcsf/config/experiment_config.py

src/arcsf/models/model.py

src/arcsf/models/trainer.py

src/arcsf/__init__.py

src/arcsf/models/config_class.py

scripts/gpt2_longer_all.sh

src/arcsf/config/experiment.py

scripts/train.py

src/arcsf/config/experiment.py

src/arcsf/models/model.py

src/arcsf/config/experiment.py

jack89roberts · 2024-05-24T11:51:48Z

^ mostly some small refactoring/docs comments. I haven't tried running it but looks good to me. Probably some upcoming headaches thinking about adding in forgetting nicely.

src/arcsf/constants.py

Co-authored-by: Jack Roberts <jroberts@turing.ac.uk>

philswatton added 6 commits April 24, 2024 11:22

model and tokenizer loading

bcb9b7b

additional dependencies

2b6ba63

configs, training script, generation

3f69c81

removed generated configs accidentally commited

2cfed79

train args

a9e5028

early bask script generation

38562f3

jack89roberts force-pushed the 8-fine-tuning branch from 4062edc to 38562f3 Compare April 24, 2024 10:25

jack89roberts added this to the Milestone 1: Working pipeline on small novel usecase milestone Apr 25, 2024

jack89roberts reviewed Apr 29, 2024

View reviewed changes

jack89roberts reviewed Apr 30, 2024

View reviewed changes

src/arcsf/models/config_class.py Outdated Show resolved Hide resolved

philswatton added 20 commits April 30, 2024 12:01

removed depracted property

dac3fe5

added wandb, broke toplevel config in process

a331b61

placeholder preprocessing

c994e07

dependencies

6180a99

example tofu configs

7e11b51

added placeholders for 1st debug run, almost ready to run

0b8fb48

placeholder data config loading

88c8735

added remaining required placeholders

f824f8c

fixed mistake in copying wandb config

662799d

to dict method started

05b9d0f

fixed import error

290ccfa

fixed name of from_yaml

d3aecc2

fixed path typo

3a719d7

fixed kwarg error

dc84fda

fixed misaligned class and config names

4525606

added seed kwarg

b35ad39

added vscode ignore

2338796

bugs for days

ba55154

kill me now

20fe114

last fix

6cbfc6c

philswatton added 3 commits May 23, 2024 08:08

updated data cfg test

a9d477c

bug fixing

d429300

finished README sections for PR

458f5c1

philswatton marked this pull request as ready for review May 23, 2024 08:22

philswatton requested review from jack89roberts and J-Dymond May 23, 2024 08:22

jack89roberts requested changes May 24, 2024

View reviewed changes

jack89roberts reviewed May 28, 2024

View reviewed changes

src/arcsf/constants.py Outdated Show resolved Hide resolved

jack89roberts reviewed May 28, 2024

View reviewed changes

src/arcsf/constants.py Outdated Show resolved Hide resolved

philswatton and others added 16 commits May 28, 2024 13:49

Update src/arcsf/models/model.py

6c86740

Co-authored-by: Jack Roberts <jroberts@turing.ac.uk>

Update src/arcsf/config/experiment.py

068e47f

Co-authored-by: Jack Roberts <jroberts@turing.ac.uk>

removed cpu per gpu

f775085

fixed error from treating str as path

0c3878c

updated argparse docstring

c4b6df3

short fn names + args to single line

8c6cf85

docstrings for top config fns

455f13b

Update src/arcsf/config/experiment.py

033d434

Co-authored-by: Jack Roberts <jroberts@turing.ac.uk>

copy -> deepcopy

dbb7184

Update src/arcsf/config/experiment.py

dafea7e

Co-authored-by: Jack Roberts <jroberts@turing.ac.uk>

Update src/arcsf/config/experiment.py

e6c9bd3

Co-authored-by: Jack Roberts <jroberts@turing.ac.uk>

Update src/arcsf/config/experiment.py

76dd37d

Co-authored-by: Jack Roberts <jroberts@turing.ac.uk>

Update src/arcsf/config/experiment.py

bba0633

Co-authored-by: Jack Roberts <jroberts@turing.ac.uk>

Update src/arcsf/config/experiment.py

512eadc

Co-authored-by: Jack Roberts <jroberts@turing.ac.uk>

rename fn

d774c9b

refactored top config generation

a576e96

philswatton requested a review from jack89roberts May 28, 2024 14:49

jack89roberts approved these changes May 28, 2024

View reviewed changes

philswatton merged commit a9d3899 into develop May 28, 2024
1 check passed

philswatton deleted the 8-fine-tuning branch May 28, 2024 15:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

8 fine tuning #14

8 fine tuning #14

philswatton commented Apr 18, 2024 •

edited

jack89roberts left a comment

jack89roberts commented May 24, 2024

8 fine tuning #14

8 fine tuning #14

Conversation

philswatton commented Apr 18, 2024 • edited

jack89roberts left a comment

Choose a reason for hiding this comment

jack89roberts commented May 24, 2024

philswatton commented Apr 18, 2024 •

edited