integrate mock vision backbone into model #441

epwalsh · 2024-02-08T23:41:40Z

Adds an abstraction for vision backbones, along with a mock dataset to test it. You can run a test training job on 2 GPUs with the mock dataset like so:

torchrun --nproc-per-node=2 scripts/train.py configs/mm-tiny.yaml

epwalsh · 2024-02-08T23:47:23Z

olmo/model.py

+            # Inject image patch embeddings into input embeddings.
+            assert image_offsets is not None
+            image_offsets_mask = image_offsets > 0
+            batch_idx = torch.arange(0, batch_size).repeat_interleave(image_offsets_mask.sum(dim=-1))
+            x.index_put_((batch_idx, image_offsets[image_offsets_mask]), img_emb[image_offsets_mask])


This took some thinking, but you can validate it with this little example:

import torch B = 2 S = 8 D = 16 P = 3 # num patches (max across instances) x = torch.zeros(B, S, D) img_emb = torch.rand(B, P, D) # use -1 for padding image_offsets = torch.tensor([[1, 5, 6], [3, -1, -1]]) assert image_offsets.shape == (B, P) image_offsets_mask = image_offsets > 0 batch_idx = torch.arange(0, B).repeat_interleave(image_offsets_mask.sum(dim=-1)) x.index_put_((batch_idx, image_offsets[image_offsets_mask]), img_emb[image_offsets_mask])

notion-workspace · 2024-02-09T01:39:24Z

Integrate mock vision backbone into OLMo

notion-workspace · 2024-02-14T21:24:28Z

Figure out how to put the image data into the sequences

integrate mock vision backbone into model

aea19b7

epwalsh commented Feb 8, 2024

View reviewed changes

Pass image inputs to model within trainer

ca4673e

epwalsh added 6 commits February 12, 2024 14:22

Update data collator for image fields

472c386

Add mock multi-modal dataset

b81a99e

Add config for testing

25b9f3c

Merge branch 'mm-dev' into epwalsh/vision-backbone

1dc4aea

fix dtype

8976f36

Add some comments

e397373

epwalsh added 6 commits February 28, 2024 19:22

fix merge conflicts

397436e

fix merge conflicts

57fa68a

Merge branch 'mm-dev' into epwalsh/vision-backbone

2723470

Merge branch 'main' into epwalsh/vision-backbone

2eb1b4a

Merge branch 'mm-dev' into epwalsh/vision-backbone

3b94d08

Merge branch 'mm-dev' into epwalsh/vision-backbone

4006195

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

integrate mock vision backbone into model #441

integrate mock vision backbone into model #441

epwalsh commented Feb 8, 2024 •

edited

epwalsh Feb 8, 2024

notion-workspace bot commented Feb 9, 2024

notion-workspace bot commented Feb 14, 2024

integrate mock vision backbone into model #441

Are you sure you want to change the base?

integrate mock vision backbone into model #441

Conversation

epwalsh commented Feb 8, 2024 • edited

epwalsh Feb 8, 2024

Choose a reason for hiding this comment

notion-workspace bot commented Feb 9, 2024

notion-workspace bot commented Feb 14, 2024

epwalsh commented Feb 8, 2024 •

edited