Skip to content

Pull requests: allenai/OLMo

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

mask instances with too many repetitions
#568 opened May 3, 2024 by epwalsh Loading…
Reverse weight decay
#567 opened May 3, 2024 by AkshitaB Loading…
1 task
Preliminary MoE extension
#541 opened Apr 11, 2024 by Muennighoff Draft
Add reorder cache for beam search
#526 opened Mar 26, 2024 by cshaib Loading…
Add scripts for Dave
#516 opened Mar 21, 2024 by epwalsh Draft
Scripts for QKV experiments
#510 opened Mar 20, 2024 by AkshitaB Loading…
OLMo 70B official training run
#507 opened Mar 19, 2024 by epwalsh Draft
hf_olmo: support flash attn 2
#471 opened Feb 29, 2024 by wade3han Loading…
integrate mock vision backbone into model
#441 opened Feb 8, 2024 by epwalsh Loading…
DeepSpeed
#384 opened Nov 27, 2023 by Muennighoff Draft
Kebab7
#360 opened Nov 3, 2023 by dirkgr Draft
Mitchish kempner
#352 opened Oct 31, 2023 by ibeltagy Loading…
Activation logging
#330 opened Oct 12, 2023 by saurabh111233212 Loading…
Configs for LUMI ablations
#324 opened Oct 10, 2023 by epwalsh Draft
3 tasks
truncated init + bloom init
#285 opened Sep 23, 2023 by ibeltagy Loading…
Compiling the AMD layer norm
#284 opened Sep 22, 2023 by dirkgr Draft
Add triton implementation of layer norm status/blocked Progress can't be made because we're waiting on something outside of our control
#260 opened Sep 6, 2023 by epwalsh Draft
ProTip! What’s not been updated in a month: updated:<2024-04-03.