Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
RuntimeError: expected mat1 and mat2 to have the same dtype, but got: float != c10::BFloat16
PEFT
PyTorch FSDP
#30914
opened May 20, 2024 by
mosama1994
4 tasks
Title: CUDA RuntimeError: Unspecified Launch Failure during Training
#30913
opened May 20, 2024 by
Hongjie1Chu
2 of 4 tasks
Training hangs at the first gradient syncing of an MoE model while using deepspeed
DeepSpeed
#30911
opened May 20, 2024 by
negar-foroutan
2 of 4 tasks
Unable to run FLAN-T5 inference on GCP TPU v3 (TF 2.16.1)
Core: Tokenization
Internals of the library; Tokenization.
TensorFlow
Anything TensorFlow
TPU
#30901
opened May 20, 2024 by
sumanthratna
2 of 4 tasks
WandbCallback always (!) uploads entire model checkpoint to wandb
#30896
opened May 19, 2024 by
mgerstgrasser
2 of 4 tasks
Logging to wandb breaks FSDP in 4.41.0
PyTorch FSDP
#30895
opened May 19, 2024 by
mgerstgrasser
2 of 4 tasks
add_generation_prompt=False in Tokenizer.apply_chat_template has no effect
Chat Template
#30893
opened May 18, 2024 by
AndreiMuresanu
3 of 4 tasks
transformers 4.41.0 breaks generate() for T5
Generation
Should Fix
This has been identified as a bug and should be fixed.
#30892
opened May 18, 2024 by
abdulfatir
1 of 4 tasks
GGUF interaction with Transformers using AutoModel Class
GGUF
#30889
opened May 18, 2024 by
Abdullah-kwl
RuntimeError: Failed to import transformers.generation.utils because of the following error (look up to see its traceback): cannot import name 'GenerateOutput' from partially initialized module 'transformers.generation.utils' (most likely due to a circular import)
#30888
opened May 18, 2024 by
sysuls1
4 tasks
Wav2vec2 model has unknown attributes weight_g/weight_v when DeepSpeed ZeRO-3 is enabled
Audio
DeepSpeed
#30881
opened May 17, 2024 by
jonnyli1125
1 of 4 tasks
Have
_is_peft_model
check if there's any peft submodule/Allow quantised training
PEFT
#30878
opened May 17, 2024 by
ambroser53
Kosmos-2.5 implementation in transformers
Multimodal
New model
#30877
opened May 17, 2024 by
Natyren
2 tasks done
Owlv2 model keeps crashing
Examples
Which is related to examples in general
Vision
#30874
opened May 17, 2024 by
preethiseshadri518
Unsuppressable warning: "<model> will not detect padding tokens in
inputs_embeds
"
#30871
opened May 16, 2024 by
naimenz
2 of 4 tasks
Cache problem while runing on multiple nodes with GPU
#30859
opened May 16, 2024 by
yuane4
2 of 4 tasks
scores_for_ground_truths Error for deepset/roberta-base-squad2 model and squad_v2 dataset
Examples
Which is related to examples in general
TensorFlow
Anything TensorFlow
#30856
opened May 16, 2024 by
rahuljauhari3
2 of 4 tasks
Mamba:
use_cache
is not passed through in prepare_inputs_for_generation
#30849
opened May 16, 2024 by
uwu-420
[BLIP2] BLIP2QFormerLayer is missing the self.intermediate parameter, which makes training from scratch impossible
#30846
opened May 16, 2024 by
tongda
1 of 4 tasks
Significant performance degradation with multi-GPU training on newer torch/transformers
#30840
opened May 15, 2024 by
abdulfatir
2 of 4 tasks
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.