Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
FutureWarning about resume_download is raised after huggingface-hub 0.23.0 release
#30618
opened May 2, 2024 by
albertvillanova
TypeError: WhisperForConditionalGeneration.forward() got an unexpected keyword argument 'model'
#30616
opened May 2, 2024 by
kadirnar
2 of 4 tasks
Error During Training with PatchTSMixerForTimeSeriesClassification for Time Series Classification
#30614
opened May 2, 2024 by
tdg2088
2 of 4 tasks
Whisper assistant decoding not working with pipeline
#30611
opened May 2, 2024 by
kamilakesbi
4 tasks
Some functional problems in the implementation of Speculative Decoding
#30608
opened May 2, 2024 by
transcend-0
3 of 4 tasks
model.safetensors
missing in model file not found error in default case
#30601
opened May 1, 2024 by
davidgxue
2 of 4 tasks
AutoModal how to enable TP for extremly large models?
#30596
opened May 1, 2024 by
MonolithFoundation
ChatGLM3-6b测试模型时报错AttributeError: can't set attribute
#30594
opened May 1, 2024 by
padsasdasd
2 of 4 tasks
Cache in different devices when use split model with dispatch_model() function and model.generate()
#30593
opened May 1, 2024 by
ZetangForward
2 of 4 tasks
Whisper Translation on low resource languages
Audio
#30592
opened May 1, 2024 by
RohitMidha23
2 of 4 tasks
i cannot find the code that transformers trainer model_wrapped by deepspeed , i can find the theory about model_wrapped was wraped by DDP(Deepspeed(transformer model )) ,but i only find the code transformers model wrapped by ddp, where is the deepspeed wrapped ? thanks ^-^
#30591
opened May 1, 2024 by
ldh127
The Phi-3 tokenizer is not inverting the chat template correctly.
#30590
opened May 1, 2024 by
tbenthompson
ReadTimeOutError with from_pretrained for some model checkpoints only
#30589
opened Apr 30, 2024 by
SRGAnalytics-MD
4 tasks
mistralai/Mixtral-8x7B-v0.1 bfloat16 much slower than FP32 on Intel EMR CPU
#30588
opened Apr 30, 2024 by
badhri-intel
2 of 4 tasks
Community contribution: enable dynamic resolution input for more vision models.
Good First Issue
Vision
#30579
opened Apr 30, 2024 by
amyeroberts
29 tasks
Make fx traced model with the use of Issues related to torchdynamo and torchinductor
past_key_values
pickable again?
Compilation
#30575
opened Apr 30, 2024 by
michaelbenayoun
Correct check for SDPA in Vision Language Models
Should Fix
This has been identified as a bug and should be fixed.
Vision
#30565
opened Apr 30, 2024 by
zucchini-nlp
7 tasks
MLFlowCallback MLFLOW_RUN_ID not used
Integrations
trainer
#30563
opened Apr 30, 2024 by
daniel-ibanez-merlyn
2 of 4 tasks
Wav2Vec2CTCTokenizer adds random unknown tokens to encoded input
Core: Tokenization
Internals of the library; Tokenization.
#30561
opened Apr 30, 2024 by
tshmak
2 of 4 tasks
Idefics2 fine-tuning: Error when unscale_gradients called on FP16 gradients during training with transformers and accelerate
#30559
opened Apr 29, 2024 by
rabiulcste
1 of 4 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.