Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add YaRN and Dynamic-YaRN RoPE Scaling Methods
#30910
opened May 20, 2024 by
mig-mfreitas
Loading…
2 of 5 tasks
FIX / TST: Fix expected results on Mistral slow test (A10)
#30909
opened May 20, 2024 by
younesbelkada
Loading…
fix the get_size_with_aspect_ratio in max_size situation
#30902
opened May 20, 2024 by
SangbumChoi
Loading…
3 of 5 tasks
Generation: get special tokens from model config
#30899
opened May 19, 2024 by
zucchini-nlp
Loading…
Bugfix: WandbCallback uploads initial model checkpoint
#30897
opened May 19, 2024 by
mgerstgrasser
Loading…
2 of 5 tasks
TST / Workflows: Get slack notifications for docker image build
#30891
opened May 18, 2024 by
younesbelkada
Loading…
[docs] Spanish translation of model_memory_anatomy.md
#30885
opened May 17, 2024 by
aaronjimv
Loading…
2 tasks
_is_peft_model update to recognise peft submodules, allowing training quantised models with peft submodules
#30884
opened May 17, 2024 by
ambroser53
Loading…
2 of 5 tasks
[trainer] allow processor instead of tokenizer
#30864
opened May 16, 2024 by
sanchit-gandhi
Loading…
[Whisper] Bump threshold of FA2 equivalence tests
#30861
opened May 16, 2024 by
sanchit-gandhi
Loading…
Paligemma - fix slow tests, add bf16 and f16 slow tests
run-slow
#30851
opened May 16, 2024 by
molbap
Loading…
[Draft] Fix tool pushing with new inputs
Agents
#30838
opened May 15, 2024 by
aymeric-roucher
Loading…
test_custom_4d_attention_mask skip with sliding window attn
#30833
opened May 15, 2024 by
poedator
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.