-
Notifications
You must be signed in to change notification settings - Fork 28.6k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
convert scale and zero to cuda when using HQQ backend
#37425
opened Apr 10, 2025 by
phymhan
Loading…
2 of 5 tasks
guard on model.eval when using torch.compile + FSDP2
#37413
opened Apr 10, 2025 by
winglian
Loading…
5 tasks
chore: standardize DeBERTa model card
#37409
opened Apr 10, 2025 by
Shoumik-Gandre
•
Draft
5 tasks done
[Regression] Fix Quark quantized model loading after refactorization
#37407
opened Apr 9, 2025 by
BowenBao
Loading…
1 of 5 tasks
[
Flex Attn
] Fix torch 2.5.1 incompatibilities
#37406
opened Apr 9, 2025 by
vasqu
Loading…
1 of 5 tasks
[Cache] Support compilable cache reuse with smaller batch sizes
#37394
opened Apr 9, 2025 by
gante
Loading…
Fix mask handling for flex attention in llama/gemma2/mistral/qwen2
#37381
opened Apr 9, 2025 by
flukeskywalker
Loading…
3 of 5 tasks
prevent creating a view/leaf param for low rank optimizers w FSDP
#37379
opened Apr 8, 2025 by
winglian
Loading…
5 tasks
Implement improved window attention in eager/sdpa version for Qwen2.5VL
#37363
opened Apr 8, 2025 by
JJJYmmm
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.