Skip to content

Pull requests: quic/efficient-transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Enabling support of rerankers models 2B and 8B of qwen3vl
#921 opened Apr 18, 2026 by quic-amitraj Contributor Loading…
[Tests]: gemma3 tests are enabled
#918 opened Apr 17, 2026 by abukhoy Contributor Loading…
MLA perf
#910 opened Apr 8, 2026 by quic-mamta Contributor Loading…
feat: Enable benchmark-mode module inventory/export across all CausalLM architectures enhancement New feature or request
#906 opened Apr 3, 2026 by vbaddi Contributor Loading…
qwen3_5_linear_attn
#901 opened Apr 1, 2026 by mohiso22 Contributor Loading…
[Nightly CI]: Creating CI Pipeline for Nightly Build
#828 opened Mar 5, 2026 by abukhoy Contributor Draft
FirstCache for Diffusers
#803 opened Feb 23, 2026 by quic-amitraj Contributor Draft
Add support for num_crops and valid_size from vLLM
#796 opened Feb 17, 2026 by quic-vargupt Contributor Loading…
MLA
#789 opened Feb 10, 2026 by quic-mamta Contributor Loading…
feat(QEff: Attn): add KV & Q blocking strategies for causal LMs enhancement New feature or request qeff.blocking
#774 opened Feb 3, 2026 by vbaddi Contributor Loading…
2 of 3 tasks
["QEff.finetuning"] Inference script for HF_trainer fine-tuning
#749 opened Jan 21, 2026 by tchawada Contributor Loading…
Adding blocked kv and skip softmax for gpt oss
#745 opened Jan 20, 2026 by kdulla Contributor Draft
Logger Module For Efficient Transformers
#696 opened Jan 2, 2026 by abhishek-singh591 Contributor Loading…
[PY-3.12]: Updating python3.12 and Removing multidict
#685 opened Dec 22, 2025 by abukhoy Contributor Draft
ProTip! Mix and match filters to narrow down what you’re looking for.