Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add mark_not_offload() interface for cpu_offload_v1
#2770 opened Mar 17, 2026 by lhb8125 Loading…
13 tasks
GEMM + Swiglu fused Grouped MLP for MXFP8 2.14.0 MoE
#2769 opened Mar 17, 2026 by ksivaman Loading…
13 tasks
[Draft]Support for score_mod and score_mod_bprop in cuDNN's sdpa
#2767 opened Mar 16, 2026 by vcherepanov-nv Loading…
2 of 13 tasks
Fix the retrieval of overwrite_main_grad
#2764 opened Mar 16, 2026 by shjwudp Draft
13 tasks
[JAX] MXFP8 Grouped GEMM
#2763 opened Mar 14, 2026 by jberchtold-nvidia Draft
13 tasks
[JAX] MXFP8 Grouped Quantize V2
#2760 opened Mar 13, 2026 by jberchtold-nvidia Draft
13 tasks
[PyTorch] transformer_engine.pytorch.autocast suport inside torch.compile
#2759 opened Mar 13, 2026 by pggPL Loading…
4 of 26 tasks
[JAX] Grouped GEMM Refactor to use first_dims and last_dims
#2749 opened Mar 10, 2026 by jberchtold-nvidia Loading…
1 of 13 tasks
[Common] Persistent Grouped MXFP8 quantization kernel enhancement New feature or request MoE
#2738 opened Mar 5, 2026 by Oleg-Goncharov Draft
9 of 13 tasks
Feat/cp nvshmem enhanced community-contribution PRs from external contributor outside the core maintainers, representing community-driven work.
#2737 opened Mar 5, 2026 by Knight-of-Thunder Loading…
1 of 13 tasks
Feature/unswizzle community-contribution PRs from external contributor outside the core maintainers, representing community-driven work.
#2732 opened Mar 4, 2026 by int-smart Loading…
9 of 13 tasks
fix: scope get_full_cu_seqlens cache key by device and inference mode
#2728 opened Mar 3, 2026 by DmCarpe93 Loading…
8 of 13 tasks
[CI] Refactor CI build on GitHub
#2723 opened Mar 2, 2026 by ptrendx Loading…
1 of 13 tasks
[Common, pyTorch] Grouped MXFP8 dequantize support
#2722 opened Mar 2, 2026 by ptrendx Loading…
1 of 13 tasks
Add MXFP8 attention
#2719 opened Mar 1, 2026 by cyanguwa Draft
13 tasks
Add DCP compatibility for FSDP2-TP sharding in TransformerEngine.
#2713 opened Feb 26, 2026 by cspades Loading…
3 of 13 tasks
Newton-Schulz via cuSOLVERMp
#2706 opened Feb 25, 2026 by vcherepanov-nv Loading…
6 of 13 tasks
[Draft][PyTorch] torch.compile support for TE Linear
#2701 opened Feb 24, 2026 by pggPL Draft
13 tasks
[PyTorch] torch.compile support for permutation functions
#2686 opened Feb 17, 2026 by pggPL Loading…
9 of 13 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.