Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[https://nvbugs/6293536][fix] Stage KV block offsets through a fresh host buffer
#15546 opened Jun 23, 2026 by thorjohnsen Collaborator Loading…
1 task done
[None][test] Refine Qwen3.5 397B test cases
#15544 opened Jun 23, 2026 by nv-guomingz Collaborator Loading…
1 task done
[TRTLLM-13575][feat] Add eplb support for qwen3.5
#15543 opened Jun 23, 2026 by nv-guomingz Collaborator Loading…
1 task done
[None][test] Add modularized perf tests (attention + MoE discrete/continuous)
#15541 opened Jun 23, 2026 by ruodil Collaborator Loading…
1 task done
[None][fix] Allow fail-early when reuse block and legacy mamba cache
#15540 opened Jun 23, 2026 by Wanli-Jiang Collaborator Loading…
1 task done
[https://nvbugs/6344108][fix] skip TestNemotron3Super120B on pre-blackwell
#15539 opened Jun 23, 2026 by bo-nv Collaborator Loading…
1 task
[https://nvbugs/6166097][fix] Fix CuteDSL NVFP4 EPLB weight layout
#15538 opened Jun 23, 2026 by nv-xtf Loading…
1 task done
[None][chore] Clean deprecated CppMambaCacheManager
#15533 opened Jun 23, 2026 by bo-nv Collaborator Loading…
1 task done
[#14874][feat] AutoDeploy : Perf optimization for gpt-oss-120b for low conc AutoDeploy <NV> AutoDeploy Backend
#15531 opened Jun 23, 2026 by taylor-yb-lee Collaborator Loading…
1 task done
[None][chore] Autodeploy disable the pipeline cache by default
#15530 opened Jun 22, 2026 by nvchenghaoz Collaborator Loading…
1 task
[None][CI] Waive flaky test_vbench_dimension_score_wan (nvbugs/6357628)
#15529 opened Jun 22, 2026 by chang-l Collaborator Loading…
[None][feat] Support FP8 base weights for MoE LoRA
#15528 opened Jun 22, 2026 by brb-nv Collaborator Draft
1 task
[None][feat] Add prefix-aware scheduling config flag to support opt-out
#15526 opened Jun 22, 2026 by SimengLiu-nv Collaborator Loading…
1 task done
[TRTLLM-12557][feat] WideEP FT: add AlltoAll watchdog (1a.4)
#15524 opened Jun 22, 2026 by chienchunhung Collaborator Loading…
[None][fix] Preserve Kimi 2.5 tool call IDs
#15523 opened Jun 22, 2026 by hvagadia Contributor Loading…
ProTip! Updated in the last three days: updated:>2026-06-20.