[Kernel] Separate Triton Attention Kernel Launches for Prefill and Decode for FULL CUDA Graph mode#29020
Open
jvlunteren wants to merge 19 commits intovllm-project:mainfrom
Open
[Kernel] Separate Triton Attention Kernel Launches for Prefill and Decode for FULL CUDA Graph mode#29020jvlunteren wants to merge 19 commits intovllm-project:mainfrom
jvlunteren wants to merge 19 commits intovllm-project:mainfrom
Commits
Commits on Nov 19, 2025
Commits on Nov 20, 2025
Commits on Nov 21, 2025
Commits on Nov 29, 2025
Commits on Dec 1, 2025
- committed
- committed