[Kernel] Separate Triton Attention Kernel Launches for Prefill and Decode for FULL CUDA Graph mode#29020

Open

jvlunteren wants to merge 19 commits intovllm-project:mainfrom

jvlunteren:jvl-triton-attn-upd5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Kernel] Separate Triton Attention Kernel Launches for Prefill and Decode for FULL CUDA Graph mode#29020

[Kernel] Separate Triton Attention Kernel Launches for Prefill and Decode for FULL CUDA Graph mode#29020
jvlunteren wants to merge 19 commits intovllm-project:mainfrom
jvlunteren:jvl-triton-attn-upd5

Commits on Nov 19, 2025

separate attention kernel launches for prefill and decode

remove comment

Merge branch 'vllm-project:main' into jvl-triton-attn-upd5

various modifications

small modifications

Commits on Nov 20, 2025

address gemini-code-assist feedback

Commits on Nov 21, 2025

Merge branch 'main' into jvl-triton-attn-upd5

Merge branch 'main' into jvl-triton-attn-upd5

Commits on Nov 29, 2025

Merge branch 'main' into jvl-triton-attn-upd5

partial code reorganisation

Commits on Dec 1, 2025

formatting

partial code reorganisation

Commits on Dec 2, 2025

modified _cudagraph_support

replace _cudagraph_support modification by assert statement

Commits on Dec 3, 2025

Merge branch 'main' into jvl-triton-attn-upd5

Commits on Dec 4, 2025

Merge branch 'vllm-project:main' into jvl-triton-attn-upd5

override get_cudagraph_support()

Add comment

formatting

Uh oh!

[Kernel] Separate Triton Attention Kernel Launches for Prefill and Decode for FULL CUDA Graph mode#29020jvlunteren wants to merge 19 commits intovllm-project:mainvllm-project/vllm:mainfrom jvlunteren:jvl-triton-attn-upd5jvlunteren/vllm:jvl-triton-attn-upd5Copy head branch name to clipboard

Commits

Commits on Nov 19, 2025

Commits on Nov 20, 2025

Commits on Nov 21, 2025

Commits on Nov 29, 2025

Commits on Dec 1, 2025

Commits on Dec 2, 2025

Commits on Dec 3, 2025

Commits on Dec 4, 2025

[Kernel] Separate Triton Attention Kernel Launches for Prefill and Decode for FULL CUDA Graph mode#29020
jvlunteren wants to merge 19 commits intovllm-project:mainfrom
jvlunteren:jvl-triton-attn-upd5