Prerequisites
Problem Description
We'd like to integrate FlashInfer into the project to improve decoding and prefill efficiency.
This integration aims to leverage FlashInfer's optimized kernel for faster inference.
Proposed Solution
Alternatives Considered
No response
Additional Context
No response
Importance
Important
Usage Statistics (Optional)
No response