You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I noticed that the code here in fp8_paged_mqa_logits only supports next_n <= 2. May I ask what the reason for this is? Additionally, what modifications would be needed to support next_n > 2?
Any reply would be very helpful.