Commit da1644d
committed
[fix][cpu] Use a SwigluOAI impl which supports interleaved gate-up weights
Current impl of `swigluoai_and_mul` for CPU assumes that gate-up weights
have been de-interleaved at load time, which is not the case.
The new impl we dispatch to is the same one used for the BF16 path on
GPU and handles interleaved gate-up.
Signed-off-by: Fadi Arafeh <[email protected]>1 parent 6fb0215 commit da1644d
1 file changed
+2
-13
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
| 9 | + | |
9 | 10 | | |
10 | 11 | | |
11 | 12 | | |
12 | 13 | | |
13 | 14 | | |
14 | 15 | | |
15 | | - | |
16 | | - | |
17 | | - | |
18 | | - | |
19 | | - | |
20 | | - | |
21 | | - | |
22 | | - | |
23 | | - | |
24 | | - | |
25 | | - | |
26 | | - | |
27 | 16 | | |
28 | 17 | | |
29 | 18 | | |
| |||
284 | 273 | | |
285 | 274 | | |
286 | 275 | | |
287 | | - | |
| 276 | + | |
288 | 277 | | |
289 | 278 | | |
290 | 279 | | |
| |||
0 commit comments