Commit 561253b
[Performance][Fix] update nvfp4 code to support renorm routing (#28569)
Signed-off-by: jiahanc <[email protected]>
Co-authored-by: Michael Goin <[email protected]>1 parent 80b6080 commit 561253b
File tree
2 files changed
+15
-8
lines changed- vllm/model_executor/layers/quantization
- utils
2 files changed
+15
-8
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
| 18 | + | |
18 | 19 | | |
19 | 20 | | |
20 | 21 | | |
| |||
1657 | 1658 | | |
1658 | 1659 | | |
1659 | 1660 | | |
1660 | | - | |
| 1661 | + | |
1661 | 1662 | | |
1662 | | - | |
| 1663 | + | |
| 1664 | + | |
| 1665 | + | |
| 1666 | + | |
| 1667 | + | |
| 1668 | + | |
1663 | 1669 | | |
1664 | 1670 | | |
1665 | 1671 | | |
1666 | 1672 | | |
1667 | | - | |
1668 | | - | |
1669 | | - | |
| 1673 | + | |
1670 | 1674 | | |
1671 | 1675 | | |
1672 | 1676 | | |
| |||
1690 | 1694 | | |
1691 | 1695 | | |
1692 | 1696 | | |
1693 | | - | |
1694 | | - | |
| 1697 | + | |
| 1698 | + | |
1695 | 1699 | | |
1696 | 1700 | | |
1697 | 1701 | | |
| |||
Lines changed: 4 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
291 | 291 | | |
292 | 292 | | |
293 | 293 | | |
294 | | - | |
| 294 | + | |
| 295 | + | |
| 296 | + | |
| 297 | + | |
295 | 298 | | |
0 commit comments