v0.8.4+rocm
·
3215 commits
to main
since this release
What's Changed
- Remove duplicate code in config.py by @sstamenk in #494
- In light of the breaking cmake v4 release by @gshtras in #495
- Docs_update_20250327 by @arakowsk-amd in #493
- Upstream merge 2025 03 31 by @gshtras in #497
- Triton MLA parameter tweak for AMD GPU by @qli88 in #498
- Upstream merge 2025 04 02 by @gshtras in #499
- Bump aiter version by @gshtras in #500
- Adding 2stage MoE support separately until it is added upstream by @gshtras in #501
- Fused FP8 conversion in attention for v1 by @gshtras in #502
- Upstream merge 2025 04 07 by @gshtras in #503
- Fix fused moe by @gshtras in #506
- Update moe_tune_script.sh by @divakar-amd in #507
- Doubled size to wa issue and preserve CAR perf by @maleksan85 in #510
- Re-enable custom paged attention for V0 by @charlifu in #511
- Updated README.md with April 10 results by @Mcirino1 in #512
- Update README.md by @faisalgulfam32 in #514
- Updating base image by @charlifu in #515
- Update test-template.j2 to enable building by @Alexei-V-Ivanov-AMD in #517
- Update test-template.j2 to fix new location of run-amd-test.sh by @Alexei-V-Ivanov-AMD in #518
- Rocm 6.4 docker by @gshtras in #519
- Update README.md by @t-parry in #521
- Update README.md by @t-parry in #523
- Upstream merge 2025 04 21 by @gshtras in #522
- Upstream merge 2025 04 25 by @gshtras in #524
New Contributors
- @sstamenk made their first contribution in #494
- @faisalgulfam32 made their first contribution in #514
Full Changelog: v0.8.2+rocm...v0.8.4+rocm