Commit 3f42b05
authored
[Refactor] [1/N] to simplify the vLLM serving architecture (#28040)
Signed-off-by: chaunceyjiang <[email protected]>1 parent 69520bc commit 3f42b05
File tree
27 files changed
+850
-455
lines changed- tests/entrypoints/openai
- vllm/entrypoints
- openai
- sagemaker
- serve
- disagg
- elastic_ep
- instrumentator
- lora
- profile
- rlhf
- sleep
- tokenize
27 files changed
+850
-455
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
232 | 232 | | |
233 | 233 | | |
234 | 234 | | |
235 | | - | |
| 235 | + | |
236 | 236 | | |
237 | 237 | | |
238 | 238 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
118 | 118 | | |
119 | 119 | | |
120 | 120 | | |
| 121 | + | |
121 | 122 | | |
122 | 123 | | |
123 | 124 | | |
| |||
0 commit comments