Skip to content

Commit 3f42b05

Browse files
[Refactor] [1/N] to simplify the vLLM serving architecture (#28040)
Signed-off-by: chaunceyjiang <[email protected]>
1 parent 69520bc commit 3f42b05

File tree

27 files changed

+850
-455
lines changed

27 files changed

+850
-455
lines changed

tests/entrypoints/openai/test_basic.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -232,7 +232,7 @@ def make_long_completion_request():
232232
@pytest.mark.asyncio
233233
async def test_health_check_engine_dead_error():
234234
# Import the health function directly to test it in isolation
235-
from vllm.entrypoints.openai.api_server import health
235+
from vllm.entrypoints.serve.instrumentator.health import health
236236

237237
# Create a mock request that simulates what FastAPI would provide
238238
mock_request = Mock(spec=Request)

vllm/entrypoints/api_server.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -118,6 +118,7 @@ async def init_app(
118118
)
119119
)
120120
app.state.engine_client = engine
121+
app.state.args = args
121122
return app
122123

123124

0 commit comments

Comments
 (0)