Skip to content

[Bug] DocSum UT test fail on xeon #2251

@ZePan110

Description

@ZePan110

Priority

P1-Stopper

OS type

Ubuntu

Hardware type

Xeon-GNR

Installation method

  • Pull docker images from hub.docker.com
  • Build docker images from source
  • Other
  • N/A

Deploy method

  • Docker
  • Docker Compose
  • Kubernetes Helm Charts
  • Kubernetes GMC
  • Other
  • N/A

Running nodes

Single Node

What's the version?

main branch

Description

curl --silent --write-out 'HTTPSTATUS:%{http_code}' -X POST -F type=audio -F messages=UklGRigAAABXQVZFZm10IBIAAAABAAEARKwAAIhYAQACABAAAABkYXRhAgAAAAEA -F max_tokens=32 -F language=en -F stream=False -H 'Content-Type: multipart/form-data' 192.168.122.147:8888/v1/docsum
  + HTTP_RESPONSE='Internal Server ErrorHTTPSTATUS:500'
  ++ echo Internal Server ErrorHTTPSTATUS:500
  ++ tr -d '\n'
  ++ sed -e 's/.*HTTPSTATUS://'
  + HTTP_STATUS=500
  ++ echo Internal Server ErrorHTTPSTATUS:500
  ++ sed -e 's/HTTPSTATUS\:.*//g'
  + RESPONSE_BODY='Internal Server Error'
  + docker logs docsum-xeon-backend-server
  [2025-09-09 05:59:15,565] [    INFO] - Base service - CORS is enabled.
  [2025-09-09 05:59:15,565] [    INFO] - Base service - Setting up HTTP server
  [2025-09-09 05:59:15,566] [    INFO] - Base service - Uvicorn server setup on port 8888
  INFO:     Waiting for application startup.
  INFO:     Application startup complete.
  INFO:     Uvicorn running on http://0.0.0.0:8888/ (Press CTRL+C to quit)
  [2025-09-09 05:59:15,587] [    INFO] - Base service - HTTP server setup successful
  ERROR:    Exception in ASGI application
  Traceback (most recent call last):
    File "/usr/local/lib/python3.11/site-packages/urllib3/connectionpool.py", line 534, in _make_request
      response = conn.getresponse()
                 ^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/urllib3/connection.py", line 565, in getresponse
      httplib_response = super().getresponse()
                         ^^^^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/http/client.py", line 1395, in getresponse
      response.begin()
    File "/usr/local/lib/python3.11/http/client.py", line 325, in begin
      version, status, reason = self._read_status()
                                ^^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/http/client.py", line 286, in _read_status
      line = str(self.fp.readline(_MAXLINE + 1), "iso-8859-1")
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/socket.py", line 718, in readinto
      return self._sock.recv_into(b)
             ^^^^^^^^^^^^^^^^^^^^^^^
  TimeoutError: timed out
  
  The above exception was the direct cause of the following exception:
  
  Traceback (most recent call last):
    File "/usr/local/lib/python3.11/site-packages/requests/adapters.py", line 644, in send
      resp = conn.urlopen(
             ^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/urllib3/connectionpool.py", line 841, in urlopen
      retries = retries.increment(
                ^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/urllib3/util/retry.py", line 474, in increment
      raise reraise(type(error), error, _stacktrace)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/urllib3/util/util.py", line 39, in reraise
      raise value
    File "/usr/local/lib/python3.11/site-packages/urllib3/connectionpool.py", line 787, in urlopen
      response = self._make_request(
                 ^^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/urllib3/connectionpool.py", line 536, in _make_request
      self._raise_timeout(err=e, url=url, timeout_value=read_timeout)
    File "/usr/local/lib/python3.11/site-packages/urllib3/connectionpool.py", line 367, in _raise_timeout
      raise ReadTimeoutError(
  urllib3.exceptions.ReadTimeoutError: HTTPConnectionPool(host='192.168.122.147', port=9000): Read timed out. (read timeout=2000)
  
  During handling of the above exception, another exception occurred:
  
  Traceback (most recent call last):
    File "/usr/local/lib/python3.11/site-packages/uvicorn/protocols/http/h11_impl.py", line 403, in run_asgi
      result = await app(  # type: ignore[func-returns-value]
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/uvicorn/middleware/proxy_headers.py", line 60, in __call__
      return await self.app(scope, receive, send)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/fastapi/applications.py", line 1054, in __call__
      await super().__call__(scope, receive, send)
    File "/usr/local/lib/python3.11/site-packages/starlette/applications.py", line 113, in __call__
      await self.middleware_stack(scope, receive, send)
    File "/usr/local/lib/python3.11/site-packages/starlette/middleware/errors.py", line 186, in __call__
      raise exc
    File "/usr/local/lib/python3.11/site-packages/starlette/middleware/errors.py", line 164, in __call__
      await self.app(scope, receive, _send)
    File "/usr/local/lib/python3.11/site-packages/prometheus_fastapi_instrumentator/middleware.py", line 177, in __call__
      raise exc
    File "/usr/local/lib/python3.11/site-packages/prometheus_fastapi_instrumentator/middleware.py", line 175, in __call__
      await self.app(scope, receive, send_wrapper)
    File "/usr/local/lib/python3.11/site-packages/starlette/middleware/cors.py", line 85, in __call__
      await self.app(scope, receive, send)
    File "/usr/local/lib/python3.11/site-packages/starlette/middleware/exceptions.py", line 63, in __call__
      await wrap_app_handling_exceptions(self.app, conn)(scope, receive, send)
    File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app
      raise exc
    File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 42, in wrapped_app
      await app(scope, receive, sender)
    File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 716, in __call__
      await self.middleware_stack(scope, receive, send)
    File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 736, in app
      await route.handle(scope, receive, send)
    File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 290, in handle
      await self.app(scope, receive, send)
    File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 78, in app
      await wrap_app_handling_exceptions(app, request)(scope, receive, send)
    File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app
      raise exc
    File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 42, in wrapped_app
      await app(scope, receive, sender)
    File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 75, in app
      response = await f(request)
                 ^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/fastapi/routing.py", line 302, in app
      raw_response = await run_endpoint_function(
                     ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/fastapi/routing.py", line 213, in run_endpoint_function
      return await dependant.call(**values)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/home/user/docsum.py", line 300, in handle_request
      result_dict, runtime_graph = await self.megaservice.schedule(
                                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/home/user/comps/cores/telemetry/opea_telemetry.py", line 61, in wrapper
      res = await func(*args, **kwargs)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/home/user/comps/cores/mega/orchestrator.py", line 151, in schedule
      response, node = await done_task
                       ^^^^^^^^^^^^^^^
    File "/home/user/comps/cores/telemetry/opea_telemetry.py", line 61, in wrapper
      res = await func(*args, **kwargs)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/home/user/comps/cores/mega/orchestrator.py", line 281, in execute
      response = requests.post(
                 ^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/requests/api.py", line 115, in post
      return request("post", url, data=data, json=json, **kwargs)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/requests/api.py", line 59, in request
      return session.request(method=method, url=url, **kwargs)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/requests/sessions.py", line 589, in request
      resp = self.send(prep, **send_kwargs)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/requests/sessions.py", line 703, in send
      r = adapter.send(request, **kwargs)
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.11/site-packages/requests/adapters.py", line 690, in send
      raise ReadTimeout(e, request=request)
  requests.exceptions.ReadTimeout: HTTPConnectionPool(host='192.168.122.147', port=9000): Read timed out. (read timeout=2000)
  + '[' 500 -ne 200 ']'
  + echo '[ docsum-xeon-backend-server ] HTTP status is not 200. Received status was 500'
  + exit 1
  [ docsum-xeon-backend-server ] HTTP status is not 200. Received status was 500
  Error: Process completed with exit code 1.

https://github.com/opea-project/GenAIExamples/actions/runs/17554534827/job/49912971296

Reproduce steps

cd DocSum/tests
bash test_compose_on_xeon.sh

Raw log

Attachments

No response

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions