Skip to content

[Bug]: can not work with llama.cpp #832

@calvin2021y

Description

@calvin2021y

Prerequisites

  • I have searched existing issues and discussions to avoid duplicates
  • I am using the latest version (or have tested against main/nightly)

Description

{
  "providers": [
    {
      "name": "a",
      "keys": [
        
      ],
      "network_config": {
        "base_url": "http://127.0.0.1:8081/v1",
        "default_request_timeout_in_seconds": 300,
        "max_retries": 0,
        "retry_backoff_initial": 500,
        "retry_backoff_max": 5000
      },
      "concurrency_and_buffer_size": {
        "concurrency": 1000,
        "buffer_size": 5000
      },
      "proxy_config": null,
      "send_back_raw_response": false,
      "custom_provider_config": {
        "is_key_less": true,
        "base_provider_type": "openai",
        "allowed_requests": {
          "list_models": true,
          "text_completion": true,
          "text_completion_stream": true,
          "chat_completion": true,
          "chat_completion_stream": true,
          "responses": true,
          "responses_stream": true,
          "embedding": true,
          "speech": true,
          "speech_stream": true,
          "transcription": true,
          "transcription_stream": true
        }
      },
      "status": "active"
    }
  ],
  "total": 1
}

Steps to reproduce

curl -X POST http://localhost:8080/v1/chat/completions -v
-H "Content-Type: application/json"
-d '{
"model": "a/a",
"messages": [{"role": "user", "content": "Hello, Bifrost!"}]
}'

Expected behavior

??

Actual behavior

??

Affected area(s)

Core (Go)

Version

the new onw

Environment

macos

try with  "base_url": "http://127.0.0.1:8081/v1", and  "base_url": "http://127.0.0.1:8081", none work.

both result int request: POST /v1/v1/chat/completions 127.0.0.1 404


## The bug reports template this project is very very very bad. ##

Relevant logs/output

{"is_bifrost_error":false,"status_code":404,"error":{"type":"not_found_error","code":"","message":"File Not Found"},"extra_fields":{"provider":"a","model_requested":"a","request_type":"chat_completion"}}
* Connection #0 to host localhost left intact

Regression?

No response

Severity

Low (minor issue or cosmetic)

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions