Adding swagger for EigenAI API #216

MadelineAu · 2025-11-28T07:52:40Z

No description provided.

vercel · 2025-11-28T07:52:45Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Updated (UTC)
eigencloud-docs	Ready	Preview	Dec 10, 2025 3:39am

NimaVaziri · 2025-12-09T16:35:33Z

static/openapi.yaml

+                        enum: [system, user, assistant, tool]
+                      content:
+                        type: string
+


@mpjunior92 can you comment on how the disable_auto_reasoning_format parameter works / should be documented here?

Also, how is this parameter specified by a non-curl client?

disable_auto_reasoning_format is used to control response parsing. Here is how it is used:

let reasoning_format = if opts.disable_auto_reasoning_format { 0 } else { 1 };

if disable_auto_reasoning_format is true, then reasoning_format is set to 0. if disable_auto_reasoning_format is false, then reasoning_format is set to 1. reasoning_format is a llama.cpp enum:

// reasoning API response format (not to be confused as chat template's reasoning format) enum common_reasoning_format { COMMON_REASONING_FORMAT_NONE, COMMON_REASONING_FORMAT_AUTO, // Same as deepseek, using `message.reasoning_content` COMMON_REASONING_FORMAT_DEEPSEEK_LEGACY, // Extract thinking tag contents and return as `message.reasoning_content`, or leave inline in <think> tags in stream mode COMMON_REASONING_FORMAT_DEEPSEEK, // Extract thinking tag contents and return as `message.reasoning_content`, including in streaming deltas. // do not extend this enum unless you absolutely have to // in most cases, use COMMON_REASONING_FORMAT_AUTO // see: https://github.com/ggml-org/llama.cpp/pull/15408 };

TL;DR: this parameter enable / disable reasoning parsing at llama.cpp level.

@NimaVaziri curl is, among other things, an HTTP client, so setting this field is always the same across any HTTP client: set the field in the body according to the language syntax / tools:

// go // Request body as a Go struct body := map[string]interface{}{ "model": "gpt-oss-20b-f16", "max_tokens": 500, "messages": []map[string]string{ {"role": "user", "content": "Explain how LLM works"}, }, "disable_auto_reasoning_format": false, }

// rust let body = json!({ "model": "gpt-oss-20b-f16", "max_tokens": 500, "messages": [ {"role": "user", "content": "Explain how LLM works"} ], "disable_auto_reasoning_format": false });

Note: this is a custom parameter.

@mpjunior92 Right, I was more so asking what the implication of this parameter is for other clients like the OpenAI client, the AI SDK client, etc - how can it be set (if it can be set)?

each SDK may have its own way of setting this value, so users must look in the docs of their respective SDKs. Here is an example for OpenAI SDK using extra_body:

from openai import OpenAI client = OpenAI( api_key="unused-but-required", # still required by SDK, but ignored by your server base_url="http://localhost:8000/v1" ) response = client.chat.completions.create( model="gpt-oss-20b-f16", max_tokens=500, messages=[ {"role": "user", "content": "Explain how LLM works"} ], extra_body={ "disable_auto_reasoning_format": True }, extra_headers={ "x-eigenai-api-key": "sk-dummy-key" } ) print(response.choices[0].message.content)

@MadelineAu let's make sure we include this as part of the docs

Taking a look at this and have a couple of questions @NimaVaziri @mpjunior92

The disable_auto_reasoning_format parameter isn't currently included for the EigenAI API - was it missed? Or has it been added recently?

We don't currently have any concept material that contextualizes what 'parsing at llama.cpp level' means - given the target audience, can I assume they would understand this statement? Or find an explanation to link out to?

This is a custom parameter - custom to us? Or custom as in not part of the OpenAI spec?

Added recently.

"parsing at llama.cpp level" we don't need to include this phrasing. All we need to say is that "this parameter is used to control response parsing and separating out the reasoning from the content of the response".

It's not a custom parameter per se, it's a parameter at the llama.cpp level which we expose higher up. But in the context of a client call, you could say it's a custom parameter. Hopefully soon we can have a migration path where the parsed output becomes the default behavior and we can deprecate the parameter entirely.

NimaVaziri · 2025-12-10T03:35:02Z

static/openapi.yaml

+                disable_auto_reasoning_format:
+                  type: boolean
+                  description: > 
+                    Controls response parsing and separating out of the reasoning from the content of the response. For client calls, this is a custom parameter. Refer to the relevant client SDK documentation for information on how to set this parameter.


Suggested change

Controls response parsing and separating out of the reasoning from the content of the response. For client calls, this is a custom parameter. Refer to the relevant client SDK documentation for information on how to set this parameter.

Controls response parsing and separating out the reasoning trace from the content of the response. For client calls, this is a custom parameter. For eg, in the OpenAI client, it can be set in the `extra_body` field. Refer to the relevant client SDK documentation for information on how to set this parameter.

Adding swagger for EigenAI API

7641f27

vercel bot deployed to Preview November 28, 2025 07:53 View deployment

wip

139b49d

vercel bot deployed to Preview December 9, 2025 09:35 View deployment

MadelineAu added 2 commits December 9, 2025 20:51

Merge remote-tracking branch 'origin/main' into swaggerExperiment

555b098

Add swagger docs for EigenAI API

94b164a

vercel bot deployed to Preview December 9, 2025 10:58 View deployment

MadelineAu marked this pull request as ready for review December 9, 2025 11:00

MadelineAu requested review from MC1823315, NimaVaziri, afkbyte, antojoseph, dabit3, mmurrs, non-fungible-nelson, scotthconner and shrimalmadhur as code owners December 9, 2025 11:00

NimaVaziri reviewed Dec 9, 2025

View reviewed changes

Updated version

eaa3d1b

vercel bot deployed to Preview December 10, 2025 02:03 View deployment

Added disable_auto_reasoning_format

e41506e

vercel bot deployed to Preview December 10, 2025 03:32 View deployment

NimaVaziri reviewed Dec 10, 2025

View reviewed changes

review rework

075bb31

NimaVaziri approved these changes Dec 10, 2025

View reviewed changes

vercel bot deployed to Preview December 10, 2025 03:39 View deployment

MadelineAu merged commit 8230d70 into main Dec 10, 2025
3 checks passed

MadelineAu deleted the swaggerExperiment branch December 10, 2025 03:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding swagger for EigenAI API #216

Adding swagger for EigenAI API #216

Uh oh!

MadelineAu commented Nov 28, 2025

Uh oh!

vercel bot commented Nov 28, 2025 •

edited

Loading

Uh oh!

NimaVaziri Dec 9, 2025

Uh oh!

mpjunior92 Dec 9, 2025 •

edited

Loading

Uh oh!

NimaVaziri Dec 9, 2025

Uh oh!

mpjunior92 Dec 9, 2025

Uh oh!

NimaVaziri Dec 9, 2025

Uh oh!

MadelineAu Dec 10, 2025

Uh oh!

NimaVaziri Dec 10, 2025 •

edited

Loading

Uh oh!

NimaVaziri Dec 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	Controls response parsing and separating out of the reasoning from the content of the response. For client calls, this is a custom parameter. Refer to the relevant client SDK documentation for information on how to set this parameter.
	Controls response parsing and separating out the reasoning trace from the content of the response. For client calls, this is a custom parameter. For eg, in the OpenAI client, it can be set in the `extra_body` field. Refer to the relevant client SDK documentation for information on how to set this parameter.

Adding swagger for EigenAI API #216

Adding swagger for EigenAI API #216

Uh oh!

Conversation

MadelineAu commented Nov 28, 2025

Uh oh!

vercel bot commented Nov 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NimaVaziri Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

mpjunior92 Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NimaVaziri Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

mpjunior92 Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

NimaVaziri Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

MadelineAu Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

NimaVaziri Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NimaVaziri Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vercel bot commented Nov 28, 2025 •

edited

Loading

mpjunior92 Dec 9, 2025 •

edited

Loading

NimaVaziri Dec 10, 2025 •

edited

Loading