Anthropic Messages [BETA]

Create a messages response. This endpoint is compatible with the Anthropic API.

post

Create a messages response. This endpoint is compatible with the Anthropic API.

Authorizations
AuthorizationstringRequired
Bearer authentication header of the form Bearer <token>.
Body

Request message for /v1/messages

preferencestring · enumOptional

The provider preference for handling the request.

Possible values:
allowed_providersstring[] · nullableOptional

The providers that are allowed to be used for the completion.

Example: ["mistral","scaleway"]
eu_nativeboolean · nullableOptional

Whether to consider only providers based and regulated withing the EU. Even when false, all our endpoints are GDPR compliant.

Example: false
allow_quantizationboolean · nullableOptional

Whether to allow quantized endpoints.

Example: true
allow_zero_data_retentionboolean · nullableOptional

Whether to use only ZDR providers.

enable_model_fallbackboolean · nullableOptional

Whether to allow model fallback when a model is currently unavailable.

max_tokensinteger · int32Optional

The maximum number of tokens to generate before stopping. The model may stop before the max_tokens when it reaches the stop sequence.

metadataone ofOptional
nullOptional
or
modelstringOptional

Model name for the model to use.

Example: latest
stop_sequencesstring[] · nullableOptional

(Not supported by reasoning models) Up to 4 sequences where the API will stop generating further tokens.

streamboolean · nullableOptional

If set, partial message deltas will be sent. Tokens will be sent as data-only server-sent events as they become available, with the stream terminated by a data: [DONE] message.

systemone ofOptional
nullOptional
or
stringOptional

Text content of system prompt.

or
temperaturenumber · max: 2 · nullableOptional

What sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. It may not work well with reasoning models.

Default: 1
tool_choiceone ofOptional
nullOptional
or
or
or
top_kinteger · nullableOptional

(Unsupported) When generating next tokens, randomly selecting the next token from the k most likely options.

top_pnumber · max: 1 · nullableOptional

An alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered. It is generally recommended to alter this or temperature but not both.

Default: 1
Responses
post
/messages

Last updated