Anthropic Messages

⚠️ Du nutzt unsere API in einem Dedicated Deployment? Ersetze einfach api.langdock.com durch die Base URL deines Deployments: <deployment-url>/api/public

Erstellt eine Modellantwort für die gegebene Chat-Konversation. Dieser Endpunkt folgt der Anthropic API-Spezifikation und die Anfragen werden an den AWS Bedrock Anthropic-Endpunkt gesendet.

Um die API zu nutzen, benötigst du einen API-Schlüssel. Administratoren können API-Schlüssel in den Einstellungen erstellen.

Alle Parameter vom Anthropic “Create a message” Endpunkt werden gemäß den Anthropic-Spezifikationen unterstützt, mit folgender Ausnahme:

model: Um zu sehen, welche Modelle für deinen Workspace verfügbar sind, frage den Models-Endpoint ab: GET /anthropic/{region}/v1/models. Die Liste der verfügbaren Modelle kann abweichen, wenn du deine eigenen API-Schlüssel in Langdock verwendest (“Bring-your-own-keys / BYOK”, siehe hier für Details).

Rate Limits

Die Rate Limit für den Messages-Endpunkt beträgt 500 RPM (Anfragen pro Minute) und 60,000 TPM (Token pro Minute). Rate Limits werden auf Workspace-Ebene definiert - und nicht auf API-Schlüssel-Ebene. Jedes Modell hat seine eigene Rate Limit. Wenn du deine Rate Limit überschreitest, erhältst du eine 429 Too Many Requests Antwort. Bitte beachte, dass die Rate Limits Änderungen unterliegen können. Beziehe dich auf diese Dokumentation für die aktuellsten Informationen. Falls du eine höhere Rate Limit benötigst, kontaktiere uns bitte unter support@langdock.com.

Verwendung von Anthropic-kompatiblen Bibliotheken

Da das Anfrage- und Antwortformat dasselbe wie bei der Anthropic-API ist, kannst du beliebte Bibliotheken wie die Anthropic Python-Bibliothek oder das Vercel AI SDK verwenden, um die Langdock-API zu nutzen.

Beispiel mit der Anthropic Python-Bibliothek

from anthropic import Anthropic
client = Anthropic(
  base_url="https://api.langdock.com/anthropic/eu/",
  api_key="<YOUR_LANGDOCK_API_KEY>"
)

message = client.messages.create(
	model="claude-sonnet-4-6-default",
	messages=[
			{ "role": "user", "content": "Write a haiku about cats" }
	],
	max_tokens=1024,
)

print(message.content[0].text)

Beispiel mit dem Vercel AI SDK in Node.js

import { generateText } from "ai";
import { createAnthropic } from "@ai-sdk/anthropic";

const langdockProvider = createAnthropic({
  baseURL: "https://api.langdock.com/anthropic/eu/v1",
  apiKey: "<YOUR_LANGDOCK_API_KEY>",
});

const result = await generateText({
  model: langdockProvider("claude-sonnet-4-6-default"),
  prompt: "Write a haiku about cats",
});

console.log(result.text);

Langdock blockiert bewusst Browser-basierte Anfragen, um deinen API-Schlüssel zu schützen und die Sicherheit deiner Anwendungen zu gewährleisten. Weitere Informationen findest du in unserem Guide zu Best Practices für API-Schlüssel.

Autorisierungen

Authorization

string

header

erforderlich

API key as Bearer token. Format "Bearer YOUR_API_KEY"

Pfadparameter

region

enum<string>

erforderlich

The region of the API to use.

Verfügbare Optionen:

eu,

us

Body

application/json

model

string

erforderlich

The model that will complete your prompt. See models for additional details and options.

messages

InputMessage · object[]

erforderlich

Input messages.

Anthropic's models are trained to operate on alternating user and agent conversational turns. When creating a new Message, you specify the prior conversational turns with the messages parameter, and the model then generates the next Message in the conversation.

Each input message must be an object with a role and content. You can specify a single user-role message, or you can include multiple user and assistant messages. The first message must always use the user role.

If the final message uses the assistant role, the response content will continue immediately from the content in that message. This can be used to constrain part of the model's response.

Example with a single user message:

[{"role": "user", "content": "Hello, Claude"}]

Example with multiple conversational turns:

[
  {"role": "user", "content": "Hello there."},
  {"role": "assistant", "content": "Hi, I'm Claude. How can I help you?"},
  {"role": "user", "content": "Can you explain LLMs in plain English?"},
]

Example with a partially-filled response from Claude:

[
  {"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
  {"role": "assistant", "content": "The best answer is ("},
]

Each input message content may be either a single string or an array of content blocks, where each block has a specific type. Using a string for content is shorthand for an array of one content block of type "text". The following input messages are equivalent:

{"role": "user", "content": "Hello, Claude"}

{"role": "user", "content": [{"type": "text", "text": "Hello, Claude"}]}

Starting with Claude 3 models, you can also send image content blocks:

{"role": "user", "content": [
  {
    "type": "image",
    "source": {
      "type": "base64",
      "media_type": "image/jpeg",
      "data": "/9j/4AAQSkZJRg...",
    }
  },
  {"type": "text", "text": "What is in this image?"}
]}

We currently support the base64 source type for images, and the image/jpeg, image/png, image/gif, and image/webp media types.

See examples for more input examples.

Note that if you want to include a system prompt, you can use the top-level system parameter â€" there is no "system" role for input messages in the Messages API.

Anzeigen untergeordnete attribute

max_tokens

integer

erforderlich

The maximum number of tokens to generate before stopping.

Note that Anthropic's models may stop before reaching this maximum. This parameter only specifies the absolute maximum number of tokens to generate.

Different models have different maximum values for this parameter. See models for details.

Erforderlicher Bereich: x >= 1

Beispiel:

[1024]

stop_sequences

string[]

Custom text sequences that will cause the model to stop generating.

Anthropic's models will normally stop when they have naturally completed their turn, which will result in a response stop_reason of "end_turn".

If you want the model to stop generating when it encounters custom strings of text, you can use the stop_sequences parameter. If the model encounters one of the custom sequences, the response stop_reason value will be "stop_sequence" and the response stop_sequence value will contain the matched stop sequence.

stream

boolean

Whether to incrementally stream the response using server-sent events.

See streaming for details.

system

System prompt.

A system prompt is a way of providing context and instructions to Claude, such as specifying a particular goal or role. See Anthropic's guide to system prompts.

Beispiel:

[
  [
    {
      "text": "Today's date is 2024-06-01.",
      "type": "text"
    }
  ],
  "Today's date is 2023-01-01."
]

temperature

number

Amount of randomness injected into the response.

Defaults to 1.0. Ranges from 0.0 to 1.0. Use temperature closer to 0.0 for analytical / multiple choice, and closer to 1.0 for creative and generative tasks.

Note that even with temperature of 0.0, the results will not be fully deterministic.

Erforderlicher Bereich: 0 <= x <= 1

Beispiel:

[1]

tool_choice

ToolChoiceAuto · object

The model will automatically decide whether to use tools.

ToolChoiceAuto
ToolChoiceAny
ToolChoiceTool

Anzeigen untergeordnete attribute

tools

Tool · object[]

Definitions of tools that the model may use.

If you include tools in your API request, the model may return tool_use content blocks that represent the model's use of those tools. You can then run those tools using the tool input generated by the model and then optionally return results back to the model using tool_result content blocks.

Each tool definition includes:

name: Name of the tool.
description: Optional, but strongly-recommended description of the tool.
input_schema: JSON schema for the tool input shape that the model will produce in tool_use output content blocks.

For example, if you defined tools as:

[
  {
    "name": "get_stock_price",
    "description": "Get the current stock price for a given ticker symbol.",
    "input_schema": {
      "type": "object",
      "properties": {
        "ticker": {
          "type": "string",
          "description": "The stock ticker symbol, e.g. AAPL for Apple Inc."
        }
      },
      "required": ["ticker"]
    }
  }
]

And then asked the model "What's the S&P 500 at today?", the model might produce tool_use content blocks in the response like this:

[
  {
    "type": "tool_use",
    "id": "toolu_01D7FLrfh4GYq7yT1ULFeyMV",
    "name": "get_stock_price",
    "input": { "ticker": "^GSPC" }
  }
]

You might then run your get_stock_price tool with {"ticker": "^GSPC"} as an input, and return the following back to the model in a subsequent user message:

[
  {
    "type": "tool_result",
    "tool_use_id": "toolu_01D7FLrfh4GYq7yT1ULFeyMV",
    "content": "259.75 USD"
  }
]

Tools can be used for workflows that include running client-side tools and functions, or more generally whenever you want the model to produce a particular JSON structure of output.

See Anthropic's guide for more details.

Anzeigen untergeordnete attribute

Beispiel:

[
  {
    "description": "Get the current weather in a given location",
    "input_schema": {
      "properties": {
        "location": {
          "description": "The city and state, e.g. San Francisco, CA",
          "type": "string"
        },
        "unit": {
          "description": "Unit for the output - one of (celsius, fahrenheit)",
          "type": "string"
        }
      },
      "required": ["location"],
      "type": "object"
    },
    "name": "get_weather"
  }
]

top_k

integer

Only sample from the top K options for each subsequent token.

Used to remove "long tail" low probability responses. Learn more technical details here.

Recommended for advanced use cases only. You usually only need to use temperature.

Erforderlicher Bereich: x >= 0

Beispiel:

[5]

top_p

number

Use nucleus sampling.

In nucleus sampling, we compute the cumulative distribution over all the options for each subsequent token in decreasing probability order and cut it off once it reaches a particular probability specified by top_p. You should either alter temperature or top_p, but not both.

Recommended for advanced use cases only. You usually only need to use temperature.

Erforderlicher Bereich: 0 <= x <= 1

Beispiel:

[0.7]

thinking

Thinking · object

Configuration for enabling Claude's extended thinking. When enabled, responses include thinking content blocks showing Claude's reasoning process before the final answer.

See extended thinking for details.

Anzeigen untergeordnete attribute

Antwort

Message object.

string

erforderlich

Unique object identifier.

The format and length of IDs may change over time.

Beispiel:

["msg_013Zva2CMHLNnXjNJJKqJ2EF"]

type

enum<string>

Standard:message

erforderlich

Object type.

For Messages, this is always "message".

Verfügbare Optionen:

message

role

enum<string>

Standard:assistant

erforderlich

Conversational role of the generated message.

This will always be "assistant".

Verfügbare Optionen:

agent

content

(Text · object | Tool Use · object | object)[]

erforderlich

Content generated by the model.

This is an array of content blocks, each of which has a type that determines its shape.

Example:

[{"type": "text", "text": "Hi, I'm Claude."}]

If the request input messages ended with an agent turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.

For example, if the input messages were:

[
  {"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
  {"role": "assistant", "content": "The best answer is ("}
]

Then the response content might be:

[{"type": "text", "text": "B)"}]

Text
Tool Use
Option 3

Anzeigen untergeordnete attribute

Beispiel:

[
  [
    {
      "text": "Hi! My name is Claude.",
      "type": "text"
    }
  ]
]

model

string

erforderlich

The model that will complete your prompt. See models for additional details and options.

stop_reason

enum<string>

erforderlich

The reason that we stopped.

This may be one the following values:

"end_turn": the model reached a natural stopping point
"max_tokens": we exceeded the requested max_tokens or the model's maximum
"stop_sequence": one of your provided custom stop_sequences was generated
"tool_use": the model invoked one or more tools

In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.

Verfügbare Optionen:

end_turn,

max_tokens,

stop_sequence,

tool_use

stop_sequence

string

erforderlich

Which custom stop sequence was generated, if any.

This value will be a non-null string if one of your custom stop sequences was generated.

usage

Usage · object

erforderlich

Input and output token counts, representing the underlying cost to our systems.

Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.

For example, output_tokens will be non-zero, even for an empty string response from Claude.

Anzeigen untergeordnete attribute

Beispiel:

[
  {
    "input_tokens": 2095,
    "output_tokens": 503
  }
]

Überblick

Completion API

Embedding API

Agents API

Assistants API (Deprecating)

Integrations API

Knowledge Folder API

Usage Export API

User Management API

Audit Logs API

Anthropic Messages

Rate Limits

Verwendung von Anthropic-kompatiblen Bibliotheken

Beispiel mit der Anthropic Python-Bibliothek

Beispiel mit dem Vercel AI SDK in Node.js

Autorisierungen

Pfadparameter

Body

Antwort

Überblick

Completion API

Embedding API

Agents API

Assistants API (Deprecating)

Integrations API

Knowledge Folder API

Usage Export API

User Management API

Audit Logs API

​Rate Limits

​Verwendung von Anthropic-kompatiblen Bibliotheken

​Beispiel mit der Anthropic Python-Bibliothek

​Beispiel mit dem Vercel AI SDK in Node.js

Autorisierungen

Pfadparameter

Body

Antwort

Rate Limits

Verwendung von Anthropic-kompatiblen Bibliotheken

Beispiel mit der Anthropic Python-Bibliothek

Beispiel mit dem Vercel AI SDK in Node.js