> ## Documentation Index
> Fetch the complete documentation index at: https://docs.langdock.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Assistants Completions API

> Creates a model response for a given Assistant.

<Warning>
  **The Assistants API will be deprecated on 16 August.**

  For new systems, we recommend using the [Agents API](/en/developer/agents-api/agent). The Agents API provides native Vercel AI SDK compatibility and removes custom transformations.

  See the [migration guide](/en/developer/assistants-api/assistant-to-agent-migration) to learn about the differences.
</Warning>

Creates a model response for a given assistant id, or pass in an Assistant configuration that should be used for your request.

<Info>
  To share an assistant with an API key, follow [this guide](/en/developer/assistants-api/assistant-api-guide)
</Info>

## Request Parameters

| Parameter     | Type    | Required                              | Description                                    |
| ------------- | ------- | ------------------------------------- | ---------------------------------------------- |
| `assistantId` | string  | One of assistantId/assistant required | ID of an existing assistant to use             |
| `assistant`   | object  | One of assistantId/assistant required | Configuration for a new assistant              |
| `messages`    | array   | Yes                                   | Array of message objects with role and content |
| `stream`      | boolean | No                                    | Enable streaming responses (default: false)    |
| `output`      | object  | No                                    | Structured output format specification         |

### Message Format

Each message in the `messages` array should contain:

* `role` (required) - One of: "user", "assistant", or "tool"
* `content` (required) - The message content as a string
* `attachmentIds` (optional) - Array of UUID strings identifying attachments for this message

### Assistant Configuration

When creating a temporary assistant, you can specify:

* `name` (required) - Name of the assistant (max 64 chars)
* `instructions` (required) - System instructions (max 16384 chars)
* `description` - Optional description (max 256 chars)
* `temperature` - Temperature between 0-1
* `model` - Model ID to use (see [Available Models](/en/developer/assistants-api/assistant-models) for options)
* `capabilities` - Enable features like web search and image generation
* `actions` - Custom API integrations
* `vectorDb` - Vector database connections
* `knowledgeFolderIds` - IDs of Knowledge bases to use
* `attachmentIds` - Array of UUID strings identifying attachments to use

<Info>
  You can retrieve a list of available models using the [Models
  API](/en/developer/assistants-api/assistant-models). This is useful when you want to see which models you can use in your assistant configuration.
</Info>

## Using Tools via API

When an assistant has tools configured (called "Actions" in the Langdock UI), it will automatically use them to respond to API requests when appropriate.

The connection must be set to "preselected connection" (shared with other users) for tool authentication to work.

<Frame>
  <img src="https://mintcdn.com/langdock-34/I2XuDU3TEaQ5DrB6/images/preselectedConnectionEng.png?fit=max&auto=format&n=I2XuDU3TEaQ5DrB6&q=85&s=8d848d7a44f93079d79a9f0d7d7f418f" alt="Preselected connection setting in assistant configuration" width="3840" height="2160" data-path="images/preselectedConnectionEng.png" />
</Frame>

<Warning>
  Tools with **"Require human confirmation"** enabled do not work via API—they require manual approval in the Langdock UI. To use a tool via API, disable this setting in the assistant configuration.
</Warning>

## Structured Output

You can specify a structured output format using the optional `output` parameter:

| Field    | Type                          | Description                                                    |
| -------- | ----------------------------- | -------------------------------------------------------------- |
| `type`   | "object" \| "array" \| "enum" | The type of structured output                                  |
| `schema` | object                        | JSON Schema definition for the output (for object/array types) |
| `enum`   | string\[]                     | Array of allowed values (for enum type)                        |

The `output` parameter behavior depends on the specified type:

* `type: "object"` with no schema: Forces the response to be a single JSON object (no specific structure)
* `type: "object"` with schema: Forces the response to match the provided JSON Schema
* `type: "array"` with schema: Forces the response to be an array of objects matching the provided schema
* `type: "enum"`: Forces the response to be one of the values specified in the `enum` array

<Info>
  You can use tools like [easy-json-schema](https://easy-json-schema.github.io/) to generate JSON Schemas from example JSON objects.
</Info>

## Streaming Responses

When `stream` is set to `true`, the API will return a stream of server-sent events (SSE) instead of waiting for the complete response. This allows you to display responses to users progressively as they are generated.

<Warning>
  Non-streaming requests are terminated with an HTTP 524 error after 100 seconds. If your assistant runs tools, generates long responses, or uses slower models, requests can exceed this limit. Set `stream: true` to keep the connection open and avoid timeouts.
</Warning>

### Stream Format

Each event in the stream follows the SSE format with JSON data:

```
data: {"type":"message","content":"Hello"}
data: {"type":"message","content":" world"}
data: {"type":"done"}
```

### Handling Streams in JavaScript

```javascript theme={null}
const response = await fetch('https://api.langdock.com/assistant/v1/chat/completions', {
  method: 'POST',
  headers: {
    'Authorization': 'Bearer YOUR_API_KEY',
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    assistantId: 'asst_123',
    messages: [{ role: 'user', content: 'Hello' }],
    stream: true
  }),
});

const reader = response.body.getReader();
const decoder = new TextDecoder();

while (true) {
  const { done, value } = await reader.read();
  if (done) break;

  const chunk = decoder.decode(value);
  const lines = chunk.split('\n');

  for (const line of lines) {
    if (line.startsWith('data: ')) {
      const data = JSON.parse(line.slice(6));
      if (data.type === 'message') {
        process.stdout.write(data.content);
      }
    }
  }
}
```

## Obtaining Attachment IDs

To use attachments in your assistant conversations, you first need to upload the files using the [Upload Attachment API](/en/developer/assistants-api/upload-attachments). This will return an `attachmentId` for each file, which you can then include in the `attachmentIds` array in your assistant or message configuration.

## Examples

### Using an Existing Assistant

```javascript theme={null}
const axios = require("axios");

async function chatWithAssistant() {
  const response = await axios.post(
    "https://api.langdock.com/assistant/v1/chat/completions",
    {
      assistantId: "asst_123",
      messages: [
        {
          role: "user",
          content: "Can you analyze this document for me?",
          attachmentIds: ["550e8400-e29b-41d4-a716-446655440000"], // Obtain attachmentIds from upload attachment endpoint
        },
      ],
      stream: true, // Enable streaming responses
    },
    {
      headers: {
        Authorization: "Bearer YOUR_API_KEY",
      },
    }
  );

  console.log(response.data.result);
}
```

### Using a temporary Assistant configuration

```javascript theme={null}
const axios = require("axios");

async function chatWithNewAssistant() {
  const response = await axios.post(
    "https://api.langdock.com/assistant/v1/chat/completions",
    {
      assistant: {
        name: "Document Analyzer",
        instructions:
          "You are a helpful assistant who analyzes documents and answers questions about them",
        temperature: 0.7,
        model: "gpt-5",
        capabilities: {
          webSearch: true,
        },
        attachmentIds: ["550e8400-e29b-41d4-a716-446655440000"], // Obtain attachmentIds from upload attachment endpoint
      },
      messages: [
        {
          role: "user",
          content: "What are the key points in the document?",
        },
      ],
    },
    {
      headers: {
        Authorization: "Bearer YOUR_API_KEY",
      },
    }
  );

  console.log(response.data.result);
}
```

### Using Structured Output with Schema

```javascript theme={null}
const axios = require("axios");

async function getStructuredWeather() {
  const response = await axios.post(
    "https://api.langdock.com/assistant/v1/chat/completions",
    {
      assistant: {
        name: "Weather Agent",
        instructions: "You are a helpful weather assistant",
        model: "gpt-5.1",
        capabilities: {
          webSearch: true,
        },
      },
      messages: [
        {
          role: "user",
          content: "What's the weather in paris, berlin and london today?",
        },
      ],
      output: {
        type: "array",
        schema: {
          type: "object",
          properties: {
            weather: {
              type: "object",
              properties: {
                city: {
                  type: "string",
                },
                tempInCelsius: {
                  type: "number",
                },
                tempInFahrenheit: {
                  type: "number",
                },
              },
              required: ["city", "tempInCelsius", "tempInFahrenheit"],
            },
          },
        },
      },
    },
    {
      headers: {
        Authorization: "Bearer YOUR_API_KEY",
      },
    }
  );

  // Access the structured data directly from output
  console.log(response.data.output);
  // Output:
  // [
  //   { "weather": { "city": "Paris", "tempInCelsius": 1, "tempInFahrenheit": 33 } },
  //   { "weather": { "city": "Berlin", "tempInCelsius": 1, "tempInFahrenheit": 35 } },
  //   { "weather": { "city": "London", "tempInCelsius": 7, "tempInFahrenheit": 45 } }
  // ]
}
```

### Using Structured Output with Object

```javascript theme={null}
const axios = require("axios");

async function extractContactInfo() {
  const response = await axios.post(
    "https://api.langdock.com/assistant/v1/chat/completions",
    {
      assistant: {
        name: "Contact Extractor",
        instructions: "You extract contact information from text",
      },
      messages: [
        {
          role: "user",
          content:
            "Extract the contact info: John Smith is our new sales lead. You can reach him at john.smith@example.com or call +1-555-123-4567.",
        },
      ],
      output: {
        type: "object",
        schema: {
          type: "object",
          properties: {
            name: {
              type: "string",
            },
            email: {
              type: "string",
            },
            phone: {
              type: "string",
            },
            role: {
              type: "string",
            },
          },
          required: ["name", "email"],
        },
      },
    },
    {
      headers: {
        Authorization: "Bearer YOUR_API_KEY",
      },
    }
  );

  // Access the structured data directly from output
  console.log(response.data.output);
  // Output:
  // {
  //   "name": "John Smith",
  //   "email": "john.smith@example.com",
  //   "phone": "+1-555-123-4567",
  //   "role": "sales lead"
  // }
}
```

### Using Structured Output with Enum

```javascript theme={null}
const axios = require("axios");

async function getSentimentAnalysis() {
  const response = await axios.post(
    "https://api.langdock.com/assistant/v1/chat/completions",
    {
      assistant: {
        name: "Sentiment Analyzer",
        instructions: "You analyze the sentiment of text",
      },
      messages: [
        {
          role: "user",
          content:
            "How would you rate this review: 'This product exceeded my expectations!'",
        },
      ],
      output: {
        type: "enum",
        enum: ["positive", "neutral", "negative"],
      },
    },
    {
      headers: {
        Authorization: "Bearer YOUR_API_KEY",
      },
    }
  );

  // Access the enum result directly from output
  console.log(response.data.output);
  // Output: "positive"
}
```

## Rate limits

The rate limit for the Assistant Completion endpoint is **500 RPM (requests per minute)** and **60,000 TPM (tokens per minute)**. Rate limits are defined at the workspace level - and not at an API key level. Each model has its own rate limit. If you exceed your rate limit, you will receive a `429 Too Many Requests` response.

Please note that the rate limits are subject to change, refer to this documentation for the most up-to-date information.

## Response Format

The API returns an object containing:

```typescript theme={null}
{
  // Standard message results - always present
  result: Array<{
    id: string;
    role: "tool" | "assistant";
    content: Array<{
      type: string;
      toolCallId?: string;
      toolName?: string;
      result?: object;
      args?: object;
      text?: string;
    }>;
  }>;

  // Structured output - included by default
  output?: object | array | string;
}
```

### Standard Result

The `result` array contains the message exchange between user and assistant, including any tool calls that were made. This is always present in the response.

### Structured Output

When the request includes an `output` parameter, the response will automatically include an `output` field containing the formatted structured data. The type of this field depends on the requested output format:

* If `output.type` was "object": Returns a JSON object (with schema validation if schema was provided)
* If `output.type` was "array": Returns an array of objects matching the provided schema
* If `output.type` was "enum": Returns a string matching one of the provided enum values

For example, when requesting weather data with structured output:

```javascript theme={null}
// Request
{
  "output": {
    "type": "array",
    "schema": {
      "type": "object",
      "properties": {
        "weather": {
          "type": "object",
          "properties": {
            "city": { "type": "string" },
            "tempInCelsius": { "type": "number" },
            "tempInFahrenheit": { "type": "number" }
          },
          "required": ["city", "tempInCelsius", "tempInFahrenheit"]
        }
      }
    }
  }
}

// Response
{
  "result": [
    // Full conversation including tool calls (e.g., web searches)
    { "role": "assistant", "content": [...], "id": "..." },
    { "role": "tool", "content": [...], "id": "..." },
    { "role": "assistant", "content": "...", "id": "..." }
  ],
  "output": [
    { "weather": { "city": "Paris", "tempInCelsius": 1, "tempInFahrenheit": 33 } },
    { "weather": { "city": "Berlin", "tempInCelsius": 1, "tempInFahrenheit": 35 } },
    { "weather": { "city": "London", "tempInCelsius": 7, "tempInFahrenheit": 45 } }
  ]
}
```

<Info>
  The `output` field is automatically populated with the formatted results based on the assistant's response and your schema definition. You can use this directly in your application without parsing the full conversation in `result`.
</Info>

## Error Handling

```javascript theme={null}
try {
  const response = await axios.post('https://api.langdock.com/assistant/v1/chat/completions', ...);
} catch (error) {
  if (error.response) {
    switch (error.response.status) {
      case 400:
        console.error('Invalid parameters:', error.response.data.message);
        break;
      case 429:
        console.error('Rate limit exceeded');
        break;
      case 500:
        console.error('Server error');
        break;
    }
  }
}
```

## Migrating to Agents API

The new Agents API offers improved compatibility with modern AI SDKs, including native support for the Vercel AI SDK. The main difference is in the chat completions endpoint format.

See the equivalent endpoint in the Agents API:

* [Agents Completions API](/en/developer/agents-api/agent) - Uses Vercel AI SDK message format

<Info>
  Langdock intentionally blocks browser-origin requests to protect your API key and ensure your applications remain secure. For more information, please see our guide on [API Key Best Practices](/en/admin/ai-adoption-and-rollout/best-practices/api-key-best-practices).
</Info>


## OpenAPI

````yaml POST /assistant/v1/chat/completions
openapi: 3.0.0
info:
  title: Langdock API
  version: 3.0.0
servers:
  - url: https://api.langdock.com
    description: Production
security:
  - bearerAuth: []
paths:
  /assistant/v1/chat/completions:
    post:
      tags:
        - Assistant
      summary: '[Deprecated] Creates a chat completion with an assistant'
      description: >-
        This endpoint is deprecated. Please use /agent/v1/chat/completions for
        new integrations.
      parameters: []
      requestBody:
        required: true
        content:
          application/json:
            examples:
              streamingEnabled:
                summary: Request with streaming enabled
                value:
                  assistantId: asst_123
                  messages:
                    - role: user
                      content: Hello, how can you help me?
                  stream: true
              streamingDisabled:
                summary: Request with streaming disabled (default)
                value:
                  assistantId: asst_123
                  messages:
                    - role: user
                      content: Hello, how can you help me?
                  stream: false
              defaultBehavior:
                summary: Request without stream parameter (defaults to false)
                value:
                  assistantId: asst_123
                  messages:
                    - role: user
                      content: Hello, how can you help me?
            schema:
              type: object
              oneOf:
                - type: object
                  required:
                    - assistantId
                    - messages
                  properties:
                    assistantId:
                      type: string
                      description: ID of an existing agent to use
                    messages:
                      type: array
                      items:
                        type: object
                        required:
                          - role
                          - content
                        properties:
                          role:
                            type: string
                            enum:
                              - user
                              - assistant
                              - tool
                          content:
                            type: string
                          attachmentIds:
                            type: array
                            items:
                              type: string
                              format: uuid
                            description: >-
                              Array of UUID strings identifying attachments for
                              this message
                    stream:
                      type: boolean
                      default: false
                      description: >-
                        Enable or disable streaming responses. When true,
                        returns server-sent events. When false, returns complete
                        JSON response.
                      example: true
                    output:
                      $ref: '#/components/schemas/StructuredOutputConfig'
                    maxSteps:
                      type: integer
                      minimum: 1
                      maximum: 20
                      default: 10
                      description: >-
                        Maximum number of steps the agent can take during the
                        conversation
                - type: object
                  required:
                    - assistant
                    - messages
                  properties:
                    assistant:
                      $ref: '#/components/schemas/Assistant'
                    messages:
                      type: array
                      items:
                        type: object
                        required:
                          - role
                          - content
                        properties:
                          role:
                            type: string
                            enum:
                              - user
                              - assistant
                              - tool
                          content:
                            type: string
                          attachmentIds:
                            type: array
                            items:
                              type: string
                              format: uuid
                            description: >-
                              Array of UUID strings identifying attachments for
                              this message
                    stream:
                      type: boolean
                      default: false
                      description: >-
                        Enable or disable streaming responses. When true,
                        returns server-sent events. When false, returns complete
                        JSON response.
                      example: true
                    output:
                      $ref: '#/components/schemas/StructuredOutputConfig'
                    maxSteps:
                      type: integer
                      minimum: 1
                      maximum: 20
                      default: 10
                      description: >-
                        Maximum number of steps the agent can take during the
                        conversation
      responses:
        '200':
          description: Successful chat completion
          content:
            application/json:
              schema:
                type: object
                required:
                  - result
                properties:
                  result:
                    type: array
                    items:
                      type: object
                      required:
                        - id
                        - role
                        - content
                      properties:
                        id:
                          type: string
                        role:
                          type: string
                          enum:
                            - tool
                            - assistant
                        content:
                          type: array
                          items:
                            type: object
                            required:
                              - type
                            properties:
                              type:
                                type: string
                              toolCallId:
                                type: string
                              toolName:
                                type: string
                              result:
                                type: object
                              args:
                                type: object
                              text:
                                type: string
                  output:
                    description: Present when output parameter was specified in the request
                    oneOf:
                      - type: object
                        description: When output.type is "object"
                      - type: array
                        description: When output.type is "array"
                      - type: string
                        description: >-
                          When output.type is "enum" (one of the provided enum
                          values)
            text/event-stream:
              schema:
                type: string
                description: Server-sent events stream when stream=true
        '400':
          description: Invalid request parameters
          content:
            application/json:
              schema:
                type: object
                properties:
                  message:
                    oneOf:
                      - type: string
                      - type: array
                        items:
                          type: object
        '429':
          description: Rate limit exceeded
          content:
            application/json:
              schema:
                type: object
                properties:
                  message:
                    type: string
        '500':
          description: Internal server error
          content:
            application/json:
              schema:
                type: object
                properties:
                  message:
                    type: string
      deprecated: true
components:
  schemas:
    StructuredOutputConfig:
      type: object
      description: >-
        Specification for structured output format. When type is object/array
        and no schema is provided, the response will be JSON but can have any
        structure. When the type is enum, you must provide an enum parameter
        with an array of strings as options.
      properties:
        type:
          type: string
          enum:
            - object
            - array
            - enum
          description: The type of structured output
        schema:
          type: object
          description: >-
            JSON Schema definition for the output (required for object/array
            types with specific structure). Search for "JSON to JSON Schema" in
            the web to find a tool to convert any JSON into the required JSON
            Schema format.
        enum:
          type: array
          items:
            type: string
          description: >-
            Array of allowed values (required for enum type). Values must be of
            type string.
      oneOf:
        - properties:
            type:
              enum:
                - enum
            enum:
              type: array
              items:
                type: string
        - properties:
            type:
              enum:
                - object
                - array
            schema:
              type: object
        - properties:
            type:
              enum:
                - object
                - array
    Assistant:
      type: object
      required:
        - name
        - instructions
      properties:
        name:
          type: string
          maxLength: 64
        description:
          type: string
          maxLength: 256
        instructions:
          type: string
          maxLength: 16384
        temperature:
          type: number
          minimum: 0
          maximum: 1
        model:
          type: string
          maxLength: 64
        capabilities:
          type: object
          properties:
            webSearch:
              type: boolean
            dataAnalyst:
              type: boolean
              deprecated: true
              description: Deprecated. Accepted for compatibility and ignored.
            imageGeneration:
              type: boolean
        actions:
          type: array
          items: 7d19e95b-21e8-417f-995e-24a8f5651c16
        vectorDb:
          type: array
          items: 48adcef5-fb6f-434d-894d-ad23cdeb3651
        knowledgeFolderIds:
          type: array
          items:
            type: string
        attachmentIds:
          type: array
          items:
            type: string
            format: uuid
          description: Array of UUID strings identifying attachments for this message
  securitySchemes:
    bearerAuth:
      type: http
      scheme: bearer
      bearerFormat: API Key
      description: API key as Bearer token. Format "Bearer YOUR_API_KEY"

````