# openai_transcription

> For the complete documentation index, see [llms.txt](https://docs.redpanda.com/llms.txt). Component-specific: [cloud-data-platform-full.txt](https://docs.redpanda.com/cloud-data-platform-full.txt)

---
title: openai_transcription
latest-operator-version: v26.1.4
latest-console-tag: v3.7.3
latest-connect-version: 4.93.0
latest-redpanda-tag: v26.1.9
docname: connect/components/processors/openai_transcription
page-component-name: cloud-data-platform
page-version: master
page-component-version: master
page-component-title: Cloud
page-relative-src-path: connect/components/processors/openai_transcription.adoc
page-edit-url: https://github.com/redpanda-data/cloud-docs/edit/main/modules/develop/pages/connect/components/processors/openai_transcription.adoc
page-git-created-date: "2024-09-09"
page-git-modified-date: "2026-05-26"
---

<!-- Source: https://docs.redpanda.com/cloud-data-platform/develop/connect/components/processors/openai_transcription.md -->

**Available in:** Cloud, [Self-Managed](https://docs.redpanda.com/connect/components/processors/openai_transcription/%20%22View%20the%20Self-Managed%20version%20of%20this%20component%22)

Generates a transcription of spoken audio in the input language, using the OpenAI API.

#### Common

```yml
processors:
  label: ""
  openai_transcription:
    server_address: https://api.openai.com/v1
    api_key: "" # No default (required)
    model: "" # No default (required)
    file: "" # No default (required)
```

#### Advanced

```yml
processors:
  label: ""
  openai_transcription:
    server_address: https://api.openai.com/v1
    api_key: "" # No default (required)
    model: "" # No default (required)
    file: "" # No default (required)
    language: "" # No default (optional)
    prompt: "" # No default (optional)
```

This processor sends an audio file object along with the input language to OpenAI API to generate a transcription. By default, the processor submits the entire payload of each message as a string, unless you use the `file` configuration field to customize it.

To learn more about audio transcription, see the: [OpenAI API documentation](https://platform.openai.com/docs/guides/speech-to-text).

## [](#fields)Fields

### [](#api_key)`api_key`

The API key for OpenAI API.

> ⚠️ **CAUTION**
>
> This field contains sensitive information that usually shouldn’t be added to a configuration directly. For more information, see [Manage Secrets](https://docs.redpanda.com/cloud-data-platform/develop/connect/configuration/secret-management/) before adding it to your configuration.

**Type**: `string`

### [](#file)`file`

The audio file object (not file name) to transcribe, in one of the following formats: `flac`, `mp3`, `mp4`, `mpeg`, `mpga`, `m4a`, `ogg`, `wav`, or `webm`.

**Type**: `string`

### [](#language)`language`

The language of the input audio. Supplying the input language in ISO-639-1 format improves accuracy and latency. This field supports [interpolation functions](https://docs.redpanda.com/cloud-data-platform/develop/connect/configuration/interpolation/#bloblang-queries).

**Type**: `string`

```yaml
# Examples:
language: en

# ---

language: fr

# ---

language: de

# ---

language: zh
```

### [](#model)`model`

The name of the OpenAI model to use.

**Type**: `string`

```yaml
# Examples:
model: whisper-1
```

### [](#prompt)`prompt`

Optional text to guide the model’s style or continue a previous audio segment. The prompt should match the audio language. This field supports [interpolation functions](https://docs.redpanda.com/cloud-data-platform/develop/connect/configuration/interpolation/#bloblang-queries).

**Type**: `string`

### [](#server_address)`server_address`

The Open API endpoint that the processor sends requests to. Update the default value to use another OpenAI compatible service.

**Type**: `string`

**Default**: `[https://api.openai.com/v1](https://api.openai.com/v1)`