# azure_cosmosdb

> For the complete documentation index, see [llms.txt](https://docs.redpanda.com/llms.txt). Component-specific: [connect-full.txt](https://docs.redpanda.com/connect-full.txt)

---
title: azure_cosmosdb
latest-connect-version: 4.93.0
latest-operator-version: v26.1.4
latest-console-tag: v3.7.3
latest-redpanda-tag: v26.1.9
docname: inputs/azure_cosmosdb
page-component-name: connect
page-version: master
page-component-version: master
page-component-title: Connect
page-relative-src-path: inputs/azure_cosmosdb.adoc
page-edit-url: https://github.com/redpanda-data/rp-connect-docs/edit/main/modules/components/pages/inputs/azure_cosmosdb.adoc
page-git-created-date: "2024-05-24"
page-git-modified-date: "2026-05-26"
---

<!-- Source: https://docs.redpanda.com/connect/components/inputs/azure_cosmosdb.md -->

**Type:** Input ▼

[Input](https://docs.redpanda.com/connect/components/inputs/azure_cosmosdb/)[Output](https://docs.redpanda.com/connect/components/outputs/azure_cosmosdb/)[Processor](https://docs.redpanda.com/connect/components/processors/azure_cosmosdb/)

**Available in:** [Cloud](https://docs.redpanda.com/cloud-data-platform/develop/connect/components/inputs/azure_cosmosdb/%20%22View%20the%20Cloud%20version%20of%20this%20component%22), Self-Managed

Executes a SQL query against [Azure CosmosDB](https://learn.microsoft.com/en-us/azure/cosmos-db/introduction) and creates a batch of messages from each page of items.

Introduced in version v4.25.0.

#### Common

```yml
inputs:
  label: ""
  azure_cosmosdb:
    endpoint: "" # No default (optional)
    account_key: "" # No default (optional)
    connection_string: "" # No default (optional)
    database: "" # No default (required)
    container: "" # No default (required)
    partition_keys_map: "" # No default (required)
    query: "" # No default (required)
    args_mapping: "" # No default (optional)
    auto_replay_nacks: true
```

#### Advanced

```yml
inputs:
  label: ""
  azure_cosmosdb:
    endpoint: "" # No default (optional)
    account_key: "" # No default (optional)
    connection_string: "" # No default (optional)
    database: "" # No default (required)
    container: "" # No default (required)
    partition_keys_map: "" # No default (required)
    query: "" # No default (required)
    args_mapping: "" # No default (optional)
    batch_count: -1
    auto_replay_nacks: true
```

## [](#cross-partition-queries)Cross-partition queries

Cross-partition queries are currently not supported by the underlying driver. For every query, the PartitionKey values must be known in advance and specified in the config. [See details](https://github.com/Azure/azure-sdk-for-go/issues/18578#issuecomment-1222510989).

## [](#credentials)Credentials

You can use one of the following authentication mechanisms:

-   Set the `endpoint` field and the `account_key` field

-   Set only the `endpoint` field to use [DefaultAzureCredential](https://pkg.go.dev/github.com/Azure/azure-sdk-for-go/sdk/azidentity#DefaultAzureCredential)

-   Set the `connection_string` field


## [](#metadata)Metadata

This component adds the following metadata fields to each message:

```none
- activity_id
- request_charge
```

You can access these metadata fields using [function interpolation](https://docs.redpanda.com/connect/configuration/interpolation/#bloblang-queries).

## [](#examples)Examples

### [](#query-container)Query container

Execute a parametrized SQL query to select documents from a container.

```yaml
input:
  azure_cosmosdb:
    endpoint: http://localhost:8080
    account_key: C2y6yDjf5/R+ob0N8A7Cgv30VRDJIWEHLM+4QDU5DE2nQ9nDuVTqobD4b8mGGyPMbIZnqyMsEcaGQy67XIw/Jw==
    database: blobbase
    container: blobfish
    partition_keys_map: root = "AbyssalPlain"
    query: SELECT * FROM blobfish AS b WHERE b.species = @species
    args_mapping: |
      root = [
          { "Name": "@species", "Value": "smooth-head" },
      ]
```

## [](#fields)Fields

### [](#account_key)`account_key`

Account key.

> ⚠️ **CAUTION**
>
> This field contains sensitive information that usually shouldn’t be added to a configuration directly. For more information, see [Secrets](https://docs.redpanda.com/connect/configuration/secrets/).

**Type**: `string`

```yaml
# Examples:
account_key: C2y6yDjf5/R+ob0N8A7Cgv30VRDJIWEHLM+4QDU5DE2nQ9nDuVTqobD4b8mGGyPMbIZnqyMsEcaGQy67XIw/Jw==
```

### [](#args_mapping)`args_mapping`

A [Bloblang mapping](https://docs.redpanda.com/connect/guides/bloblang/about/) that, for each message, creates a list of arguments to use with the query.

**Type**: `string`

```yaml
# Examples:
args_mapping: |-
  root = [
    { "Name": "@name", "Value": "benthos" },
  ]
```

### [](#auto_replay_nacks)`auto_replay_nacks`

Whether messages that are rejected (nacked) at the output level should be automatically replayed indefinitely, eventually resulting in back pressure if the cause of the rejections is persistent. If set to `false` these messages will instead be deleted. Disabling auto replays can greatly improve memory efficiency of high throughput streams as the original shape of the data can be discarded immediately upon consumption and mutation.

**Type**: `bool`

**Default**: `true`

### [](#batch_count)`batch_count`

The maximum number of messages that should be accumulated into each batch. Use '-1' specify dynamic page size.

**Type**: `int`

**Default**: `-1`

### [](#connection_string)`connection_string`

Connection string.

> ⚠️ **CAUTION**
>
> This field contains sensitive information that usually shouldn’t be added to a configuration directly. For more information, see [Secrets](https://docs.redpanda.com/connect/configuration/secrets/).

**Type**: `string`

```yaml
# Examples:
connection_string: AccountEndpoint=https://localhost:8081/;AccountKey=C2y6yDjf5/R+ob0N8A7Cgv30VRDJIWEHLM+4QDU5DE2nQ9nDuVTqobD4b8mGGyPMbIZnqyMsEcaGQy67XIw/Jw==;
```

### [](#container)`container`

Container.

**Type**: `string`

```yaml
# Examples:
container: testcontainer
```

### [](#database)`database`

Database.

**Type**: `string`

```yaml
# Examples:
database: testdb
```

### [](#endpoint)`endpoint`

CosmosDB endpoint.

**Type**: `string`

```yaml
# Examples:
endpoint: https://localhost:8081
```

### [](#partition_keys_map)`partition_keys_map`

A [Bloblang mapping](https://docs.redpanda.com/connect/guides/bloblang/about/) which should evaluate to a single partition key value or an array of partition key values of type string, integer or boolean. Currently, hierarchical partition keys are not supported so only one value may be provided.

**Type**: `string`

```yaml
# Examples:
partition_keys_map: root = "blobfish"

# ---

partition_keys_map: root = 41

# ---

partition_keys_map: root = true

# ---

partition_keys_map: root = null

# ---

partition_keys_map: root = now().ts_format("2006-01-02")
```

### [](#query)`query`

The query to execute

**Type**: `string`

```yaml
# Examples:
query: SELECT c.foo FROM testcontainer AS c WHERE c.bar = "baz" AND c.timestamp < @timestamp
```

## [](#cosmosdb-emulator)CosmosDB emulator

If you wish to run the CosmosDB emulator that is referenced in the documentation [here](https://learn.microsoft.com/en-us/azure/cosmos-db/linux-emulator), the following Docker command should do the trick:

```bash
> docker run --rm -it -p 8081:8081 --name=cosmosdb -e AZURE_COSMOS_EMULATOR_PARTITION_COUNT=10 -e AZURE_COSMOS_EMULATOR_ENABLE_DATA_PERSISTENCE=false mcr.microsoft.com/cosmosdb/linux/azure-cosmos-emulator
```

Note: `AZURE_COSMOS_EMULATOR_PARTITION_COUNT` controls the number of partitions that will be supported by the emulator. The bigger the value, the longer it takes for the container to start up.

Additionally, instead of installing the container self-signed certificate which is exposed via `[https://localhost:8081/_explorer/emulator.pem](https://localhost:8081/_explorer/emulator.pem)`, you can run [mitmproxy](https://mitmproxy.org/) like so:

```bash
> mitmproxy -k --mode "reverse:https://localhost:8081"
```

Then you can access the CosmosDB UI via `[http://localhost:8080/_explorer/index.html](http://localhost:8080/_explorer/index.html)` and use `[http://localhost:8080](http://localhost:8080)` as the CosmosDB endpoint.