# hdfs

> For the complete documentation index, see [llms.txt](https://docs.redpanda.com/llms.txt). Component-specific: [connect-full.txt](https://docs.redpanda.com/connect-full.txt)

---
title: hdfs
latest-connect-version: 4.93.0
latest-operator-version: v26.1.4
latest-console-tag: v3.7.3
latest-redpanda-tag: v26.1.9
docname: inputs/hdfs
page-component-name: connect
page-version: master
page-component-version: master
page-component-title: Connect
page-relative-src-path: inputs/hdfs.adoc
page-edit-url: https://github.com/redpanda-data/rp-connect-docs/edit/main/modules/components/pages/inputs/hdfs.adoc
page-git-created-date: "2024-05-24"
page-git-modified-date: "2026-05-26"
---

<!-- Source: https://docs.redpanda.com/connect/components/inputs/hdfs.md -->

**Type:** Input ▼

[Input](https://docs.redpanda.com/connect/components/inputs/hdfs/)[Output](https://docs.redpanda.com/connect/components/outputs/hdfs/)

**Available in:** Self-Managed

Reads files from a HDFS directory, where each discrete file will be consumed as a single message payload.

```yml
inputs:
  label: ""
  hdfs:
    hosts: [] # No default (required)
    user: ""
    directory: "" # No default (required)
```

## [](#metadata)Metadata

This input adds the following metadata fields to each message:

-   hdfs\_name

-   hdfs\_path


You can access these metadata fields using [function interpolation](https://docs.redpanda.com/connect/configuration/interpolation/#bloblang-queries).

## [](#fields)Fields

### [](#directory)`directory`

The directory to consume from.

**Type**: `string`

### [](#hosts)`hosts[]`

A list of target host addresses to connect to.

**Type**: `array`

```yaml
# Examples:
hosts: localhost:9000
```

### [](#user)`user`

A user ID to connect as.

**Type**: `string`

**Default**: `""`