Cloud

group_by

Available in: Cloud, Self-Managed

Splits a batch of messages into N batches, where each resulting batch contains a group of messages determined by a Bloblang query.

# Config fields, showing default values
label: ""
group_by: [] # No default (required)

Once the groups are established a list of processors are applied to their respective grouped batch, which can be used to label the batch as per their grouping. Messages that do not pass the check of any specified group are placed in their own group.

The functionality of this processor depends on being applied across messages that are batched. You can find out more about batching in this doc.

Fields

`check`

A Bloblang query that should return a boolean value indicating whether a message belongs to a given group.

Type: string

# Examples:
check: this.type == "foo"

# ---

check: this.contents.urls.contains("https://benthos.dev/")

# ---

check: true

`processors[]`

A list of processors to execute on the newly formed group.

Type: processor

Default: []

Examples

Grouped Processing

Imagine we have a batch of messages that we wish to split into a group of foos and everything else, which should be sent to different output destinations based on those groupings. We also need to send the foos as a tar gzip archive. For this purpose we can use the group_by processor with a switch output:

pipeline:
  processors:
    - group_by:
      - check: content().contains("this is a foo")
        processors:
          - archive:
              format: tar
          - compress:
              algorithm: gzip
          - mapping: 'meta grouping = "foo"'

output:
  switch:
    cases:
      - check: meta("grouping") == "foo"
        output:
          gcp_pubsub:
            project: foo_prod
            topic: only_the_foos
      - output:
          gcp_pubsub:
            project: somewhere_else
            topic: no_foos_here

Was this helpful?

group Ask in the community

mail Share your feedback

group_add Make a contribution

What do you think of this page?

Let us know more:

Let us contact you about your feedback:

group_by

Fields

check

processors[]

Examples

Grouped Processing

Simple online edits

Contribution guide

`check`

`processors[]`