Scale Remote MCP Server Resources

You can scale your MCP server’s resources up or down to match your workload needs. Resource allocation affects your billing costs, which are charged per compute unit hour.

Prerequisites

You must have an existing MCP server. If you do not have one, see Remote MCP Server Quickstart.

Scale resources

  • Cloud UI

  • Data Plane API

  1. In the Redpanda Cloud Console, navigate to Remote MCP.

  2. Find the MCP server you want to scale and click its name.

  3. Click Edit configuration.

  4. Under Resources, select a new size:

    • XSmall: Lowest cost, suitable for development or light workloads

    • Small: Light production workloads

    • Medium: Standard production workloads

    • Large: High-throughput workloads

    • XLarge: Highest performance for demanding workloads

  5. Click Save to apply the new resource allocation.

    The specified resources are available immediately.

  1. Authenticate and get the base URL for the Data Plane API.

  2. Make a request to GET /v1/redpanda-connect/mcp-servers/{mcp_server_id} to retrieve the current configuration.

  3. Make a request to PATCH /v1/redpanda-connect/mcp-servers/{mcp_server_id} to update the resources:

    curl -X PATCH "https://<dataplane-api-url>/v1/redpanda-connect/mcp-servers/<mcp-server-id>?update_mask=resources" \
      -H "Authorization: Bearer <token>" \
      -H "Content-Type: application/json" \
      -d '{
        "mcp_server": {
          "resources": {
            "memory_shares": "2Gi",
            "cpu_shares": "1000m"
          }
        }
      }'

    The updated resources are available immediately.

Monitor your MCP server’s performance and adjust resources as needed. You can scale up during peak usage periods and scale down during quieter times to optimize costs. For compute unit definitions and pricing, see MCP billing metrics.