Scale Remote MCP Server Resources

After creating an MCP server, you can scale its resources up or down to match your workload needs. Resource allocation affects your billing costs, which are charged per compute unit hour.

After reading this page, you will be able to:

  • Scale MCP server resources up or down

  • Choose appropriate resource sizes for workloads

  • Optimize costs through resource management

Prerequisites

You must have an existing MCP server. If you do not have one, see Remote MCP Server Quickstart.

Scale resources

  • Cloud Console

  • Data Plane API

  1. In the Redpanda Cloud Console, navigate to Remote MCP.

  2. Find the MCP server you want to scale and click its name.

  3. Click Edit configuration.

  4. Under Resources, select a new size:

    • XSmall: Lowest cost, suitable for development or light workloads

    • Small: Light production workloads

    • Medium: Standard production workloads

    • Large: High-throughput workloads

    • XLarge: Highest performance for demanding workloads

  5. Click Save to apply the new resource allocation.

    Redpanda makes the specified resources available immediately.

  1. Authenticate and get the base URL for the Data Plane API.

  2. Make a request to GET /v1/redpanda-connect/mcp-servers/{mcp_server_id} to retrieve the current configuration.

  3. Make a request to PATCH /v1/redpanda-connect/mcp-servers/{mcp_server_id} to update the resources:

    curl -X PATCH "https://<dataplane-api-url>/v1/redpanda-connect/mcp-servers/<mcp-server-id>?update_mask=resources" \
      -H "Authorization: Bearer <token>" \
      -H "Content-Type: application/json" \
      -d '{
        "mcp_server": {
          "resources": {
            "memory_shares": "2Gi",
            "cpu_shares": "1000m"
          }
        }
      }'

    Redpanda makes the updated resources available immediately.

Monitor your MCP server’s performance and adjust resources as needed. You can scale up during peak usage periods and scale down during quieter times to optimize costs. For compute unit definitions and pricing, see MCP billing metrics.