Performance & Capacity Practical By Samson Tanimawo, PhD Published Aug 5, 2025 4 min read

Connection Multiplexing

HTTP/2 advantage.

Overview

HTTP/2 multiplexing runs multiple concurrent requests over a single TCP connection. HTTP/1.1 limited concurrency to one request per connection (browsers worked around this by opening 6+ connections per origin); HTTP/2 lifts the limit and adds header compression, stream prioritization, and lower per-request overhead. The discipline is in deploying HTTP/2 end-to-end (origin, CDN, client) and monitoring for the failure modes that survive even with multiplexing in place.

The approach

The practical approach is HTTP/2 on origin (nginx and ALB support it natively), modern HTTP client libraries that negotiate HTTP/2, monitor max concurrent streams to catch saturation, watch for TCP-level head-of-line blocking (HTTP/3 over QUIC fixes it where it matters), and document the per-tier protocol so the chain from origin to client is reviewable.

Why this compounds

HTTP/2 multiplexing compounds across services. Each connection serves many requests; the per-request overhead drops durably; the team builds intuition for HTTP performance that pays off when HTTP/3 over QUIC becomes the next migration. Without the discipline, services run on HTTP/1.1 long past the point where the concurrency cost matters.

HTTP/2 multiplexing discipline is an operational discipline that pays off across years. Nova AI Ops integrates with HTTP telemetry, surfaces stream patterns, and supports the team’s performance discipline.