24 Dec 2024 7 min read

The Deadlock Hiding in HTTP/2 Flow Control

Plagued by indefinitely hanging HTTP requests, starved connection pools, and widespread SLA breaches, we put on our troubleshooting hats and reported a Reactor Netty issue.

The hangs primarily affected a high-volume, scheduled background process responsible for delivering payment acknowledgments to clients, but the impact spread far beyond it. Consumers across critical flows saw their latencies spike as they were forced to retry and fail over to healthier instances—those with remaining available server sockets.

The Offending Process

The process involved two services, A and B, and two separate endpoint calls, each handling reactive streams of data:

A -> B to retrieve data S (c1)
A -> B to write data S' (c2)

First, A calls B to fetch S, applies business logic to transform it into S’, and then sends S’ back to B. To balance the load across multiple instances of B, we split S’ into windows of 50 elements. The number of concurrent windows was capped at four.

int windowConcurrency = 4;
int windowSize = 50;

@Scheduled
public Flux<Data> process() {
  return retrieve()
        .window(windowSize)
        .flatMap(this::write, windowConcurrency);
}

private Flux<Data> retrieve() {}

private Flux<Data> write(Flux<Data> dataWindow) {}

Conceptual code of the scheduled process

The system marks data as processed in /write, ensuring that each call to /retrieve fetches only new data added since the last request.

What We Knew

The failures were not intermittent. Once the first hang occurred, every subsequent run of the scheduled process exhibited the same behaviour - a valuable data point when combined with the knowledge the process selects all data since the last successful execution.

What We Tried

A basic two-service test harness was built to quickly test hypotheses and pinpoint a deterministically failing case.

After numerous failed experiments, we switched to HTTP/1.1 and re-ran our test suite—a simple two-service setup with a parameterised endpoint for simulating different dataset sizes. On my local machine, the test would reliably fail when the dataset reached a certain size. Yet, with HTTP/1.1, it passed every time.

This pointed to an issue specific to HTTP/2, likely tied to stream multiplexing, which allows multiple requests to share a single TCP connection. Other tests produced data points we struggled to connect, but one key observation stood out: using separate WebClient instances—and thus separate connection pools—for calls c1 and c2 prevented stalls, even on HTTP/2.

Removing the concurrency limit (set to 4 in the example) eliminated stalls across all input sizes.

We had gathered useful clues, but we needed another perspective.

The Cause

Before we dive into the details, it is useful to know HTTP/2 implements flow control (RFC) at both the stream and connection level, as a mechanism to 'protect endpoints that are operating under resource constraints'. This means it handles systems producing and consuming at different rates.

An essential aspect of flow control in HTTP/2 is the WINDOW_UPDATE frame, which allows the receiver to specify how much data they can accept on both a per-stream and per-connection basis. The default window size is 65,535 bytes. In a single request from A to B, both A and B act as receivers (depending on the direction of data flow) and announce a window size to the sender. Once a window is fully utilised, no additional DATA frames can be sent until the window is updated by the receiver, with further WINDOW_UPDATE frames.

Problem Statement

(Huge props to chemicL for the diagnosis)

💡

In the specific scenario of cyclic calls, a deadlock occurs when Client A is waiting for a response from Server B on one stream (Stream 1), while Server B is waiting for Client A to consume more of the response on another stream (Stream 2). Since neither party can make progress, the system enters an infinite hang.

Let's break this down...

Consider HTTP/2 window size = 64KB and connection pool size = 1.

Step 1. Client A sends a request on Stream 1

Client A sends a small GET request on Stream 1 to Server B.

Server B prepares a large response (64KB).

Flow Control State
• Connection window: 64KB (full)
• Stream 1 window: 64KB (full)

Step 2: Server B sends a large response on Stream 1

Server B sends 64KB of DATA frames on Stream 1, consuming the entire 
connection-level and stream-level flow control windows.

Server B is now blocked and cannot send more data until Client A sends a 
WINDOW_UPDATE.

Flow Control State:
• Connection window: 0KB (fully used).
• Stream 1 window: 0KB (fully used).

Step 3: Client A consumes part of the response and starts Stream 2

Client A consumes 32KB of data from Stream 1 and sends a WINDOW_UPDATE for 
32KB to Server B.

Client A initiates a second request to Server B on Stream 2, using some of 
the data from Stream 1.

HEADERS are not subject to flow control, so the request on Stream 2 is sent 
successfully.

Flow Control State:
• Connection window: 32KB (available).
• Stream 1 window: 32KB (available).
• Stream 2: Request headers sent, ready to receive data.

Step 4: Server B sends a response on Stream 2

Server B processes the request on Stream 2 and sends 32KB of DATA frames, 
consuming the remaining connection window.

The connection-level window is now 0KB again, so Server B is fully blocked 
from sending data on both Stream 1 and Stream 2.

Flow Control State:
• Connection window: 0KB (fully used).
• Stream 1 window: 32KB (available, but blocked by the connection).
• Stream 2 window: 0KB (fully used).

Step 5: Deadlock (No Progress Possible)

Client A is waiting for Server B to respond on Stream 2.

Server B is waiting for Client A to consume more data from Stream 1 and 
send a WINDOW_UPDATE.

Since Client A doesn’t consume more data from Stream 1 (due to concurrency 
limits, in our case), the connection window remains blocked, and both 
parties are stuck.

Flow Control State:
• Connection window: 0KB (fully blocked).
• Stream 1 & Stream 2: Blocked.

Key Points

The connection-level flow control window is shared across all streams, meaning one stream’s usage can block others.
This deadlock happens because Client A and Server B are waiting on each other:
- Server B waits for a WINDOW_UPDATE to free up connection flow control, but Client A's concurrency constraints make this impossible
- Client A waits for a response on Stream 2, which would free up a concurrency slot, but Server B cannot send it.

The Solutions

A number of workarounds and protections exist, each with their own implications. Let's consider a few:

Option 1: Avoid Cyclical Calls on the Same Connection

Separate Connection Pools:
- Use different WebClient instances (each with its own connection pool) for each call.
- Pros: Eliminates contention at the connection level; straightforward setup if additional connections are acceptable.
- Cons: More TCP connections; slightly higher resource usage.

Option 2: Increase the HTTP/2 Flow Control Window

Adjust initialWindowSize in Reactor Netty:
- Raising the default window from 65,535 bytes can reduce the likelihood of hitting a deadlock.
- Pros: Easy configuration change; good intermediate workaround.
- Cons: Not a permanent fix if data volumes or concurrency keep growing; may only delay a potential deadlock.

Option 3: Fully Consume One Call’s Data Before Making the Next

Buffer All Data from the First Call:
- Ensure the first response is fully read (and potentially buffered) before initiating the second call.
- Pros: No chance of overlapping streams competing for flow control on the same connection.
- Cons: Requires buffering large payloads, losing streaming benefits; increased memory usage.

Option 4: Increase or Remove Concurrency Limits

If your reactive pipeline uses operators like flatMap(this::writeData, concurrency = X), consider raising X significantly or removing the limit. By allowing more simultaneous consumption, you’re less likely to stall a particular stream—and thus you keep issuing WINDOW_UPDATE frames more regularly.
Pros
- Reduces Partial Consumption: More concurrency means the client promptly reads from all in-flight streams, freeing up window space
Cons
- Not a Silver Bullet: If your application logic inherently delays reading a stream (or you deal with extremely large payloads), deadlocks can still occur under certain conditions.

@Bean
public WebClientCustomizer webClientCustomizer() {
    return builder -> builder
      .clientConnector(new ReactorClientHttpConnector(HttpClient.create()
              .http2Settings(spec -> spec.initialWindowSize(3000 * 120))
      );
}

Configuring the window size in Reactor Netty

What We Did

Option 3 (reading all data from the initial call) suited our needs because our services are not memory constrained, and the flow in question is asynchronous with no stringent latency requirements. This notably allowed us to keep our concurrency limits whilst fully protecting against deadlock.

In addition, regardless of solution choice, we also recommend configuring sensible timeouts for all HTTP requests as an additional layer of protection against deadlock (and to fail-fast, but that's a different topic!).

Will HTTP/3 help?

(Thanks to Lucas Pardue on X for the heads up)

HTTP/3, like HTTP/2, is a stream-based protocol that multiplexes multiple streams over a single connection. Unfortunately, it can suffer from the same issue for similar reasons. While each stream in HTTP/3 has its own flow control window (a notable improvement over HTTP/2), the connection-level flow control window still exists and can lead to a deadlock scenario.

Lucas, who is debugging a similar issue on behalf of Cloudflare, highlighted an intriguing open Chromium issue. This issue describes a deadlock that occurs when 13 <video> elements are loaded concurrently within an HTML webpage. The problem manifests in both HTTP/2 and HTTP/3 due to the exhaustion of the connection-level flow control window.

TL;DR

In HTTP/2, all streams share one connection-level flow-control window. When a single client (A) makes two separate calls (A→B and A→B again) on the same connection, the first call can consume the shared window and block the second call. Meanwhile, the second call can also partially consume (or demand) window space before the first call has fully finished. If neither call’s data is consumed quickly enough (and WINDOW_UPDATE frames aren’t sent), each call ends up waiting on the other to free the shared connection window. The result is a deadlock where no progress is possible.

HTTP/3, which similarly shares a single connection-level flow-control window, suffers the same fate.

Links

Reactor Netty issue: https://github.com/reactor/reactor-netty/issues/3495
HTTP/2 RFC: https://datatracker.ietf.org/doc/html/rfc9113
HTTP/3 RFC: https://datatracker.ietf.org/doc/html/rfc9114
Lucas on X: https://x.com/SimmerVigor/status/1875101622747173255
Chromium window deadlock: https://issues.chromium.org/issues/41161335?pli=1