The TCP "challenge ACK" side channel

By Jake Edge
August 10, 2016

Side-channel attacks against various kinds of protocols (typically networking or cryptographic) are both dangerous and often hard for developers and reviewers to spot. They are generally passive attacks, which makes them hard to detect as well. A recent paper [PDF] describes in detail one such attack against the kernel's TCP networking stack; the bug (CVE-2016-5696) has existed since Linux 3.6, which was released in 2012. Ironically, the bug was introduced because Linux has implemented a countermeasure against another type of attack.

There are a number of pieces of information that an attacker needs to interfere with a TCP connection between two hosts. To start with, the so-called four-tuple, which consists of the source IP address, source port number, destination IP address, and destination port number, is needed. Several of those values can be guessed or inferred (e.g. destination IP and port), but there is another piece of information needed to actually interfere with a connection.

TCP has 32-bit sequence numbers that are used to order the packets in the connection stream. They are also an important part of the packets used to establish and break down connections. A packet that could interfere with a connection must have a sequence number that is within the receive window of the target. That window effectively determines the range of sequence numbers that are acceptable.

Once upon a time, in a far more trusting era, sequence numbers were fairly easily predicted, but those days are long gone. These days, sequence numbers are randomized to thwart various kinds of packet-injection and connection-spoofing attacks. An eavesdropper can still observe the sequence numbers in a conversation, but an "off-path" attacker must guess. By randomizing the initial sequence number (ISN) used by a connection, network stacks make guessing difficult enough to stop most off-path attacks.

But if a way can be found to more quickly narrow in on the sequence numbers used in a connection, off-path attackers can be more efficient in their probing—to the point where they can inject packets into an established connection. That is effectively what the researchers found.

But first, there is another obstacle to overcome: according to the paper, off-path attacks have generally been limited by the need to get unprivileged malware running on one of the endpoints to determine whether two hosts are actually communicating. But the researchers found a way to quickly determine whether two hosts are communicating and what port numbers they are using, without any assistance from malware.

Linux is the only operating system vulnerable to this attack because it is the only one that has faithfully implemented RFC 5961, which was proposed to avoid a different kind of packet injection attack. It uses "challenge ACKs" to avoid resetting a connection when a spoofed connection request (SYN) or connection termination (RST) packet with a sequence number within the receive window is received. The challenge ACK will allow long-lived connections to be more resistant to these spoofed packets that are meant to close the connection.

The challenge ACKs require that the original sender reply with the exact sequence number expected for the next packet, not just one within the receive window, which is more difficult for an off-path attacker to arrange. But challenge ACKs also consume resources, so the RFC recommends that a limit be imposed on the number of challenge ACKs sent over a given time frame (Linux used 100/second by default). Since challenge ACKs were expected to be rare occurrences, the counter for rate-limiting them was global for all TCP connections on the system—and the RFC specifically directed that regular ACKs should not be counted. Because of this, challenge ACKs provide a side channel:

At a very high level, the vulnerability allows an attacker to create contention on a shared resource, i.e., the global rate limit counter on the target system by sending spoofed packets. The attacker can then subsequently observe the effect on the counter changes, measurable through probing packets.

Through extensive experimentation, we demonstrate that the attack is extremely effective and reliable. Given any two arbitrary hosts, it takes only 10 seconds to successfully infer whether they are communicating. If there is a connection, subsequently, it takes also only tens of seconds to infer the TCP sequence numbers used on the connection.

The general outlines of an attack are as follows. The attacker establishes an ordinary connection to the server, then sends a stream of bogus RST and SYN packets to force the target to generate the maximum number of challenge ACKs. Some spoofed packets "from" the client of interest are also sent. If all of the expected 100 challenge ACKs are received on the regular connection, then the four-tuple in the spoofed packet does not represent an active connection, but if some are missing, they must have been sent as challenge ACKs in response to spoofed packets, indicating that a connection exists.

Once that is established, in-window sequence numbers need to be determined—challenge ACKs can help there too. Once again, challenge ACKs are provoked using a normal connection and the number received are counted. Spoofed RST packets with a guessed sequence number are also sent; the number of challenge ACKs received on the regular connection allow the attacker to infer whether the sequence number is within the window or not. Further probing with spoofed ACKs can narrow things down to the exact sequence number expected for the next packet.

Two kinds of attacks are described in the paper. The easiest is to simply reset an in-progress connection. The other hijacks the connection to inject content of the attacker's choosing. The paper describes the former being reliably deployed against SSH and Tor connections, while it mentions the latter being targeted at long-lived connections for data like video streams, advertisements, or news sites.

There are some more wrinkles to the attack, of course, including synchronizing with the host's clock so that the one-second boundary can be reliably determined. That, too, uses challenge ACKs. Other hurdles are also discussed in the paper. But the attack can have far-reaching effects as the team's short YouTube video demonstrates. It injects some JavaScript into a web session to display attacker-controlled content.

The research was also highlighted in an article in a University of California, Riverside (UCR) publication as most of the researchers are students or faculty there. Yue Cao, Zhiyun Qian, Zhongjie Wang, Tuan Dao and Srikanth V. Krishnamurthy of UCR were joined by Lisa M. Marvel from the United States Army Research Laboratory in writing the paper, which was presented at the USENIX Security Symposium on August 10.

It is an interesting and clever attack that sadly only lacks a catchy name, colorful logo, and hype-filled web site. Cao did alert kernel developers to the problem, which was fixed in the mainline in July (and appears in the 4.7 kernel). The fix raises the limit to 1000 challenge ACKs per second, but also adds some randomization to the value so that counting will be less effective. In addition, the patch notes per-socket rate-limiting is available, which could lead to the removal of the global challenge ACK count down the road; some work toward that end has been merged as well.

The fix has not made it to the stable kernels yet, but there is a mitigation available in the form of the tcp_challenge_ack_limit sysctl knob. Setting that value to something enormous (e.g. 999999999) will make it much harder for attackers to exploit the flaw.

Spoofing source IP addresses is not technically difficult, though it may be hard to get the packet through routers and the like in some cases. A Center for Applied Internet Data Analysis study shows that nearly half of the autonomous systems on the internet are at least partly spoofable, though. There are, as yet, no reports of attacks using this technique in the wild, though one would guess it won't be long before we do see some.

In the end, challenge ACKs seem a reasonable solution to a real problem, but Linux played the role of a guinea pig here. There are upsides to doing that, such as providing a platform where the researchers could discover the problems in the RFC. There are downsides, as well; Linux is currently getting some bad press about its networking implementation, for example. On the whole, though, these problems needed to be found—and now they are.

Index entries for this article
Security	Linux kernel/Networking
Security	Networking

The TCP "challenge ACK" side channel

Posted Aug 10, 2016 22:55 UTC (Wed) by smoogen (subscriber, #97) [Link] (3 responses)

Another thing this example can be used for is as the example to point out when someone says "Why aren't you RFC compliant" as if RFC's are holy grails that are written securely, clearly and without problems.

The TCP "challenge ACK" side channel

Posted Aug 11, 2016 14:57 UTC (Thu) by ssmith32 (subscriber, #72404) [Link] (2 responses)

Noooo... note that they fixed this without violating the RFC. "Why aren't you RFC compliant?" doesn't mean it's a holy grail. RFC's are generally very boring, detailed and clear. In other words, "You had ONE job, it was simple, straightforward and clear - what went wrong?". Following it doesn't mean it is secure or awesome. You still need to look for bugs, and be diligent. Otherwise we would just have RFC compilers, and no need for skilled engineers to implement them *well*.

Please don't pretend this is an excuse to violate RFC's... we'll end up with IE and all other kinds of 1990s Microsoft wonderfulness.

The TCP "challenge ACK" side channel

Posted Aug 11, 2016 16:08 UTC (Thu) by smoogen (subscriber, #97) [Link] (1 responses)

I was talking about people who think that RFC's are holy grails and that you can implement them purely like they were compilers. I have dealt with way too many engineers who have dealt with some aspect of one RFC which is written in such a way and thus think that all of them are or should be.

As a side comment, while IE did vary off from various standards some of the WORST things people complained about in the IE1->4 days was where it was actually following the standard to the letter but Netscape didn't because to follow the standard to the letter gave crappy viewing (or subpar performance or a thousand other things.) I know this because I worked on the upstream browser and we spend most of our time having to implement 'this does not follow the HTML standard but it makes it work with Netscape' quite often. [Do not take this as Microsoft or Spyglass was right in putting in the various RFC breaks.. just that sometimes compliant and complaint are only 2 letters different]

The TCP "challenge ACK" side channel

Posted Aug 11, 2016 17:22 UTC (Thu) by flussence (guest, #85566) [Link]

The HTML standard agrees with you these days — in the same spec with a volume dedicated to a “how to parse 10 billion pages of broken crap consistently” algorithm, there used to be gems like “comments must be valid SGML with balanced -- delimiters”. Thankfully the WHATWG saw some common sense and changed it (the W3C has yet to...)

The TCP "challenge ACK" side channel

Posted Aug 10, 2016 23:20 UTC (Wed) by dowdle (subscriber, #659) [Link]

Red Hat has a page on it (CVE-2016-5696) that says it will be addressed with an upcoming RHEL6 and RHEL7 kernel update:

https://access.redhat.com/security/cve/cve-2016-5696

The TCP "challenge ACK" side channel

Posted Aug 11, 2016 8:14 UTC (Thu) by epa (subscriber, #39769) [Link] (1 responses)

This shows how having a hard limit such as 100 challenge ACKs sent per second can leak information. It might be better to have a fuzzy limit where you count the number of such ACKs sent in the past second, and the probability of skipping the next one is N / 200, in other words skipping becomes steadily more likely as N approaches 200, with a new challenge ACK always skipped when N >= 200. Or some similar randomized rule, perhaps using an exponentially weighted probability so there is no exact upper limit.

The TCP "challenge ACK" side channel

Posted Aug 11, 2016 8:16 UTC (Thu) by epa (subscriber, #39769) [Link]

Ah, and I see that the fix does randomize a bit, but in a different way. I wanted to make a general point that any fixed upper limit could leak information in this way - and it may apply to leakage between processes on the same system as well as across the network.

Patches landed in the stable queue

Posted Aug 12, 2016 15:03 UTC (Fri) by hmh (subscriber, #3838) [Link]

The patches addressing CVE-2016-5696 are available in the public stable queue tree, and are very likely to be present in the next round of stable releases.

One catchy name coming right up

Posted Aug 14, 2016 13:28 UTC (Sun) by fratti (guest, #105722) [Link] (3 responses)

>It is an interesting and clever attack that sadly only lacks a catchy name, colorful logo, and hype-filled web site.

Oh! Oh! I've got one! "ACK Overflow". And the logo should be something cool, like a dinosaur with laser guns. Just copypaste the heartbleed website.

In all seriousness though, this is a beautiful attack, and yet another reason for websites to move towards HTTPS (since injection would be mitigated that way).

One catchy name coming right up

Posted Aug 16, 2016 10:01 UTC (Tue) by b1r63r (guest, #110634) [Link]

Another one.

UCR ACK attack just begs for the name UCRACK

One catchy name coming right up

Posted Aug 21, 2016 8:59 UTC (Sun) by dirtyepic (guest, #30178) [Link]

I think every CVE number assignment should just come with a randomly generated name that you're required to reference in all communications about the vulnerability. No one would make a hype-filled website for the Butterscotch Narwhal attack (though the logo could be entertaining).

One catchy name coming right up

Posted Aug 23, 2016 11:08 UTC (Tue) by Lennie (subscriber, #49641) [Link]

HTTPS can have it's own problems too. How about a Same Origin Policy violation !:

http://arstechnica.com/security/2016/08/new-attack-steals...

The TCP "challenge ACK" side channel

Posted Aug 18, 2016 11:11 UTC (Thu) by jpritikin73 (guest, #107608) [Link]

Here's how to add a work-around to your system manually, https://www.mail-archive.com/debian-user@lists.debian.org...

The TCP "challenge ACK" side channel

Posted Aug 20, 2016 5:44 UTC (Sat) by toyotabedzrock (guest, #88005) [Link]

In one of the network man pages a note talks about assuming someone would never send an invalid value. I can't recall where I read it but this type of assumption has probably left vulnerable code throughout.

The TCP "challenge ACK" side channel

Posted Aug 26, 2016 20:11 UTC (Fri) by timmotheusglorien (guest, #110868) [Link]

Hello,
i am not happy with "maximum warp" setting of tcp_challenge_ack_limit. I programmed a daemon which periodically changes the value inside the proc file as described in the paper. The proc handler of this parameter should handle it and it might work quite well now. However i need some support to get this daemon "community standard" maybe some people like to work and help on this. This is my first open-source project, so please be patient and polite.

https://github.com/timmotheusglorien/challack-daemon.git

Kind Regards