What are TCP sequence numbers?
A TCP connection is a method of transmitting two byte streams,
one stream in each direction.
To map the unordered, unreliable bytes in IP packets to the ordered bytes in this stream,
each byte in each stream is identified by a sequence number.
Each TCP packet contains a segment of the stream as its payload.
The TCP header contains the sequence number of the first byte in this segment.
TCP packets can contain an acknowledgement,
which is the sequence number of the next byte the sender expects to receive
(and thus, an acknowledgement of receiving all bytes prior to that).
The sequence number field is 32 bits.
Naively, this means each byte stream can be up to 232 bytes,
or 4.3 gigabytes, long.
Does this mean you can’t use a plain TCP connection to download a file larger than 4.3 gigabytes?
No: when the sequence number 232 is reached,
the sequence number wraps around to zero again!
Because of this wrap-around,
bytes in the stream don’t actually have a unique identifying sequence number.
How then can we identify which byte a packet is talking about?
The answer strictly is “we can’t”.
(To mitigate this, TCP can use timestamps to identify old packets,
so as to discard them.
More about that attack another time.)
You might expect the first byte in the stream to have index 0, or 1.
But it does not!
Instead, the sender chooses a random “initial sequence number” (ISN)
during the connection handshake.
(This complication is apparently to mitigate an attack
whereby an attacker can impersonate another IP address,
if the attacker is able to forge the source IP address of the IP packets it sends.
Again, I’ll cover that another time.)
tcpdump
can show the initial sequence number in a TCP header.
In the following, we see the first packet sent by the TCP client,
with seq 3112279261
(The four bytes b981 9cdd
in the hexdump output).
22:09:04.387241 IP (tos 0x0, ttl 64, id 33056, offset 0, flags [DF], proto TCP (6), length 60)
127.0.0.1.56742 > 127.0.0.2.12345: Flags [S], cksum 0xfe31 (incorrect -> 0x212f), seq 3112279261, win 43690, options [mss 65495,sackOK,TS val 29629949 ecr 0,nop,wscale 6], length 0
0x0000: 4500 003c 8120 4000 4006 bb98 7f00 0001 E..<..@.@.......
0x0010: 7f00 0002 dda6 3039 b981 9cdd 0000 0000 ......09........
0x0020: a002 aaaa fe31 0000 0204 ffd7 0402 080a .....1..........
0x0030: 01c4 1dfd 0000 0000 0103 0306 ............
We see next that the server sends its own random ISN (seq 3504942089
),
and acknowledges the client’s initial sequence number by sending ack 3112279262
.
Notice that this is one more than the ISN!
Thus, the initial sequence number is not actually the identifier for the first byte in the sequence;
it’s the number just prior to that.
22:09:04.387254 IP (tos 0x0, ttl 64, id 0, offset 0, flags [DF], proto TCP (6), length 60)
127.0.0.2.12345 > 127.0.0.1.56742: Flags [S.], cksum 0xfe31 (incorrect -> 0x046a), seq 3504942089, ack 3112279262, win 43690, options [mss 65495,sackOK,TS val 29629949 ecr 29629949,nop,wscale 6], length 0
0x0000: 4500 003c 0000 4000 4006 3cb9 7f00 0002 E..<..@.@.<.....
0x0010: 7f00 0001 3039 dda6 d0e9 2c09 b981 9cde ....09....,.....
0x0020: a012 aaaa fe31 0000 0204 ffd7 0402 080a .....1..........
0x0030: 01c4 1dfd 01c4 1dfd 0103 0306 ............
22:09:04.387266 IP (tos 0x0, ttl 64, id 33057, offset 0, flags [DF], proto TCP (6), length 52)
Similar posts
More by Jim
What does the dot do in JavaScript?
foo.bar
, foo.bar()
, or foo.bar = baz
- what do they mean? A deep dive into prototypical inheritance and getters/setters. 2020-11-01
Smear phishing: a new Android vulnerability
Trick Android to display an SMS as coming from any contact. Convincing phishing vuln, but still unpatched. 2020-08-06
A probabilistic pub quiz for nerds
A “true or false” quiz where you respond with your confidence level, and the optimal strategy is to report your true belief. 2020-04-26
Time is running out to catch COVID-19
Simulation shows it’s rational to deliberately infect yourself with COVID-19 early on to get treatment, but after healthcare capacity is exceeded, it’s better to avoid infection. Includes interactive parameters and visualizations. 2020-03-14
The inception bar: a new phishing method
A new phishing technique that displays a fake URL bar in Chrome for mobile. A key innovation is the “scroll jail” that traps the user in a fake browser. 2019-04-27
The hacker hype cycle
I got started with simple web development, but because enamored with increasingly esoteric programming concepts, leading to a “trough of hipster technologies” before returning to more productive work. 2019-03-23
Project C-43: the lost origins of asymmetric crypto
Bob invents asymmetric cryptography by playing loud white noise to obscure Alice’s message, which he can cancel out but an eavesdropper cannot. This idea, published in 1944 by Walter Koenig Jr., is the forgotten origin of asymmetric crypto. 2019-02-16
How Hacker News stays interesting
Hacker News buried my post on conspiracy theories in my family due to overheated discussion, not censorship. Moderation keeps the site focused on interesting technical content. 2019-01-26
My parents are Flat-Earthers
For decades, my parents have been working up to Flat-Earther beliefs. From Egyptology to Jehovah’s Witnesses to theories that human built the Moon billions of years in the future. Surprisingly, it doesn’t affect their successful lives very much. For me, it’s a fun family pastime. 2019-01-20
The dots do matter: how to scam a Gmail user
Gmail’s “dots don’t matter” feature lets scammers create an account on, say, Netflix, with your email address but different dots. Results in convincing phishing emails. 2018-04-07
The sorry state of OpenSSL usability
OpenSSL’s inadequate documentation, confusing key formats, and deprecated interfaces make it difficult to use, despite its importance. 2017-12-02
I hate telephones
I hate telephones. Some rational reasons: lack of authentication, no spam filtering, forced synchronous communication. But also just a visceral fear. 2017-11-08
The Three Ts of Time, Thought and Typing: measuring cost on the web
Businesses often tout “free” services, but the real costs come in terms of time, thought, and typing required from users. Reducing these “Three Ts” is key to improving sign-up flows and increasing conversions. 2017-10-26
Granddad died today
Granddad died. The unspoken practice of death-by-dehydration in the NHS. The Liverpool Care Pathway. Assisted dying in the UK. The importance of planning in end-of-life care. 2017-05-19
How do I call a program in C, setting up standard pipes?
A C function to create a new process, set up its standard input/output/error pipes, and return a struct containing the process ID and pipe file descriptors. 2017-02-17
Your syntax highlighter is wrong
Syntax highlighters make value judgments about code. Most highlighters judge that comments are cruft, and try to hide them. Most diff viewers judge that code deletions are bad. 2014-05-11
Want to build a fantastic product using LLMs? I work at
Granola where we're building the future IDE for knowledge work. Come and work with us!
Read more or
get in touch! This page copyright James Fisher 2018. Content is not associated with my employer. Found an error? Edit this page.