Block Coding & Hamming Distance

Block coding adds structured error-checking bits to a data word. The reliability of a block code relies heavily on its Minimum Hamming Distance ($d_{min}$), which defines the minimum number of bit flips required to accidentally turn one valid codeword into another valid codeword.

Learning Goals

Introduction to Block Coding: Mapping $k$-bit datawords into $n$-bit codewords ($n > k$).
Error Detection vs. Error Correction mechanisms in block codes.
Hamming Distance: Definition, calculating distance between two binary words, and Minimum Hamming Distance ($d_{min}$).
Geometric representation and conditions for detecting ($s$ errors) and correcting ($t$ errors).
Compute the Hamming distance between any two given binary sequences.
Mathematically prove how many errors a code can detect or correct using the formulas:$$d_{min} = s + 1$$$$d_{min} = 2t + 1$$

In the Data Link Layer of computer networks, block coding adds controlled redundancy so that transmission errors can be detected and, in some schemes, corrected before data is delivered upward.2 A source message is divided into $k$ -bit datawords and each dataword is mapped to an $n$ -bit codeword with $n>k$ . The added $r=n-k$ bits carry no new payload information, but they create structure in the code space that helps the receiver recognize corruption.2

A central metric in this structure is the Hamming distance, defined as the number of differing bit positions between two binary words of equal length.2 For binary vectors $x$ and $y$ , it can be computed by XOR and bit counting:

d(x,y)=w(x\oplus y),

where $w(\cdot)$ is the number of $1$ s in the result, also called the Hamming weight.2 The design quantity that matters most is the minimum Hamming distance $d_{min}$ , because it determines how many errors the code can reliably detect or correct.3

In network-theory context, block coding is not merely abstract algebra: it expresses the tradeoff among reliability, bandwidth efficiency, and decoder complexity. A higher code rate $k/n$ preserves more bandwidth, while a larger $d_{min}$ improves resilience to noise. This section develops the mapping from datawords to codewords, the difference between error detection and error correction, the geometry of Hamming space, and the formal results

d_{min}=s+1 \qquad\text{and}\qquad d_{min}=2t+1,

which characterize guaranteed detection of $s$ errors and guaranteed correction of $t$ errors.3

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩ ↩² ↩³ ↩⁴ ↩⁵
Data Link Layer - Data link layer notes explaining datawords, codewords, redundancy, and minimum-distance rules. ↩ ↩² ↩³ ↩⁴ ↩⁵
Introduction to binary block codes - MIT material on Hamming space, spheres, and geometric interpretation of block codes. ↩
Minimum Hamming Distance - GeeksforGeeks - Accessible explanation of Hamming weight, Hamming distance, and examples. ↩ ↩²
Codewords and Hamming Distance • Error Detection: parity - MIT - MIT notes summarizing how minimum distance determines detectable and correctable errors. ↩ ↩² ↩³

Hamming Distance and Minimum Hamming Distance

Why this matters in the Data Link Layer

Frames can be corrupted by noise, interference, attenuation, or synchronization issues. Block coding adds structured redundancy so a receiver can detect invalid bit patterns and, in stronger codes, infer the most likely transmitted codeword.2

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩
Data Link Layer - Data link layer notes explaining datawords, codewords, redundancy, and minimum-distance rules. ↩

Block Coding Model

A block code can be described as an $(n,k)$ scheme: each $k$ -bit dataword is transformed into an $n$ -bit codeword, where $n=k+r$ and $r$ is the number of redundant bits. Because there are $2^k$ possible datawords, an encoder must assign one valid codeword to each of those $2^k$ inputs. Not every $n$ -bit word is valid; only the selected set of codewords belongs to the code. This restriction is what makes error detection possible.2

For example, consider a simple $(3,2)$ code:

Dataword	Codeword
$00$	$000$
$01$	$011$
$10$	$101$
$11$	$110$

Only four of the eight possible $3$ -bit patterns are valid. If the receiver gets $001$ , it can immediately declare an error because $001$ is not one of the legal codewords. This is the essence of error detection.

A stronger code may also support error correction by choosing the valid codeword nearest to the received word in Hamming distance.2 In that case, codewords must be separated more widely so that the “regions of influence” around them do not overlap.2

A useful rate measure is

\text{Code rate}=\frac{k}{n},

which quantifies efficiency. Higher redundancy lowers rate but can increase $d_{min}$ and therefore reliability.

Data Link Layer - Data link layer notes explaining datawords, codewords, redundancy, and minimum-distance rules. ↩ ↩² ↩³ ↩⁴
Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩ ↩² ↩³
Codewords and Hamming Distance • Error Detection: parity - MIT - MIT notes summarizing how minimum distance determines detectable and correctable errors. ↩ ↩²
Introduction to binary block codes - MIT material on Hamming space, spheres, and geometric interpretation of block codes. ↩

How to Compute Hamming Distance Between Two Binary Sequences

1
Step 1
The two binary sequences must have the same number of bits; otherwise the Hamming distance is not defined in the standard block-coding sense.2

Footnotes

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩

Minimum Hamming Distance - GeeksforGeeks - Accessible explanation of Hamming weight, Hamming distance, and examples. ↩
2
Step 2
Inspect each bit pair from left to right and note every position at which the bits differ.

Footnotes

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩
3
Step 3
XOR the two sequences. Every resulting $1$ marks a differing position, while every $0$ marks a matching position.2

Footnotes

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩

Data Link Layer - Data link layer notes explaining datawords, codewords, redundancy, and minimum-distance rules. ↩
4
Step 4
The total number of differing positions is the Hamming distance.2

Footnotes

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩

Minimum Hamming Distance - GeeksforGeeks - Accessible explanation of Hamming weight, Hamming distance, and examples. ↩
5
Step 5
A larger distance means the two words are more separated in Hamming space, which generally improves distinguishability under noise.2

Footnotes

Introduction to binary block codes - MIT material on Hamming space, spheres, and geometric interpretation of block codes. ↩

Codewords and Hamming Distance • Error Detection: parity - MIT - MIT notes summarizing how minimum distance determines detectable and correctable errors. ↩

Worked Examples of Hamming Distance

Let us compute distances directly.

Example 1
For $x=000$ and $y=011$ :

x\oplus y = 000\oplus 011 = 011

The XOR result has two $1$ s, so

d(000,011)=2.

This example is standard in block-coding discussions.2

Example 2
For $x=10101$ and $y=11110$ :

10101\oplus 11110 = 01011

The XOR result contains three $1$ s, hence

d(10101,11110)=3.

Again, the distance equals the number of differing bit positions.2

Example 3
For $x=1100101$ and $y=1001100$ :

1100101\oplus 1001100 = 0101001

The XOR result has three $1$ s, so

d(1100101,1001100)=3.

For an entire code, we calculate all pairwise distances among valid codewords and choose the smallest. That smallest value is $d_{min}$ .2 If a code has codewords $\{000,011,101,110\}$ , the pairwise distances are all $2$ , so

d_{min}=2.

In this code, every valid codeword is separated from every other valid codeword by at least two bit changes, so any single-bit corruption moves a transmitted codeword to an invalid word rather than another valid one.

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩ ↩² ↩³
Data Link Layer - Data link layer notes explaining datawords, codewords, redundancy, and minimum-distance rules. ↩ ↩² ↩³ ↩⁴

Fast exam shortcut

To compute Hamming distance quickly, XOR the two bit strings and count the $1$ s. This avoids manual comparison and directly matches the formal definition $d(x,y)=w(x\oplus y)$ .2

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩
Minimum Hamming Distance - GeeksforGeeks - Accessible explanation of Hamming weight, Hamming distance, and examples. ↩

Minimum Hamming Distance and Its Meaning

The minimum distance $d_{min}$ is the most important design parameter of a block code because the worst-separated pair of valid codewords determines the code’s guaranteed performance.2 Formally, if $C$ is the set of valid codewords, then

d_{min}=\min_{\substack{x,y\in C\\x\ne y}} d(x,y).

Interpretation:

If valid codewords are too close, a small number of bit flips may transform one valid codeword into another, making reliable detection impossible.
If valid codewords are far apart, more corruption is needed before ambiguity arises.2

A classic relation states:

To detect up to $s$ errors in all cases, a code must satisfy $d_{min}\ge s+1.$
To correct up to $t$ errors in all cases, a code must satisfy $d_{min}\ge 2t+1.$

These are often written in exact guaranteed-capability form as

s=d_{min}-1 \qquad\text{and}\qquad t=\left\lfloor\frac{d_{min}-1}{2}\right\rfloor.

The equalities $d_{min}=s+1$ and $d_{min}=2t+1$ describe the threshold values for exact guaranteed detection and correction limits.2

This explains familiar cases:

$d_{min}$	Guaranteed detection	Guaranteed correction
$1$	$0$ errors	$0$ errors
$2$	$1$ error	$0$ errors
$3$	$2$ errors	$1$ error
$4$	$3$ errors	$1$ error
$5$	$4$ errors	$2$ errors

The correction count grows more slowly because correction requires not only noticing that an error occurred, but deciding which original codeword was sent.2

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩ ↩² ↩³ ↩⁴
Data Link Layer - Data link layer notes explaining datawords, codewords, redundancy, and minimum-distance rules. ↩
Codewords and Hamming Distance • Error Detection: parity - MIT - MIT notes summarizing how minimum distance determines detectable and correctable errors. ↩ ↩² ↩³
Introduction to binary block codes - MIT material on Hamming space, spheres, and geometric interpretation of block codes. ↩

Guaranteed Capability vs. Minimum Hamming Distance

Comparison of detectable and correctable error counts implied by $d_{min}$ .2

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩
Codewords and Hamming Distance • Error Detection: parity - MIT - MIT notes summarizing how minimum distance determines detectable and correctable errors. ↩

Geometric View: Hamming Space, Spheres, and Decoding Regions

All binary words of length $n$ can be viewed as points in an $n$ -dimensional Hamming space.2 Valid codewords occupy selected points in this space, and the Hamming distance measures how many coordinate changes are needed to move from one point to another.

For correction, we imagine a Hamming sphere of radius $t$ around each valid codeword. Every received word inside that sphere is decoded to the center codeword.2 To guarantee correction of $t$ errors, spheres of radius $t$ around distinct codewords must not overlap. If they overlapped, some received word would be within distance $t$ of two different codewords, causing ambiguity.2

For detection only, overlap is not the issue. Instead, we need every pattern of up to $s$ bit errors to move the transmitted codeword outside the set of all valid codewords.2 Therefore, no two codewords may be closer than $s+1$ .

This geometric picture is especially valuable because it turns an algebraic rule into an intuitive one:

Detection means corrupted words should not land on another legal codeword.
Correction means corrupted words should remain closest to the true codeword.2

Introduction to binary block codes - MIT material on Hamming space, spheres, and geometric interpretation of block codes. ↩ ↩² ↩³ ↩⁴ ↩⁵
Hamming Metric and the Minimum Distance - UCSD notes providing proof ideas for correction capability via triangle inequality and Hamming spheres. ↩ ↩² ↩³
Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩ ↩²
Data Link Layer - Data link layer notes explaining datawords, codewords, redundancy, and minimum-distance rules. ↩
Codewords and Hamming Distance • Error Detection: parity - MIT - MIT notes summarizing how minimum distance determines detectable and correctable errors. ↩

Proof Idea for Error Detection: Why $d_{min}=s+1$

1
Step 1
This means every pair of distinct valid codewords differs in at least $d_{min}$ bit positions.2

Footnotes

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩

Data Link Layer - Data link layer notes explaining datawords, codewords, redundancy, and minimum-distance rules. ↩
2
Step 2
Suppose a codeword $x$ is sent and at most $s$ bit errors occur during transmission.
3
Step 3
The received word $y$ differs from $x$ in at most $s$ positions, so $d(x,y)\le s$ .
4
Step 4
If $y$ were some other valid codeword $z e x$ , then $d(x,z)$ would have to be at most $s$ , contradicting the definition that all distinct codewords are at least $d_{min}$ apart.
5
Step 5
Thus guaranteed detection of all patterns of up to $s$ errors requires $s<d_{min}$ , equivalently $d_{min}\ge s+1$ .2

Footnotes

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩

Codewords and Hamming Distance • Error Detection: parity - MIT - MIT notes summarizing how minimum distance determines detectable and correctable errors. ↩

Proof Idea for Error Correction: Why $d_{min}=2t+1$

1
Step 1
The decoder selects the valid codeword with smallest Hamming distance from the received word, which is the standard minimum-distance decoding rule.2

Footnotes

Codewords and Hamming Distance • Error Detection: parity - MIT - MIT notes summarizing how minimum distance determines detectable and correctable errors. ↩

Hamming Metric and the Minimum Distance - UCSD notes providing proof ideas for correction capability via triangle inequality and Hamming spheres. ↩
2
Step 2
If codeword $x$ is sent and the received word is $y$ , then $d(x,y)\le t$ .
3
Step 3
For successful correction, no other valid codeword $z$ can also lie within distance $t$ of $y$ .
4
Step 4
If both $d(x,y)\le t$ and $d(z,y)\le t$ , then $d(x,z)\le d(x,y)+d(y,z)\le 2t$ .
5
Step 5
But distinct codewords must be separated by at least $d_{min}$ . Therefore ambiguity is impossible only when $d_{min}>2t$ , equivalently $d_{min}\ge 2t+1$ .2

Footnotes

Codewords and Hamming Distance • Error Detection: parity - MIT - MIT notes summarizing how minimum distance determines detectable and correctable errors. ↩

Hamming Metric and the Minimum Distance - UCSD notes providing proof ideas for correction capability via triangle inequality and Hamming spheres. ↩
6
Step 6
Correction is harder than detection because the decoder must identify the original word, not just flag inconsistency.

Common Questions and Edge Cases

For binary words $x$ and $y$ of equal length,

d(x,y)=w(x\oplus y)

where $w(\cdot)$ counts the number of $1$ s in the XOR result.2

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩
Minimum Hamming Distance - GeeksforGeeks - Accessible explanation of Hamming weight, Hamming distance, and examples. ↩

Conceptual Roadmap for Learning Block Coding and Hamming Distance

Represent data in fixed-size blocks

Stage 1

Divide the bit stream into $k$ -bit datawords so encoding can be applied systematically."

Data Link Layer - Data link layer notes explaining datawords, codewords, redundancy, and minimum-distance rules. ↩

Add redundancy

Stage 2

Map each dataword to an $n$ -bit codeword with $n>k$ , introducing structure into the set of valid words."

Data Link Layer - Data link layer notes explaining datawords, codewords, redundancy, and minimum-distance rules. ↩

Measure pairwise separation

Stage 3

Use Hamming distance to quantify how far apart codewords are in binary space.2"

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩
Minimum Hamming Distance - GeeksforGeeks - Accessible explanation of Hamming weight, Hamming distance, and examples. ↩

Find $d_{min}$

Stage 4

Compute the smallest pairwise distance among all valid codewords; this determines guaranteed capability.2"

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩
Data Link Layer - Data link layer notes explaining datawords, codewords, redundancy, and minimum-distance rules. ↩

Apply detection and correction conditions

Stage 5

Use $d_{min}\ge s+1$ for detection and $d_{min}\ge 2t+1$ for correction.3"

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩
Codewords and Hamming Distance • Error Detection: parity - MIT - MIT notes summarizing how minimum distance determines detectable and correctable errors. ↩
Hamming Metric and the Minimum Distance - UCSD notes providing proof ideas for correction capability via triangle inequality and Hamming spheres. ↩

Interpret geometrically

Stage 6

View valid codewords as centers of Hamming spheres whose overlap properties determine correctability.2"

Introduction to binary block codes - MIT material on Hamming space, spheres, and geometric interpretation of block codes. ↩
Hamming Metric and the Minimum Distance - UCSD notes providing proof ideas for correction capability via triangle inequality and Hamming spheres. ↩

Frequent misconception

A code that can detect $2$ errors does not automatically correct $1$ error and detect $2$ errors simultaneously under all decoding strategies. For example, correcting $1$ error requires $d_{min}\ge 3$ , but correcting $1$ and also safely distinguishing some larger patterns may need stronger constraints depending on the decoder and objective.

Codewords and Hamming Distance • Error Detection: parity - MIT - MIT notes summarizing how minimum distance determines detectable and correctable errors. ↩

Network-Theory Interpretation

Within the data link layer, block coding supports reliable frame delivery across imperfect physical media.2 Although many practical systems use specialized error-detecting codes such as CRC for frame checking, the theory of block coding and Hamming distance provides the mathematical foundation for understanding why redundancy works at all.2 The key ideas transfer broadly:

legal patterns are separated in code space,
channel noise perturbs transmitted patterns,
receiver logic exploits structure to detect or correct perturbations.2

From a design perspective, block coding creates a tradeoff:

\text{More redundancy} \Rightarrow \text{lower rate } \frac{k}{n} \Rightarrow \text{potentially larger } d_{min} \Rightarrow \text{better reliability}.

This is why coding theory sits naturally inside network performance analysis: it connects bandwidth, noise tolerance, and decoding certainty.

A concise summary is:

Concept	Meaning	Key formula
Block coding	Map $k$ information bits to $n$ coded bits	$n=k+r$
Hamming distance	Number of differing positions between two words	$d(x,y)=w(x\oplus y)$
Minimum distance	Worst-case separation among valid codewords	$d_{min}=\min d(x,y)$
Error detection	Know an error occurred	$d_{min}\ge s+1$
Error correction	Recover original codeword	$d_{min}\ge 2t+1$

For this module, you should be able to compute distances between binary sequences, determine $d_{min}$ from a codebook, and prove or apply the thresholds for detection and correction.3

Data Link Layer - Akshay Jain - Lecture notes covering block coding, Hamming distance, minimum distance, and detection/correction conditions. ↩ ↩² ↩³
Data Link Layer - Data link layer notes explaining datawords, codewords, redundancy, and minimum-distance rules. ↩ ↩² ↩³
Introduction to binary block codes - MIT material on Hamming space, spheres, and geometric interpretation of block codes. ↩
Codewords and Hamming Distance • Error Detection: parity - MIT - MIT notes summarizing how minimum distance determines detectable and correctable errors. ↩ ↩² ↩³

Knowledge Check

Question 1 of 5

Q1Single choice

In a block code, what does the notation $(n,k)$ mean?

Each $k$ -bit dataword is mapped to an $n$ -bit codeword

Each $n$ -bit dataword is mapped to a $k$ -bit codeword

There are $n$ codewords and $k$ parity checks

The code can detect $n-k$ errors

Fundamentals

Cyclic Redundancy Check (CRC)

Block Coding & Hamming Distance

Learning Goals

Footnotes

Hamming Distance and Minimum Hamming Distance

Why this matters in the Data Link Layer

Footnotes

Block Coding Model

Footnotes

How to Compute Hamming Distance Between Two Binary Sequences

Footnotes

Footnotes

Footnotes

Footnotes

Footnotes

Worked Examples of Hamming Distance

Footnotes

Fast exam shortcut

Footnotes

Minimum Hamming Distance and Its Meaning

Footnotes

Guaranteed Capability vs. Minimum Hamming Distance

Footnotes

Geometric View: Hamming Space, Spheres, and Decoding Regions

Footnotes

Proof Idea for Error Detection: Why $d_{min}=s+1$

Footnotes

Footnotes

Proof Idea for Error Correction: Why $d_{min}=2t+1$

Footnotes

Footnotes

Common Questions and Edge Cases

Footnotes

Conceptual Roadmap for Learning Block Coding and Hamming Distance

Represent data in fixed-size blocks

Footnotes

Add redundancy

Footnotes

Measure pairwise separation

Footnotes

Find $d_{min}$

Footnotes

Apply detection and correction conditions

Footnotes

Interpret geometrically

Footnotes

Frequent misconception

Footnotes

Network-Theory Interpretation

Footnotes

Knowledge Check