Fast division by 2ⁿ-1

Michael Schmidt / 24 min read / 2026 Jan 25

I recently found this code snippet in the source code of pic-scale-safe:

Rust

fn div_round_by_1023(v: u32) -> u32 {
    let round = 1 << 9;
    let w = v + round;
    ((w >> 10) + w) >> 10
}

This function computes $round (v /1023)$ for inputs $v < 2^{20} + 2^{9} - 1$ with just a few bit shifts and additions in 32-bit arithmetic. Quite efficient.

But that's not all. This trick only needs 21 bits for the intermediate results. Other approaches like the multiply-add method require 31 bits for intermediate results to perform the same rounded division by 1023 in the range $v < 2^{20} + 2^{9} - 1$ . While not the case for division by 1023, in general this trick needs at most one additional bit, which can be the difference between being able to use 32-bit arithmetic or having to resort to 64-bit arithmetic. This is especially important for auto-vectorization and manual SIMD code.

As I hinted, this trick doesn't work just for division by 1023. In general, it works for any divisor of the form $2^{n} - 1$ for inputs $v < 2^{2 n} + 2^{n - 1} - 1$ . Here is the generalized version:

Rust

fn div_round_by_2pn_m1(v: u32, n: u32) -> u32 {
    let round = 1 << (n - 1);
    let w = v + round;
    ((w >> n) + w) >> n
}

This function returns exactly $round (v / (2^{n} - 1))$ for all inputs $v < 2^{2 n} + 2^{n - 1} - 1$ . For larger inputs, the results are typically close, but not exact. This makes it an approximation for rounded division by $2^{n} - 1$ .

Unfortunately, it's not obvious at all why this approximation is exact for $v < 2^{2 n} + 2^{n - 1} - 1$ , and why it stops working at $v = 2^{2 n} + 2^{n - 1} - 1$ .

In this article, I will answer both questions and generalize the trick to (1) increase the input range and (2) support floor and ceil division as well.

Results
Code Generation
Proving correctness
Showing failure points
Extending the input range
Proving correctness for extended ranges
Other rounding modes
Proving correctness for floor division
Proving correctness for ceiling division
Interesting extra: Division by $2^{n} + 1$

Results

Before I start with the derivations and proofs, here is a summary of the results.

$round (\frac{v}{2 ^{n} - 1})$ can be approximated using:
$R_{1} R_{i + 1} := ⌊ \frac{v + 2 ^{n - 1}}{2 ^{n}} ⌋ := ⌊ \frac{R _{i} + v + 2 ^{n - 1}}{2 ^{n}} ⌋$
The approximation $R_{i}$ is exact for all inputs $v < 2^{in} + 2^{n - 1} - 1$ .
$⌊ \frac{v}{2 ^{n} - 1} ⌋$ can be approximated using:
$F_{1} F_{i + 1} := ⌊ \frac{v + 1}{2 ^{n}} ⌋ := ⌊ \frac{F _{i} + v + 1}{2 ^{n}} ⌋$
The approximation $F_{i}$ is exact for all inputs $v < 2^{in} + 2^{n} - 2$ .
$⌈ \frac{v}{2 ^{n} - 1} ⌉$ can be approximated using:
$C_{1} C_{i + 1} := ⌊ \frac{v + 2 ^{n} - 1}{2 ^{n}} ⌋ := ⌊ \frac{C _{i} + v + 2 ^{n} - 1}{2 ^{n}} ⌋$
The approximation $C_{i}$ is exact for all inputs $v < 2^{in}$ .

Note

These are theoretical results. In practice, integer overflow has to be carefully considered in order to determine the true range of inputs for which a particular implementation is exact.

The below tool determines bounds for correctness automatically based on the given settings.

Code Generation

I also implemented a little code gen tool that uses these results to generate Rust code. You can set $n$ , the iteration count, the rounding mode, and the integer type all operations will be performed in. Everything (except for the integer type) can also be made variable at runtime by checking the Parameter box.

n:ParameterIteration countIterations:ParameterRounding mode:ParameterInteger type:

Rust

/// Returns `round(v / (2^n - 1))`.
///
/// Returned values are correct if both:
/// 1. the approximation is exact: v < 2^(2n) + 2^(n-1) - 1
/// 2. no overflow occurs:         v < 2^32 - 2^(n-1) - round(v/(2^n-1)).
fn div_round_by_2pn_m1(v: u32, n: u32) -> u32 {
    debug_assert!(n != 0, "Division by zero");
    let round = 1 << (n - 1);
    let w = v + round;
    ((w >> n) + w) >> n
}

Limitation

If $2^{n}$ cannot be represented by the chosen integer type, generated code may fail to compile or panic at runtime (if $n$ is a parameter). Such cases are mostly nonsensical anyway, so just avoid them.

Proving correctness

Formally, the approximation is defined as follows: Let $v \in N, n \in N_{1}$ , then:

round (\frac{v}{2 ^{n} - 1}) \approx ⌊ \frac{⌊ \frac{v + 2 ^{n - 1}}{2 ^{n}} ⌋ + v + 2 ^{n - 1}}{2 ^{n}} ⌋

Info

This looks more complicated than it really is. $⌊ x / 2^{n} ⌋$ is just x >> n in code, and $⌊(x + 2^{n - 1}) / 2^{n} ⌋$ is the same as $round (x / 2^{n})$ . So really, the approximation is just two nested rounded divisions (implemented as bit shifts in code). I will keep it in floor division form for the rest of the article, since it makes the proofs easier.

To capture how good the approximation is, I will introduce an error term $δ$ defined as follows:

δ := ⌊ \frac{⌊ \frac{v + 2 ^{n - 1}}{2 ^{n}} ⌋ + v + 2 ^{n - 1}}{2 ^{n}} ⌋ - round (\frac{v}{2 ^{n} - 1})

Consequently, the approximation is exact for an input $v$ iff $δ = 0$ .

Now, I will write $v$ a bit differently. Let $v = a (2^{n} - 1) + b + 2^{n - 1}$ for $a \in Z, b \in N_{0}, b < 2^{n} - 1$ . While a bit unusual, it should be obvious that all integers can be uniquely expressed in this form. I choose this form because it has the nice property that $round (v / (2^{n} - 1)) = a + 1$ .

Proof:

round (\frac{v}{2 ^{n} - 1}) = ⌊ \frac{a ( 2 ^{n} - 1 ) + b + 2 ^{n - 1} + ⌊( 2 ^{n} - 1 ) /2 ⌋}{2 ^{n} - 1} ⌋ = ⌊ \frac{a ( 2 ^{n} - 1 ) + b + 2 ^{n - 1} + 2 ^{n - 1} - 1}{2 ^{n} - 1} ⌋ = ⌊ \frac{a ( 2 ^{n} - 1 ) + b + ( 2 ^{n} - 1 )}{2 ^{n} - 1} ⌋ = ⌊ \frac{b}{2 ^{n} - 1} + a + 1 ⌋ = ⌊ (\frac{b}{2 ^{n} - 1}) \in [0, 1) ⌋ + a + 1 = a + 1

(Note: $⌊ b / (2^{n} - 1)⌋ = 0$ since $b$ is defined as $0 \leq b < 2^{n} - 1$ .)

Info

All proofs that use this form of $v$ will make heavy use of the following properties:

$round (v / (2^{n} - 1)) = a + 1$
$⌊ b / (2^{n} - 1)⌋ = 0$
$⌊ b / 2^{n} ⌋ = 0$
$⌊(b + 1) / 2^{n} ⌋ = 0$

This makes it possible to simplify the error term:

δ : = ⌊ \frac{⌊ \frac{v + 2 ^{n - 1}}{2 ^{n}} ⌋ + v + 2 ^{n - 1}}{2 ^{n}} ⌋ - round (\frac{v}{2 ^{n} - 1}) = ⌊ \frac{⌊ \frac{a ( 2 ^{n} - 1 ) + b + 2 ^{n - 1} + 2 ^{n - 1}}{2 ^{n}} ⌋ + a ( 2 ^{n} - 1 ) + b + 2 ^{n - 1} + 2 ^{n - 1}}{2 ^{n}} ⌋ - a - 1 = ⌊ \frac{⌊ \frac{b - a + a 2 ^{n} + 2 ^{n}}{2 ^{n}} ⌋ + b - a + a 2 ^{n} + 2 ^{n}}{2 ^{n}} ⌋ - a - 1 = ⌊ \frac{⌊ \frac{b - a}{2 ^{n}} ⌋ + a + 1 + b - a}{2 ^{n}} + a + 1 ⌋ - a - 1 = ⌊ \frac{⌊ \frac{b - a}{2 ^{n}} ⌋ + b + 1}{2 ^{n}} ⌋

Now it is easy to show that $- 2^{n} \leq b - a < 2^{n} ⟹ δ = 0$ .

0 \leq b - a < 2^{n} ⟹ δ = ⌊ \frac{⌊ \frac{b - a}{2 ^{n}} ⌋ = 0 + b + 1}{2 ^{n}} ⌋ = ⌊ \frac{b + 1}{2 ^{n}} ⌋ = 0

- 2^{n} \leq b - a < 0 ⟹ δ = ⌊ \frac{⌊ \frac{b - a}{2 ^{n}} ⌋ = - 1 + b + 1}{2 ^{n}} ⌋ = ⌊ \frac{b}{2 ^{n}} ⌋ = 0

All that is left to do, is to show that for all inputs $v < 2^{2 n} + 2^{n - 1} - 1$ , it holds that $- 2^{n} \leq b - a < 2^{n}$ . This is done in two cases:

$a \in {0, ..., 2^{n}}$ : This corresponds to $v \in {2^{n - 1}, ..., 2^{2 n} + 2^{n - 1} - 2}$ .

Two cases:
1. $a \leq b ⟹ 0 \leq b - a < 2^{n} - 1 ⟹ - 2^{n} \leq b - a < 2^{n} ⟹ δ = 0$ .
2. $a > b ⟹ - 2^{n} \leq b - a < 2^{n - 1} ⟹ - 2^{n} \leq b - a < 2^{n} ⟹ δ = 0$ .
$a = - 1$ : This corresponds to $v \in {- 2^{n - 1} + 1, ..., 2^{n - 1} - 1}$ .

$a = - 1 ⟹ 1 \leq b - a < 2^{n} ⟹ - 2^{n} \leq b - a < 2^{n} ⟹ δ = 0$ .

Note: While this case algebraically includes negative values for $v$ , the bounds $v \geq 0$ are implied by $v \in N$ . So this cannot be taken as proof that the approximation works for negative inputs.

Taken together, this proves $0 \leq v < 2^{2 n} + 2^{n - 1} - 1 ⟹ δ = 0$ , which means that the approximation is exactly equal to rounded division by $2^{n} - 1$ for those input.

Further, proving that the approximation is not exact for $v = 2^{2 n} + 2^{n - 1} - 1$ is easy. This number corresponds to $a = 2^{n} + 1, b = 0$ , which results in a non-zero error term:

δ = ⌊ \frac{⌊ \frac{b - a}{2 ^{n}} ⌋ + b + 1}{2 ^{n}} ⌋ = ⌊ \frac{⌊ \frac{- 2 ^{n} - 1}{2 ^{n}} ⌋ + 1}{2 ^{n}} ⌋ = ⌊ \frac{- 2 + 1}{2 ^{n}} ⌋ = - 1

Therefore, $v = 2^{2 n} + 2^{n - 1} - 1$ is the smallest (non-negative) input for which the approximation is not exact.

Showing failure points

After proving that the approximation fails, I thought it might also be interesting to see it fail for a few values of $n$ . So here are the smallest inputs where the approximation starts to differ from the actual result for $n$ from 1 to 15:

n	First non-exact input v	v rewritten	Approximation	$round (\frac{v}{2 ^{n} - 1})$
1	4	$2^{2} + 0$	3	4
2	17	$2^{4} + 1$	5	6
3	67	$2^{6} + 3$	9	10
4	263	$2^{8} + 7$	17	19
5	1039	$2^{10} + 15$	33	34
6	4127	$2^{12} + 31$	65	66
7	16447	$2^{14} + 63$	129	130
8	65663	$2^{16} + 127$	257	258
9	262399	$2^{18} + 255$	513	514
10	1049087	$2^{20} + 511$	1025	1026
11	4195327	$2^{22} + 1023$	2049	2050
12	16779263	$2^{24} + 2047$	4097	4098
13	67112959	$2^{26} + 4095$	8193	8194
14	268443647	$2^{28} + 8191$	16385	16386
15	1073758207	$2^{30} + 16383$	32769	32770

These numbers were found experimentally using brute-force search.

Extending the input range

Depending on the use case, an input range of $v < 2^{2 n} + 2^{n - 1} - 1$ might be too small. However, the trick can be extended to support larger inputs.

The main insight here is this equality:

\frac{v}{2 ^{n} - 1} = \frac{\frac{v}{2 ^{n} - 1} + v}{2 ^{n}}

Derived from:

\frac{v}{2 ^{n} - 1} = \frac{\frac{2 ^{n}}{2 ^{n} - 1} \cdot v}{2 ^{n}} = \frac{\frac{1 + ( 2 ^{n} - 1 )}{2 ^{n} - 1} \cdot v}{2 ^{n}} = \frac{\frac{v}{2 ^{n} - 1} + v}{2 ^{n}}

This makes it possible to recursively rewrite the division by $2^{n} - 1$ into a series of divisions by $2^{n}$ .

Let $R_{i}$ be the approximation of rounded division by $2^{n} - 1$ after $i$ iterations, defined as follows:

R_{1} R_{i + 1} := ⌊ \frac{v + 2 ^{n - 1}}{2 ^{n}} ⌋ := ⌊ \frac{R _{i} + v + 2 ^{n - 1}}{2 ^{n}} ⌋

The trick from above then corresponds to $R_{2}$ .

Let's see the smallest inputs $v$ the approximations $R_{i}$ start to fail for:

n	$R_{1}$	$R_{2}$	$R_{3}$	$R_{4}$	$R_{5}$
1	$2^{1} + 0$	$2^{2} + 0$	$2^{3} + 0$	$2^{4} + 0$	$2^{5} + 0$
2	$2^{2} + 1$	$2^{4} + 1$	$2^{6} + 1$	$2^{8} + 1$	$2^{10} + 1$
3	$2^{3} + 3$	$2^{6} + 3$	$2^{9} + 3$	$2^{12} + 3$	$2^{15} + 3$
4	$2^{4} + 7$	$2^{8} + 7$	$2^{12} + 7$	$2^{16} + 7$	$2^{20} + 7$
5	$2^{5} + 15$	$2^{10} + 15$	$2^{15} + 15$	$2^{20} + 15$	$2^{25} + 15$
6	$2^{6} + 31$	$2^{12} + 31$	$2^{18} + 31$	$2^{24} + 31$	-
7	$2^{7} + 63$	$2^{14} + 63$	$2^{21} + 63$	-	-
8	$2^{8} + 127$	$2^{16} + 127$	$2^{24} + 127$	-	-

These numbers were found experimentally using a brute-force search. I aborted the search when it took too long, so some entries are missing.

The pattern is very clear. It seems that $R_{i}$ is exact for all inputs $v < 2^{in} + 2^{n - 1} - 1$ . So the trick can be extended to support arbitrarily large input ranges by increasing the number of iterations.

In code, this is implemented as follows:

Rust

fn div_round_2pn_m1_iters(v: u32, n: u32, iters: u8) -> u32 {
    let round = 1 << (n - 1);
    let w = v + round;
    let mut r = w >> n; // R_1
    for _ in 1..iters {
        r = (r + w) >> n; // R_{i+1}
    }
    r
}

Proving correctness for extended ranges

Let $v, n, a, b$ be defined as before: $v = a (2^{n} - 1) + b + 2^{n - 1}$ . Further, let $i \in N_{1}$ be the i-th iteration of the approximation $R_{i}$ as defined above and $δ_{i} := R_{i} - round (v / (2^{n} - 1))$ . As before, $R_{i}$ is exact for inputs $v$ iff $δ_{i} = 0$ .

I will start by simplifying the error term $δ_{i}$ :

δ_{1} = R_{1} - round (\frac{v}{2 ^{n} - 1}) = ⌊ \frac{v + 2 ^{n - 1}}{2 ^{n}} ⌋ - round (\frac{v}{2 ^{n} - 1}) = ⌊ \frac{a ( 2 ^{n} - 1 ) + b + 2 ^{n - 1} + 2 ^{n - 1}}{2 ^{n}} ⌋ - a - 1 = ⌊ \frac{b - a}{2 ^{n}} + a + 1 ⌋ - a - 1 = ⌊ \frac{b - a}{2 ^{n}} ⌋

δ_{i + 1} = R_{i + 1} - round (\frac{v}{2 ^{n} - 1}) = ⌊ \frac{R _{i} + v + 2 ^{n - 1}}{2 ^{n}} ⌋ - round (\frac{v}{2 ^{n} - 1}) = ⌊ \frac{δ _{i} + round ( \frac{v}{2 ^{n} - 1} ) + v + 2 ^{n - 1}}{2 ^{n}} ⌋ - round (\frac{v}{2 ^{n} - 1}) = ⌊ \frac{δ _{i} + a + 1 + v + 2 ^{n - 1}}{2 ^{n}} ⌋ - a - 1 = ⌊ \frac{δ _{i} + a + 1 + a ( 2 ^{n} - 1 ) + b + 2 ^{n - 1} + 2 ^{n - 1}}{2 ^{n}} ⌋ - a - 1 = ⌊ \frac{δ _{i} + b + 1}{2 ^{n}} + a + 1 ⌋ - a - 1 = ⌊ \frac{δ _{i} + b + 1}{2 ^{n}} ⌋

With the error term in a nicer form, it's now easy to show by induction that $a = - 1 ⟹ δ_{i} = 0$ :

Base case ( $i = 1$ ): $a = - 1 ⟹ δ_{1} = ⌊(b - a) / 2^{n} ⌋ = ⌊(b + 1) / 2^{n} ⌋ = 0$ .
Induction step: $δ_{i + 1} = ⌊(δ_{i} + b + 1) / 2^{n} ⌋ = ⌊(b + 1) / 2^{n} ⌋ = 0$ .

Since $a = - 1$ corresponds to $v \in {- 2^{n - 1} + 1, ..., 2^{n - 1} - 1}$ , this shows that the approximations (for any number of iterations) are exact for this range. (Again, the bounds $v \geq 0$ are implied by $v \in N$ .)

To make things simpler going forward, I derived an explicit formula for $δ_{i}$ by unrolling the recursion:

δ_{i} = ⌊ \frac{b - a}{2 ^{in}} + l = 1 \sum i - 1 \frac{b + 1}{2 ^{l n}} ⌋

Proof for the explicit error term:

The correctness of the explicit error term formula can be shown using induction over $i$ :

Base case ( $i = 1$ ):
$δ_{1} = ⌊ \frac{b - a}{2 ^{n}} ⌋ = ⌊ \frac{b - a}{2 ^{1 \cdot n}} + l = 1 \sum 0 \frac{b + 1}{2 ^{l n}} = 0 ⌋$
Induction step (from $i$ to $i + 1$ ):
$δ_{i + 1} = ⌊ \frac{δ _{i} + b + 1}{2 ^{n}} ⌋ = ⌊ \frac{⌊ \frac{b - a}{2 ^{in}} + \sum _{l = 1}^{i - 1} \frac{b + 1}{2 ^{l n}} ⌋ + b + 1}{2 ^{n}} ⌋ = ⌊ \frac{\frac{b - a}{2 ^{in}} + \sum _{l = 1}^{i - 1} \frac{b + 1}{2 ^{l n}} + b + 1}{2 ^{n}} ⌋ = ⌊ \frac{b - a}{2 ^{(i + 1) n}} + l = 1 \sum i - 1 \frac{b + 1}{2 ^{(l + 1) n}} + \frac{b + 1}{2 ^{n}} ⌋ = ⌊ \frac{b - a}{2 ^{(i + 1) n}} + l = 2 \sum i \frac{b + 1}{2 ^{l n}} + \frac{b + 1}{2 ^{n}} ⌋ = ⌊ \frac{b - a}{2 ^{(i + 1) n}} + l = 1 \sum i \frac{b + 1}{2 ^{l n}} ⌋$
$□$

Using the explicit formula for $δ_{i}$ , it's easy to show that $v = 2^{in} + 2^{n - 1} - 1$ results in $δ_{i} = - 1$ , making the approximation non-exact for that input. $v = 2^{in} + 2^{n - 1} - 1$ corresponds to $a = \sum_{l = 0}^{i - 1} 2^{l n}, b = 0$ . Plugging this into the explicit formula for $δ_{i}$ gives:

δ_{i} = ⌊ \frac{b - a}{2 ^{in}} + l = 1 \sum i - 1 \frac{b + 1}{2 ^{l n}} ⌋ = ⌊ \frac{- \sum _{l = 0}^{i - 1} 2 ^{l n}}{2 ^{in}} + l = 1 \sum i - 1 \frac{1}{2 ^{l n}} ⌋ = ⌊ - l = 1 \sum i \frac{1}{2 ^{l n}} + l = 1 \sum i - 1 \frac{1}{2 ^{l n}} ⌋ = ⌊ - \frac{1}{2 ^{in}} ⌋ = - 1

Before I can prove the rest, I need split $a$ similar to how I split $v$ into $a, b$ . Let $a = j_{i} 2^{in} + k_{i}$ for $j_{i} \in Z, j_{i} := ⌊ a / 2^{in} ⌋$ and $k_{i} \in N, k_{i} < 2^{in}, k_{i} := a mod 2^{in}$ at iteration $i$ .

Plugging this into the explicit formula for $δ_{i}$ gives:

δ_{i} = - j_{i} + =: γ_{i} ⌊ \frac{b - k _{i}}{2 ^{in}} + l = 1 \sum i - 1 \frac{b + 1}{2 ^{l n}} ⌋

This way of representing $δ_{i}$ reveals that $δ_{i}$ is just $- j_{i}$ plus some small correction term $γ_{i}$ that depends on $k_{i}$ and $b$ but not $j_{i}$ .

It's rather easy to show that $γ_{i} \in {- 1, 0}$ is always the case, which implies that $δ_{i} \in {- j_{i} - 1, - j_{i}}$ .

Proof:

γ_{i} = ⌊ \frac{b - k _{i}}{2 ^{in}} + l = 1 \sum i - 1 \frac{b + 1}{2 ^{l n}} ⌋ = ⌊ \frac{( b + 1 ) + ( - 1 - k _{i} )}{2 ^{in}} + l = 1 \sum i - 1 \frac{b + 1}{2 ^{l n}} ⌋ = ⌊ - \frac{k _{i} + 1}{2 ^{in}} + l = 1 \sum i \frac{b + 1}{2 ^{l n}} ⌋

Now I will determine the range for the two terms inside the floor function:

Range for the first term:
$⟹ ⟹ 0 \leq k_{i} \leq 2^{in} - 1 \frac{1}{2 ^{in}} \leq \frac{k _{i} + 1}{2 ^{in}} \leq \frac{2 ^{in}}{2 ^{in}} = 1 \frac{k _{i} + 1}{2 ^{in}} \in (0, 1]$
Range for the second term:

$⟹ 0 \leq b \leq 2^{n} - 2 l = 1 \sum i \frac{1}{2 ^{l n}} \leq l = 1 \sum i \frac{b + 1}{2 ^{l n}} \leq l = 1 \sum i \frac{2 ^{n} - 1}{2 ^{l n}}$

Note that:
- $i \geq 1 ⟹ \frac{1}{2 ^{n}} \leq \sum_{l = 1}^{i} \frac{1}{2 ^{l n}}$ .
- $\sum_{l = 0}^{\infty} \frac{1}{2 ^{l n}} = \frac{1}{1 - \frac{1}{2 ^{n}}} = \frac{2 ^{n}}{2 ^{n} - 1}$ , so $\sum_{l = 1}^{\infty} \frac{1}{2 ^{l n}} = \frac{1}{2 ^{n} - 1}$ . Therefore, $0 < i < \infty ⟹ \sum_{l = 1}^{i} \frac{2 ^{n} - 1}{2 ^{l n}} = (2^{n} - 1) \sum_{l = 1}^{i} \frac{1}{2 ^{l n}} < (2^{n} - 1) \sum_{l = 1}^{\infty} \frac{1}{2 ^{l n}}$ .
Therefore:

$⟹ ⟹ l = 1 \sum i \frac{1}{2 ^{l n}} \leq l = 1 \sum i \frac{b + 1}{2 ^{l n}} \leq l = 1 \sum i \frac{2 ^{n} - 1}{2 ^{l n}} \frac{1}{2 ^{n}} \leq l = 1 \sum i \frac{b + 1}{2 ^{l n}} < 1 l = 1 \sum i \frac{b + 1}{2 ^{l n}} \in (0, 1)$

From those two ranges, it follows that $γ_{i} \in {- 1, 0}$ .

However, that's not all that can be shown about $γ_{i}$ . There is more structure to it. This structure becomes obvious by looking at a few concrete values. So are the possible values of $γ_{2}$ (with $n = 2$ ) depending on $k_{2}$ :

$k_{2}$	$γ_{2}$ values	Note
0	${0}$	┐
1	${0}$	┃
2	${0}$	┃ always zero
3	${0}$	┃
4	${0}$	┘
5	${- 1, 0}$	┐
6	${- 1, 0}$	┃
...	...	┃ always -1 or 0
14	${- 1, 0}$	┃
15	${- 1, 0}$	┘

The table shows that $γ_{2}$ starts being always zero for $k_{2} < 5$ and then switches to being either $- 1$ or $0$ for $k_{2} \geq 5$ .

This pattern persists across all values for $i, n$ , but the switch occurs at different values $k_{i}$ . I will call the value $k_{i}$ where the switch occurs the critical value, denoted as $c_{i}$ . Here are a few values of $c_{i}$ I determined experimentally (missing values were too slow to compute):

n	$c_{1}$	$c_{2}$	$c_{3}$	$c_{4}$	$c_{5}$
1	1	3	7	15	31
2	1	5	21	85	341
3	1	9	73	585	4681
4	1	17	273	4369	-
5	1	33	1057	-	-

The pattern is $c_{i} = \sum_{l = 0}^{i - 1} 2^{l n} = (2^{in} - 1) / (2^{n} - 1)$ .

Showing that $k_{i} < c_{i} ⟹ γ_{i} = 0$ is easy.

Proof:

γ_{i} = ⌊ - \frac{k _{i} + 1}{2 ^{in}} + l = 1 \sum i \frac{b + 1}{2 ^{l n}} ⌋

Since $γ_{i} \in {- 1, 0}$ has already been proven, I just have to show that the lower bound of the expression inside the floor function is at least $0$ . That expression has its minimum when $b$ is minimal ( $⟹ b = 0$ ) and $k_{i}$ is maximal ( $⟹ k_{i} = c_{i} - 1$ ). Plugging in those values gives:

- \frac{k _{i} + 1}{2 ^{in}} + l = 1 \sum i \frac{b + 1}{2 ^{l n}} = - \frac{c _{i}}{2 ^{in}} + l = 1 \sum i \frac{1}{2 ^{l n}} = - \frac{\sum _{l = 0}^{i - 1} 2 ^{l n}}{2 ^{in}} + l = 1 \sum i \frac{1}{2 ^{l n}} = - l = 1 \sum i \frac{1}{2 ^{l n}} + l = 1 \sum i \frac{1}{2 ^{l n}} = 0

$□$

With this proven, 2 facts about $δ_{i}$ are now known:

$δ_{i} \in {- j_{i} - 1, - j_{i}}$ .
$k_{i} < c_{i} ⟹ δ_{i} = - j_{i}$ .

Since $v = 2^{in} + 2^{n - 1} - 1$ corresponds to $a = \sum_{l = 0}^{i - 1} 2^{l n} = c_{i}$ and $b = 0$ , it follows from (2) that $a \in {0, ..., c_{i} - 1} ⟹ δ_{i} = 0$ . This range of $a$ corresponds to inputs $v \in {2^{n - 1}, ..., 2^{in} + 2^{n - 1} - 2}$ . Together with the earlier result for $a = - 1$ , this proves that the approximation $R_{i}$ is exact for all inputs $v < 2^{in} + 2^{n - 1} - 1$ . $□$

Other rounding modes

As it turns out, other rounding modes can also be implemented by making a small modification to the approximation. Similar to how $R_{i}$ was defined, $F_{i}$ and $C_{i}$ can be defined for floor and ceiling division, respectively:

F_{1} F_{i + 1} C_{1} C_{i + 1} := ⌊ \frac{v + 1}{2 ^{n}} ⌋ := ⌊ \frac{F _{i} + v + 1}{2 ^{n}} ⌋ := ⌊ \frac{v + 2 ^{n} - 1}{2 ^{n}} ⌋ := ⌊ \frac{C _{i} + v + 2 ^{n} - 1}{2 ^{n}} ⌋

where:

F_{i} C_{i} \approx ⌊ \frac{v}{2 ^{n} - 1} ⌋ \approx ⌈ \frac{v}{2 ^{n} - 1} ⌉

$F_{i}$ is exact for all inputs $v < 2^{in} + 2^{n} - 2$ , and $C_{i}$ is exact for all inputs $v < 2^{in}$ .

Proving correctness for floor division

The proof for floor division is very similar to the proof for rounded division. The main difference is that $v$ is split differently. Let $v = a_{F} (2^{n} - 1) + b_{F} + 2^{n} - 1$ with $a_{F}, b_{F}$ defined similarly to before. Let $δ_{i}^{F} := F_{i} - ⌊ v / (2^{n} - 1)⌋$ be the error term for floor division after $i$ iterations. Simplifying the error term gives:

δ_{1}^{F} = F_{1} - ⌊ \frac{v}{2 ^{n} - 1} ⌋ = ⌊ \frac{v + 1}{2 ^{n}} ⌋ - ⌊ \frac{v}{2 ^{n} - 1} ⌋ = ⌊ \frac{a _{F} ( 2 ^{n} - 1 ) + b _{F} + 2 ^{n} - 1 + 1}{2 ^{n}} ⌋ - ⌊ \frac{a _{F} ( 2 ^{n} - 1 ) + b _{F} + 2 ^{n} - 1}{2 ^{n} - 1} ⌋ = ⌊ \frac{b _{F} - a _{F}}{2 ^{n}} + a_{F} + 1 ⌋ - a_{F} - 1 = ⌊ \frac{b _{F} - a _{F}}{2 ^{n}} ⌋

δ_{i + 1}^{F} = F_{i + 1} - ⌊ \frac{v}{2 ^{n} - 1} ⌋ = ⌊ \frac{F _{i} + v + 1}{2 ^{n}} ⌋ - ⌊ \frac{v}{2 ^{n} - 1} ⌋ = ⌊ \frac{δ _{i}^{F} + ⌊ \frac{v}{2 ^{n} - 1} ⌋ + v + 1}{2 ^{n}} ⌋ - ⌊ \frac{v}{2 ^{n} - 1} ⌋ = ⌊ \frac{δ _{i}^{F} + a _{F} + 1 + v + 1}{2 ^{n}} ⌋ - a_{F} - 1 = ⌊ \frac{δ _{i}^{F} + a _{F} + a _{F} ( 2 ^{n} - 1 ) + b _{F} + 2 ^{n} - 1 + 1}{2 ^{n}} ⌋ - a_{F} - 1 = ⌊ \frac{δ _{i}^{F} + b _{F} + 1}{2 ^{n}} + a_{F} + 1 ⌋ - a_{F} - 1 = ⌊ \frac{δ _{i}^{F} + b _{F} + 1}{2 ^{n}} ⌋

Note that $δ_{i}^{F}$ has the same recursive structure as $δ_{i}$ (rounded division). Therefore, the rest of the proof follows the same steps as before, leading to the conclusion that $F_{i}$ is exact for all inputs $v < 2^{in} + 2^{n} - 2$ . Writing out this proof will be left as an exercise to the reader. $□$

Proving correctness for ceiling division

Same game again. Let $v = a_{C} (2^{n} - 1) + b_{C} + 1$ with $a_{C}, b_{C}$ defined similarly to before. Let $δ_{i}^{C} := C_{i} - ⌈ v / (2^{n} - 1)⌉$ be the error term for ceiling division after $i$ iterations. Simplifying the error term gives:

δ_{1}^{C} = C_{1} - ⌈ \frac{v}{2 ^{n} - 1} ⌉ = ⌊ \frac{v + 2 ^{n} - 1}{2 ^{n}} ⌋ - ⌊ \frac{v + 2 ^{n} - 2}{2 ^{n} - 1} ⌋ = ⌊ \frac{a _{C} ( 2 ^{n} - 1 ) + b _{C} + 1 + 2 ^{n} - 1}{2 ^{n}} ⌋ - ⌊ \frac{a _{C} ( 2 ^{n} - 1 ) + b _{C} + 1 + 2 ^{n} - 2}{2 ^{n} - 1} ⌋ = ⌊ \frac{b _{C} - a _{C}}{2 ^{n}} + a_{C} + 1 ⌋ - a_{C} - 1 = ⌊ \frac{b _{C} - a _{C}}{2 ^{n}} ⌋

δ_{i + 1}^{C} = C_{i + 1} - ⌈ \frac{v}{2 ^{n} - 1} ⌉ = ⌊ \frac{C _{i} + v + 2 ^{n} - 1}{2 ^{n}} ⌋ - ⌈ \frac{v}{2 ^{n} - 1} ⌉ = ⌊ \frac{δ _{i}^{C} + ⌈ \frac{v}{2 ^{n} - 1} ⌉ + v + 2 ^{n} - 1}{2 ^{n}} ⌋ - ⌈ \frac{v}{2 ^{n} - 1} ⌉ = ⌊ \frac{δ _{i}^{C} + a _{C} + 1 + v + 2 ^{n} - 1}{2 ^{n}} ⌋ - a_{C} - 1 = ⌊ \frac{δ _{i}^{C} + a _{C} + a _{C} ( 2 ^{n} - 1 ) + b _{C} + 1 + 2 ^{n} - 1}{2 ^{n}} ⌋ - a_{C} - 1 = ⌊ \frac{δ _{i}^{C} + b _{C} + 1}{2 ^{n}} + a_{C} + 1 ⌋ - a_{C} - 1 = ⌊ \frac{δ _{i}^{C} + b _{C} + 1}{2 ^{n}} ⌋

Same as before, $δ_{i}^{C}$ has the same recursive structure as $δ_{i}$ (rounded division). Therefore, the rest of the proof follows the same steps as before, leading to the conclusion that $C_{i}$ is exact for all inputs $v < 2^{in}$ . Writing out this proof will be left as an exercise to the reader. $□$

Interesting extra: Division by $2^{n} + 1$

Similar to how:

\frac{v}{2 ^{n} - 1} = \frac{v + \frac{v}{2 ^{n} - 1}}{2 ^{n}}

something similar is also true for division by $2^{n} + 1$ :

\frac{v}{2 ^{n} + 1} = \frac{v - \frac{v}{2 ^{n} + 1}}{2 ^{n}}

So, is turning one plus into a minus enough to make the trick work for division by $2^{n} + 1$ ? No. Well, it is almost enough.

As it turns out, the number added to $v$ (called round in the code) has to be changed for floor and ceiling division. Furthermore, the evenness of the iteration count $i$ also matters.

Rust

enum RoundingMode {
    Floor,
    Round,
    Ceil,
}
fn div_2pn_p1(v: u32, n: u32, i: u32, mode: RoundingMode) -> u32 {
    let round = match mode {
        RoundingMode::Floor => 0,
        RoundingMode::Round => 1 << (n - 1),
        RoundingMode::Ceil => 1 << n,
    };
    let w = v + round - i % 2;
    let mut r = w >> n;
    for _ in 1..i {
        r = (w - r) >> n;
    }
    r
}

As before, here are tables for the smallest inputs $v$ the approximations start to fail for. First for rounded division by $2^{n} + 1$ :

n	$i = 1$	$i = 2$	$i = 3$	$i = 4$	$i = 5$	$i = 6$	$i = 7$	$i = 8$
1	$2^{1} + 2$	$2^{2} + 1$	$2^{3} + 2$	$2^{4} + 1$	$2^{5} + 2$	$2^{6} + 1$	$2^{7} + 2$	$2^{8} + 1$
2	$2^{2} + 3$	$2^{4} + 2$	$2^{6} + 3$	$2^{8} + 2$	$2^{10} + 3$	$2^{12} + 2$	$2^{14} + 3$	$2^{16} + 2$
3	$2^{3} + 5$	$2^{6} + 4$	$2^{9} + 5$	$2^{12} + 4$	$2^{15} + 5$	$2^{18} + 4$	$2^{21} + 5$	$2^{24} + 4$
4	$2^{4} + 9$	$2^{8} + 8$	$2^{12} + 9$	$2^{16} + 8$	$2^{20} + 9$	$2^{24} + 8$	$2^{28} + 9$	-
5	$2^{5} + 17$	$2^{10} + 16$	$2^{15} + 17$	$2^{20} + 16$	$2^{25} + 17$	-	-	-
6	$2^{6} + 33$	$2^{12} + 32$	$2^{18} + 33$	$2^{24} + 32$	-	-	-	-
7	$2^{7} + 65$	$2^{14} + 64$	$2^{21} + 65$	$2^{28} + 64$	-	-	-	-
8	$2^{8} + 129$	$2^{16} + 128$	$2^{24} + 129$	-	-	-	-	-

Second for ceiling division by $2^{n} + 1$ :

n	$i = 1$	$i = 2$	$i = 3$	$i = 4$	$i = 5$	$i = 6$	$i = 7$	$i = 8$
1	$2^{1} + 1$	$2^{2}$	$2^{3} + 1$	$2^{4}$	$2^{5} + 1$	$2^{6}$	$2^{7} + 1$	$2^{8}$
2	$2^{2} + 1$	$2^{4}$	$2^{6} + 1$	$2^{8}$	$2^{10} + 1$	$2^{12}$	$2^{14} + 1$	$2^{16}$
3	$2^{3} + 1$	$2^{6}$	$2^{9} + 1$	$2^{12}$	$2^{15} + 1$	$2^{18}$	$2^{21} + 1$	$2^{24}$
4	$2^{4} + 1$	$2^{8}$	$2^{12} + 1$	$2^{16}$	$2^{20} + 1$	$2^{24}$	$2^{28} + 1$	-
5	$2^{5} + 1$	$2^{10}$	$2^{15} + 1$	$2^{20}$	$2^{25} + 1$	-	-	-
6	$2^{6} + 1$	$2^{12}$	$2^{18} + 1$	$2^{24}$	-	-	-	-
7	$2^{7} + 1$	$2^{14}$	$2^{21} + 1$	$2^{28}$	-	-	-	-
8	$2^{8} + 1$	$2^{16}$	$2^{24} + 1$	-	-	-	-	-

And lastly for floor division by $2^{n} + 1$ :

n	$i = 1$	$i = 2$	$i = 3$	$i = 4$	$i = 5$	$i = 6$	$i = 7$	$i = 8$
1	$0$	$2^{2} + 2$	$0$	$2^{4} + 2$	$0$	$2^{6} + 2$	$0$	$2^{8} + 2$
2	$0$	$2^{4} + 4$	$0$	$2^{8} + 4$	$0$	$2^{12} + 4$	$0$	$2^{16} + 4$
3	$0$	$2^{6} + 8$	$0$	$2^{12} + 8$	$0$	$2^{18} + 8$	$0$	$2^{24} + 8$
4	$0$	$2^{8} + 16$	$0$	$2^{16} + 16$	$0$	$2^{24} + 16$	$0$	-
5	$0$	$2^{10} + 32$	$0$	$2^{20} + 32$	$0$	-	$0$	-
6	$0$	$2^{12} + 64$	$0$	$2^{24} + 64$	$0$	-	$0$	-
7	$0$	$2^{14} + 128$	$0$	$2^{28} + 128$	$0$	-	$0$	-
8	$0$	$2^{16} + 256$	$0$	-	$0$	-	$0$	-

Floor division fails at $v = 0$ for odd iteration counts, because it tries to compute v - 1. This either (1) panics or (2) underflows to a huge number. If (3) the backing number type is signed, so that w = -1 can be represented, then the function returns -1, which is also incorrect. No matter what, it's wrong.

If $v = 0$ is ignored, the table for floor division looks like this:

n	$i = 1$	$i = 2$	$i = 3$	$i = 4$	$i = 5$	$i = 6$	$i = 7$	$i = 8$
1	$2^{1} + 3$	$2^{2} + 2$	$2^{3} + 3$	$2^{4} + 2$	$2^{5} + 3$	$2^{6} + 2$	$2^{7} + 3$	$2^{8} + 2$
2	$2^{2} + 5$	$2^{4} + 4$	$2^{6} + 5$	$2^{8} + 4$	$2^{10} + 5$	$2^{12} + 4$	$2^{14} + 5$	$2^{16} + 4$
3	$2^{3} + 9$	$2^{6} + 8$	$2^{9} + 9$	$2^{12} + 8$	$2^{15} + 9$	$2^{18} + 8$	$2^{21} + 9$	$2^{24} + 8$
4	$2^{4} + 17$	$2^{8} + 16$	$2^{12} + 17$	$2^{16} + 16$	$2^{20} + 17$	$2^{24} + 16$	$2^{28} + 17$	-
5	$2^{5} + 33$	$2^{10} + 32$	$2^{15} + 33$	$2^{20} + 32$	$2^{25} + 33$	-	-	-
6	$2^{6} + 65$	$2^{12} + 64$	$2^{18} + 65$	$2^{24} + 64$	-	-	-	-
7	$2^{7} + 129$	$2^{14} + 128$	$2^{21} + 129$	$2^{28} + 128$	-	-	-	-
8	$2^{8} + 257$	$2^{16} + 256$	$2^{24} + 257$	-	-	-	-	-

Better. However, this makes the trick less useful for floor division by $2^{n} + 1$ , since it requires an even iteration count or special handling for $v = 0$ .

In any case, this is all I have for division by $2^{n} + 1$ . I haven't formally analyzed the error terms, so I can't explain why it works and why the weirdness around odd iteration counts exists.

For those inclined, this might be a fun exercise to spend the weekend on.

Fast division by 2n-1

Contents

Fast division by 2ⁿ-1