PE 0001: Multiples of 3 or 5
If we list all the natural numbers below
Find the sum of all the multiples of
Sequential Approach
One very simple approach would be to look at every number, one by one until reaching 1000, adding up each number that is either a multiple of 3 or multiple of 5:
Because only 1000 values have to be inspected, this is actually not an unreasonable approach. And it's a simple property – integer divisibility – something that we can easily check with the modulus operator (%
).
sum(i for i in range(N) if (i%3 == 0 or i%5 == 0))
sum := 0
for i := range n {
if i%3 == 0 || i%5 == 0 {
sum += i
}
}
% Copyright (c) 2024 Kevin Damm
%
% Permission is hereby granted, free of charge, to any person obtaining a copy
% of this software and associated documentation files (the "Software"), to deal
% in the Software without restriction, including without limitation the rights
% to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
% copies of the Software, and to permit persons to whom the Software is
% furnished to do so, subject to the following conditions:
%
% The above copyright notice and this permission notice shall be included in all
% copies or substantial portions of the Software.
%
% THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
% IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
% FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
% AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
% LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
% OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
% SOFTWARE.
% Sum is the sum of multiples of {3, 5} below N.
sum_multiples(N, Sum) :-
sum_multiples_naive(N-1, 0, Sum).
% Base case, Sum is total accumulated when N is 0.
sum_multiples_naive(0, Acc, Acc).
% Sum is accumulated for all values > 0 divisible by 3 or 5.
sum_multiples_naive(N, Acc, Sum) :-
N > 0, (N mod 3 =:= 0 ; N mod 5 =:= 0),
!,
NewAcc is Acc + N,
Nminus is N - 1,
sum_multiples_naive(Nminus, NewAcc, Sum).
% Rule for carrying accumulated value on non-divisible values.
sum_multiples_naive(N, Acc, Sum) :-
N > 0,
Nminus is N - 1,
sum_multiples_naive(Nminus, Acc, Sum).
main :-
sum_multiples(1000, Sum),
write(Sum).
A direct and brute-force approach like this will still complete in less than a second, even on a small device.
Under a minute, really, even if you include the time for typing the code out.
Selection Approach
That naïve solution inspects all natural values below
To save having to div/mod each of these excluded values, we can count upwards in increments of these numbers, making sure to only count the multiples of three or five.
def gen_multiples(n: int) -> Iterator[int]:
"""Generate all integers between 1..n that are divisible by either 3 or 5."""
for i3 in range(3,n,3):
yield i3
for i5 in range(5,n,15):
# skip each third multiple of 5,
yield i5
i5 += 5
if i5 < n:
# ...careful not to exceed n.
yield i5
sum(gen_multiples(100))
sum := 0
for i := 3; i < n; i += 3 {
sum += i
}
for j := 5; j < n; j += 15 {
sum += j
if j+5 < n {
sum += j + 5
}
}
% Copyright (c) 2024 Kevin Damm
%
% Permission is hereby granted, free of charge, to any person obtaining a copy
% of this software and associated documentation files (the "Software"), to deal
% in the Software without restriction, including without limitation the rights
% to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
% copies of the Software, and to permit persons to whom the Software is
% furnished to do so, subject to the following conditions:
%
% The above copyright notice and this permission notice shall be included in all
% copies or substantial portions of the Software.
%
% THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
% IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
% FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
% AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
% LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
% OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
% SOFTWARE.
% Sum is the sum of multiples of {3, 5} below N.
sum_multiples(N, Sum) :-
sum_by(3, N, Sum3),
sum_by(5, N, Sum5),
sum_by(15, N, Sum15),
Sum is Sum3 + Sum5 - Sum15.
sum_by(Increment, N, Sum) :-
sum_by(Increment, 0, N, 0, Sum).
% Accumulate each multilpe of Inc less than N.
sum_by(Increment, I, N, Accumulator, Sum) :-
I < N,
!,
SubSum is Accumulator + I,
NextI is I + Increment,
sum_by(Increment, NextI, N, SubSum, Sum).
% Resolve total sum when incremented past the limit.
sum_by(Inc, I, N, Acc, Acc) :- I >= N.
main :-
sum_multiples(1000, Sum),
write(Sum).
This skips a lot of unnecessary computation by only looking at the numbers that we know will be included in the sum. You could simplify the second loop to look at every five-multiple and skip when the number also divides
The order that these numbers are added together is not strictly increasing. Since that doesn't affect the final result, and it's a little easier to read this way anyway, I've left it as two separate for-loops. An example of interleaving the series so that the numbers are in sorted order can be found in the github repo for this site.
Constant-time Approach
As
Realigning the previous solution will reveal a pattern that emerges:
The components of the above are the arithmetic sum, in steps of
Another way to look at this is to separate out the common parts of each column and row in the above grid, resulting in one part that increases vertically and another part that is the same for every row.
This is in terms of
n := limit - 1
k := n / 15 // number of rows (except partial final row)
rowcount := 5 /*3s*/ + 3 /*5s*/ - 1 /*common*/
rowsum := 0 + 3 + 5 + 6 + 9 + 10 + 12
sum := rowcount * 15 * k * (k - 1) / 2
sum += k * rowsum
The last row may be partial, a few values remaining between
// Add remaining multiples of 3 and 5 (excluding n).
delta := n - k*15
div3, div5 := delta/3, delta/5
sum += k*15*(div3+div5+1) +
div3*(div3+1)*3/2 +
div5*(div5+1)*5/2
To understand the last three lines, it helps to think of the last (partial) row as its own arithmetic series for the threes and fives separately, offset by the div3
and div5
are the limits on these smaller series, arrived at by counting the numbers between rowsum
, and this can be generalized as well.
Generalized Approach
We could further generalize this to any pair of numbers
function SumOfPairsLessThan(a: number, b: number, n: number): number {
const width = lcm(a, b);
const k = Math.floor(n / width);
const diva = width / a;
const divb = width / b;
const rowcount = diva + divb - 1;
const rowsum = (diva * (diva + 1) * a + divb * (divb + 1) * b) / 2;
var sum = rowcount * width * k * (k - 1) / 2;
sum += k * rowsum;
const delta = n - 1 - k*width;
const rema = delta / a;
const remb = delta / b;
sum += k * width * (rema + remb + 1);
sum += rema * (rema + 1) * a / 2;
sum += remb * (remb + 1) * b / 2;
return sum
}
function lcm(a: number, b: number): number {
// Javascript and floating-point error, let's round to be sure, though it's
// only needed when a*b is many orders of magnitude larger than gcd(a, b).
return Math.round((a * b) / gcd(a, b));
}
function gcd(a: number, b: number): number {
if (b === 0) {
return a;
}
return gcd(b, a % b);
}
See Also
Arithmetic Series
Central to this problem is the idea of summing over a series of integers, and the extension to this – summing over all multiples of an integer.
In some places we had to convert instead to
If
Try it yourself with some pencil and paper if you're not seeing it yet!
Euclid's Algorithm (GCD and LCM)
After understanding the series part of the problem, further investigation shows that the Least Common Multiple decides what the period of interference between the two divisors is. This is the series that needs to be subtracted out if it gets double counted between the 3s and the 5s.
Since we already knew the pair of constants is
There are actually some interesting identities bewtween the LCM and GCD, and entire algebras can be derived from their operations. The computation needed to find the GCD is also interesting, and the pairs of values which require the most computation are found along the Fibonacci Sequence. That is, values which follow the line
Solution overview on projecteuler.net
This doc will only be available if you've already submitted the correct answer. I'm not going to give you the answer directly, but you can execute the code examples here, with only minor edits, to find the answer.
The official overview stops at calculating the
It is also valuable to read through the forums for any problems you solve, you may see an elegant solution in a language you are less familiar with.
FizzBuzz (programming challenge)
This puzzle is reminiscent of FizzBuzz, but the ProjectEuler.net problem appeard a few years before it became popular as a test of basic programming ability.
Originally it was a children's game, then suggested by Imran Ghory in "Using FizzBuzz to Find Developers who Grok Coding". A month later it was also featured in a codinghorror blog commentary by Jeff Atwood. It isn't meant to be hard, it's meant to be a minimum evaluation function, after conventional wisdom that 99% of interview candidates cannot actually code even the simplest of functions, while knowing enough terminology to give the impression that they can.
The question usually implies a serial printing of "fizz" (at 3s), "buzz" (at 5s) and "fizzbuzz" (at 15s) with the numbers themselves printed otherwies. This means the constant-time algorithms (and even the selection-based one) wouldn't qualify, but I think I would ask if we could just print how many fizzes, buzzes and fizzbuzzes there are (pointing to this blog post if necessary) and maybe get a pass on the interview question. What I'm saying is, I'm not above doing fizzbuzz but give me something challenging.
If you want to take this problem to the next level, especially if you're using it as a starter question in your own interviews, I suggest asking next "what is the probability that an integer chosen at random in range 1..1000 is divisible by 3 or 5?", especially if you are looking for someone with a (slightly) deeper mathematical comprehension. There are numerous topics branching out from there.
Exercises
Where is the break-even point, i.e. when the constant-time approach goes from being worse than the linear-time solution to being better? This may vary by platform based on the cost of a div/mod and chosen values for
and . Benchmark both solutions on your chosen platform and measure for yourself. You can find three example benchmarks in the tests of my golang solution.
In the constant-time solution there is an expression
rowcount * 15 * k * (k - 1) / 2
. How do we know that this division bywill not give us a remainder that gets truncated off? Further generalize the solution to any three divisors. In doing so, you should be able to see a straightforward approach for any
divisors. What is the computational complexity of finding this sum (with respect to
) for any divisors? If is not fixed? If is larger than ?