CSE 102 – Tutorial 4 – Backtracking
General instructions
Sections tagged with a green marginal line are mandatory.
Sections tagged with an orange marginal line are either more advanced or supplementary, and are therefore considered optional. They will help you deepen your understanding of the material in the course. You may skip these sections on a first pass through the tutorial, and come back to them after finishing the mandatory exercises.
After you are finished with the tutorial, please upload your submission to Moodle by completing “Tutorial 4 assignment” before the indicated deadline.
When submitting a solution, the system will do some basic validation of your files and run some automated tests on your solutions. A green tick tells you that your file compiles and has passed all of the automated tests. A red tick tells you either that your file is badly broken (e.g., a syntax error) or that your solutions fail one or more of the automated tests. You can see the result of the tests by clicking on “See my uploads” in the upper-left menu, and then clicking on the right arrow button next to each file you submitted.
Although passing all of our tests provides evidence that your solutions are correct, it might not guarantee it, as typically we would need to run infinitely many tests! You are encouraged to write your own tests, and more generally to think about why your code works and not just rely on the automated tests.
Discussion of the tutorials with friends and classmates is allowed, with acknowledgment, but please read the Moodle page on “Collaboration, plagiarism, and AI”.
Warmup: maze escape!
We expect you to write your solutions to this problem in a file named
maze.py
and to upload it using the form below. You can
resubmit your solution as many times as you like.
In this exercise you are navigating through a square grid maze, such as the one below:

Your goal is to go from the upper left corner of the maze to the lower right corner without entering a gray cell. For example, this maze has two possible solutions:


We will represent square grid mazes in Python by \(n \times n\) matrices with 1s and 0s indicating whether a cell is free or blocked respectively. For example, the \(6 \times 6\) maze above will be encoded like so:
= [ [1 , 0 , 0 , 1 , 0 , 0] ,
example_maze 1 , 1 , 1 , 1 , 1 , 0] ,
[0 , 0 , 1 , 0 , 1 , 1] ,
[1 , 1 , 1 , 1 , 1 , 0] ,
[1 , 0 , 0 , 1 , 0 , 0] ,
[1 , 0 , 1 , 1 , 1 , 1] ] [
We will represent paths through the maze as lists of (row, column) pairs, where rows are numbered from top to bottom and columns from left to right using 0-indexing. For example, the two paths above will be encoded as follows:
= [(0,0), (1,0), (1,1), (1,2), (1,3), (1,4), (2,4), (3,4), (3,3), (4,3), (5,3), (5,4), (5,5)]
red_path = [(0,0), (1,0), (1,1), (1,2), (2,2), (3,2), (3,3), (4,3), (5,3), (5,4), (5,5)] purple_path
In general, we say that a list of coordinates \(p = [(r_1,c_1),\dots,(r_n,c_n)]\) constitutes a path in a maze \(M\) just in case it satisfies four conditions:
- (\(p\) is contiguous:) \((r_{i+1} = r_i \wedge c_{i+1} = c_i \pm 1) \vee (r_{i+1} = r_i\pm 1 \wedge c_{i+1} = c_i)\) for all \(1 \le i < n\);
- (\(p\) stays within the maze:) \(0 \le r_i < n\) and \(0 \le c_i < n\) for all \(1 \le i \le n\);
- (\(p\) only visits free cells:) \(M[r_i][c_i] = 1\) for all \(1 \le i \le n\);
- (\(p\) doesn’t cross itself:) \((r_i,c_i) \ne (r_j,c_j)\) for all \(i \ne j\).
To qualify as a solution to a maze of dimension \(n\), a path must satisfy two additional conditions:
- (start in the upper left corner:) \((r_1,c_1) = (0,0)\);
- (end in the lower right corner:) \((r_n,c_n) = (n-1,n-1)\);
Our aim is to solve mazes using backtracking, for which it’s important to have a notion of “partial” solution. In this case, we can simply drop condition (6). In other words, we consider paths that start at the upper left corner \((0,0)\) and end somewhere inside the maze, at some white cell \((r,c)\). This leads to a simple recursive algorithm that attempts to solve a maze starting from any partial solution \(p\):
- try extending the current path by adding a cell \((r\pm 1,c)\) or \((r,c \pm 1)\) (assuming it doesn’t violate conditions 2-4), to form an extended path \(p'\);
- recursively look for a solution extending \(p'\);
- if you found a solution extending any of the \(p'\), then it is a solution extending \(p\);
- otherwise, you have to backtrack and try a different partial solution \(p\).
Your first task is to write this recursive backtracking function.
Write a function
escape_bt(n, M, p)
which takes as input:- a positive integer \(n\)
- a maze \(M\) of dimension \(n\) (i.e., an \(n\times n\) matrix with values 0 or 1)
- a partial solution \(p\) (i.e., a path from \((0,0)\) to some free cell of \(M\) satisfying conditions 1-5 described above)
and returns a solution to the maze extending \(p\), if it exists, or else
None
.
Now that you’ve written this backtracking solver that starts from an arbitrary partial solution, it is then an easy task to apply it to solve any (solvable) maze.
- Write a function
escape(n, M)
which takes as input a positive integer \(n\) and a maze \(M\) of dimension \(n\), and returns a solution to the maze if it exists, or elseNone
. (There may be many solutions to \(M\), but you only have to return one if there are any.)
Cyclic dice
We expect you to write your solution to this problem in a file named
dice.py
and to upload it using the form below. You can
resubmit your solution as many times as you like.
A sequence of dice \(A_1, A_2, \dots, A_n\) is said to be cyclic if \(A_1\) rolls higher than \(A_2\) more than half the time, \(A_2\) rolls higher than \(A_3\) more than half the time, and so on up to \(A_n\), which rolls higher than \(A_1\) more than half the time. This is a special case of non-transitive dice and tends to go against most people’s ideas about probability, making them convenient for tricks or bets.
An example of a cyclic sequence of 6-sided dice, due to Bradley Efron, is the following:
A: 4, 4, 4, 4, 0, 0
B: 3, 3, 3, 3, 3, 3
C: 6, 6, 2, 2, 2, 2
D: 5, 5, 5, 1, 1, 1
which leads to \(\Pr(A>B) = \Pr(B>C) = \Pr(C>D) = \Pr(D>A) = \frac{2}{3}\)
Representing the dice sequence
One can prove that it is equivalent to consider all sides of all dice
different or to allow some repetitions. For simplicity we will therefore
use always different sides, starting at \(0\). We will represent the sequence of dice
as a single list of length \(n \cdot
s\) of distinct numbers between 0 and \(n\cdot s-1\), where \(n\) is the number of dice and \(s\) the number of sides. Hence each
permutation of list(range(n*s))
represents a list of
dice.
For example, the three 3-sided dice
[[0, 4, 8], [2, 3, 7], [1, 5, 6]]
(which are a cyclic dice
sequence with each die beating the next one with probability \(5/9\)) are represented by the list
[0, 4, 8, 2, 3, 7, 1, 5, 6]
. One can slice such a list back
into individual dice as follows:
def slice_dice(n, s, dice):
return [dice[i * s:i * s + s] for i in range(n)]
That is, we have
slice_dice(3, 3, [0, 4, 8, 2, 3, 7, 1, 5, 6]) = [[0, 4, 8], [2, 3, 7], [1, 5, 6]]
.
Partial solutions for backtracking
Our goal is to build a list of \(s\)-sided cyclic dice of length \(n\). To do this through backtracking, we must once again come-up with a notion of partial solution. Here, we will consider a sequence of length \(k\) of integers smaller than \(n\cdot s\), where moreover:
- all the elements of the sequence should be different;
- inside each subsequence corresponding to an individual die, the elements should be strictly increasing; and
- each complete pair of successive dice \((A, B)\) contained in the sequence should have \(\Pr(A > B) > \frac{1}{2}\).
Generating all solutions
In contrast to the maze escape problem where it was sufficient to
find a single path through the maze, now we want to generate
all solutions to the cyclic dice problem. This can be done very
conveniently in Python using generators, where we yield
as
soon as we’ve found a solution, and backtrack when the user calls the
generator again to ask for another solution.
However, we only want to generate all solutions up to symmetry. In order to suppress the symmetry that (for example) if \((A, B, C)\) is a cyclic sequence of dice, \((B, C, A)\) is also such a sequence, we will enforce that our single list starts with \(0\). In other words we enforce that \(A\) is lexicographically the smallest die.
Write a function
win_probability(D1, D2)
that takes two lists of equal length representing two dice and computes the winning probability of \(D_1\) over \(D_2\), as well as a functionbeats(D1, D2)
that checks if \(\Pr(D_1 > D_2) > \frac{1}{2}\).Write a function
get_dice(n, s, dice)
that takes the number of dicen
their number of sidess
and a partial solutiondice
as a list of integers respecting the constraints we stated above, and that generates (by callingyield
) all possible extensions ofdice
to a full \(n\times s\) cyclic dice sequence.In particular
get_dice(n, s, [0])
should generate all cyclic dice sequences of \(n\) \(\times\) \(s\)-sided dice. For example,get_dice(3, 3, [0])
should generate the following lists (the order in which you generate the lists does not matter):[0, 4, 8, 2, 3, 7, 1, 5, 6] [0, 5, 7, 2, 4, 6, 1, 3, 8] [0, 5, 7, 3, 4, 6, 1, 2, 8] [0, 6, 7, 2, 4, 5, 1, 3, 8] [0, 6, 7, 3, 4, 5, 1, 2, 8]
Hint: When considering a new element to add to the partial solution there are conditions you should always check (that it does not appear already) and others that are specific when the element is not the first of a die or when it finishes a die.
Note: It is also possible to add a whole die at a time. This might be a little more difficult but feel free to do it that way if you prefer. However do make sure that
get_dice(n, s, [0])
still does what it is supposed to do.
Optimization (optional)
If one intends to bet using such dice, it is useful to have the largest chance to beat whatever die your opponent will choose. Hence we will consider the score of a cyclic sequence as the minimum of the \(n\) winning probabilities between one die and the next in the cyclic order.
To optimize the score we will use a branch and bound procedure around our backtracking search. The idea is to iteratively search for one solution, but with a new constraint each time: that this solution is better than the best we have found so far.
Modify your function
beats
such that it can take a third argument (defaulting to \(0.5\)) providing the probability \(p\) such that \(\Pr(A > B) > p\).Write a refined version of the
get_dice
generator calledget_better_dice(n, s, partial, minscore=0.5)
, that yields all solutions with a score better thanminscore
.Write a function
score(n, s, dice)
computing the score of a list of dice.Implement a branch and bound optimization: instead of trying to find all solutions by backtracking search, implement a function
optimal_dice(n, s)
that returns an optimal solution (orNone
if there is none) by iteratively trying to find one single solution (by backtracking) but with increasingly high score constraints.Hint: You might want to rely on a generator. Note that when a generator does not provide any more solution, it throws a
StopIteration
exception. One can therefore either wrap the call tonext
in atry/except
block, or provide a second argument tonext
that it will return if the generator given as first argument has reached its end:try: …except StopIteration: # one called next(g) on a generator that has no more solutions
= next(g, None) n if n is None: # one called next(g) on a generator that has no more solutions
In order to avoid revisiting already explored parts, we can add another constraint to our iterated backtracking search: that a solution is (lexicographically) higher than a given reference. Implement this in
get_better_dice
by adding yet another argument: that reference solution.Hint: The lexicographic order constraint can be enforced each time we extend a partial solution (unless it has been satisfied already).
Long live the queens (optional)
We expect you to write your solutions to both sets of problems in a
file named queens.py
and to upload it using the form below.
You can resubmit your solution as many times as you like.
The famous “\(n\) queens problem” asks whether \(n\) chess queens can be placed on an \(n \times n\) chessboard so that no two queens are attacking each other.
In this part of the tutorial we will be considering two variations of the \(n\) queens problems, one where the chessboard is replaced by a honeycomb and the chess queens are replaced by queen bees, and the other where the chess queens are grouped in two opposing teams.
The \(n\) queen bees problem
To illustrate the first problem, consider the following rendering (in ASCII art) of a \(4 \times 4\) honeycomb on which 4 queen bees have been placed, in such a way that no two bees are on the same (vertical or diagonal) row:
__ __/ \__ __/@@\__/ \__ __/ \__/ \__/ \__ __/ \__/ \__/ \__/@@\__ # a valid arrangement of 4 queen bees \__/@@\__/@@\__/ \__/ # on a 4x4 honeycomb \__/ \__/ \__/ \__/ \__/ \__/
The following is an invalid arrangement of queen bees, since two bees are on the same (positively-sloped) diagonal row:
__ __/ \__ __/@@\__/ \__ __/ \__/ \__/@@\__ __/ \__/ \__/ \__/ \__ # an invalid arrangement of 4 queen bees \__/@@\__/@@\__/ \__/ # on a 4x4 honeycomb \__/ \__/ \__/ \__/ \__/ \__/
The general problem is to place \(n\) queen bees on an \(n \times n\) honeycomb, in such a way that no two bees are on the same row. Actually, it is not hard to come up with a rather boring solution for any \(n\):
__ __/ \__ __/ \__/ \__ __/ \__/ \__/ \__ __/@@\__/@@\__/@@\__/@@\__ # a boring arrangement of 4 queen bees \__/ \__/ \__/ \__/ # on a 4x4 honeycomb \__/ \__/ \__/ \__/ \__/ \__/
However, we are interested in finding all solutions.
In order to solve the \(n\) queen bees problem using backtracking, we will use the following notion of partial solution: for some \(k \le n\), place \(k\) bees on each of the first \(k\) negative(ly sloped) diagonal rows of the \(n \times n\) honeycomb, in such a way that no two bees are on the same vertical or positive diagonal row. The condition is trivially satisfied when \(k=0\), and when \(k = n\) it gives a full solution to the \(n\) queen bees problem.
As an example, here is a partial solution for the case \(n = 4\) with \(k=3\):
__ __/ \__ __/@@\__/ \__ __/ \__/ \__/ \__ __/ \__/ \__/ \__/ \__ # a valid arrangement of 3 queen bees \__/@@\__/@@\__/ \__/ # on the first 3 negative diagonal rows \__/ \__/ \__/ # of a 4x4 honeycomb \__/ \__/ \__/
In this case, we see that there is a unique way to extend the partial solution to a valid arrangement of 4 queen bees, corresponding to the first arrangement given above.
Representing the honeycomb
A partially-filled honeycomb such as the above can be represented in
Python as a list of \(k\) numbers
between \(0\) and \(n-1\), where each number gives the positive
offset of the corresponding bee along its negative diagonal. For
instance, the example above is represented by the list
[1,2,0]
. For your convenience, we have provided the
following utility function for printing out partially-filled honeycombs
(this is a bit involved, since we have to convert between positions on
the line and “bee coordinates”):
def comb_print(n, comb):
def occupied(i,j):
return i<len(comb) and comb[i] == j
print(' '*3*n + '__')
for line in range(n):
print(' '*(n-line-1)*3 + '__/', end='')
for cell in range(line+1):
= n-1-line+cell
i = cell
j = '@@' if occupied(i,j) else ' '
s print(s + '\\__' + ('/' if cell<line else ''), end='')
print()
for line in range(n,2*n):
print(' '*(line-n)*3 + ' \\__/', end='')
for cell in range(2*n-1-line):
= cell
i = cell+line-n+1
j = '@@' if occupied(i,j) else ' '
s print(s + '\\__/', end='')
print()
print()
Implementation
Write a function
invalid_offset(k, comb, j)
that takes a numberk
, a list of numberscomb
such thatlen(comb) = k
representing a partially-filled honeycomb, and another numberj
, and decides whether the arrangement of bees can be extended by placing a bee at positionj
in the next negative diagonal row. The function should returnTrue
just in case there is already a bee occupying the same vertical or positive diagonal row (i.e., the proposed extension is invalid), andFalse
otherwise.You might find it helpful to consider the following atlas of “bee coordinates”, where each cell of the honeycomb is labelled by a pair of numbers
ij
giving the positive offsetj
within the negative diagonal rowi
.__ __/30\__ __/20\__/31\__ __/10\__/21\__/32\__ __/00\__/11\__/22\__/33\__ \__/01\__/12\__/23\__/ \__/02\__/13\__/ \__/03\__/ \__/
Write a function
queen_bees(n, k, comb)
that generates all (full) solutions to then
queen bees problem extending a valid partial solutioncomb
of lengthk <= n
. In particular,queen_bees(n, 0, [])
should generate all solutions to then
queen bees problems. You should use theyield
primitive, making exactly one call toyield
for every valid solution extendingcomb
.For debugging (or “debee-ing”?), you might find use for the following function, which is meant to print out all solutions to the
n
queen bees problem (assuming that your implementation ofqueen_bees
is correct):def show_combs(n): for comb in queen_bees(n, 0, []): comb_print(n, comb)
Use
queen_bees
to write a functioncount_combs(n)
that counts the number of solutions to then
queen bees problem, and check that it agrees with the first few values of sequence A099152 in the Online Encyclopedia of Integer Sequences. Add timing statistics, and try to optimize yourqueen_bees
function so that you are able to computecount_combs(n)
more quickly and for higher values ofn
.
Peaceable queens
The second variation of the \(n\) queens problem is known as peaceable queens, as described in this Numberphile video. Briefly, the goal is to place two opposing camps of \(m\) black queens and \(m\) white queens on an \(n \times n\) chessboard, in such a way that no two queens of opposing camps are attacking each other. Moreover, we want \(m\) to be as large as possible.
For example, the following is one possible “peaceable” arrangement of two teams of \(5\) queens on a \(6 \times 6\) chessboard:
+-------------+ | B . B . . . | | . . . . W W | | . . B . . . | | B B . . . . | | . . . W . W | | . . . . W . | +-------------+
It turns out that this is the best possible, in the sense that there is no peaceable arrangement of two teams of size \(6\) on a \(6 \times 6\) board.
Representing the board, and ordering moves for backtracking
Once again, to setup a backtracking problem it is always important to
identify a notion of partial solution. In this case, to a first
approximation, we can simply consider a peaceable arrangement of \(k\) queens from each team, for some \(k \le m\), represented in Python as a pair
of lists black
and white
containing the
(row,col) coordinates of the black queens and the white queens. We will
use the following function for printing out such partial board
assignments:
def board_print(n, black, white):
def square(i,j):
if (i,j) in black:
return 'B'
elif (i,j) in white:
return 'W'
return '.'
print('+' + '-' * (2*n+1) + '+')
for i in range(n):
print('|', end = '')
for j in range(n):
print(' ' + square(i,j), end = '')
print(' |')
print('+' + '-' * (2*n+1) + '+')
For example, calling
board_print(6, [(0,0),(0,2)], [(1,4),(1,5)])
prints out the
following board:
+-------------+ | B . B . . . | | . . . . W W | | . . . . . . | | . . . . . . | | . . . . . . | | . . . . . . | +-------------+
However, we still have to be a bit more careful before setting up the
problem for backtracking, in order to avoid exploring redundant
possibilities. For example, although the partial solution above could
have been reached by placing the first pair of black and white queens at
squares (0,0)
and (1,4)
+-------------+ | B . . . . . | | . . . . W . | | . . . . . . | | . . . . . . | | . . . . . . | | . . . . . . | +-------------+
and the second pair at squares (0,2)
and
(1,5)
, at least naively, it could have just as easily been
reached by placing the first pair at (0,2)
and
(1,5)
+-------------+ | . . B . . . | | . . . . . W | | . . . . . . | | . . . . . . | | . . . . . . | | . . . . . . | +-------------+
and the second pair at (0,0)
and (1,4)
, or
the first pair at (0,2)
and (1,4)
and the
second pair at (0,0)
and (1,5)
, etc. This
artificially increases the size of the search space, since there is no
point in considering these different paths to the same position.
The way we remedy this is by placing an ordering on the
squares of the board, and enforcing the rule that once a team has placed
a queen on a square, that team can no longer place a queen on an
“earlier” square. For instance, we could take the lexicographic
order on (row,col) coordinates, which would mean that the first move
in the example above could only have been placing the black and white
queens at squares (0,0)
and (1,4)
. The
ordering one uses can actually make a difference for the efficiency of
backtracking search, and a heuristic is to prioritize moves that cut
down the search space as much as possible in the beginning. For our
purposes, though, we just care that some ordering exists, since it is
what enables us to avoid going down different paths towards the same
destination.
Implementation
Write a function
peaceable_queens(n, m)
that generates all peaceable arrangements ofm
black queens andm
white queens on an \(n \times n\) chessboard. You should make exactly one call of the formyield (black, white)
for every pair of listsblack
andwhite
representing such a peaceable arrangement.Hint: introduce an auxiliary function
solve_peace(n, m, black, white, bmoves, wmoves)
which takes as extra parameters a partial solution for the positions of the black and white teams, together with the (ordered) lists of remaining possible moves for black and white.Use
peaceable_queens
to write a functionpax_regina_number(n)
that computes the maximal number \(m\) such that there exists a peaceable arrangement of two teams of \(m\) queens on an \(n \times n\) board. Check that it agrees with the first few values of sequence A250000 in the Online Encyclopedia of Integer Sequences. Add timing statistics, and try to optimize yourpeaceable_queens
function so that you are able to computepax_regina_number(n)
more quickly and for higher values ofn
.Abstract your backtracking solver so that it is parameterized by an attack function telling whether or not two positions are in an attacking relation. Use this to write functions
peaceable_rooks(n, m)
,peaceable_bishops(n, m)
, etc., or whatever suits your fancy.
Gambling queens
Note that there are no automated tests for this sequence of optional exercises, it is up to you to ensure that your code is correct!
In class, we saw how to solve the \(n\)-queens problem using backtracking. It is also possible to take a randomized approach:
Start by placing the first queen in the first column, choosing its position (= row) uniformly at random.
Then place the second queen in the second column, choosing its position uniformly at random among all positions that do not create a conflict with the first queen.
Continue in the same manner until either you’ve placed all \(n\) queens on the board, or you’ve placed \(k < n\) and there are no positions to place the \((k+1)\)st queen. In that case, rather than backtracking to the \(k\)th queen, restart the whole process from step (1) and try again until you manage to build a complete, random solution.
This is a “Las Vegas algorithm”, in the sense that it always returns a correct solution, although it may get unlucky and require many iterations to find one.
- Implement the above algorithm as a function
vegas_queens(n)
, taking as input a natural numbern
and returning a solution to the \(n\)-queens problem (represented as an appropriate permutation oflist(range(n))
). Compare the performance ofvegas_queens(n)
to the generatorqueens([], n)
defined using backtracking that we saw in Lecture 3, for different values of \(n\), and write down your observations as a comment.
One nice feature of backtracking is that it allows us to exhaustively enumerate all solutions. On the other hand, depending on our available computing power the full backtracking tree might be too large to exhaust. The Las Vegas algorithm described above can be adapted to quickly obtain a Monte Carlo estimate of the size of the full backtracking tree, which we can use to decide whether it is worth performing a full enumeration.
The idea is to keep track of the number of choices \(c_1,\dots,c_k\) we had at each iteration, from the initial choice (in this case, a choice of an arbitrary row for the first queen, so that \(c_1 = n\)) up until the point where either the search failed (with \(c_k = 0\)) or we found a complete solution (with \(k = n\) and \(c_n > 0\)). From each such trial, we infer an estimate of
\[ 1 + c_1 + c_1c_2 + \dots + c_1c_2\cdots c_n \]
for the number of nodes in the backtracking tree, and by performing repeated trials we can get a pretty good estimate for the actual number of nodes.
- Implement the Monte Carlo algorithm sketched above as a function
qbt_estimate(n, trials=100)
, taking as input a natural numbern
and an optional number of trials, and returning an estimate for the total size of the backtracking tree for the \(n\)-queens problem. Compare this estimate to the exact answer computed using the functionqueen_stats
in Lecture 3, and write down your observations as a comment. (You can also try varying the number of trials.)