CSE 102 – Tutorial 3 – Dynamic programming
General instructions
Sections tagged with a green marginal line are mandatory.
Sections tagged with an orange marginal line are either more advanced or supplementary, and are therefore considered optional. They will help you deepen your understanding of the material in the course. You may skip these sections on a first pass through the tutorial, and come back to them after finishing the mandatory exercises.
After you are finished with the tutorial, please upload your submission to Moodle by completing “Tutorial 3 assignment” before the indicated deadline.
When submitting a solution, the system will do some basic validation of your files and run some automated tests on your solutions. A green tick tells you that your file compiles and has passed all of the automated tests. A red tick tells you either that your file is badly broken (e.g., a syntax error) or that your solutions fail one or more of the automated tests. You can see the result of the tests by clicking on “See my uploads” in the upper-left menu, and then clicking on the right arrow button next to each file you submitted.
Although passing all of our tests provides evidence that your solutions are correct, it might not guarantee it, as typically we would need to run infinitely many tests! You are encouraged to write your own tests, and more generally to think about why your code works and not just rely on the automated tests.
Discussion of the tutorials with friends and classmates is allowed, with acknowledgment, but please read the Moodle page on “Collaboration, plagiarism, and AI”.
Fruit flight of fantasy
We expect you to write your solution to these problems in a file
named flight.py
and to upload it using the form below. You
can resubmit your solution as many times as you like.
For the sake of this exercise, imagine that you are a fruit bat flying eastwards across a triangular grid. At each unit of time, you can either fly due East or North-East or South-East. Some locations of the grid contain fruit (indicated by yellow circles below), and your goal is to eat as many of these as possible before making it to the end of the grid, after which you’ll go back into your cave to hibernate. For example, the grid might look like this:

In this case, there is a single optimal flight path travelling East, then North-East, then North-East, and finally South-East:

Your first task will be to implement a brute-force solution to this problem by exhaustive enumeration. By “exhaustive enumeration”, we mean the approach of generating all possible flight paths, computing their fruit value, and picking the one with the maximum value. Although this brute-force approach is very inefficient, it provides a baseline and will help us to get a better understanding of the problem specification.
To start, we need to pick a representation for bat flight paths and for grids. In general, a bat path visits a series of nodes \((x_0,y_0),(x_1,y_1),\dots,(x_n,y_n)\), where \(x_i - x_{i-1} = 1\) and \(y_i - y_{i-1} \in \{-1,0,1\}\). We represent such a bat path as a list of vertical displacements \([d_1,\dots,d_n]\) where \(d_1 = y_1-y_0\), \(d_2 = y_2-y_1\), and so on. Moreover, to simplify notation, we take the bat’s starting coordinate to be the origin \((x_0,y_0) = (0,0)\), so that \(x_i = i\) and \(y_i\) is always between \(-i\) and \(i\). Thus the flight path in red above, visiting nodes \((0,0)-(1,0)-(2,1)-(3,2)-(4,1)\), is represented by the length 4 list \([0,1,1,-1]\).
We assume a fixed geometry for the grid (always a triangular grid, as above) but the positions of the fruits can vary. We represent the fruit positions as a list of sets \([f_0,f_1,\dots,f_n]\), where each \(f_i\) is a subset of values \(f_i \subset \{-i,\dots,i\}\), with \(y \in f_i\) just in case there is fruit at node \((i,y)\). Thus the fruits in the grid above are represented by the length 5 list of sets \([\{\}, \{-1,0\},\{-1,1,2\},\{-3,2\},\{-2,1,4\}]\). (Note that in this example there is no fruit at the origin, but if there were that would be represented by the set \(\{0\}\) at the head of the list.)
For testing purposes, you may find it convenient to copy these examples into your Python file:
= [0,1,1,-1]
example_flight = [{}, {-1,0},{-1,1,2},{-3,2},{-2,1,4}] example_fruits
Write a function
flight_value(fruits, flight)
which takes a grid (represented as a length \(n+1\) list of sets of fruit positions, as explained above) and a flight path (represented as a length \(n\) list of displacements) and returns the number of pieces of fruit encountered by the bat along the path. For example,flight_value(example_fruits, example_flight)
should evaluate to 4.The function below recursively constructs a list of all flight paths of length
n
:def flights_list(n): if n == 0: return [[]] return [[i] + w for w in flights_list(n-1) for i in [-1,0,1]]
Using this function as well as the function
flight_value
you wrote above, write a functionoptimal_brute(fruits)
that computes the value of an optimal flight by running through the list of all flights, computing their values, and returning the maximum. For example,optimal_brute(example_fruits)
should evaluate to 4.
- Rewrite the function
flights_list(n)
as a generatorflights_gen(n)
, so that rather than returning a list of all flight paths it insteadyield
s each path successively. It is okay if you generate the paths in any order as long as you generate each path exactly once, but you should not callflights_list
, nor should you ever store the list of all paths in memory. Once you’ve written the generator, modify your functionoptimal_brute
so that it callsflights_gen
instead offlights_list
.
Since there are \(3^n\) possible paths through a triangular grid of length \(n\), the brute force approach quickly becomes intractable (whether or not we use a generator). We can do much better with dynamic programming. Our first step will be to establish a recursive formula for the value of an optimal flight path. To that end, we need to consider paths starting at arbitrary locations on the triangular grid. Let us write \(V(x,y)\) for the value of an optimal flight path starting at node \((x,y)\) and ending at \((n,y')\) for some \(y'\). If \(F(x,y)\) denotes the value of the fruit at a given node (either 1 or 0 depending on whether or not there is fruit at \((x,y)\)), then we can easily verify that the values \(V(x,y)\) satisfy the recurrence
\[ V(x,y) = F(x,y) + \max_{d \in \{-1,0,1\}} V(x+1,y+d) \]
for all \(0 \le x \le n\), with \(V(x,y) = 0\) when \(x > n\).
- Translate the above recurrence into a recursive function
optimal_rec(fruits, x=0, y=0)
which computes the value \(V(x,y)\) of an optimal path starting at location \((x,y)\) on an \(n\)-column triangular grid. Note that we take the arguments \(x\) and \(y\) to have default value 0 for convenience. In particular,optimal_rec(fruits)
should compute the same value asoptimal_brute(fruits)
.
In terms of efficiency, the recursive approach is not much better than the brute-force approach since it still considers all \(3^n\) paths through the grid implicitly, in the sense that they are represented implicitly in the call stack. The key property that allows us to obtain a more efficient algorithm via dynamic programming is that the value of an optimal path from \((x,y)\) is independent of the path taken to \((x,y)\). This means that the values \(V(x,y)\) only have to be computed once, and can then be reused when computing the values of \(V(x',y')\) for any \(x' < x\). We can see this better if we label the nodes of our example grid with the values of \(V(x,y)\) for each \((x,y)\):

In general, a triangular grid of length \(n\) has \(1 + 3 + 5 + \dots + (2n+1) = (n+1)^2\) nodes, so there are only \((n+1)^2\) values of \(V(i,j)\) to compute.
Modify your recursive solution so that it performs memoization to solve the optimal path problem using top-down dynamic programming. It will be convenient to define a recursive function
optimal_td(fruits, x=0, y=0, memo=None)
taking an extra “memo table” as an optional argument, which is initialized to be an empty dictionary on the first call to the function, as indicated below.def optimal_td(fruits, x=0, y=0, memo=None): if memo is None: memo = {} # ...
Your function optimal_td(fruits)
should compute the same
value as optimal_rec(fruits)
and
optimal_brute(fruits)
, but much more quickly. Of course the
difference will be negligible for the tiny example grid above, but
should be noticeable even on relatively small grids since \(3^n\) grows much more quickly than \((n+1)^2\). For testing purposes, you can
use the functions below to construct some larger grids of fruits:
def line_fruits(n):
return [{j} for j in range(n+1)]
def cyclic_fruits(n,k):
return [{i for i in range(-j,j+1) if (j*j+j+i)%k == 0} for j in range(0,n+1)]
For example, try to reproduce the interaction below, observing how long it takes to compute each answer.
1]: optimal_brute(line_fruits(12)), optimal_brute(cyclic_fruits(13,5))
In [1]: (13, 9)
Out[
2]: optimal_rec(line_fruits(12)), optimal_rec(cyclic_fruits(13,5))
In [2]: (13, 9)
Out[
3]: optimal_td(line_fruits(12)), optimal_td(cyclic_fruits(13,5))
In [3]: (13, 9) Out[
Perform some experiments with the different versions of the optimal path value computation, and record your observations as a comment in your submission file. You can use the function below to get more precise timing statistics:
import time
def timed(f, x):
'''Runs f(x) and returns a pair of the resulting value and the elapsed time.'''
= time.time()
t0 = f(x)
v = time.time()
t1 return (v, t1-t0)
Once you’ve implemented the bottom-up version of the computation
(optimal_bu
below), repeat your experiments and extend your
observations.
Having successfully implemented a recursive top-down solution to the problem of computing the values of optimal paths, your next goal will be to implement an iterative, bottom-up approach. For this it may be helpful to look again at the diagram above where each node is labelled by the corresponding value of \(V(x,y)\). Notice that the values in the last column are either 1 or 0 depending on whether there is fruit at the node, while the values in each inner column only depend on the values in the column to its immediate right. This means that we can compute the value of the optimal path by working from right to left, starting with the rightmost column \(x = n\) and running down to the trivial leftmost column \(x = 0\), at each iteration computing the values of \(V(x,y)\) for all \(y\) between \(-x\) and \(x\).
- Write a function
optimal_bu(fruits)
that computes the value of the optimal path through the grid using a bottom-up dynamic programming approach, as described above. You should not use recursion, only definite iteration (i.e., for-loops).
Finally, hibernation time is approaching! Knowing the theoretical value of an optimal flight path is not good enough – you need the actual path itself. Luckily, finding one is also possible using dynamic programming.
Write a function
optimal_path(fruits)
that computes an optimal path through the grid, returning the corresponding list of displacements. (In general there could be multiple paths that are optimal, and your function only has to return one of them.) Some examples:4]: optimal_path(example_fruits) In [4]: [0, 1, 1, -1] Out[ 5]: optimal_path(line_fruits(12)) In [5]: [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1] Out[ 6]: optimal_path(cyclic_fruits(13,5)) In [6]: [-1, 0, -1, -1, 0, 1, 1, -1, -1, 0, 1, 1, -1] Out[
Hint: You can use either an iterative bottom-up or recursive top-down approach. With a bottom-up approach, it is enough to compute a displacement for every node indicating the direction of an optimal path, and then use that to reconstruct a path at the end of the right-to-left sweep. If using a top-down approach, a good strategy is to adapt your function
optimal_td
to compute both the optimal value and the optimal path at the same time, and then extract the path.
Adapt your solution to the previous exercise to write a generator
optimal_paths(fruits)
that successively yields all optimal paths through the grid in an arbitrary order. Some examples:7]: list(optimal_paths(example_fruits)) In [7]: [[0, 1, 1, -1]] Out[ 8]: modified_example_fruits = [{}, {0,-1}, {1,2,-1}, {2,-2}, {1,4,-2}] In [9]: list(optimal_paths(modified_example_fruits)) In [9]: [[-1, 0, -1, 0], [0, -1, -1, 0], [0, 1, 1, -1]] Out[ 10]: g = optimal_paths(cyclic_fruits(13,5)) In [11]: next(g) In [11]: [-1, 0, -1, -1, 0, 1, 1, -1, -1, 0, 1, 1, -1] Out[ 12]: next(g) In [12]: [-1, 0, -1, -1, 0, 1, 1, -1, -1, 1, 0, 1, -1] Out[
Optimal typesetting (optional)
We expect you to write your solution to these problems in a file
named text.py
and to upload it using the form below. You
can resubmit your solution as many times as you like.
For this exercise, imagine that you are the CEO of a startup company whose business model is based on providing a service that formats texts into highly-aesthetic fixed-width blocks. The user provides a text \(T\) in the form of a list of words and specifies an upper limit \(L\) on the number of characters per line (including spaces), and you automatically format the text into a list of lines \(\ell_1,\dots,\ell_n\) so as to minimize the defect defined by the formula
\[ D = \sum_{i=1}^n (L - |\ell_i|)^3 \]
corresponding to the sum of cubes of numbers of trailing characters. For example, consider the following text:
= ['Lorem', 'ipsum', 'dolor', 'sit', 'amet', 'consectetur', \
lorem 'adipiscing', 'elit', 'sed', 'do', 'eiusmod', 'tempor', \
'incididunt', 'ut', 'labore', 'et', 'dolore', 'magna', 'aliqua']
With a limit of \(L = 20\) characters per line, a naive way of formatting the text into lines is as below:
Lorem ipsum dolor | sit amet consectetur| adipiscing elit sed | do eiusmod tempor | incididunt ut labore| et dolore magna | aliqua |
Unfortunately this looks awful, with a defect of 3³ + 0³ + 1³ + 3³ + 0³ + 5³ + 14³ = 2744. In contrast, this is how your service would format the text:
Lorem ipsum dolor | sit amet consectetur| adipiscing elit | sed do eiusmod | tempor incididunt | ut labore et | dolore magna aliqua |
With a defect of only 3³ + 0³ + 5³ + 6³ + 3³ + 8³ + 1³ = 908, the text is now beautifully typeset. The business prospects for your startup are looking rosy, if only you can get the service to work correctly!
Write a function
line_defect(L, T)
which given the parameter \(L\) and a list of words \(T\), computes the defect \((L - |\ell|)^3\) of the line \(\ell\) composed of the list of words \(T\) separated by space characters. If \(|\ell| > L\) (i.e., if the list of words does not fit in a line), you should returnmath.inf
.Some examples:
13]: line_defect(20, ['Lorem', 'ipsum', 'dolor']) In [13]: 27 Out[ 14]: line_defect(20, ['Lorem', 'ipsum', 'dolor', 'sit']) In [14]: inf Out[
Write a function
best_defect(L, T)
, which given the parameter \(L\) and a list of words \(T\), returns the minimum overall defect \(D = \sum_{i=1}^n (L - |\ell_i|)^3\) of formatting the list of words into a list of lines \(\ell_1,\dots,\ell_n\).Some examples:
15]: best_defect(20, lorem) In [15]: 908 Out[ 16]: best_defect(22, lorem) In [16]: 293 Out[
Hint: Using dynamic programming, compute the minimum defect of the subtext \(T[0:i]\) for each \(0 \le i \le n\), where \(n\) is the number of words in \(T\). In particular, figure out how to compute the minimum defect of \(T[0:i]\) given the minimum defects of \(T[0:j]\) for all \(j < i\). Finally return the minimum defect of \(T[0:n]\).
Write a function
best_format
, which given \(L\) and \(T\) as above, returns a list of lists of words representing an optimal formatting of \(T\) into lines of length \(\le L\).For testing purposes, you may find it convenient to use the following function for printing line-formatted messages:
def print_message(L, msg): print('-' * L) print('\n'.join([' '.join(blk).ljust(L) + '|' for blk in msg])) print('-' * L)
Some examples:
17]: best_format(20, lorem) In [17]: Out['Lorem', 'ipsum', 'dolor'], [['sit', 'amet', 'consectetur'], ['adipiscing', 'elit'], ['sed', 'do', 'eiusmod'], ['tempor', 'incididunt'], ['ut', 'labore', 'et'], ['dolore', 'magna', 'aliqua']] [ 18]: for L in [20,22,18]: print_message(L, best_format(L, lorem)) In [-------------------- | Lorem ipsum dolor | sit amet consectetur| adipiscing elit | sed do eiusmod | tempor incididunt | ut labore et | dolore magna aliqua -------------------- ---------------------- | Lorem ipsum dolor | sit amet consectetur | adipiscing elit sed | do eiusmod tempor | incididunt ut labore | et dolore magna aliqua---------------------- ------------------ | Lorem ipsum | dolor sit amet | consectetur | adipiscing | elit sed do | eiusmod tempor | incididunt ut | labore et dolore | magna aliqua ------------------
Hint: refine your solution for
best_defect
.
Crocodile sequences (optional)
We expect you to write your solution to these problems in a file
named crocs.py
and to upload it using the form below. You
can resubmit your solution as many times as you like.
You will recall from your biology classes that crocodiles can lay three kinds of eggs – green, red, or blue – where:
- a green egg contains either a green baby crocodile or a pair of a red egg and a blue egg, arranged left to right;
- a red egg contains either a red baby crocodile or a pair of a blue egg and a green egg, arranged left to right;
- a blue egg contains either a blue baby crocodile or a pair of a green egg and a red egg, arranged left to right.
Hence a single crocodile egg can hatch a potentially unbounded number of baby crocodiles. Moreover, these crocodile siblings always come out arranged in an orderly line according to the nesting structure of the eggs. For example, the green egg
ultimately hatches the following line of siblings:
For this exercise, imagine that you are applying for a job at a crocodile nursery, and that as part of the interview process, you will be asked to look at a line of crocodile babies and quickly determine whether they could possibly be a complete group of siblings.
Write a function
crocsibs(bcs)
which given a stringbcs
describing a line of baby crocodiles, determines whether or not it could possibly be a complete group of siblings. The line is described by a color-coded string with the capital letters G, B, and R representing a green baby, a blue baby, and a red baby respectively.For example,
crocsibs
should returnTrue
on the string"RBRGRBR"
since there is a green egg producing a red-blue-red-green-red-blue-red sequence of baby crocodiles (as shown above). Likewise, it should returnTrue
on"RBR"
, since such a sequence of babies can be produced by a blue egg (B -> GR -> RBR
). On the other hand, it should returnFalse
on"RR"
, since there is no type of crocodile egg that will hatch just two red siblings.Hint: Let \(G(i,j)\), \(B(i,j)\), \(R(i,j)\) be the logical assertions that the subsequence
bcs[i:j]
can possibly be hatched by a green egg, a blue egg, or a red egg respectively. Find mutually recursive equations for \(G(i,j)\), \(B(i,j)\), and \(R(i,j)\), and then solve them using dynamic programming.