Programming Assignment 6 - Advanced Shortest Paths
Programming Assignment 6 - Advanced Shortest Paths
Programming Project:
Advanced Shortest Paths
Revision: April 12, 2017
Introduction
Welcome to the Advanced Shortest Paths programming project! This project is organized as a series of
coding problems in which you will implement the algorithms from the lectures, apply them to real-world
road networks and social networks, and come up with your own ideas on how to speedup the algorithms even
more. Due to time limitations, the test inputs in the grader range up to maps of New York and Colorado, but
don’t include the maps of USA and/or Europe. However, here you can find the graphs in the same format
and compare the running time of your algorithms on those graphs on your own computer.
We also encourage you to create a forum thread and compete whose solution to the problem “Contraction
Hierarchies Large” is the fastest in terms of both preprocessing time and query time, and also which solution
is the most memory efficient. You can post the screenshots of the grader feedback in the comments, and the
thread owner can keep updated the table with the top results in different nominations in the initial post
itself.
The grader will show you the input data in the coding problems only if your solution fails on one of the
first few tests (please review the questions 7.4 and 7.5 in the FAQ section for a more detailed explanation of
this behavior of the grader). Also, in the problems with preprocessing phase (starting from the third problem,
“Compute Distance with Preprocessing”), there are two separate inputs — the graph itself (for preprocessing)
and the queries for computing the distance. You will be shown both inputs in case your solution fails on one
of the first few tests. If your solution fails on further tests, you won’t see the inputs.
Learning Outcomes
Upon completing this programming assignment you will be able to:
1. implement Bidirectional Dijkstra and use it to compute distances in social networks very quickly;
2. implement A* search algorithm and apply it to compute distances in road networks faster than the
classic algorithms do;
3. implement Contraction Hierarchies algorithm and apply it to preprocess large road networks and then
compute distances in them much faster;
4. implement even more heuristics speeding up Contraction Hierarchies and solve even bigger road net-
works;
5. use your Contraction Hierarchies implementation to go further and solve the classical logistics problem
(called Travelling Salesman Problem): find the optimal path for a truck leaving the depot to visit a list
of stores, deliver the goods to them and return to the depot.
1
Passing Criteria: 3 out of 5
Passing this programming assignment requires passing at least 3 out of 5 code problems from this assignment.
In turn, passing a code problem requires implementing a solution that passes all the tests for this problem
in the grader and does so under the time and memory limits specified in the problem statement.
Contents
1 Problem: Friend Suggestion 3
2
1 Problem: Friend Suggestion
Problem Introduction
Social networks are live on the connections between people, so friend
suggestions is one of the most important features of Facebook. One of
the most important inputs of the algorithm for friend suggestion is most
probably the current distance between you and the suggested person
in the graph of friends connections. Your task is to implement efficient
computation of this distance. The grader will test your algorithm against
different real-world networks, such as a part of internet, a network of
scientific citations or coauthorship, a social network of jazz musicians or
even a social network of dolphins :) You need to compute the distance
between two nodes in such network. We took some of the graphs from
here to use in the grader, and you can play with more of them on your
own computer.
Note that Python, Ruby and Javascript are too slow to solve
the largest tests in time, so solutions in these languages won’t
be tested against some of the largest tests. Solutions in C++,
Java, C#, Haskell and Scala will be tested against all the tests.
Note that we only guarantee (as usual) that there exists a so-
lution under the given time and memory constraints for C++,
Java and Python3. For other languages, the solution may not
exist.
Problem Description
Task. Compute the distance between several pairs of nodes in the network.
Input Format. The first line contains two integers 𝑛 and 𝑚 — the number of nodes and edges in the
network, respectively. The nodes are numbered from 1 to 𝑛. Each of the following 𝑚 lines contains
three integers 𝑢, 𝑣 and 𝑙 describing a directed edge (𝑢, 𝑣) of length 𝑙 from the node number 𝑢to the
node number 𝑣. (Note that some social networks are represented by directed graphs while some other
correspond naturally to undirected graphs. For example, Twitter is a directed graph (with a directed
edge (𝑢, 𝑣) meaning that 𝑢 follows 𝑣), while Facebook is an undirected graph (where an undirected
edge {𝑢, 𝑣} means that 𝑢 and 𝑣 are friends). In this problem, we work with directed graphs only for a
simple reason. It is easy to turn an undirected graph into a directed one: just replace each undirected
edge {𝑢, 𝑣} with a pair of directed edges (𝑢, 𝑣) and (𝑣, 𝑢).)
The next line contains an integer 𝑞 — the number of queries for computing the distance. Each of the
following 𝑞 lines contains two integers 𝑢 and 𝑣 — the numbers of the two nodes to compute the distance
from 𝑢 to 𝑣.
Constraints. 1 ≤ 𝑛 ≤ 1 000 000; 1 ≤ 𝑚 ≤ 6 000 000; 1 ≤ 𝑢, 𝑣 ≤ 𝑛; 1 ≤ 𝑙 ≤ 1 000; 1 ≤ 𝑞 ≤ 1 000. For
Python2, Python3, Ruby and Javascript, 1 ≤ 𝑚 ≤ 2 000 000.
Output Format. For each query, output one integer on a separate line. If there is no path from 𝑢 to 𝑣,
output −1. Otherwise, output the distance from 𝑢 to 𝑣.
Time Limits.
language C C++ Java Python C# Haskell JavaScript Ruby Scala
time (sec) 25 25 125 150 37.5 50 150 150 150
3
Sample 1.
Input:
21
121
4
11
22
12
21
Output:
0
0
1
-1
Explanation:
1
1 2
The distance from a node to itself is always 0. The distance from 1 to 2 is 1, and there is no path from
2 to 1.
Sample 2.
Input:
44
121
412
232
135
1
13
Output:
3
Explanation:
2
2 3
1
5
1 4
2
There is a direct edge from node 1 to node 3 of length 5, but there is a shorter path 1 → 2 → 3 of
length 1 + 2 = 3.
Starter Files
The starter solutions for this problem read the input data from the standard input, pass it to a blank
procedure, and then write the result to the standard output. You are supposed to implement your algorithm
in this blank procedure if you are using C++, Java, or Python3. For other programming languages, you need
to implement a solution from scratch. Filename: friend_suggestion
4
What To Do
Implement the Bidirectional Dijkstra algorithm from the lectures.
Need Help?
Ask a question or see the questions asked by other learners at this forum thread.
5
2 Problem: Compute Distance Faster Using Coordinates
Problem Introduction
In this task you will be given a description of a real-world road network
with not just edges and their lengths, but also with the coordinates of the
nodes. Your task is still to find the distance between some pairs of nodes,
but you will need to use the additional information about coordinates to
speedup your search.
Note that Python, Ruby and Javascript are too slow to solve
the largest tests in time, so solutions in these languages won’t
be tested against some of the largest tests. Solutions in C++,
Java, C#, Haskell and Scala will be tested against all the tests.
Note that we only guarantee (as usual) that there exists a so-
lution under the given time and memory constraints for C++,
Java and Python3. For other languages, the solution may not
exist.
Problem Description
Task. Compute the distance between several pairs of nodes in the network.
Input Format. The first line contains two integers 𝑛 and 𝑚 — the number of nodes and edges in the
network, respectively. The nodes are numbered from 1 to 𝑛. Each of the following 𝑛 lines contains the
coordinates 𝑥 and 𝑦 of the corresponding node. Each of the following 𝑚 lines contains three integers
𝑢, 𝑣 and 𝑙 describing a directed
√︀ edge (𝑢, 𝑣) of length 𝑙 from the node number 𝑢 to the node number 𝑣.
It is guaranteed that 𝑙 ≥ (𝑥(𝑢) − 𝑥(𝑣))2 + (𝑦(𝑢) − 𝑦(𝑣))2 where (𝑥(𝑢), 𝑦(𝑢)) are the coordinates of 𝑢
and (𝑥(𝑣), 𝑦(𝑣)) are the coordinates of 𝑣. The next line contains an integer 𝑞 — the number of queries
for computing the distance. Each of the following 𝑞 lines contains two integers 𝑢 and 𝑣 — the numbers
of the two nodes to compute the distance from 𝑢 to 𝑣.
Constraints. 1 ≤ 𝑛 ≤ 110 000; 1 ≤ 𝑚 ≤ 250 000; −109 ≤ 𝑥, 𝑦 ≤ 109 ; 1 ≤ 𝑢, 𝑣 ≤ 𝑛; 0 ≤ 𝑙 ≤ 100 000;
1 ≤ 𝑞 ≤ 10 000. For Python2, Python3, Ruby and Javascript, 1 ≤ 𝑛 ≤ 11 000, 1 ≤ 𝑚 ≤ 30 000.
Output Format. For each query, output one integer. If there is no path from 𝑢 to 𝑣, output −1. Otherwise,
output the distance from 𝑢 to 𝑣.
Time Limits.
language C C++ Java Python C# Haskell JavaScript Ruby Scala
time (sec) 50 50 100 50 75 100 50 50 100
6
Sample 1.
Input:
21
00
01
121
4
11
22
12
21
Output:
0
0
1
-1
Explanation:
1 2
1
0 1
𝑥
0 1 2
The distance from a node to itself is always 0. The distance from 1 to 2 is 1, and there is no path from
2 to 1.
7
Sample 2.
Input:
44
00
01
21
20
121
412
232
136
1
13
Output:
3
Explanation:
2
1 2 3
1
6
0 1 4
2
𝑥
0 1 2
There is a direct edge from node 1 to node 3 of length 6, but there is a shorter path 1 → 2 → 3 of
length 1 + 2 = 3.
Starter Files
The starter solutions for this problem read the input data from the standard input, pass it to a blank
procedure, and then write the result to the standard output. You are supposed to implement your algorithm
in this blank procedure if you are using C++, Java, or Python3. For other programming languages, you need
to implement a solution from scratch. Filename: dist_with_coords
What To Do
Implement the A* algorithm from the lectures.
Need Help?
Ask a question or see the questions asked by other learners at this forum thread.
8
3 Problem: Compute Distance with Preprocessing
Problem Introduction
In this task you will be first given a graph of a real road network, and
you can preprocess it as you wish under the preprocessing time limit.
Then you will get a set of queries for computing distance, and you will
need to answer all of them under the separate time limit for queries. You
will have to respond to queries much faster than in the previous problem.
Note that Python, Ruby and Javascript are too slow to solve
the largest tests in time, so solutions in these languages won’t
be tested against some of the largest tests. Solutions in C++,
Java, C#, Haskell and Scala will be tested against all the tests.
Note that we only guarantee (as usual) that there exists a so-
lution under the given time and memory constraints for C++,
Java and Python3. For other languages, the solution may not
exist.
Problem Description
Task. Compute the distance between several pairs of nodes in the network.
Input Format. You will be given the input for this problem in two parts. The first part contains the
description of a road network, the second part contains the queries. You have a separate time limit for
preprocessing the graph. Under this time limit, you need to read the graph and preprocess it. After
you’ve preprocessed the graph, you need to output the string “Ready” (without quotes) and flush the
output buffer (the starter files for C++, Java and Python3 do that for you; if you use another language,
you will have to find out how to do this). Only after you output the string “Ready” you will be given
the queries. You will have a time limit for the querying part, and under this time limit you will need
to input all the queries and output the results for each of the quires.
The first line of the road network description contains two integers 𝑛 and 𝑚 — the number of nodes
and edges in the network, respectively. The nodes are numbered from 1 to 𝑛. Each of the following
𝑚 lines contains three integers 𝑢, 𝑣 and 𝑙 describing a directed edge (𝑢, 𝑣) of length 𝑙 from the node
number 𝑢 to the node number 𝑣.
The first line of the queries description contains an integer 𝑞 — the number of queries for computing
the distance. Each of the following 𝑞 lines contains two integers 𝑢 and 𝑣 — the numbers of the two
nodes to compute the distance from 𝑢 to 𝑣.
Constraints. 1 ≤ 𝑛 ≤ 110 000; 1 ≤ 𝑚 ≤ 250 000; 1 ≤ 𝑢, 𝑣 ≤ 𝑛; 1 ≤ 𝑙 ≤ 200 000; 1 ≤ 𝑞 ≤ 10 000. It is
guaranteed that the correct distances are less than 1 000 000 000. For Python2, Python3, Ruby
and Javascript, 1 ≤ 𝑛 ≤ 11 000, 1 ≤ 𝑚 ≤ 25 000, 1 ≤ 𝑞 ≤ 1 000.
Output Format. After you’ve read the description of the road network and done your preprocessing,
output one string “Ready” (without quotes) on a separate line and flush the output buffer. Then read
the queries, and for each query, output one integer on a separate line. If there is no path from 𝑢 to 𝑣,
output −1. Otherwise, output the distance from 𝑢 to 𝑣.
9
Time Limits.
language C C++ Java Python C# Haskell JavaScript Ruby Scala
preprocessing (sec) 10 10 45 50 15 20 50 50 90
query time (sec) 2 2 9 10 3 4 10 10 18
Sample 1.
Input:
21
121
4
11
22
12
21
Output:
Ready
0
0
1
-1
Explanation:
1
1 2
The distance from a node to itself is always 0. The distance from 1 to 2 is 1, and there is no path from
2 to 1.
Sample 2.
Input:
44
121
412
232
135
1
13
Output:
Ready
3
Explanation:
2
2 3
1
5
1 4
2
10
There is a direct edge from node 1 to node 3 of length 5, but there is a shorter path 1 → 2 → 3 of
length 1 + 2 = 3.
Starter Files
The starter solutions for this problem read the input data from the standard input, pass it to a blank
procedure, and then write the result to the standard output. You are supposed to implement your algorithm
in this blank procedure if you are using C++, Java, or Python3. For other programming languages, you need
to implement a solution from scratch. Filename: dist_preprocess_small
What To Do
Implement the Contraction Hierarchies algorithm from the lectures. It is not necessary to implement all the
heuristics to pass, only some of them.
Need Help?
Ask a question or see the questions asked by other learners at this forum thread.
11
4 Advanced Problem: Compute Distance with Preprocessing on
Larger Road Networks
We strongly recommend you start solving advanced problems only when you are done with the basic problems
(for some advanced problems, algorithms are not covered in the video lectures and require additional ideas
to be solved; for some other advanced problems, algorithms are covered in the lectures, but implementing
them is a more challenging task than for other problems).
Problem Introduction
This problem is the same as the previous one, but you will have to solve
larger road networks under the same time limits.
Note that Python, Ruby and Javascript are too slow to solve
the largest tests in time, so solutions in these languages won’t
be tested against some of the largest tests. Solutions in C++,
Java, C#, Haskell and Scala will be tested against all the tests.
Note that we only guarantee (as usual) that there exists a so-
lution under the given time and memory constraints for C++,
Java and Python3. For other languages, the solution may not
exist.
Problem Description
Task. Compute the distance between several pairs of nodes in the network.
Input Format. See the input format for the previous problem.
Constraints. 1 ≤ 𝑛 ≤ 500 000; 1 ≤ 𝑚 ≤ 1 100 000; 1 ≤ 𝑢, 𝑣 ≤ 𝑛; 1 ≤ 𝑙 ≤ 200 000; 1 ≤ 𝑞 ≤ 10 000. It is
guaranteed that the correct distances are less than 1 000 000 000. For Python2, Python3, Ruby
and Javascript, 1 ≤ 𝑛 ≤ 11 000, 1 ≤ 𝑚 ≤ 25 000, 1 ≤ 𝑞 ≤ 1 000.
Output Format. See the output format for the previous problem.
Time Limits.
language C C++ Java Python C# Haskell JavaScript Ruby Scala
preprocessing (sec) 55 55 220 55 82.5 110 55 55 220
query time (sec) 5 5 20 5 7.5 10 5 5 20
Starter Files
The starter solutions for this problem read the input data from the standard input, pass it to a blank
procedure, and then write the result to the standard output. You are supposed to implement your algorithm
in this blank procedure if you are using C++, Java, or Python3. For other programming languages, you need
to implement a solution from scratch. Filename: dist_preprocess_large
12
What To Do
Implement the Contraction Hierarchies algorithm from the lectures. It will be necessary to implement more
of the heuristics and tricks than for the previous problem. You can also come up with your own ideas and
techniques for speeding up the solution. We encourage you to compete on the forums, whose solution is the
fastest (both in terms of preprocessing time and query time) and uses less memory!
Need Help?
Ask a question or see the questions asked by other learners at this forum thread.
13
5 Advanced Problem: Travelling Salesman Problem
We strongly recommend you start solving advanced problems only when you are done with the basic problems
(for some advanced problems, algorithms are not covered in the video lectures and require additional ideas
to be solved; for some other advanced problems, algorithms are covered in the lectures, but implementing
them is a more challenging task than for other problems).
Problem Introduction
In this task you will solve the classical logistics problem called Travelling
Salesman Problem: you are given the location of a depot and the location
of a list of stores on a road network, and you need to find the shortest
path for a truck to start in the depot, visit each of the stores to deliver
the goods there, and return back to the depot.
Note that Python, Ruby and Javascript are too slow to solve
the largest tests in time, so solutions in these languages won’t
be tested against some of the largest tests. Solutions in C++,
Java, C#, Haskell and Scala will be tested against all the tests.
Note that we only guarantee (as usual) that there exists a so-
lution under the given time and memory constraints for C++,
Java and Python3. For other languages, the solution may not
exist.
Problem Description
Task. Compute the length of the shortest path starting in the depot, visiting each store at least once and
returning to the depot.
Input Format. You will be given the input for this problem in two parts. The first part contains the
description of a road network, the second part contains the queries. You have a separate time limit for
preprocessing the graph. Under this time limit, you need to read the graph and preprocess it. After
you’ve preprocessed the graph, you need to output the string “Ready” (without quotes) and flush the
output buffer (the starter files for C++, Java and Python3 do that for you; if you use another language,
you will have to find out how to do this). Only after you output the string “Ready” you will be given
the queries. You will have a time limit for the querying part, and under this time limit you will need
to input all the queries and output the results for each of the quires.
The first line of the road network description contains two integers 𝑛 and 𝑚 — the number of nodes
and edges in the network, respectively. The nodes are numbered from 1 to 𝑛. Each of the following
𝑚 lines contains three integers 𝑢, 𝑣 and 𝑙 describing a directed edge (𝑢, 𝑣) of length 𝑙 from the node
number 𝑢 to the node number 𝑣.
The first line of the queries description contains an integer 𝑞 — the number of queries for computing
the distance. Each of the following 𝑞 lines starts with the integer 𝑘 — the number of points the truck
must visit, including all the stores and the depot. There are 𝑘 more integers on the same line. The first
of them is the number of the node corresponding to the depot location. The next 𝑘 − 1 integers are the
numbers of the nodes corresponding to the store locations.
14
depot, visiting each store at least once and returning to the depot, output −1. Otherwise, output the
length of the shortest path starting at a depot, visiting each store at least once and returning to the
depot.
Time Limits.
language C C++ Java Python C# Haskell JavaScript Ruby Scala
preprocessing (sec) 20 20 50 50 30 40 50 50 50
query time (sec) 16 16 40 40 24 32 40 40 40
1
2 3
1 1 1
1 4
1
For the first query, we need to start in the node 1, visit node 2 and return to node 1. The shortest
path for that is to get directly from 1 to 2 and then directly back from 2 to 1. The length is 2.
For the second query, we need to start in the node 1, visit node 3 and return to node 1. A shortest
path for that is to get from 1 to 2, then from 2 to 3, then return from 3 to 2, then return from 2 to 1.
Another shortest path would be to go from 1 to 2, then from 2 to 3, then from 3 to 4, then from 4 to
1. The length is 4.
For the third query, we need to start in the node 1, visit all the other nodes and return to node 1. The
shortest path for that is to go from 1 to 2, then from 2 to 3, then from 3 to 4, then from 4 to 1. The
length is 4.
15
Starter Files
The starter solutions for this problem read the input data from the standard input, pass it to a blank
procedure, and then write the result to the standard output. You are supposed to implement your algorithm
in this blank procedure if you are using C++, Java, or Python3. For other programming languages, you need
to implement a solution from scratch. Filename: travelling_salesman_problem
What To Do
First you need to compute the pairwise distances between all the stores and between the depot and all the
stores, to and from. To do that quickly, you will need to use your solution to the previous problem (preprocess
the graph using Contraction Hierarchies preprocessing, then find the distance between each pair of interesting
locations using Contraction Hierarchies querying). After that, you have the following problem: given a graph
with 𝑘 ≤ 20 nodes and weighted edges, find the shortest path starting in some node, visiting all the nodes
at least once and returning to the initial node. This is the classical Travelling Salesman Problem which can
be solved for small values of 𝑘 using dynamic programming. You can try to come up with this dynamic
programming yourself, or watch this lecture about it in the next course of our Specialization, Advanced
Algorithms and Complexity.
Need Help?
Ask a question or see the questions asked by other learners at this forum thread.
16
6 General Instructions and Recommendations on Solving Algorith-
mic Problems
Your main goal in an algorithmic problem is to implement a program that solves a given computational
problem in just few seconds even on massive datasets. Your program should read a dataset from the standard
input and write an answer to the standard output.
Below we provide general instructions and recommendations on solving such problems. Before reading
them, go through readings and screencasts in the first module that show a step by step process of solving
two algorithmic problems: link.
17
same way on your machine and on the testing machine (note that a buggy program may behave differently
when compiled by different compilers, or even by the same compiler with different flags).
If your C/C++ compiler does not recognize -std=c++14 flag, try replacing it with -std=c++0x flag
or compiling without this flag at all (all starter solutions can be compiled without it). On Linux
and MacOS, you most probably have the required compiler. On Windows, you may use your favorite
compiler or install, e.g., cygwin.
∙ Python 2 (CPython 2.7). File extensions: .py2 or .py (a file ending in .py needs to have a first line
which is a comment containing “python2”). No flags:
python2
∙ Python 3 (CPython 3.4). File extensions: .py3 or .py (a file ending in .py needs to have a first line
which is a comment containing “python3”). No flags:
python3
18
6.5 Testing Your Program
When your program is ready, you start testing it. It makes sense to start with small datasets (for example,
sample tests provided in the problem description). Ensure that your program produces a correct result.
You then proceed to checking how long does it take your program to process a massive dataset. For
this, it makes sense to implement your algorithm as a function like solve(dataset) and then implement an
additional procedure generate() that produces a large dataset. For example, if an input to a problem is a
sequence of integers of length 1 ≤ 𝑛 ≤ 105 , then generate a sequence of length exactly 105 , pass it to your
solve() function, and ensure that the program outputs the result quickly.
Also, check the boundary values. Ensure that your program processes correctly sequences of size 𝑛 =
1, 2, 105 . If a sequence of integers from 0 to, say, 106 is given as an input, check how your program behaves
when it is given a sequence 0, 0, . . . , 0 or a sequence 106 , 106 , . . . , 106 . Check also on randomly generated
data. For each such test check that you program produces a correct result (or at least a reasonably looking
result).
In the end, we encourage you to stress test your program to make sure it passes in the system at the first
attempt. See the readings and screencasts from the first week to learn about testing and stress testing: link.
19
7 Frequently Asked Questions
7.1 I submit the program, but nothing happens. Why?
You need to create submission and upload the file with your solution in one of the programming languages C,
C++, Java, or Python (see Subsections 6.3 and 6.4). Make sure that after uploading the file with your solution
you press on the blue “Submit” button in the bottom. After that, the grading starts, and the submission
being graded is enclosed in an orange rectangle. After the testing is finished, the rectangle disappears, and
the results of the testing of all problems is shown to you.
7.2 I submit the solution only for one problem, but all the problems in the
assignment are graded. Why?
Each time you submit any solution, the last uploaded solution for each problem is tested. Don’t worry: this
doesn’t affect your score even if the submissions for the other problems are wrong. As soon as you pass the
sufficient number of problems in the assignment (see in the pdf with instructions), you pass the assignment.
After that, you can improve your result if you successfully pass more problems from the assignment. We
recommend working on one problem at a time, checking whether your solution for any given problem passes
in the system as soon as you are confident in it. However, it is better to test it first, please refer to the
reading about stress testing: link.
7.3 What are the possible grading outcomes, and how to read them?
Your solution may either pass or not. To pass, it must work without crashing and return the correct answers
on all the test cases we prepared for you, and do so under the time limit and memory limit constraints
specified in the problem statement. If your solution passes, you get the corresponding feedback "Good job!"
and get a point for the problem. If your solution fails, it can be because it crashes, returns wrong answer,
works for too long or uses too much memory for some test case. The feedback will contain the number of
the test case on which your solution fails and the total number of test cases in the system. The tests for the
problem are numbered from 1 to the total number of test cases for the problem, and the program is always
tested on all the tests in the order from the test number 1 to the test with the biggest number.
Here are the possible outcomes:
Good job! Hurrah! Your solution passed, and you get a point!
Wrong answer. Your solution has output incorrect answer for some test case. If it is a sample test case from
the problem statement, or if you are solving Programming Assignment 1, you will also see the input
data, the output of your program and the correct answer. Otherwise, you won’t know the input, the
output, and the correct answer. Check that you consider all the cases correctly, avoid integer overflow,
output the required white space, output the floating point numbers with the required precision, don’t
output anything in addition to what you are asked to output in the output specification of the problem
statement. See this reading on testing: link.
Time limit exceeded. Your solution worked longer than the allowed time limit for some test case. If it
is a sample test case from the problem statement, or if you are solving Programming Assignment 1,
you will also see the input data and the correct answer. Otherwise, you won’t know the input and the
correct answer. Check again that your algorithm has good enough running time estimate. Test your
program locally on the test of maximum size allowed by the problem statement and see how long it
works. Check that your program doesn’t wait for some input from the user which makes it to wait
forever. See this reading on testing: link.
Memory limit exceeded. Your solution used more than the allowed memory limit for some test case. If it
is a sample test case from the problem statement, or if you are solving Programming Assignment 1,
20
you will also see the input data and the correct answer. Otherwise, you won’t know the input and the
correct answer. Estimate the amount of memory that your program is going to use in the worst case
and check that it is less than the memory limit. Check that you don’t create too large arrays or data
structures. Check that you don’t create large arrays or lists or vectors consisting of empty arrays or
empty strings, since those in some cases still eat up memory. Test your program locally on the test of
maximum size allowed by the problem statement and look at its memory consumption in the system.
Cannot check answer. Perhaps output format is wrong. This happens when you output something
completely different than expected. For example, you are required to output word “Yes” or “No”, but
you output number 1 or 0, or vice versa. Or your program has empty output. Or your program outputs
not only the correct answer, but also some additional information (this is not allowed, so please follow
exactly the output format specified in the problem statement). Maybe your program doesn’t output
anything, because it crashes.
Unknown signal 6 (or 7, or 8, or 11, or some other). This happens when your program crashes. It
can be because of division by zero, accessing memory outside of the array bounds, using uninitialized
variables, too deep recursion that triggers stack overflow, sorting with contradictory comparator, re-
moving elements from an empty data structure, trying to allocate too much memory, and many other
reasons. Look at your code and think about all those possibilities. Make sure that you use the same
compilers and the same compiler options as we do. Try different testing techniques from this reading:
link.
Internal error: exception... Most probably, you submitted a compiled program instead of a source
code.
Grading failed. Something very wrong happened with the system. Contact Coursera for help or write in
the forums to let us know.
21
make one’s programs work, one must test them really well. Sometimes, the programs still don’t work although
you tried really hard to test them, and you need to be both skilled and creative to fix your bugs. Solutions
to algorithmic problems are one of the hardest to implement correctly. That’s why in this Specialization you
will gain this important experience which will be invaluable in the future when you write programs which
you really need to get right.
It is crucial for you to learn to test and fix your programs yourself. In the real life, often there will be no
or only partial information about the failure of your program or service. Still, you will have to reproduce the
failure to fix it (or just guess what it is, but that’s rare, and you will still need to reproduce the failure to
make sure you have really fixed it). When you solve algorithmic problems, it is very frequent to make subtle
mistakes. That’s why you should apply the testing techniques described in this reading to find the failing
test case and fix your program.
7.6 My solution does not pass the tests? May I post it in the forum and ask
for a help?
No, please do not post any solutions in the forum or anywhere on the web, even if a solution does not
pass the tests (as in this case you are still revealing parts of a correct solution). Recall the third item
of the Coursera Honor Code: “I will not make solutions to homework, quizzes, exams, projects, and other
assignments available to anyone else (except to the extent an assignment explicitly permits sharing solutions).
This includes both solutions written by me, as well as any solutions provided by the course staff or others”
(link).
7.7 My implementation always fails in the grader, though I already tested and
stress tested it a lot. Would not it be better if you give me a solution to
this problem or at least the test cases that you use? I will then be able to
fix my code and will learn how to avoid making mistakes. Otherwise, I do
not feel that I learn anything from solving this problem. I am just stuck.
First of all, you always learn from your mistakes.
The process of trying to invent new test cases that might fail your program and proving them wrong
is often enlightening. This thinking about the invariants which you expect your loops, ifs, etc. to keep and
proving them wrong (or right) makes you understand what happens inside your program and in the general
algorithm you’re studying much more.
Also, it is important to be able to find a bug in your implementation without knowing a test case and
without having a reference solution. Assume that you designed an application and an annoyed user reports
that it crashed. Most probably, the user will not tell you the exact sequence of operations that led to a crash.
Moreover, there will be no reference application. Hence, once again, it is important to be able to locate a
bug in your implementation yourself, without a magic oracle giving you either a test case that your program
fails or a reference solution. We encourage you to use programming assignments in this class as a way of
practicing this important skill.
If you have already tested a lot (considered all corner cases that you can imagine, constructed a set of
manual test cases, applied stress testing), but your program still fails and you are stuck, try to ask for help
on the forum. We encourage you to do this by first explaining what kind of corner cases you have already
considered (it may happen that when writing such a post you will realize that you missed some corner cases!)
and only then asking other learners to give you more ideas for tests cases.
22