0% found this document useful (0 votes)

146 views21 pages

TSP - Infrastructure For The Traveling Salesperson Problem: Michael Hahsler Kurt Hornik

The document introduces the R package TSP, which provides tools for handling and solving traveling salesperson problems (TSP). TSP is an important combinatorial optimization problem where the goal is to find the shortest route to visit each city in a list once and return to the starting city. The package includes classes to specify TSP instances and solutions, as well as heuristics to find good solutions. It also interfaces with Concorde, one of the best exact TSP solvers. The package provides an infrastructure for working with and solving TSP problems in R.

Uploaded by

Sri Hari Charan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

146 views21 pages

TSP - Infrastructure For The Traveling Salesperson Problem: Michael Hahsler Kurt Hornik

Uploaded by

Sri Hari Charan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

TSP – Infrastructure for the Traveling Salesperson

Problem

Michael Hahsler Kurt Hornik

Southern Methodist University Wirtschaftsuniversität Wien

Abstract
The traveling salesperson (or, salesman) problem (TSP) is a well known and important
combinatorial optimization problem. The goal is to find the shortest tour that visits
each city in a given list exactly once and then returns to the starting city. Despite this
simple problem statement, solving the TSP is difficult since it belongs to the class of
NP-complete problems. The importance of the TSP arises besides from its theoretical
appeal from the variety of its applications. Typical applications in operations research
include vehicle routing, computer wiring, cutting wallpaper and job sequencing. The main
application in statistics is combinatorial data analysis, e.g., reordering rows and columns
of data matrices or identifying clusters. In this paper we introduce the R package TSP
which provides a basic infrastructure for handling and solving the traveling salesperson
problem. The package features S3 classes for specifying a TSP and its (possibly optimal)
solution as well as several heuristics to find good solutions. In addition, it provides an
interface to Concorde, one of the best exact TSP solvers currently available.

Keywords: combinatorial optimization, traveling salesman problem, R.

1. Introduction
The traveling salesperson problem (TSP; Lawler, Lenstra, Rinnooy Kan, and Shmoys 1985;
Gutin and Punnen 2002) is a well known and important combinatorial optimization problem.
The goal is to find the shortest tour that visits each city in a given list exactly once and
then returns to the starting city. Formally, the TSP can be stated as follows. The distances
between n cities are stored in a distance matrix D with elements dij where i, j = 1 . . . n and
the diagonal elements dii are zero. A tour can be represented by a cyclic permutation π
of {1, 2, . . . , n} where π(i) represents the city that follows city i on the tour. The traveling
salesperson problem is then the optimization problem to find a permutation π that minimizes
the length of the tour denoted by

n
X
diπ(i) . (1)
i=1

For this minimization task, the tour length of (n − 1)! permutation vectors have to be com-
pared. This results in a problem which is very hard to solve and in fact known to be NP-
complete (Johnson and Papadimitriou 1985a). However, solving TSPs is an important part
of applications in many areas including vehicle routing, computer wiring, machine sequencing
2 Infrastructure for the TSP

and scheduling, frequency assignment in communication networks (Lenstra and Kan 1975;
Punnen 2002). Applications in statistical data analysis include ordering and clustering ob-
jects. For example, data analysis applications in psychology ranging from profile smoothing to
finding an order in developmental data are presented by Hubert and Baker (1978). Clustering
and ordering using TSP solvers is currently becoming popular in biostatistics. For example,
Ray, Bandyopadhyay, and Pal (2007) describe an application for ordering genes and Johnson
and Liu (2006) use a TSP solver for clustering proteins.
In this paper we give a very brief overview of the TSP and introduce the R package TSP
which provides an infrastructure for handling and solving TSPs. The paper is organized as
follows. In Section 2 we briefly present important aspects of the TSP including different
problem formulations and approaches to solve TSPs. In Section 3 we give an overview of the
infrastructure implemented in TSP and the basic usage. In Section 4, several examples are
used to illustrate the package’s capabilities. Section 5 concludes the paper.
A previous version of this manuscript was published in the Journal of Statistical Software
(Hahsler and Hornik 2007).

2. Theory
In this section, we briefly summarize some aspects of the TSP which are important for the
implementation of the TSP package described in this paper. For a complete treatment of all
aspects of the TSP, we refer the interested reader to the classic book edited by Lawler et al.
(1985) and the more modern book edited by Gutin and Punnen (2002).
It has to be noted that in this paper, following the origin of the TSP, the term distance is
used. Distance is used here interchangeably with dissimilarity or cost and, unless explicitly
stated, no restrictions to measures which obey the triangle inequality are made. An important
distinction can be made between the symmetric TSP and the more general asymmetric TSP.
For the symmetric case (normally referred to as just TSP ), for all distances in D the equality
dij = dji holds, i.e., it does not matter if we travel from i to j or the other way round, the
distance is the same. In the asymmetric case (called ATSP ), the distances are not equal for
all pairs of cities. Problems of this kind arise when we do not deal with spatial distances
between cities but, e.g., with the cost or necessary time associated with traveling between
locations, where the price for the plane ticket between two cities may be different depending
on which way we go.

2.1. Different formulations of the TSP

Other than the permutation problem in the introduction, the TSP can also be formulated
as a graph theoretic problem. Here the TSP is formulated by means of a complete graph
G = (V, E), where the cities correspond to the node set V = {1, 2, . . . , n} and each edge ei ∈ E
has an associated weight wi representing the distance between the nodes it connects. If the
graph is not complete, the missing edges can be replaced by edges with very large distances.
The goal is to find a Hamiltonian cycle, i.e., a cycle which visits each node in the graph
exactly once, with the least weight in the graph (Hoffman and Wolfe 1985). This formulation
naturally leads to procedures involving minimum spanning trees for tour construction or edge
exchanges to improve existing tours.
TSPs can also be represented as integer and linear programming problems (see, e.g., Punnen
Michael Hahsler, Kurt Hornik 3

2002). The integer programming (IP) formulation is based on the assignment problem with
additional constraint of no sub-tours:
Pn Pn
Minimize i=1 j=1 dij xij
Pn
Subject to Pni=1 xij = 1, j = 1, . . . , n,
j=1 xij = 1, i = 1, . . . , n,
xij = 0 or 1
no sub-tours allowed

The solution matrix X = (xij ) of the assignment problem represents a tour or a collection
of sub-tour (several unconnected cycles) where only edges which corresponding to elements
xij = 1 are on the tour or a sub-tour. The additional restriction that no sub-tours are
allowed (called sub-tour elimination constraints) restrict the solution to only proper tours.
Unfortunately, the number of sub-tour elimination constraints grows exponentially with the
number of cities which leads to an extremely hard problem.
The linear programming (LP) formulation of the TSP is given by:
Pm T
Minimize i=1 wi xi = w x

Subject to x∈S

where m is the number of edges ei in G, wi ∈ w is the weight of edge ei and x is the incidence
vector indicating the presence or absence of each edge in the tour. Again, the constraints
given by x ∈ S are problematic since they have to contain the set of incidence vectors of all
possible Hamiltonian cycles in G which amounts to a direct search of all (n − 1)! possibilities
and thus in general is infeasible. However, relaxed versions of the linear programming problem
with removed integrality and sub-tour elimination constraints are extensively used by modern
TSP solvers where such a partial description of constraints is used and improved iteratively
in a branch-and-bound approach.

2.2. Useful manipulations of the distance matrix

Sometimes it is useful to transform the distance matrix D = (dij ) of a TSP into a different
matrix D0 = (d0ij ) which has the same optimal solution. Such a transformation requires that
for any Hamiltonian cycle H in a graph represented by its distance matrix D the equality
X X
dij = α d0ij + β
i,j∈H i,j∈H

holds for suitable α > 0 and β ∈ R. From the equality we see that additive and multiplicative
constants leave the optimal solution invariant. This property is useful to rescale distances,
e.g., for many solvers, distances in the interval [0, 1] have to be converted into integers from 1
to a maximal value.
A different manipulation is to reformulate an asymmetric TSP as a symmetric TSP. This is
possible by doubling the number of cities (Jonker and Volgenant 1983). For each city a dummy
city is added. Between each city and its corresponding dummy city a very small value (e.g.,
−∞) is used. This makes sure that each city always occurs in the solution together with its
dummy city. The original distances are used between the cities and the dummy cities, where
4 Infrastructure for the TSP

each city is responsible for the distance going to the city and the dummy city is responsible for
the distance coming from the city. The distances between all cities and the distances between
all dummy cities are set to a very large value (e.g., ∞) which makes these edges infeasible.
An example for equivalent formulations as an asymmetric TSP (to the left) and a symmetric
TSP (to the right) for three cities is:

∞ ∞ −∞ d21 d31
 
0
  ∞ 0 ∞ d12 −∞ d31 
0 d12 d13 
∞

∞ 0 d13 d23 −∞ 
d21 0 d23  ⇐⇒ 
−∞ d12 d13

0 ∞ ∞
d31 d32 0 
 d21 −∞ d23

∞ 0 ∞
d31 d32 −∞ ∞ ∞ 0

Instead of the infinity values suitably large negative and positive values can be used. The new
symmetric TSP can be solved using techniques for symmetric TSPs which are currently far
more advanced than techniques for ATSPs. Removing the dummy cities from the resulting
tour gives the solution for the original ATSP.

2.3. Finding exact solutions for the TSP

Finding the exact solution to a TSP with n cities requires to check (n − 1)! possible tours. To
evaluate all possible tours is infeasible for even small TSP instances. To find the optimal tour
Held and Karp (1962) presented the following dynamic programming formulation: Given a
subset of city indices (excluding the first city) S ⊂ {2, 3, . . . , n} and l ∈ S, let d∗ (S, l) denote
the length of the shortest path from city 1 to city l, visiting all cities in S in-between. For
S = {l}, d∗ (S, l) is defined as d1l . The shortest path for larger sets with |S| > 1 is

d∗ (S, l) = minm∈S\{l} d∗ (S \ {l}, m) + dml . (2)

Finally, the minimal tour length for a complete tour which includes returning to city 1 is

d∗∗ = minl∈{2,3,...,n} d∗ ({2, 3, . . . , n}, l) + dl1 . (3)

Using the last two equations, the quantities d∗ (S, l) can be computed recursively and the
minimal tour length d∗∗ can be found. In a second step, the optimal permutation π =
{1, i2 , i3 , . . . , in } of city indices 1 through n can be computed in reverse order, starting with
in and working successively back to i2 . The procedure exploits the fact that a permutation π
can only be optimal, if

d∗∗ = d∗ ({2, 3, . . . , n}, in ) + din 1 (4)

and, for 2 ≤ p ≤ n − 1,

d∗ ({i2 , i3 , . . . , ip , ip+1 }, ip+1 ) = d∗ ({i2 , i3 , . . . , ip }, ip ) + dip ip+1 . (5)

The space complexity of storing the values for all d∗ (S, l) is (n−1)2n−2 which severely restricts
the dynamic programming algorithm to TSP problems of small sizes. However, for very small
TSP instances this approach is fast and efficient.
Michael Hahsler, Kurt Hornik 5

A different method, which can deal with larger instances, uses a relaxation of the linear
programming problem presented in Section 2.1 and iteratively tightens the relaxation till a
solution is found. This general method for solving linear programming problems with complex
and large inequality systems is called cutting plane method and was introduced by Dantzig,
Fulkerson, and Johnson (1954).
Each iteration begins with using instead of the original linear inequality description S the
relaxation Ax ≤ b, where the polyhedron P defined by the relaxation contains S and is
bounded. The optimal solution x∗ of the relaxed problem can be obtained using standard
linear programming solvers. If the x∗ found belongs to S, the optimal solution of the original
problem is obtained, otherwise, a linear inequality can be found which is satisfied by all points
in S but violated by x∗ . Such an inequality is called a cutting plane or cut. A family of such
cutting planes can be added to the inequality system Ax ≤ b to obtain a tighter relaxation
for the next iteration.
If no further cutting planes can be found or the improvement in the objective function due
to adding cuts gets very small, the problem is branched into two sub-problems which can
be minimized separately. Branching is done iteratively which leads to a binary tree of sub-
problems. Each sub-problem is either solved without further branching or is found to be
irrelevant because its relaxed version already produces a longer path than a solution of another
sub-problem. This method is called branch-and-cut (Padberg and Rinaldi 1990) which is a
variation of the well known branch-and-bound (Land and Doig 1960) procedure.
The initial polyhedron P used by Dantzig et al. (1954) contains all vectors x for which all
xe ∈ x satisfy 0 ≤ xe ≤ 1 and in the resulting tour each city is linked to exactly two other
cities. Various separation algorithms for finding subsequent cuts to prevent sub-tours (sub-
tour elimination inequalities) and to ensure an integer solution (Gomory cuts; Gomory 1963)
were developed over time. The currently most efficient implementation of this method is
Concorde described in Applegate, Bixby, Chvátal, and Cook (2000).

2.4. Heuristics for the TSP

The NP-completeness of the TSP already makes it more time efficient for small-to-medium
size TSP instances to rely on heuristics in case a good but not necessarily optimal solution
is sufficient. TSP heuristics typically fall into two groups, tour construction heuristics which
create tours from scratch and tour improvement heuristics which use simple local search
heuristics to improve existing tours.
In the following we will only discuss heuristics available in TSP, for a comprehensive overview
of the multitude of TSP heuristics including an experimental comparison, we refer the reader
to the book chapter by Johnson and McGeoch (2002).

Tour construction heuristics

The implemented tour construction heuristics are the nearest neighbor algorithm and the
insertion algorithms.

Nearest neighbor algorithm. The nearest neighbor algorithm (Rosenkrantz, Stearns,

and Philip M. Lewis 1977) follows a very simple greedy procedure: The algorithm starts with
a tour containing a randomly chosen city and then always adds to the last city in the tour
6 Infrastructure for the TSP

the nearest not yet visited city. The algorithm stops when all cities are on the tour.
An extension to this algorithm is to repeat it with each city as the starting point and then
return the best tour found. This heuristic is called repetitive nearest neighbor.

Insertion algorithms. All insertion algorithms (Rosenkrantz et al. 1977) start with a tour
consisting of an arbitrary city and then choose in each step a city k not yet on the tour. This
city is inserted into the existing tour between two consecutive cities i and j, such that the
insertion cost (i.e., the increase in the tour’s length)

d(i, k) + d(k, j) − d(i, j)

is minimized. The algorithms stop when all cities are on the tour.
The insertion algorithms differ in the way the city to be inserted next is chosen. The following
variations are implemented:

Nearest insertion The city k is chosen in each step as the city which is nearest to a city on
the tour.

Farthest insertion The city k is chosen in each step as the city which is farthest from any
of the cities on the tour.

Cheapest insertion The city k is chosen in each step such that the cost of inserting the
new city is minimal.

Arbitrary insertion The city k is chosen randomly from all cities not yet on the tour.

The nearest and cheapest insertion algorithms correspond to the minimum spanning tree
algorithm by Prim (1957). Adding a city to a partial tour corresponds to adding an edge to a
partial spanning tree. For TSPs with distances obeying the triangular inequality, the equality
to minimum spanning trees provides a theoretical upper bound for the two algorithms of twice
the optimal tour length.
The idea behind the farthest insertion algorithm is to link cities far outside into the tour
first to establish an outline of the whole tour early. With this change, the algorithm cannot
be directly related to generating a minimum spanning tree and thus the upper bound stated
above cannot be guaranteed. However, it can was shown that the algorithm generates tours
which approach 2/3 times the optimal tour length (Johnson and Papadimitriou 1985b).

Tour improvement heuristics

Tour improvement heuristics are simple local search heuristics which try to improve an initial
tour. A comprehensive treatment of the topic can be found in the book chapter by Rego and
Glover (2002).

k-Opt heuristics. The idea is to define a neighborhood structure on the set of all admissible
tours. Typically, a tour t0 is a neighbor of another tour t if t0 can be obtained from t by
deleting k edges and replacing them by a set of different feasible edges (a k-Opt move). In
such a structure, the tour can iteratively be improved by always moving from one tour to its
Michael Hahsler, Kurt Hornik 7

dist matrix
as.matrix()
TSP() TSP()/ATSP()
as.dist() as.TSP()/as.ATSP()
as.TSP()

TSP/ATSP solve_TSP() TOUR

write_TSPLIB() read_TSPLIB() as.integer() TOUR()

cut_tour() as.TOUR()

integer (vector)
TSPLIB
file

Figure 1: An overview of the classes in TSP.

best neighbor till no further improvement is possible. The resulting tour represents a local
optimum which is called k-optimal.
Typically, 2-Opt (Croes 1958) and 3-Opt (Lin 1965) heuristics are used in practice.

Lin-Kernighan heuristic. This heuristic (Lin and Kernighan 1973) does not use a fixed
value for k for its k-Opt moves, but tries to find the best choice of k for each move. The
heuristic uses the fact that each k-Opt move can be represented as a sequence of 2-Opt
moves. It builds up a sequence of 2-Opt moves, checking after each additional move whether
a stopping rule is met. Then the part of the sequence which gives the best improvement is
used. This is equivalent to a choice of one k-Opt move with variable k. Such moves are used
till a local optimum is reached.
By using full backtracking, the optimal solution can always be found, but the running time
would be immense. Therefore, only limited backtracking is allowed in the procedure, which
helps to find better local optima or even the optimal solution. Further improvements to the
procedure are described by Lin and Kernighan (1973).

3. Computational infrastructure: the TSP package

In package TSP, a traveling salesperson problem is defined by an object of class TSP (sym-
metric) or ATSP (asymmetric). solve_TSP() is used to find a solution, which is represented
by an object of class TOUR. Figure 1 gives an overview of this infrastructure.
TSP objects can be created from a distance matrix (a dist object) or a symmetric matrix using
the creator function TSP() or coercion with as.TSP(). Similarly, ATSP objects are created
by ATSP() or as.ATSP() from square matrices representing the distances. In the creation
process, labels are taken and stored as city names in the object or can be explicitly given as
arguments to the creator functions. Several methods are defined for the classes:

print() displays basic information about the problem (number of cities and the distance
measure employed).
8 Infrastructure for the TSP

n_of_cities() returns the number of cities.

labels() returns the city names.

image() produces a shaded matrix plot of the distances between cities. The order of
the cities can be specified as the argument order.

Internally, an object of class TSP is a dist object with an additional class attribute and,
therefore, if needed, can be coerced to dist or to a matrix. An ATSP object is represented
as a square matrix. Obviously, asymmetric TSPs are more general than symmetric TSPs,
hence, symmetric TSPs can also be represented as asymmetric TSPs. To formulate an
asymmetric TSP as a symmetric TSP with double the number of cities (see Section 2.2),
reformulate_ATSP_as_TSP() is provided. This function creates the necessary dummy cities
and adapts the distance matrix accordingly.
A popular format to save TSP descriptions to disk which is supported by most TSP solvers is
the format used by TSPLIB, a library of sample instances of the TSP maintained by Reinelt
(2004). The TSP package provides read_TSPLIB() and write_TSPLIB() to read and save
symmetric and asymmetric TSPs in TSPLIB format.
Class TOUR represents a solution to a TSP by an integer permutation vector containing the
ordered indices and labels of the cities to visit. In addition, it stores an attribute indicating
the length of the tour. Again, suitable print() and labels() methods are provided. The
raw permutation vector (i.e., the order in which cities are visited) can be obtained from a
tour using as.integer(). With cut_tour(), a circular tour can be split at a specified city
resulting in a path represented by a vector of city indices.
The length of a tour can always be calculated using tour_length() and specifying a TSP
and a tour. Instead of the tour, an integer permutation vector calculated outside the TSP
package can be used as long as it has the correct length.
All TSP solvers in TSP can be used with the simple common interface:

solve_TSP(x, method, control)

where x is the TSP to be solved, method is a character string indicating the method used to
solve the TSP and control can contain a list with additional information used by the solver.
The available algorithms are shown in Table 1.
All algorithms except the Concorde TSP solver and the Chained Lin-Kernighan heuristic (a
Lin-Kernighan variation described in Applegate, Cook, and Rohe (2003)) are included in the
package and distributed under the GNU Public License (GPL). For the Concorde TSP solver
and the Chained Lin-Kernighan heuristic only a simple interface (using write_TSPLIB(),
calling the executable and reading back the resulting tour) is included in TSP. The executable
itself is part of the Concorde distribution, has to be installed separately and is governed by
a different license which allows only for academic use. The interfaces are included since
Concorde (Applegate et al. 2000; Applegate, Bixby, Chvátal, and Cook 2006) is currently one
of the best implementations for solving symmetric TSPs based on the branch-and-cut method
discussed in section 2.3. In May 2004, Concorde was used to find the optimal solution for
the TSP of visiting all 24,978 cities in Sweden. The computation was carried out on a cluster
with 96 Xeon 2.8 GHz nodes and took in total almost 100 CPU years.
Michael Hahsler, Kurt Hornik 9

Table 1: Available algorithms in TSP.

Algorithm Method argument Applicable to
Nearest neighbor algorithm "nn" TSP/ATSP
Repetitive nearest neighbor algorithm "repetitive_nn" TSP/ATSP
Nearest insertion "nearest_insertion" TSP/ATSP
Farthest insertion "farthest_insertion" TSP/ATSP
Cheapest insertion "cheapest_insertion" TSP/ATSP
Arbitrary insertion "arbitrary_insertion" TSP/ATSP
Concorde TSP solver "concorde" TSP
2-Opt improvement heuristic "2-opt" TSP/ATSP
Chained Lin-Kernighan "linkern" TSP

4. Examples
In this section we provide some examples for the use of package TSP. We start with a simple
example of how to use the interface of the TSP solver to compare different heuristics. Then we
show how to solve related tasks, using the Hamiltonian shortest path problem as an example.
Finally, we give an example of clustering using the TSP package. An additional application
can be found in package seriation (Hahsler, Buchta, and Hornik 2006) which uses the TSP
solvers from TSP to order (seriate) objects given a proximity matrix.

4.1. Comparing some heuristics

In the following example, we use several heuristics to find a short path in the USCA50 data
set which contains the distances between the first 50 cities in the USCA312 data set. The
USCA312 data set contains the distances between 312 cities in the USA and Canada coded
as a symmetric TSP. The smaller data set is used here, since some of the heuristic solvers
employed are rather slow.

> library("TSP")
> data("USCA50")
> USCA50

object of class ‘TSP’

50 cities (distance ‘euclidean’)

We calculate tours using different heuristics and store the results in the list tours. As an
example, we show the first tour which displays the method employed, the number of cities
involved and the tour length. All tour lengths are compared using the dot chart in Figure 2.
For the chart, we add a point for the optimal solution which has a tour length of 14497. The
optimal solution can be found using Concorde (method = "concorde"). It is omitted here,
since Concorde has to be installed separately.

> methods <- c("nearest_insertion", "farthest_insertion", "cheapest_insertion",

+ "arbitrary_insertion", "nn", "repetitive_nn", "2-opt")
> tours <- sapply(methods, FUN = function(m) solve_TSP(USCA50,
10 Infrastructure for the TSP

optimal ●

2−opt ●

repetitive_nn ●

nn ●

arbitrary_insertion ●

cheapest_insertion ●

farthest_insertion ●

nearest_insertion ●

0 5000 10000 15000 20000

tour length

Figure 2: Comparison of the tour lengths for the USCA50 data set.

+ method = m), simplify = FALSE)

> tours[[1]]

object of class ‘TOUR’

result of method ‘nearest_insertion’ for 50 cities
tour length: 17421

> dotchart(c(sapply(tours, FUN = attr, "tour_length"), optimal = 14497),

+ xlab = "tour length", xlim = c(0, 20000))

4.2. Finding the shortest Hamiltonian path

The problem of finding the shortest Hamiltonian path through a graph (i.e., a path which
visits each node in the graph exactly once) can be transformed into the TSP with cities and
distances representing the graphs vertices and edge weights, respectively (Garfinkel 1985).
Finding the shortest Hamiltonian path through all cities disregarding the endpoints can be
achieved by inserting a ‘dummy city’ which has a distance of zero to all other cities. The
position of this city in the final tour represents the cutting point for the path. In the following
we use a heuristic to find a short path in the USCA312 data set. Inserting dummy cities is
performed in TSP by insert_dummy().

> library("TSP")
> data("USCA312")
> tsp <- insert_dummy(USCA312, label = "cut")
> tsp

object of class ‘TSP’

313 cities (distance ‘euclidean’)

The TSP now contains an additional dummy city and we can try to solve this TSP.
Michael Hahsler, Kurt Hornik 11

> tour <- solve_TSP(tsp, method = "farthest_insertion")

> tour

object of class ‘TOUR’

result of method ‘farthest_insertion’ for 313 cities
tour length: 38184

Since the dummy city has distance zero to all other cities, the path length is equal to the tour
length reported above. The path starts with the first city in the list after the ‘dummy’ city
and ends with the city right before it. We use cut_tour() to create a path and show the first
and last 6 cities on it.

> path <- cut_tour(tour, "cut")

> head(labels(path))

[1] "Lihue, HI" "Honolulu, HI" "Hilo, HI"

[4] "San Francisco, CA" "Berkeley, CA" "Oakland, CA"

> tail(labels(path))

[1] "Anchorage, AK" "Fairbanks, AK" "Dawson, YT"

[4] "Whitehorse, YK" "Juneau, AK" "Prince Rupert, BC"

The tour found in the example results in a path from Lihue on Hawaii to Prince Rupert
in British Columbia. Such a tour can also be visualized using the packages sp, maps and
maptools (Pebesma and Bivand 2005).

> library("maps")
> library("sp")
> library("maptools")

Note: polygon geometry computations in maptools

depend on the package gpclib, which has a
restricted licence. It is disabled by default;
to enable gpclib, type gpclibPermit()

Checking rgeos availability as gpclib substitute:

FALSE

> data("USCA312_map")
> plot_path <- function(path) {
+ plot(as(USCA312_coords, "Spatial"), axes = TRUE)
+ plot(USCA312_basemap, add = TRUE, col = "gray")
+ points(USCA312_coords, pch = 3, cex = 0.4, col = "red")
+ path_line <- SpatialLines(list(Lines(list(Line(USCA312_coords[path,
+ ])), ID = "1")))
12 Infrastructure for the TSP

80°N
70°N
60°N

●
50°N
40°N
30°N

●
20°N

160°W 140°W 120°W 100°W 80°W 60°W

Figure 3: A “short” Hamiltonian path for the USCA312 dataset.

+ plot(path_line, add = TRUE, col = "black")

+ points(USCA312_coords[c(head(path, 1), tail(path, 1)),
+ ], pch = 19, col = "black")
+ }
> plot_path(path)

The map containing the path is presented in Figure 3. It has to be mentioned that the path
found by the used heuristic is considerable longer than the optimal path found by Concorde
with a length of 34928, illustrating the power of modern TSP algorithms.
For the following two examples, we indicate how the distance matrix between cities can
be modified to solve related shortest Hamiltonian path problems. These examples serve as
illustrations of how modifications can be made to transform different problems into a TSP.
The first problem is to find the shortest Hamiltonian path starting with a given city. In this
case, all distances to the selected city are set to zero, forcing the evaluation of all possible
paths starting with this city and disregarding the way back from the final city in the tour.
By modifying the distances the symmetric TSP is changed into an asymmetric TSP (ATSP)
since the distances between the starting city and all other cities are no longer symmetric.
As an example, we choose New York as the starting city. We transform the data set into
an ATSP and set the column corresponding to New York to zero before solving it. Thus,
the distance to return from the last city in the path to New York does not contribute to the
path length. We use the nearest neighbor heuristic to calculate an initial tour which is then
improved using 2-Opt moves and cut at New York to create a path.
Michael Hahsler, Kurt Hornik 13

> atsp <- as.ATSP(USCA312)

> ny <- which(labels(USCA312) == "New York, NY")
> atsp[, ny] <- 0
> initial_tour <- solve_TSP(atsp, method = "nn")
> initial_tour

object of class ‘TOUR’

result of method ‘nn’ for 312 cities
tour length: 49697

> tour <- solve_TSP(atsp, method = "2-opt", control = list(tour = initial_tour))

> tour

object of class ‘TOUR’

result of method ‘2-opt’ for 312 cities
tour length: 39445

> path <- cut_tour(tour, ny, exclude_cut = FALSE)

> head(labels(path))

[1] "New York, NY" "Jersey City, NJ" "Elizabeth, NJ" "Newark, NJ"
[5] "Paterson, NJ" "Binghamtom, NY"

> tail(labels(path))

[1] "Edmonton, AB" "Saskatoon, SK" "Moose Jaw, SK" "Regina, SK"
[5] "Minot, ND" "Brandon, MB"

> plot_path(path)

The found path is presented in Figure 4. It begins with New York and cities in New Jersey
and ends in a city in Manitoba, Canada.
Concorde and many advanced TSP solvers can only solve symmetric TSPs. To use these
solvers, we can formulate the ATSP as a TSP using reformulate_ATSP_as_TSP() which
introduces a dummy city for each city (see Section 2.2).

> tsp <- reformulate_ATSP_as_TSP(atsp)

> tsp

object of class ‘TSP’

624 cities (distance ‘unknown’)

After finding a tour for the TSP, the dummy cities are removed again giving the tour for the
original ATSP. Note that the tour needs to be reversed if the dummy cities appear before and
not after the original cities in the solution of the TSP. The following code is not executed
here, since it takes several minutes to execute and Concorde has to be installed separately.
Concorde finds the optimal solution with a length of 36091.
14 Infrastructure for the TSP

80°N
70°N
60°N
50°N

●
40°N

●
30°N
20°N

160°W 140°W 120°W 100°W 80°W 60°W

Figure 4: A Hamiltonian path for the USCA312 dataset starting in New York.

> tour <- solve_TSP(tsp, method = "concorde")

> tour <- as.TOUR(tour[tour <= n_of_cities(atsp)])

Finding the shortest Hamiltonian path which ends in a given city can be achieved likewise by
setting the row in the distance matrix which corresponds to this city to zero.
For finding the shortest Hamiltonian path we can also restrict both end points. This problem
can be transformed to a TSP by replacing the two cities by a single city which contains the
distances from the start point in the columns and the distances to the end point in the rows.
Obviously this is again an asymmetric TSP.
For the following example, we are only interested in paths starting in New York and ending
in Los Angeles. Therefore, we remove the two cities from the distance matrix, create an
asymmetric TSP and insert a dummy city called "LA/NY". The distances from this dummy
city are replaced by the distances from New York and the distances towards are replaced by
the distances towards Los Angeles.

> m <- as.matrix(USCA312)

> ny <- which(labels(USCA312) == "New York, NY")
> la <- which(labels(USCA312) == "Los Angeles, CA")
> atsp <- ATSP(m[-c(ny, la), -c(ny, la)])
> atsp <- insert_dummy(atsp, label = "LA/NY")
> la_ny <- which(labels(atsp) == "LA/NY")
> atsp[la_ny, ] <- c(m[-c(ny, la), ny], 0)
> atsp[, la_ny] <- c(m[la, -c(ny, la)], 0)
Michael Hahsler, Kurt Hornik 15

We use the nearest insertion heuristic.

> tour <- solve_TSP(atsp, method = "nearest_insertion")

> tour

object of class ‘TOUR’

result of method ‘nearest_insertion’ for 311 cities
tour length: 45029

> path_labels <- c("New York, NY", labels(cut_tour(tour, la_ny)),

+ "Los Angeles, CA")
> path_ids <- match(path_labels, labels(USCA312))
> head(path_labels)

[1] "New York, NY" "North Bay, ON" "Sudbury, ON"

[4] "Timmins, ON" "Sault Ste Marie, ON" "Thunder Bay, ON"

> tail(path_labels)

[1] "Eureka, CA" "Reno, NV" "Carson City, NV"

[4] "Stockton, CA" "Santa Barbara, CA" "Los Angeles, CA"

> plot_path(path_ids)

The path jumps from New York to cities in Ontario and it passes through cities in California
and Nevada before ending in Los Angeles. The path displayed in Figure 5 contains multiple
crossings which indicate that the solution is suboptimal. The optimal solution generated by
reformulating the problem as a TSP and using Concorde has only a tour length of 38489.

4.3. Rearrangement clustering

Solving a TSP to obtain a clustering was suggested several times in the literature (see, e.g.,
Lenstra 1974; Alpert and Kahng 1997; Johnson, Krishnan, Chhugani, Kumar, and Venkata-
subramanian 2004). The idea is that objects in clusters are visited in consecutive order and
from one cluster to the next larger “jumps” are necessary. Climer and Zhang (2006) call
this type of clustering rearrangement clustering and suggest to automatically find the cluster
boundaries of k clusters by adding k dummy cities which have constant distance c to all
other cities and are infinitely far from each other. In the optimal solution of the TSP, the
dummy cities must separate the most distant cities and thus represent optimal boundaries
for k clusters.
For the following example, we use the well known iris data set. Since we know that the dataset
contains three classes denoted by the variable Species, we insert three dummy cities into the
TSP for the iris data set and perform rearrangement clustering using the default method
(nearest insertion algorithm). Note that this algorithm does not find the optimal solution
and it is not guaranteed that the dummy cities will present the best cluster boundaries.
16 Infrastructure for the TSP

80°N
70°N
60°N
50°N
40°N

●
30°N
20°N

160°W 140°W 120°W 100°W 80°W 60°W

Figure 5: A Hamiltonian path for the USCA312 dataset starting in New York and ending in
Los Angles.

> data("iris")
> tsp <- TSP(dist(iris[-5]), labels = iris[, "Species"])
> tsp_dummy <- insert_dummy(tsp, n = 3, label = "boundary")
> tour <- solve_TSP(tsp_dummy)

Next, we plot the TSP’s permuted distance matrix using shading to represent distances.
The result is displayed as Figure 6. Lighter areas represent larger distances. The additional
red lines represent the positions of the dummy cities in the tour, which mark the cluster
boundaries obtained.

> image(tsp_dummy, tour, xlab = "objects", ylab = "objects")

> abline(h = which(labels(tour) == "boundary"), col = "red")
> abline(v = which(labels(tour) == "boundary"), col = "red")

One pair of red horizontal and vertical lines exactly separates the darker from lighter areas.
The second pair occurs inside the larger dark block. We can look at how well the partitioning
obtained fits the structure in the data given by the species field in the data set. Since we used
the species as the city labels in the TSP, the labels in the tour represent the partitioning with
the dummy cities named ‘boundary’ separating groups. The result can be summarized based
on the run length encoding of the obtained tour labels:

> out <- rle(labels(tour))

> data.frame(Species = out$values, Lenghts = out$lengths, Pos = cumsum(out$lengths))
Michael Hahsler, Kurt Hornik 17

140
120
100
objects

80
60
40
20

20 40 60 80 100 120 140

objects

Figure 6: Result of rearrangement clustering using three dummy cities and the nearest inser-
tion algorithm on the iris data set.

Species Lenghts Pos

1 boundary 1 1
2 virginica 7 8
3 boundary 1 9
4 virginica 18 27
5 versicolor 5 32
6 virginica 20 52
7 versicolor 1 53
8 virginica 3 56
9 versicolor 13 69
10 virginica 1 70
11 versicolor 13 83
12 virginica 1 84
13 versicolor 18 102
14 boundary 1 103
15 setosa 50 153

One boundary perfectly splits the iris data set into a group containing only examples of species
‘Setosa’ and a second group containing examples for ‘Virginica’ and ‘Versicolor’. However, the
second boundary only separates several examples of species ‘Virginica’ from other examples
of the same species. Even in the optimal tour found by Concorde, this problem occurs.
The reason why the rearrangement clustering fails to split the data into three groups is the
closeness between the groups ‘Virginica’ and ‘Versicolor’. To inspect this problem further, we
can project the data points on the first two principal components of the data set and add the
18 Infrastructure for the TSP

1.0
●

●
0.5

● ●

● ●
●●●●●
●● ●●
●●
0.0

●
PC2

●
●
● ● ●●
● ●
●●
●
●
●●
● ●
●● ●
−0.5

●●
●●
●
●
●
●
●
−1.0

●
●

−3 −2 −1 0 1 2 3 4

PC1

Figure 7: The 3 path segments representing a rearrangement clustering of the iris data set.
The data points are projected on the set’s first two principal components. The three species
are represented by different markers and colors.

path segments which resulted from solving the TSP.

> prc <- prcomp(iris[1:4])

> plot(prc$x, pch = as.numeric(iris[, 5]), col = as.numeric(iris[,
+ 5]))
> indices <- c(tour, tour[1])
> indices[indices > 150] <- NA
> lines(prc$x[indices, ])

The result in shown in Figure 7. The three species are identified by different markers and all
points connected by a single path represent a cluster found. Clearly, the two groups to the right
side of the plot are too close to be separated correctly by using just the distances between
individual points. This problem is similar to the chaining effect known from hierarchical
clustering using the single-linkage method.

5. Conclusion
In this paper we presented the R extension package TSP which implements an infrastructure
to handle and solve TSPs. The package introduces classes for problem descriptions (TSP and
ATSP) and for the solution (TOUR). Together with a simple interface for solving TSPs, it
allows for an easy and transparent usage of the package.
Michael Hahsler, Kurt Hornik 19

With the interface to Concorde, TSP also can use a state of the art implementation which
efficiently computes exact solutions using branch-and-cut.

Acknowledgments
The authors of this paper want to thank Roger Bivand for providing the code to correctly
draw tours and paths on a projected map.

References

Alpert CJ, Kahng AB (1997). “Splitting an Ordering into a Partititon to Minimize Diameter.”
Journal of Classification, 14(1), 51–74.
Applegate D, Bixby R, Chvátal V, Cook W (2006). Concorde TSP Solver. URL http:
//www.tsp.gatech.edu/concorde/.
Applegate D, Bixby RE, Chvátal V, Cook W (2000). “TSP Cuts Which Do Not Conform to
the Template Paradigm.” In M Junger, D Naddef (eds.), “Computational Combinatorial
Optimization, Optimal or Provably Near-Optimal Solutions,” volume 2241 of Lecture Notes
In Computer Science, pp. 261–304. Springer-Verlag, London, UK.
Applegate D, Cook W, Rohe A (2003). “Chained Lin-Kernighan for Large Traveling Salesman
Problems.” INFORMS Journal on Computing, 15(1), 82–92.
Climer S, Zhang W (2006). “Rearrangement Clustering: Pitfalls, Remedies, and Applications.”
Journal of Machine Learning Research, 7, 919–943.
Croes GA (1958). “A Method for Solving Traveling-Salesman Problems.” Operations Research,
6(6), 791–812.
Dantzig G, Fulkerson D, Johnson S (1954). “Solution of a Large-scale Traveling Salesman
Problem.” Operations Research, 2, 393–410.
Garfinkel R (1985). “Motivation and Modeling.” In Lawler et al. (1985), chapter 2, pp. 17–36.
Gomory R (1963). “An algorithm for integer solutions to linear programs.” In R Graves,
P Wolfe (eds.), “Recent Advances in Mathematical Programming,” pp. 269–302. McGraw-
Hill, New York.
Gutin G, Punnen A (eds.) (2002). The Traveling Salesman Problem and Its Variations,
volume 12 of Combinatorial Optimization. Kluwer, Dordrecht.
Hahsler M, Buchta C, Hornik K (2006). seriation: Infrastructure for seriation. R package
version 0.1-1.
Hahsler M, Hornik K (2007). “TSP – Infrastructure for the Traveling Salesperson Problem.”
Journal of Statistical Software, 23(2), 1–21. ISSN 1548-7660.
Held M, Karp R (1962). “A Dynamic Programming Approach to Sequencing Problems.”
Journal of SIAM, 10, 196–210.
20 Infrastructure for the TSP

Hoffman A, Wolfe P (1985). “History.” In Lawler et al. (1985), chapter 1, pp. 1–16.

Hubert LJ, Baker FB (1978). “Applications of Combinatorial Programming to Data Analysis:

The Traveling Salesman and Related Problems.” Psychometrika, 43(1), 81–91.

Johnson D, Krishnan S, Chhugani J, Kumar S, Venkatasubramanian S (2004). “Compressing

Large Boolean Matrices Using Reordering Techniques.” In “Proceedings of the 30th VLDB
Conference,” pp. 13–23.

Johnson D, McGeoch L (2002). “Experimental Analysis of Heuristics for the STSP.” In Gutin
and Punnen (2002), chapter 9, pp. 369–444.

Johnson D, Papadimitriou C (1985a). “Computational complexity.” In Lawler et al. (1985),

chapter 3, pp. 37–86.

Johnson D, Papadimitriou C (1985b). “Performance guarantees for heuristics.” In Lawler

et al. (1985), chapter 5, pp. 145–180.

Johnson O, Liu J (2006). “A traveling salesman approach for predicting protein functions.”
Source Code for Biology and Medicine, 1(3), 1–7.

Jonker R, Volgenant T (1983). “Transforming asymmetric into symmetric traveling salesman

problems.” Operations Research Letters, 2, 161–163.

Land A, Doig A (1960). “An Automatic Method for Solving Discrete Programming Problems.”
Econometrica, 28, 497–520.

Lawler EL, Lenstra JK, Rinnooy Kan AHG, Shmoys DB (eds.) (1985). The Traveling Sales-
man Problem. Wiley, New York.

Lenstra J, Kan AR (1975). “Some simple applications of the travelling salesman problem.”
Operational Research Quarterly, 26(4), 717–733.

Lenstra JK (1974). “Clustering a Data Array and the Traveling-Salesman Problem.” Opera-
tions Research, 22(2), 413–414.

Lin S (1965). “Computer solutions of the traveling-salesman problem.” Bell System Technology
Journal, 44, 2245–2269.

Lin S, Kernighan B (1973). “An effective heuristic algorithm for the traveling-salesman prob-
lem.” Operations Research, 21(2), 498–516.

Padberg M, Rinaldi G (1990). “Facet identification for the symmetric traveling salesman
polytope.” Mathematical Programming, 47(2), 219–257. ISSN 0025-5610.

Pebesma EJ, Bivand RS (2005). “Classes and methods for spatial data in R.” R News, 5(2),
9–13. URL http://CRAN.R-project.org/doc/Rnews/.

Prim R (1957). “Shortest connection networks and some generalisations.” Bell System Tech-
nical Journal, 36, 1389–1401.

Punnen A (2002). “The Traveling Salesman Problem: Applications, Formulations and Varia-
tions.” In Gutin and Punnen (2002), chapter 1, pp. 1–28.
Michael Hahsler, Kurt Hornik 21

Ray SS, Bandyopadhyay S, Pal SK (2007). “Gene Ordering in Partitive Clustering using
Microarray Expressions.” Journal of Biosciences, 32(5), 1019–1025.

Rego C, Glover F (2002). “Local Search and Metaheuristics.” In Gutin and Punnen (2002),
chapter 8, pp. 309–368.

Reinelt G (2004). TSPLIB. Universität Heidelberg, Institut für Informatik, Im Neuen-

heimer Feld 368,D-69120 Heidelberg, Germany. URL http://www.iwr.uni-heidelberg.
de/groups/comopt/software/TSPLIB95/.

Rosenkrantz DJ, Stearns RE, Philip M Lewis I (1977). “An Analysis of Several Heuristics for
the Traveling Salesman Problem.” SIAM Journal on Computing, 6(3), 563–581.

Affiliation:
Michael Hahsler
Computer Science and Engineering
Lyle School of Engineering
Southern Methodist University
P.O. Box 750122
Dallas, TX 75275-0122
E-mail: [email protected]

Kurt Hornik
Department of Finance, Accounting and Statistics
Wirtschaftsuniversität Wien
1090 Wien, Austria
E-mail: [email protected]
URL: http://statmath.wu.ac.at/~hornik/

Operations Research
100% (3)
Operations Research
293 pages
Balas1999 New Classes of Efficiently Solvable Generalized TSP
No ratings yet
Balas1999 New Classes of Efficiently Solvable Generalized TSP
30 pages
Toolbox - Global Optimization PDF
No ratings yet
Toolbox - Global Optimization PDF
724 pages
Greco F. (Ed.) - Travelling Salesman Problem (2008)
No ratings yet
Greco F. (Ed.) - Travelling Salesman Problem (2008)
210 pages
Bmi 401-Bmsda 403-Design and Analysis of Algorithms - Lec 5
No ratings yet
Bmi 401-Bmsda 403-Design and Analysis of Algorithms - Lec 5
13 pages
1 s2.0 S0377221723005581 Main
No ratings yet
1 s2.0 S0377221723005581 Main
17 pages
M2 Full
No ratings yet
M2 Full
59 pages
Travelling Sales Person Final Report
No ratings yet
Travelling Sales Person Final Report
42 pages
4.4 Travelling Salesman Problem
No ratings yet
4.4 Travelling Salesman Problem
14 pages
The Travelling Salesman Problem Introduc
No ratings yet
The Travelling Salesman Problem Introduc
19 pages
INDG1051 - Unit 4 - Part I
No ratings yet
INDG1051 - Unit 4 - Part I
41 pages
Traveling Salesman Problem
No ratings yet
Traveling Salesman Problem
19 pages
Traveling Salesman Problem: An Overview of Applications, Formulations, and Solution Approaches
No ratings yet
Traveling Salesman Problem: An Overview of Applications, Formulations, and Solution Approaches
26 pages
Literature Review On Travelling Salesman Problem
100% (1)
Literature Review On Travelling Salesman Problem
8 pages
Örnek 1
No ratings yet
Örnek 1
17 pages
MAT 392 FINAL Pawley-Traveling Salesman
No ratings yet
MAT 392 FINAL Pawley-Traveling Salesman
18 pages
Generating Subtour Elimination Constraints For The PDF
No ratings yet
Generating Subtour Elimination Constraints For The PDF
43 pages
The Travelling Salesman Problem Applications and Solvers
No ratings yet
The Travelling Salesman Problem Applications and Solvers
9 pages
Ijgi 07 00115 PDF
No ratings yet
Ijgi 07 00115 PDF
16 pages
TSP
No ratings yet
TSP
22 pages
A Comparison of Exact and Heuristic Algorithms To Solve The Travelling Salesman Problem
No ratings yet
A Comparison of Exact and Heuristic Algorithms To Solve The Travelling Salesman Problem
39 pages
Solving The Travelling Salesman Problem With The Excel
No ratings yet
Solving The Travelling Salesman Problem With The Excel
8 pages
01 Formulations For The TSP With AMPL
No ratings yet
01 Formulations For The TSP With AMPL
21 pages
OR LPM2 CHP 2 New
No ratings yet
OR LPM2 CHP 2 New
85 pages
October 10, 2022 1 / 100
No ratings yet
October 10, 2022 1 / 100
105 pages
Judul 2
No ratings yet
Judul 2
19 pages
Travelling Salesman
No ratings yet
Travelling Salesman
13 pages
TSP Report
No ratings yet
TSP Report
5 pages
The Traveling Salesman Problem: Irina Bryan April 18, 2009
No ratings yet
The Traveling Salesman Problem: Irina Bryan April 18, 2009
17 pages
Lectures on the Coupling Method
From Everand
Lectures on the Coupling Method
Torgny Lindvall
No ratings yet
Network Models: Multiple Choice
100% (1)
Network Models: Multiple Choice
12 pages
The Traveling Salesman Problem: A Neural Network Perspective
No ratings yet
The Traveling Salesman Problem: A Neural Network Perspective
60 pages
Ant Calony Paper 2008
No ratings yet
Ant Calony Paper 2008
6 pages
TravellingSalesmanProblem PDF
No ratings yet
TravellingSalesmanProblem PDF
212 pages
Laporte - TSP Review PDF
No ratings yet
Laporte - TSP Review PDF
17 pages
Optimization Techniques and Applications
No ratings yet
Optimization Techniques and Applications
10 pages
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Module 8 - Linear Inequalities & LPP - CET
No ratings yet
Module 8 - Linear Inequalities & LPP - CET
20 pages
HW 4 Sol
No ratings yet
HW 4 Sol
21 pages
Methods For Non-Linear Least Squares Problems-2nd
No ratings yet
Methods For Non-Linear Least Squares Problems-2nd
58 pages
Literature Review On Travelling Salesman
No ratings yet
Literature Review On Travelling Salesman
4 pages
Final Report - Solving Traveling Salesman Problem by Dynamic Programming Approach in Java Program Aditya Nugroho Ht083276e
100% (5)
Final Report - Solving Traveling Salesman Problem by Dynamic Programming Approach in Java Program Aditya Nugroho Ht083276e
15 pages
Bachelorthesis Isabel Droste
No ratings yet
Bachelorthesis Isabel Droste
52 pages
A Decision Tree Application For The Development of A Novel Approach To The Traveling Salesman Issue
No ratings yet
A Decision Tree Application For The Development of A Novel Approach To The Traveling Salesman Issue
5 pages
TSP Repport
No ratings yet
TSP Repport
7 pages
Journal of Statistical Software: TSP - Infrastructure For The Traveling Salesperson Problem
No ratings yet
Journal of Statistical Software: TSP - Infrastructure For The Traveling Salesperson Problem
21 pages
An To An A That It It An: I. (L, The
No ratings yet
An To An A That It It An: I. (L, The
10 pages
Tool-Path Optimization For Minimizing Airtime During Machining
No ratings yet
Tool-Path Optimization For Minimizing Airtime During Machining
7 pages
Analyzing The Performance of Mutation Operators To Solve The Travelling Salesman Problem
No ratings yet
Analyzing The Performance of Mutation Operators To Solve The Travelling Salesman Problem
18 pages
The Maximal Flow Problem
No ratings yet
The Maximal Flow Problem
12 pages
A Survey Paper On Solving Travelling Salesman Problem Using Bee Colony Optimization
No ratings yet
A Survey Paper On Solving Travelling Salesman Problem Using Bee Colony Optimization
6 pages
Operations Research
No ratings yet
Operations Research
11 pages
Engineering Optimization Theory and Practice Fourth Edition Singiresu S. Rao PDF Download
No ratings yet
Engineering Optimization Theory and Practice Fourth Edition Singiresu S. Rao PDF Download
45 pages
P V Reddy's JNTU Conf Paper
No ratings yet
P V Reddy's JNTU Conf Paper
14 pages
Solving The Traveling Salesman Problem B PDF
No ratings yet
Solving The Traveling Salesman Problem B PDF
10 pages
Linear Programming: (Graphical Method)
No ratings yet
Linear Programming: (Graphical Method)
10 pages
Traveling Salesman Problem
No ratings yet
Traveling Salesman Problem
20 pages
Dynamic Programming Treatment of The Travelling Salesman Problem
No ratings yet
Dynamic Programming Treatment of The Travelling Salesman Problem
4 pages
Unit4-MaximumPrinciple Important
No ratings yet
Unit4-MaximumPrinciple Important
18 pages
A Survey Review On Solving Algorithms For Travelling Salesman Problem (TSP)
No ratings yet
A Survey Review On Solving Algorithms For Travelling Salesman Problem (TSP)
4 pages
Research Paper of Minor
No ratings yet
Research Paper of Minor
5 pages
Mail - Iiitdmj.ac - in Squirrelmail SRC Webmail New
No ratings yet
Mail - Iiitdmj.ac - in Squirrelmail SRC Webmail New
17 pages
Formulations For The TSP With AMPL
No ratings yet
Formulations For The TSP With AMPL
21 pages
TSP Survey
No ratings yet
TSP Survey
24 pages
Assignment#1
No ratings yet
Assignment#1
6 pages
Answer The Following Questions: Q1: Choose The Correct Answer (20 Points)
No ratings yet
Answer The Following Questions: Q1: Choose The Correct Answer (20 Points)
13 pages
Travelling Salesman and Distribution Problems: Ik Ij JK
No ratings yet
Travelling Salesman and Distribution Problems: Ik Ij JK
11 pages
TSP Hoffman Padberg Rinaldi
No ratings yet
TSP Hoffman Padberg Rinaldi
9 pages
Travelling Salesman Problem Mathematical Description: December 2016
No ratings yet
Travelling Salesman Problem Mathematical Description: December 2016
7 pages
Homework Topic 5 PDF
No ratings yet
Homework Topic 5 PDF
7 pages
Traveling Salesman Problem
No ratings yet
Traveling Salesman Problem
6 pages
Traveling Salesman Problem
No ratings yet
Traveling Salesman Problem
11 pages
Effects of A PID Controller in Closed Loop Feedback System
No ratings yet
Effects of A PID Controller in Closed Loop Feedback System
4 pages
Travelling Salesman Problem Mathematical Description
No ratings yet
Travelling Salesman Problem Mathematical Description
6 pages
TSP Indin08
No ratings yet
TSP Indin08
7 pages
Travelling Salesman Problem
No ratings yet
Travelling Salesman Problem
2 pages
Solution Quiz2 and Mid-Semester
No ratings yet
Solution Quiz2 and Mid-Semester
17 pages
15.093: Optimization Methods
No ratings yet
15.093: Optimization Methods
8 pages
Interior Point Methods: ME575 - Optimization Methods John Hedengren
No ratings yet
Interior Point Methods: ME575 - Optimization Methods John Hedengren
26 pages
Ee 37
No ratings yet
Ee 37
6 pages
A Primal-Dual Method For Solving Linear PDF
No ratings yet
A Primal-Dual Method For Solving Linear PDF
22 pages
Synopsis Of: Algorithm Analysis and Design CSE 408
No ratings yet
Synopsis Of: Algorithm Analysis and Design CSE 408
3 pages
SADCAS AP 05 - Identification and Management of Nonconformities (Issue 4)
No ratings yet
SADCAS AP 05 - Identification and Management of Nonconformities (Issue 4)
7 pages
MBA 2nd Sem
No ratings yet
MBA 2nd Sem
3 pages
Dual Simplex Method
No ratings yet
Dual Simplex Method
10 pages
Travelling Salesman Probelm
No ratings yet
Travelling Salesman Probelm
3 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
Denovo: Bùi Nguyễn Quang Minh - IELSIU18081
No ratings yet
Denovo: Bùi Nguyễn Quang Minh - IELSIU18081
4 pages
Indian Institute of Management Udaipur: Operations Research: Quiz - 1
No ratings yet
Indian Institute of Management Udaipur: Operations Research: Quiz - 1
3 pages
Operations Research
No ratings yet
Operations Research
1 page

TSP - Infrastructure For The Traveling Salesperson Problem: Michael Hahsler Kurt Hornik

Uploaded by

TSP - Infrastructure For The Traveling Salesperson Problem: Michael Hahsler Kurt Hornik

Uploaded by

TSP – Infrastructure for the Traveling Salesperson

Michael Hahsler Kurt Hornik

Keywords: combinatorial optimization, traveling salesman problem, R.

2.1. Different formulations of the TSP

2.2. Useful manipulations of the distance matrix

2.3. Finding exact solutions for the TSP

d∗∗ = d∗ ({2, 3, . . . , n}, in ) + din 1 (4)

d∗ ({i2 , i3 , . . . , ip , ip+1 }, ip+1 ) = d∗ ({i2 , i3 , . . . , ip }, ip ) + dip ip+1 . (5)

2.4. Heuristics for the TSP

Tour construction heuristics

Nearest neighbor algorithm. The nearest neighbor algorithm (Rosenkrantz, Stearns,

d(i, k) + d(k, j) − d(i, j)

Tour improvement heuristics

TSP/ATSP solve_TSP() TOUR

write_TSPLIB() read_TSPLIB() as.integer() TOUR()

Figure 1: An overview of the classes in TSP.

3. Computational infrastructure: the TSP package

 n_of_cities() returns the number of cities.

 labels() returns the city names.

solve_TSP(x, method, control)

Table 1: Available algorithms in TSP.

4.1. Comparing some heuristics

object of class ‘TSP’

> methods <- c("nearest_insertion", "farthest_insertion", "cheapest_insertion",

0 5000 10000 15000 20000

+ method = m), simplify = FALSE)

object of class ‘TOUR’

> dotchart(c(sapply(tours, FUN = attr, "tour_length"), optimal = 14497),

4.2. Finding the shortest Hamiltonian path

object of class ‘TSP’

> tour <- solve_TSP(tsp, method = "farthest_insertion")

object of class ‘TOUR’

> path <- cut_tour(tour, "cut")

[1] "Lihue, HI" "Honolulu, HI" "Hilo, HI"

[1] "Anchorage, AK" "Fairbanks, AK" "Dawson, YT"

Note: polygon geometry computations in maptools

Checking rgeos availability as gpclib substitute:

160°W 140°W 120°W 100°W 80°W 60°W

Figure 3: A “short” Hamiltonian path for the USCA312 dataset.

+ plot(path_line, add = TRUE, col = "black")

> atsp <- as.ATSP(USCA312)

object of class ‘TOUR’

> tour <- solve_TSP(atsp, method = "2-opt", control = list(tour = initial_tour))

object of class ‘TOUR’

> path <- cut_tour(tour, ny, exclude_cut = FALSE)

> tsp <- reformulate_ATSP_as_TSP(atsp)

object of class ‘TSP’

160°W 140°W 120°W 100°W 80°W 60°W

> tour <- solve_TSP(tsp, method = "concorde")

> m <- as.matrix(USCA312)

We use the nearest insertion heuristic.

> tour <- solve_TSP(atsp, method = "nearest_insertion")

object of class ‘TOUR’

> path_labels <- c("New York, NY", labels(cut_tour(tour, la_ny)),

[1] "New York, NY" "North Bay, ON" "Sudbury, ON"

[1] "Eureka, CA" "Reno, NV" "Carson City, NV"

4.3. Rearrangement clustering

160°W 140°W 120°W 100°W 80°W 60°W

> image(tsp_dummy, tour, xlab = "objects", ylab = "objects")

> out <- rle(labels(tour))

20 40 60 80 100 120 140

Species Lenghts Pos

path segments which resulted from solving the TSP.

> prc <- prcomp(iris[1:4])

Hubert LJ, Baker FB (1978). “Applications of Combinatorial Programming to Data Analysis:

Johnson D, Krishnan S, Chhugani J, Kumar S, Venkatasubramanian S (2004). “Compressing

Johnson D, Papadimitriou C (1985a). “Computational complexity.” In Lawler et al. (1985),

Johnson D, Papadimitriou C (1985b). “Performance guarantees for heuristics.” In Lawler

Jonker R, Volgenant T (1983). “Transforming asymmetric into symmetric traveling salesman

Reinelt G (2004). TSPLIB. Universität Heidelberg, Institut für Informatik, Im Neuen-

You might also like

n_of_cities() returns the number of cities.

labels() returns the city names.