The OPTGRAPH Procedure

Traveling Salesman Problem

The traveling salesman problem (TSP) finds a minimum-cost tour in a graph G that has a node set, N, and link set, A. A path in a graph is a sequence of nodes, each of which has a link to the next node in the sequence. An elementary cycle is a path in which the start node and end node are the same and no node appears more than once in the sequence. A Hamiltonian cycle (or tour) is an elementary cycle that visits every node. The goal is then to find a Hamiltonian cycle of minimum total cost, where the total cost is the sum of the costs of the links in the tour. Associated with each link $(i,j) \in A$ are a binary variable $x_{ij}$, which indicates whether link $x_{ij}$ is part of the tour, and a cost $c_{ij}$. Let $\delta (S) = \{ (i,j) \in A \; |\;  i \in S, \;  j \notin S\} $. Then an integer linear programming formulation of the TSP (for an undirected graph G), is as follows:

\[  \begin{array}{llllll} \text {minimize} &  \displaystyle \sum _{(i,j) \in A} c_{ij} x_{ij} \\ \text {subject to} &  \displaystyle \sum _{(i,j) \in \delta ({i})} x_{i,j} &  = &  2 &  i \in N &  (\mr{two\_ match}) \\ &  \displaystyle \sum _{(i,j) \in \delta (S)}x_{ij} &  \geq &  2 &  S \subset N, \;  2 \leq |S| \leq |N| - 1 &  (\mr{subtour\_ elim})\\ &  x_{ij} \in \{ 0,1\}  & & &  (i,j) \in A \end{array}  \]

The equations (two_match) are the matching constraints, which ensure that each node has degree two in the subgraph. The inequalities (subtour_elim) are the subtour elimination constraints (SECs), which enforce connectivity.

For a directed graph, G, the same formulation and solution approach is used on an expanded graph $G’$, as described in Kumar and Li (1994). PROC OPTGRAPH takes care of the construction of the expanded graph and returns the solution in terms of the original input graph.

In practical terms, you can think of the TSP in the context of a routing problem in which each node is a city and the links are roads that connect cities. Given the pairwise distances between each city, the goal is to find the shortest possible route that visits each city exactly once. The TSP has applications in planning, logistics, manufacturing, genomics, and many other areas.

In PROC OPTGRAPH, you can invoke the traveling salesman problem solver by using the TSP statement. The options for this statement are described in the section TSP Statement.

The traveling salesman problem solver reports status information in a macro variable called _OPTGRAPH_TSP_. See the section Macro Variable _OPTGRAPH_TSP_ for more information about this macro variable.

The algorithm that PROC OPTGRAPH uses for solving a TSP is based on a variant of the branch-and-cut process described in Applegate et al. (2006).

The resulting tour is represented in two ways: In the data set that is specified in the OUT_NODES= option in the PROC OPTGRAPH statement, the tour is specified as a sequence of nodes. In the data set that is specified in the OUT= option of the TSP statement, the tour is specified as a list of links in the optimal tour.

Traveling Salesman Problem of a Simple Undirected Graph

As a simple example, consider the weighted undirected graph in Figure 1.125.

Figure 1.125: A Simple Undirected Graph

A Simple Undirected Graph


The links data set can be represented as follows:

data LinkSetIn;
   input from $ to $ weight @@;
   datalines;
A B 1.0 A C 1.0 A D 1.5 B C 2.0 B D 4.0
B E 3.0 C D 3.0 C F 3.0 C H 4.0 D E 1.5
D F 3.0 D G 4.0 E F 1.0 E G 1.0 F G 2.0
F H 4.0 H I 3.0 I J 1.0 C J 5.0 F J 3.0
F I 1.0 H J 1.0
;

The following statements calculate an optimal traveling salesman tour and output the results in the data sets TSPTour and NodeSetOut:

proc optgraph
   loglevel   = moderate
   data_links = LinkSetIn
   out_nodes  = NodeSetOut;
   tsp
      out     = TSPTour;
run;
%put &_OPTGRAPH_;
%put &_OPTGRAPH_TSP_;

The progress of the procedure is shown in Figure 1.126.

Figure 1.126: PROC OPTGRAPH Log: Optimal Traveling Salesman Tour of a Simple Undirected Graph

NOTE: ------------------------------------------------------------------------------------------
NOTE: ------------------------------------------------------------------------------------------
NOTE: Running OPTGRAPH version 14.1.                                                            
NOTE: ------------------------------------------------------------------------------------------
NOTE: ------------------------------------------------------------------------------------------
NOTE: The OPTGRAPH procedure is executing in single-machine mode.                               
NOTE: ------------------------------------------------------------------------------------------
NOTE: ------------------------------------------------------------------------------------------
NOTE: Reading the links data set.                                                               
NOTE: There were 22 observations read from the data set WORK.LINKSETIN.                         
NOTE: Data input used 0.01 (cpu: 0.00) seconds.                                                 
NOTE: Building the input graph storage used 0.00 (cpu: 0.00) seconds.                           
NOTE: The input graph storage is using 0.3 MBs of memory.                                       
NOTE: The number of nodes in the input graph is 10.                                             
NOTE: The number of links in the input graph is 22.                                             
NOTE: ------------------------------------------------------------------------------------------
NOTE: ------------------------------------------------------------------------------------------
NOTE: Processing the traveling salesman problem.                                                
NOTE: The initial TSP heuristics found a tour with cost 16 using 0.00 (cpu: 0.00) seconds.      
NOTE: The MILP presolver value NONE is applied.                                                 
NOTE: The MILP solver is called.                                                                
          Node  Active    Sols    BestInteger      BestBound      Gap    Time                   
             0       1       1     16.0000000     15.5005000    3.22%       0                   
             0       0       1     16.0000000     16.0000000    0.00%       0                   
NOTE: Objective = 16.                                                                           
NOTE: Processing the traveling salesman problem used 0.07 (cpu: 0.00) seconds.                  
NOTE: ------------------------------------------------------------------------------------------
NOTE: ------------------------------------------------------------------------------------------
NOTE: Creating nodes data set output.                                                           
NOTE: Creating traveling salesman data set output.                                              
NOTE: Data output used 0.00 (cpu: 0.02) seconds.                                                
NOTE: ------------------------------------------------------------------------------------------
NOTE: ------------------------------------------------------------------------------------------
NOTE: The data set WORK.NODESETOUT has 10 observations and 2 variables.                         
NOTE: The data set WORK.TSPTOUR has 10 observations and 3 variables.                            
STATUS=OK  TSP=OPTIMAL                                                                          
STATUS=OPTIMAL  OBJECTIVE=16  RELATIVE_GAP=0  ABSOLUTE_GAP=0  PRIMAL_INFEASIBILITY=0            
BOUND_INFEASIBILITY=0  INTEGER_INFEASIBILITY=0  BEST_BOUND=16  NODES=1  ITERATIONS=17           
CPU_TIME=0.00  REAL_TIME=0.07                                                                   



The data set NodeSetOut now contains a sequence of nodes in the optimal tour and is shown in Figure 1.127.

Figure 1.127: Nodes in the Optimal Traveling Salesman Tour

node tsp_order
A 1
B 2
C 3
H 4
J 5
I 6
F 7
G 8
E 9
D 10



The data set TSPTour now contains the links in the optimal tour and is shown in Figure 1.128.

Figure 1.128: Links in the Optimal Traveling Salesman Tour

from to weight
A B 1.0
B C 2.0
C H 4.0
H J 1.0
I J 1.0
F I 1.0
F G 2.0
E G 1.0
D E 1.5
A D 1.5
    16.0



The minimum-cost links are shown in green in Figure 1.129.

Figure 1.129: Optimal Traveling Salesman Tour

Optimal Traveling Salesman Tour


Traveling Salesman Problem of a Simple Directed Graph

As another simple example, consider the weighted directed graph in Figure 1.130.

Figure 1.130: A Simple Directed Graph

A Simple Directed Graph


The links data set can be represented as follows:

data LinkSetIn;
   input from $ to $ weight @@;
   datalines;
A B 2.0 A C 1.0 A E 4.0 B A 1.0 B C 2.0 
B D 1.0 B E 1.0 C B 2.0 C D 3.0 D A 1.0
D C 1.0 D E 2.0 E A 2.0 E D 1.0 
;

The following statements calculate an optimal traveling salesman tour (on a directed graph) and output the results in the data sets TSPTour and NodeSetOut:

proc optgraph
   direction  = directed
   loglevel   = moderate
   data_links = LinkSetIn
   out_nodes  = NodeSetOut;
   tsp
      out     = TSPTour;
run;
%put &_OPTGRAPH_;
%put &_OPTGRAPH_TSP_;

The progress of the procedure is shown in Figure 1.131.

Figure 1.131: PROC OPTGRAPH Log: Optimal Traveling Salesman Tour of a Simple Directed Graph

NOTE: ------------------------------------------------------------------------------------------
NOTE: ------------------------------------------------------------------------------------------
NOTE: Running OPTGRAPH version 14.1.                                                            
NOTE: ------------------------------------------------------------------------------------------
NOTE: ------------------------------------------------------------------------------------------
NOTE: The OPTGRAPH procedure is executing in single-machine mode.                               
NOTE: ------------------------------------------------------------------------------------------
NOTE: ------------------------------------------------------------------------------------------
NOTE: Reading the links data set.                                                               
NOTE: There were 14 observations read from the data set WORK.LINKSETIN.                         
NOTE: Data input used 0.01 (cpu: 0.00) seconds.                                                 
NOTE: Building the input graph storage used 0.00 (cpu: 0.00) seconds.                           
NOTE: The input graph storage is using 0.3 MBs of memory.                                       
NOTE: The number of nodes in the input graph is 5.                                              
NOTE: The number of links in the input graph is 14.                                             
NOTE: ------------------------------------------------------------------------------------------
NOTE: ------------------------------------------------------------------------------------------
NOTE: The TSP solver is starting using an augmented symmetric graph with 10 nodes and 19 links. 
NOTE: ------------------------------------------------------------------------------------------
NOTE: ------------------------------------------------------------------------------------------
NOTE: Processing the traveling salesman problem.                                                
NOTE: The initial TSP heuristics found a tour with cost 6 using 0.00 (cpu: 0.02) seconds.       
NOTE: The MILP presolver value NONE is applied.                                                 
NOTE: The MILP solver is called.                                                                
          Node  Active    Sols    BestInteger      BestBound      Gap    Time                   
             0       1       1      6.0000000      5.9001000    1.69%       0                   
             0       0       1      6.0000000      6.0000000    0.00%       0                   
NOTE: Objective = 6.                                                                            
NOTE: Processing the traveling salesman problem used 0.00 (cpu: 0.02) seconds.                  
NOTE: ------------------------------------------------------------------------------------------
NOTE: ------------------------------------------------------------------------------------------
NOTE: Creating nodes data set output.                                                           
NOTE: Creating traveling salesman data set output.                                              
NOTE: Data output used 0.00 (cpu: 0.00) seconds.                                                
NOTE: ------------------------------------------------------------------------------------------
NOTE: ------------------------------------------------------------------------------------------
NOTE: The data set WORK.NODESETOUT has 5 observations and 2 variables.                          
NOTE: The data set WORK.TSPTOUR has 5 observations and 3 variables.                             
STATUS=OK  TSP=OPTIMAL                                                                          
STATUS=OPTIMAL  OBJECTIVE=6  RELATIVE_GAP=2.96E-16  ABSOLUTE_GAP=-1.77636E-15                   
PRIMAL_INFEASIBILITY=0  BOUND_INFEASIBILITY=0  INTEGER_INFEASIBILITY=0  BEST_BOUND=6  NODES=1   
ITERATIONS=8  CPU_TIME=0.02  REAL_TIME=0.00                                                     



The data set NodeSetOut now contains a sequence of nodes in the optimal tour and is shown in Figure 1.132.

Figure 1.132: Nodes in the Optimal Traveling Salesman Tour

node tsp_order
A 1
C 2
B 3
E 4
D 5



The data set TSPTour now contains the links in the optimal tour and is shown in Figure 1.133.

Figure 1.133: Links in the Optimal Traveling Salesman Tour

from to weight
A C 1
C B 2
B E 1
E D 1
D A 1
    6



The minimum-cost links are shown in green in Figure 1.134.

Figure 1.134: Optimal Traveling Salesman Tour

Optimal Traveling Salesman Tour