The Network Solver (Experimental)

Shortest Path

A shortest path between two nodes $u$ and $v$ in a graph is a path that starts at $u$ and ends at $v$ and has the lowest total link weight. The starting node is called the source node, and the ending node is called the sink node.

In the network solver, shortest paths can be calculated by invoking the SHORTPATH= option.

By default, the network solver finds shortest paths for all pairs. That is, it finds a shortest path for each possible combination of source and sink nodes. Alternatively, you can use the SOURCE= suboption to fix a particular source node and find shortest paths from the fixed source node to all possible sink nodes. Conversely, by using the SINK= suboption, you can fix a sink node and find shortest paths from all possible source nodes to the fixed sink node. Using both suboptions together, you can request one particular shortest path for a specific source-sink pair. In addition, you can use the SOURCE= and SINK= suboptions to define a list of source-sink pairs to process. The following sections show examples of these suboptions.

The algorithm that the network solver uses for finding shortest paths is a variant of Dijkstra’s algorithm (Ahuja, Magnanti, and Orlin 1993). For unweighted graphs, the network solver uses a variant of breadth-first search. Dijkstra’s algorithm on weighted graphs runs in time $O(|N| \log |N| + |A|)$ for each source node. Breadth-first search runs in time $O(|N|+|A|)$ for each source node.

For weighted graphs, the algorithm uses the parameter that is specified in the WEIGHT= suboption in the SHORTPATH= option to evaluate a path’s total weight (cost).

Outputs

The shortest path algorithm produces up to two outputs. The output set that is specified in the SPPATHS= suboption contains the links of a shortest path for each source-sink pair combination. The output parameter that is specified in the SPWEIGHTS= suboption contains the total weight for the shortest path for each source-sink pair combination.

SPPATHS= Set

This set contains the links present in the shortest path for each of the source-sink pairs. For large graphs and a large requested number of source-sink pairs, this set can be extremely large. For extremely large graphs, generating the output can sometimes take longer than computing the shortest paths. For example, using the US road network data for the state of New York, the data contain a directed graph that has 264,346 nodes. Finding the shortest path for all pairs from only one source node results in 140,969,120 observations, which is a set of size 11 GB. Finding shortest paths for all pairs from all nodes would produce an enormous set.

The SPPATHS= set contains the following tuple members:

  1. the source node of this shortest path

  2. the sink node of this shortest path

  3. for this source-sink pair, the order of this link in a shortest path

  4. the tail node of this link in a shortest path

  5. the head node of this link in a shortest path

SPWEIGHTS= Parameter

This parameter contains the total weight for the shortest path for each of the source-sink pairs.

Shortest Paths for All Pairs

This example illustrates the use of the shortest path algorithm for all source-sink pairs on the simple undirected graph $G$ that is shown in Figure 8.49.

Figure 8.49: A Simple Undirected Graph $G$


The undirected graph $G$ can be represented by the following links data set LinkSetIn:

data LinkSetIn;
   input from $ to $ weight @@;
   datalines;
A B 3  A C 2  A D 6  A E 4  B D 5
B F 5  C E 1  D E 2  D F 1  E F 4
;

The following statements calculate shortest paths for all source-sink pairs:

proc optmodel;
   set <str,str> LINKS;
   num weight{LINKS};
   read data LinkSetIn into LINKS=[from to] weight;
   set <str,str,num,str,str> PATHS; /* source, sink, order, from, to */
   set NODES = union{<i,j> in LINKS} {i,j};
   num path_length{NODES, NODES};

   solve with NETWORK /
      links     = (weight=weight)
      shortpath
      out       = (sppaths=PATHS spweights=path_length)
   ;

   put PATHS;
   print path_length;
   create data ShortPathP from [source sink order from to]=PATHS
      weight[from,to];
   create data ShortPathW from [source sink]
      path_weight=path_length;
quit;

The data set ShortPathP contains the shortest paths and is shown in Figure 8.50.

Figure 8.50: All-Pairs Shortest Paths

ShortPathP

source sink order from to weight
A B 1 A B 3
A C 1 A C 2
A D 1 A C 2
A D 2 C E 1
A D 3 D E 2
A E 1 A C 2
A E 2 C E 1
A F 1 A C 2
A F 2 C E 1
A F 3 D E 2
A F 4 D F 1
B A 1 A B 3
B C 1 A B 3
B C 2 A C 2
B D 1 B D 5
B E 1 A B 3
B E 2 A C 2
B E 3 C E 1
B F 1 B F 5
C A 1 A C 2
C B 1 A C 2
C B 2 A B 3
C D 1 C E 1
C D 2 D E 2
C E 1 C E 1
C F 1 C E 1
C F 2 D E 2
C F 3 D F 1
D A 1 D E 2
D A 2 C E 1
D A 3 A C 2
D B 1 B D 5
D C 1 D E 2
D C 2 C E 1
D E 1 D E 2
D F 1 D F 1
E A 1 C E 1
E A 2 A C 2
E B 1 C E 1
E B 2 A C 2
E B 3 A B 3
E C 1 C E 1
E D 1 D E 2
E F 1 D E 2
E F 2 D F 1
F A 1 D F 1
F A 2 D E 2
F A 3 C E 1
F A 4 A C 2
F B 1 B F 5
F C 1 D F 1
F C 2 D E 2
F C 3 C E 1
F D 1 D F 1
F E 1 D F 1
F E 2 D E 2


The data set ShortPathW contains the path weight for the shortest paths of each source-sink pair and is shown in Figure 8.51.

Figure 8.51: All-Pairs Shortest Paths Summary

ShortPathW

source sink path_weight
A A .
A B 3
A C 2
A D 5
A E 3
A F 6
B A 3
B B .
B C 5
B D 5
B E 6
B F 5
C A 2
C B 5
C C .
C D 3
C E 1
C F 4
D A 5
D B 5
D C 3
D D .
D E 2
D F 1
E A 3
E B 6
E C 1
E D 2
E E .
E F 3
F A 6
F B 5
F C 4
F D 1
F E 3
F F .


When you are interested only in the source-sink pair that has the longest shortest path, you can use the PATHS= suboption. This suboption affects only the output processing; it does not affect the computation. All the designated source-sink shortest paths are calculated, but only the longest ones are written to the output set.

The following statements display only the longest shortest paths:

proc optmodel;
   set <str,str> LINKS;
   num weight{LINKS};
   read data LinkSetIn into LINKS=[from to] weight;
   set <str,str,num,str,str> PATHS; /* source, sink, order, from, to */

   solve with NETWORK /
      links     = ( weight = weight )
      shortpath = ( paths = longest )
      out       = ( sppaths = PATHS )
   ;

   put PATHS;
   create data ShortPathLong from [source sink order from to]=PATHS
      weight[from,to];
quit;

The data set ShortPathLong now contains the longest shortest paths and is shown in Figure 8.52.

Figure 8.52: Longest Shortest Paths

ShortPathLong

source sink order from to weight
A F 1 A C 2
A F 2 C E 1
A F 3 D E 2
A F 4 D F 1
B E 1 A B 3
B E 2 A C 2
B E 3 C E 1
E B 1 C E 1
E B 2 A C 2
E B 3 A B 3
F A 1 D F 1
F A 2 D E 2
F A 3 C E 1
F A 4 A C 2


Shortest Paths for a Subset of Source-Sink Pairs

This section illustrates the use of the SOURCE= and SINK= suboptions and the shortest path algorithm for calculating shortest paths between a subset of source-sink pairs. If $S$ denotes the nodes in the SOURCE= set and $T$ denotes the nodes in the SINK= set, the network solver calculates all the source-sink pairs in the cross product of these two sets.

For example, the following statements calculate a shortest path for the four combinations of source-sink pairs in $S\times T=\{ A,C\}  \times \{ B,F\} $:

proc optmodel;
   set <str,str> LINKS;
   num weight{LINKS};
   read data LinkSetIn into LINKS=[from to] weight;
   set <str,str,num,str,str> PATHS; /* source, sink, order, from, to */
   set SOURCES = / A C /;
   set SINKS   = / B F /;

   solve with NETWORK /
      links     = (weight=weight)
      shortpath = (source=SOURCES sink=SINKS)
      out       = (sppaths=PATHS)
   ;

   put PATHS;
   create data ShortPath from [source sink order from to]=PATHS weight[from,to];
quit;

The data set ShortPath contains the shortest paths and is shown in Figure 8.53.

Figure 8.53: Shortest Paths for a Subset of Source-Sink Pairs

ShortPath

source sink order from to weight
A B 1 A B 3
A F 1 A C 2
A F 2 C E 1
A F 3 D E 2
A F 4 D F 1
C B 1 A C 2
C B 2 A B 3
C F 1 C E 1
C F 2 D E 2
C F 3 D F 1


Shortest Paths for a Subset of Source or Sink Pairs

This section illustrates the use of the shortest path algorithm for calculating shortest paths between a subset of source (or sink) nodes and all other sink (or source) nodes.

In this case, you designate the subset of source (or sink) nodes in the node set by specifying the SOURCE= (or SINK=) suboption. By specifying only one of the suboptions, you indicate that you want the network solver to calculate all pairs from a subset of source nodes (or to calculate all pairs to a subset of sink nodes).

For example, the following statements calculate all the shortest paths from nodes B and E.:

proc optmodel;
   set <str,str> LINKS;
   num weight{LINKS};
   read data LinkSetIn into LINKS=[from to] weight;
   set <str,str,num,str,str> PATHS; /* source, sink, order, from, to */
   set SOURCES = / B E /;

   solve with NETWORK /
      links     = (weight=weight)
      shortpath = (source=SOURCES)
      out       = (sppaths=PATHS)
   ;

   put PATHS;
   create data ShortPath from [source sink order from to]=PATHS weight[from,to];
quit;

The data set ShortPath contains the shortest paths and is shown in Figure 8.54.

Figure 8.54: Shortest Paths for a Subset of Source Pairs

ShortPath

source sink order from to weight
B A 1 A B 3
B C 1 A B 3
B C 2 A C 2
B D 1 B D 5
B E 1 A B 3
B E 2 A C 2
B E 3 C E 1
B F 1 B F 5
E A 1 C E 1
E A 2 A C 2
E B 1 C E 1
E B 2 A C 2
E B 3 A B 3
E C 1 C E 1
E D 1 D E 2
E F 1 D E 2
E F 2 D F 1


Conversely, the following statements calculate all the shortest paths to nodes B and E.:

proc optmodel;
   set <str,str> LINKS;
   num weight{LINKS};
   read data LinkSetIn into LINKS=[from to] weight;
   set <str,str,num,str,str> PATHS; /* source, sink, order, from, to */
   set SINKS = / B E /;

   solve with NETWORK /
      links     = (weight=weight)
      shortpath = (sink=SINKS)
      out       = (sppaths=PATHS)
   ;

   put PATHS;
   create data ShortPath from [source sink order from to]=PATHS weight[from,to];
quit;

The data set ShortPath contains the shortest paths and is shown in Figure 8.55.

Figure 8.55: Shortest Paths for a Subset of Sink Pairs

ShortPath

source sink order from to weight
A B 1 A B 3
A E 1 A C 2
A E 2 C E 1
B E 1 A B 3
B E 2 A C 2
B E 3 C E 1
C B 1 A C 2
C B 2 A B 3
C E 1 C E 1
D B 1 B D 5
D E 1 D E 2
E B 1 C E 1
E B 2 A C 2
E B 3 A B 3
F B 1 B F 5
F E 1 D F 1
F E 2 D E 2


Shortest Paths for One Source-Sink Pair

This section illustrates the use of the shortest path algorithm for calculating shortest paths between one source-sink pair by using the SOURCE= and SINK= suboptions.

The following statements calculate a shortest path between node $C$ and node $F$:

proc optmodel;
   set <str,str> LINKS;
   num weight{LINKS};
   read data LinkSetIn into LINKS=[from to] weight;
   set <str,str,num,str,str> PATHS; /* source, sink, order, from, to */
   set SOURCES = / C /;
   set SINKS   = / F /;

   solve with NETWORK /
      links     = (weight=weight)
      shortpath = (source=SOURCES sink=SINKS)
      out       = (sppaths=PATHS)
   ;

   put PATHS;
   create data ShortPath from [source sink order from to]=PATHS weight[from,to];
quit;

The data set ShortPath contains this shortest path and is shown in Figure 8.56.

Figure 8.56: Shortest Paths for One Source-Sink Pair

ShortPath

source sink order from to weight
C F 1 C E 1
C F 2 D E 2
C F 3 D F 1


The shortest path is shown graphically in Figure 8.57.

Figure 8.57: Shortest Path between Nodes $C$ and $F$


Shortest Paths with Auxiliary Weight Calculation

This section illustrates the use of the shortest path algorithm, where auxiliary weights are used for calculating the shortest paths between all source-sink pairs.

Consider a links data set in which the auxiliary weight is a counter for each link:

data LinkSetIn;
   input from $ to $ weight count @@;
   datalines;
A B 3 1  A C 2 1  A D 6 1  A E 4 1  B D 5 1
B F 5 1  C E 1 1  D E 2 1  D F 1 1  E F 4 1
;

The following statements calculate shortest paths for all source-sink pairs:

proc optmodel;
   set <str,str> LINKS;
   num weight{LINKS};
   num count{LINKS};
   read data LinkSetIn into LINKS=[from to] weight count;
   set <str,str,num,str,str> PATHS; /* source, sink, order, from, to */
   set NODES = union{<i,j> in LINKS} {i,j};
   num path_length{NODES, NODES};

   solve with NETWORK /
      links     = (weight=weight)
      shortpath
      out       = (sppaths=PATHS spweights=path_length)
   ;

   put PATHS;
   num path_weight2{source in NODES, sink in NODES} =
      sum {<(source),(sink),order,from,to> in PATHS} count[from,to];
   print path_length path_weight2;
   create data ShortPathW from [source sink]
      path_weight=path_length path_weight2;
quit;

The data set ShortPathW contains the total path weight for shortest paths in each source-sink pair and is shown in Figure 8.58. Because the variable count in LinkSetIn is 1 for all links, the value in the output data set variable path_weights2 contains the number of links in each shortest path.

Figure 8.58: Shortest Paths Including Auxiliary Weights in Calculation

ShortPathW

source sink path_weight path_weight2
A A . 0
A B 3 1
A C 2 1
A D 5 3
A E 3 2
A F 6 4
B A 3 1
B B . 0
B C 5 2
B D 5 1
B E 6 3
B F 5 1
C A 2 1
C B 5 2
C C . 0
C D 3 2
C E 1 1
C F 4 3
D A 5 3
D B 5 1
D C 3 2
D D . 0
D E 2 1
D F 1 1
E A 3 2
E B 6 3
E C 1 1
E D 2 1
E E . 0
E F 3 2
F A 6 4
F B 5 1
F C 4 3
F D 1 1
F E 3 2
F F . 0


The section Getting Started: Network Solver shows an example of using the shortest path algorithm for minimizing travel to and from work based on traffic conditions.