# Shortest path problem

In graph theory, the shortest path problem is the problem of finding a path between two vertices (or nodes) in a graph such that the sum of the weights of its constituent edges is minimized.

Shortest path (A, C, E, D, F) between vertices A and F in the weighted directed graph

The problem of finding the shortest path between two intersections on a road map may be modeled as a special case of the shortest path problem in graphs, where the vertices correspond to intersections and the edges correspond to road segments, each weighted by the length of the segment.

## Definition

The shortest path problem can be defined for graphs whether undirected, directed, or mixed. It is defined here for undirected graphs; for directed graphs the definition of path requires that consecutive vertices be connected by an appropriate directed edge.

Two vertices are adjacent when they are both incident to a common edge. A path in an undirected graph is a sequence of vertices ${\displaystyle P=(v_{1},v_{2},\ldots ,v_{n})\in V\times V\times \cdots \times V}$ such that ${\displaystyle v_{i}}$ is adjacent to ${\displaystyle v_{i+1}}$ for ${\displaystyle 1\leq i. Such a path ${\displaystyle P}$ is called a path of length ${\displaystyle n-1}$ from ${\displaystyle v_{1}}$ to ${\displaystyle v_{n}}$. (The ${\displaystyle v_{i}}$ are variables; their numbering here relates to their position in the sequence and needs not to relate to any canonical labeling of the vertices.)

Let ${\displaystyle e_{i,j}}$ be the edge incident to both ${\displaystyle v_{i}}$ and ${\displaystyle v_{j}}$. Given a real-valued weight function ${\displaystyle f:E\rightarrow \mathbb {R} }$, and an undirected (simple) graph ${\displaystyle G}$, the shortest path from ${\displaystyle v}$ to ${\displaystyle v'}$ is the path ${\displaystyle P=(v_{1},v_{2},\ldots ,v_{n})}$ (where ${\displaystyle v_{1}=v}$ and ${\displaystyle v_{n}=v'}$) that over all possible ${\displaystyle n}$ minimizes the sum ${\displaystyle \sum _{i=1}^{n-1}f(e_{i,i+1}).}$ When each edge in the graph has unit weight or ${\displaystyle f:E\rightarrow \{1\}}$, this is equivalent to finding the path with fewest edges.

The problem is also sometimes called the single-pair shortest path problem, to distinguish it from the following variations:

• The single-source shortest path problem, in which we have to find shortest paths from a source vertex v to all other vertices in the graph.
• The single-destination shortest path problem, in which we have to find shortest paths from all vertices in the directed graph to a single destination vertex v. This can be reduced to the single-source shortest path problem by reversing the arcs in the directed graph.
• The all-pairs shortest path problem, in which we have to find shortest paths between every pair of vertices v, v' in the graph.

These generalizations have significantly more efficient algorithms than the simplistic approach of running a single-pair shortest path algorithm on all relevant pairs of vertices.

## Algorithms

The most important algorithms for solving this problem are:

• Dijkstra's algorithm solves the single-source shortest path problem with non-negative edge weight.
• Bellman–Ford algorithm solves the single-source problem if edge weights may be negative.
• A* search algorithm solves for single-pair shortest path using heuristics to try to speed up the search.
• Floyd–Warshall algorithm solves all pairs shortest paths.
• Johnson's algorithm solves all pairs shortest paths, and may be faster than Floyd–Warshall on sparse graphs.
• Viterbi algorithm solves the shortest stochastic path problem with an additional probabilistic weight on each node.

Additional algorithms and associated evaluations may be found in Cherkassky, Goldberg & Radzik (1996).

## Single-source shortest paths

### Undirected graphs

WeightsTime complexityAuthor
+O(V2)Dijkstra 1959
+O((E + V) log V)Johnson 1977 (binary heap)
+O(E + V log V)Fredman & Tarjan 1984 (Fibonacci heap)
O(E)Thorup 1999 (requires constant-time multiplication)

### Unweighted graphs

AlgorithmTime complexityAuthor

### Directed acyclic graphs (DAGs)

An algorithm using topological sorting can solve the single-source shortest path problem in time Θ(E + V) in arbitrarily-weighted DAGs.[1]

### Directed graphs with nonnegative weights

The following table is taken from Schrijver (2004), with some corrections and additions. A green background indicates an asymptotically best bound in the table; L is the maximum length (or weight) among all edges, assuming integer edge weights.

WeightsAlgorithmTime complexityAuthor
O(V 2EL)Ford 1956
Bellman–Ford algorithmO(VE)Shimbel 1955, Bellman 1958, Moore 1959
O(V 2 log V)Dantzig 1960
Dijkstra's algorithm with listO(V 2)Leyzorek et al. 1957, Dijkstra 1959, Minty (see Pollack & Wiebenson 1960), Whiting & Hillier 1960
Dijkstra's algorithm with binary heapO((E + V) log V)Johnson 1977
Dijkstra's algorithm with Fibonacci heapO(E + V log V)Fredman & Tarjan 1984, Fredman & Tarjan 1987
Dial's algorithm[2] (Dijkstra's algorithm using a bucket queue with L buckets)O(E + LV)Dial 1969
O(E log log L)Johnson 1981, Karlsson & Poblete 1983
Gabow's algorithmO(E logE/V L)Gabow 1983, Gabow 1985
O(E + V log L)Ahuja et al. 1990
ThorupO(E + V log log V)Thorup 2004

### Directed graphs with arbitrary weights without negative cycles

WeightsAlgorithmTime complexityAuthor
O(V 2EL)Ford 1956
Bellman–Ford algorithmO(VE)Shimbel 1955, Bellman 1958, Moore 1959
Johnson-Dijkstra with binary heapO(V (E + log V))Johnson 1977
Johnson-Dijkstra with Fibonacci heapO(V (E + log V))Fredman & Tarjan 1984, Fredman & Tarjan 1987, adapted after Johnson 1977
Johnson's technique applied to Dial's algorithm[2]O(V (E + L))Dial 1969, adapted after Johnson 1977

## All-pairs shortest paths

The all-pairs shortest path problem finds the shortest paths between every pair of vertices v, v' in the graph. The all-pairs shortest paths problem for unweighted directed graphs was introduced by Shimbel (1953), who observed that it could be solved by a linear number of matrix multiplications that takes a total time of O(V4).

### Undirected graph

WeightsTime complexityAlgorithm
+O(V3)Floyd–Warshall algorithm
${\displaystyle \{1,\infty \}}$${\displaystyle O(V^{\omega }\log V)}$Seidel's algorithm (expected running time)
${\displaystyle O(V^{3}/2^{\Omega (\log n)^{1/2}})}$Williams 2014
+O(EV log α(E,V))Pettie & Ramachandran 2002
O(EV)Thorup 1999 applied to every vertex (requires constant-time multiplication).

### Directed graph

WeightsTime complexityAlgorithm
ℝ (no negative cycles)O(V3)Floyd–Warshall algorithm
${\displaystyle O(V^{3}/2^{\Omega (\log n)^{1/2}})}$Williams 2014
ℝ (no negative cycles)O(EV + V2 log V)Johnson–Dijkstra
ℝ (no negative cycles)O(EV + V2 log log V)Pettie 2004
O(EV + V2 log log V)Hagerup 2000

## Applications

Shortest path algorithms are applied to automatically find directions between physical locations, such as driving directions on web mapping websites like MapQuest or Google Maps. For this application fast specialized algorithms are available.[3]

If one represents a nondeterministic abstract machine as a graph where vertices describe states and edges describe possible transitions, shortest path algorithms can be used to find an optimal sequence of choices to reach a certain goal state, or to establish lower bounds on the time needed to reach a given state. For example, if vertices represent the states of a puzzle like a Rubik's Cube and each directed edge corresponds to a single move or turn, shortest path algorithms can be used to find a solution that uses the minimum possible number of moves.

In a networking or telecommunications mindset, this shortest path problem is sometimes called the min-delay path problem and usually tied with a widest path problem. For example, the algorithm may seek the shortest (min-delay) widest path, or widest shortest (min-delay) path.

A more lighthearted application is the games of "six degrees of separation" that try to find the shortest path in graphs like movie stars appearing in the same film.

Other applications, often studied in operations research, include plant and facility layout, robotics, transportation, and VLSI design.[4]

A road network can be considered as a graph with positive weights. The nodes represent road junctions and each edge of the graph is associated with a road segment between two junctions. The weight of an edge may correspond to the length of the associated road segment, the time needed to traverse the segment, or the cost of traversing the segment. Using directed edges it is also possible to model one-way streets. Such graphs are special in the sense that some edges are more important than others for long-distance travel (e.g. highways). This property has been formalized using the notion of highway dimension.[5] There are a great number of algorithms that exploit this property and are therefore able to compute the shortest path a lot quicker than would be possible on general graphs.

All of these algorithms work in two phases. In the first phase, the graph is preprocessed without knowing the source or target node. The second phase is the query phase. In this phase, source and target node are known. The idea is that the road network is static, so the preprocessing phase can be done once and used for a large number of queries on the same road network.

The algorithm with the fastest known query time is called hub labeling and is able to compute shortest path on the road networks of Europe or the US in a fraction of a microsecond.[6] Other techniques that have been used are:

• ALT (A* search, landmarks, and triangle inequality)
• Arc flags
• Contraction hierarchies
• Transit node routing
• Reach-based pruning
• Labeling
• Hub labels

For shortest path problems in computational geometry, see Euclidean shortest path.

The travelling salesman problem is the problem of finding the shortest path that goes through every vertex exactly once, and returns to the start. Unlike the shortest path problem, which can be solved in polynomial time in graphs without negative cycles, the travelling salesman problem is NP-complete and, as such, is believed not to be efficiently solvable for large sets of data (see P = NP problem). The problem of finding the longest path in a graph is also NP-complete.

The Canadian traveller problem and the stochastic shortest path problem are generalizations where either the graph isn't completely known to the mover, changes over time, or where actions (traversals) are probabilistic.

The shortest multiple disconnected path [7] is a representation of the primitive path network within the framework of Reptation theory.

The widest path problem seeks a path so that the minimum label of any edge is as large as possible.

### Strategic shortest-paths

Sometimes, the edges in a graph have personalities: each edge has its own selfish interest. An example is a communication network, in which each edge is a computer that possibly belongs to a different person. Different computers have different transmission speeds, so every edge in the network has a numeric weight equal to the number of milliseconds it takes to transmit a message. Our goal is to send a message between two points in the network in the shortest time possible. If we know the transmission-time of each computer (the weight of each edge), then we can use a standard shortest-paths algorithm. If we do not know the transmission times, then we have to ask each computer to tell us its transmission-time. But, the computers may be selfish: a computer might tell us that its transmission time is very long, so that we will not bother it with our messages. A possible solution to this problem is to use a variant of the VCG mechanism, which gives the computers an incentive to reveal their true weights.

## Linear programming formulation

There is a natural linear programming formulation for the shortest path problem, given below. It is very simple compared to most other uses of linear programs in discrete optimization, however it illustrates connections to other concepts.

Given a directed graph (V, A) with source node s, target node t, and cost wij for each edge (i, j) in A, consider the program with variables xij

minimize ${\displaystyle \sum _{ij\in A}w_{ij}x_{ij}}$ subject to ${\displaystyle x\geq 0}$ and for all i, ${\displaystyle \sum _{j}x_{ij}-\sum _{j}x_{ji}={\begin{cases}1,&{\text{if }}i=s;\\-1,&{\text{if }}i=t;\\0,&{\text{ otherwise.}}\end{cases}}}$

The intuition behind this is that ${\displaystyle x_{ij}}$ is an indicator variable for whether edge (i, j) is part of the shortest path: 1 when it is, and 0 if it is not. We wish to select the set of edges with minimal weight, subject to the constraint that this set forms a path from s to t (represented by the equality constraint: for all vertices except s and t the number of incoming and outcoming edges that are part of the path must be the same (i.e., that it should be a path from s to t).

This LP has the special property that it is integral; more specifically, every basic optimal solution (when one exists) has all variables equal to 0 or 1, and the set of edges whose variables equal 1 form an s-t dipath. See Ahuja et al.[8] for one proof, although the origin of this approach dates back to mid-20th century.

The dual for this linear program is

maximize ytys subject to for all ij, yjyi wij

and feasible duals correspond to the concept of a consistent heuristic for the A* algorithm for shortest paths. For any feasible dual y the reduced costs ${\displaystyle w'_{ij}=w_{ij}-y_{j}+y_{i}}$ are nonnegative and A* essentially runs Dijkstra's algorithm on these reduced costs.

## General algebraic framework on semirings: the algebraic path problem

Many problems can be framed as a form of the shortest path for some suitably substituted notions of addition along a path and taking the minimum. The general approach to these is to consider the two operations to be those of a semiring. Semiring multiplication is done along the path, and the addition is between paths. This general framework is known as the algebraic path problem.[9][10][11]

Most of the classic shortest-path algorithms (and new ones) can be formulated as solving linear systems over such algebraic structures.[12]

More recently, an even more general framework for solving these (and much less obviously related problems) has been developed under the banner of valuation algebras.[13]

## Shortest path in stochastic time-dependent networks

In real-life situations, the transportation network is usually stochastic and time-dependent. In fact, a traveler traversing a link daily may experiences different travel times on that link due not only to the fluctuations in travel demand (origin-destination matrix) but also due to such incidents as work zones, bad weather conditions, accidents and vehicle breakdowns. As a result, a stochastic time-dependent (STD) network is a more realistic representation of an actual road network compared with the deterministic one.[14][15]

Despite considerable progress during the course of the past decade, it remains a controversial question how an optimal path should be defined and identified in stochastic road networks. In other words, there is no unique definition of an optimal path under uncertainty. One possible and common answer to this question is to find a path with the minimum expected travel time. The main advantage of using this approach is that efficient shortest path algorithms introduced for the deterministic networks can be readily employed to identify the path with the minimum expected travel time in a stochastic network. However, the resulting optimal path identified by this approach may not be reliable, because this approach fails to address travel time variability. To tackle this issue some researchers use distribution of travel time instead of expected value of it so they find the probability distribution of total travelling time using different optimization methods such as dynamic programming and Dijkstra's algorithm .[16] These methods use stochastic optimization, specifically stochastic dynamic programming to find the shortest path in networks with probabilistic arc length.[17] The concept of travel time reliability is used interchangeably with travel time variability in the transportation research literature, so that, in general, one can say that the higher the variability in travel time, the lower the reliability would be, and vice versa.

In order to account for travel time reliability more accurately, two common alternative definitions for an optimal path under uncertainty have been suggested. Some have introduced the concept of the most reliable path, aiming to maximize the probability of arriving on time or earlier than a given travel time budget. Others, alternatively, have put forward the concept of an α-reliable path based on which they intended to minimize the travel time budget required to ensure a pre-specified on-time arrival probability.

• Bidirectional search, an algorithm that finds the shortest path between two vertices on a directed graph
• Euclidean shortest path
• Flow network
• K shortest path routing
• Min-plus matrix multiplication
• Pathfinding
• Shortest Path Bridging
• Shortest path tree

## References

### Notes

1. Cormen et al. 2001, p. 655
2. Dial, Robert B. (1969), "Algorithm 360: Shortest-Path Forest with Topological Ordering [H]", Communications of the ACM, 12 (11): 632–633, doi:10.1145/363269.363610, S2CID 6754003
3. Sanders, Peter (March 23, 2009). "Fast route planning". Google Tech Talk. Cite journal requires |journal= (help)
4. Chen, Danny Z. (December 1996). "Developing algorithms and software for geometric path planning problems". ACM Computing Surveys. 28 (4es). Article 18. doi:10.1145/242224.242246. S2CID 11761485.
5. Abraham, Ittai; Fiat, Amos; Goldberg, Andrew V.; Werneck, Renato F. "Highway Dimension, Shortest Paths, and Provably Efficient Algorithms". ACM-SIAM Symposium on Discrete Algorithms, pages 782–793, 2010.
6. Abraham, Ittai; Delling, Daniel; Goldberg, Andrew V.; Werneck, Renato F. research.microsoft.com/pubs/142356/HL-TR.pdf "A Hub-Based Labeling Algorithm for Shortest Paths on Road Networks". Symposium on Experimental Algorithms, pages 230–241, 2011.
7. Kroger, Martin (2005). "Shortest multiple disconnected path for the analysis of entanglements in two- and three-dimensional polymeric systems". Computer Physics Communications. 168 (3): 209–232. Bibcode:2005CoPhC.168..209K. doi:10.1016/j.cpc.2005.01.020.
8. Ahuja, Ravindra K.; Magnanti, Thomas L.; Orlin, James B. (1993). Network Flows: Theory, Algorithms and Applications. Prentice Hall. ISBN 978-0-13-617549-0.
9. Pair, Claude (1967), "Sur des algorithmes pour des problèmes de cheminement dans les graphes finis (On algorithms for path problems in finite graphs)", in Rosentiehl (ed.), Théorie des graphes (journées internationales d'études) -- Theory of Graphs (international symposium), Rome (Italy), July 1966: Dunod (Paris) et Gordon and Breach (New York), p. 271CS1 maint: location (link)
10. Derniame, Jean Claude; Pair, Claude (1971), Problèmes de cheminement dans les graphes (Path Problems in Graphs), Dunod (Paris)
11. Baras, John; Theodorakopoulos, George (4 April 2010). Path Problems in Networks. Morgan & Claypool Publishers. pp. 9–. ISBN 978-1-59829-924-3.
12. Gondran, Michel; Minoux, Michel (2008). Graphs, Dioids and Semirings: New Models and Algorithms. Springer Science & Business Media. chapter 4. ISBN 978-0-387-75450-5.
13. Pouly, Marc; Kohlas, Jürg (2011). Generic Inference: A Unifying Theory for Automated Reasoning. John Wiley & Sons. Chapter 6. Valuation Algebras for Path Problems. ISBN 978-1-118-01086-0.
14. Loui, R.P., 1983. Optimal paths in graphs with stochastic or multidimensional weights. Communications of the ACM, 26(9), pp.670-676.
15. Rajabi-Bahaabadi, Mojtaba; Shariat-Mohaymany, Afshin; Babaei, Mohsen; Ahn, Chang Wook (2015). "Multi-objective path finding in stochastic time-dependent road networks using non-dominated sorting genetic algorithm". Expert Systems with Applications. 42 (12): 5056–5064. doi:10.1016/j.eswa.2015.02.046.
16. Olya, Mohammad Hessam (2014). "Finding shortest path in a combined exponential – gamma probability distribution arc length". International Journal of Operational Research. 21 (1): 25–37. doi:10.1504/IJOR.2014.064020.
17. Olya, Mohammad Hessam (2014). "Applying Dijkstra's algorithm for general shortest path problem with normal probability distribution arc length". International Journal of Operational Research. 21 (2): 143–154. doi:10.1504/IJOR.2014.064541.

### Bibliography

• Frigioni, D.; Marchetti-Spaccamela, A.; Nanni, U. (1998). "Fully dynamic output bounded single source shortest path problem". Proc. 7th Annu. ACM-SIAM Symp. Discrete Algorithms. Atlanta, GA. pp. 212–221. CiteSeerX 10.1.1.32.9856.
• Dreyfus, S. E. (October 1967). An Appraisal of Some Shortest Path Algorithms (PDF) (Report). Project Rand. United States Air Force. RM-5433-PR. DTIC AD-661265.