Finish Kruskal.

[saga.git] / mst.tex
diff --git a/mst.tex b/mst.tex

index f5d935c51583889e7a60601f651d14eca9d3bfd9..5e70a45a921fd2a951e48515dc234ae96dd36aab 100644 (file)
--- a/mst.tex
+++ b/mst.tex
@@ -9,7 +9,7 @@
  The problem of finding a minimum spanning tree of a weighted graph is one of the
  best studied problems in the area of combinatorial optimization since its birth.
  Its colorful history (see \cite{graham:msthistory} and \cite{nesetril:history} for the full account)
-begins in~1926 with the pioneering work of Bor\accent23uvka
+begins in~1926 with the pioneering work of Bor\o{u}vka
  \cite{boruvka:ojistem}\foot{See \cite{nesetril:boruvka} for an English translation with commentary.},
  who studied primarily an Euclidean version of the problem related to planning
  of electrical transmission lines (see \cite{boruvka:networks}), but gave an efficient
@@ -34,7 +34,7 @@ For a given graph~$G$ with weights $w:E(G)\rightarrow {\bb R}$:
    a union of (minimum) spanning trees of its connected components.
  \endlist
  
-Bor\accent23uvka's work was further extended by Jarn\'\i{}k \cite{jarnik:ojistem}, again in
+Bor\o{u}vka's work was further extended by Jarn\'\i{}k \cite{jarnik:ojistem}, again in
  mostly geometric setting, giving another efficient algorithm. However, when
  computer science and graph theory started forming in the 1950's and the
  spanning tree problem was one of the central topics of the flourishing new
@@ -144,7 +144,7 @@ $w(f)\ge w(e')$ due to the choice of $e'$ and all other edges on~$C$ are lighter
  than~$e'$ as $e'$ was not $T$-light.
  \qed
  
-\theorem\thmid{mstthm}%
+\thm\thmid{mstthm}%
  A~spanning tree~$T$ is minimum iff there is no $T$-light edge.
  
  \proof
@@ -163,7 +163,7 @@ minimum spanning trees according to the Cayley's formula \cite{cayley:trees}).
  However, as the following theorem shows, this is possible only if the weight
  function is not injective.
  
-\theoremn{MST uniqueness}
+\thmn{MST uniqueness}
  If all edge weights are distinct, then the minimum spanning tree is unique.
  
  \proof
@@ -255,13 +255,13 @@ and $V(G)\setminus M$ contains no blue edges, therefore we can use the blue rule
  
  \figure{mst-bez.eps}{295pt}{Configurations in the proof of the Black lemma}
  
-\theoremn{Red-Blue correctness}
+\thmn{Red-Blue correctness}
  For any selection of rules, the Red-Blue procedure stops and the blue edges form
  the minimum spanning tree of the input graph.
  
  \proof
  To prove that the procedure stops, let us notice that no edge is ever recolored,
-so we must run out of black edges after at most $m$ steps. Recoloring
+so we must run out of black edges after at most~$m$ steps. Recoloring
  to the same color is avoided by the conditions built in the rules, recoloring to
  a different color would mean that the an edge would be both inside and outside~$T_{min}$
  due to our Red and Blue lemmata.
@@ -274,25 +274,70 @@ lemma all other (red) edges are outside~$T_{min}$, so the blue edges are exactly
  \section{Classical algorithms}
  
  The three classical MST algorithms can be easily stated in terms of the Red-Blue meta-algorithm.
+For each of them, we first state the general version of the algorithm, then we prove that
+it gives the correct result and finally we discuss the time complexity of various
+implementations.
  
-\algn{Kruskal \cite{kruskal:mst}}
+\algn{Bor\o{u}vka \cite{boruvka:ojistem}, Choquet \cite{choquet:mst}, Sollin \cite{sollin:mst} and others}
  \algo
  \algin A~graph~$G$ with an edge comparison oracle.
-\:Sort edges of~$G$ by their increasing weight.
-\:$T\=\emptyset$. \cmt{an empty spanning subgraph}
-\:For all edges $e$ in their sorted order:
-\::If $T+e$ is acyclic, add~$e$ to~$T$.
-\::Otherwise drop~$e$.
+\:$T\=$ a forest consisting of vertices of~$G$ and no edges.
+\:While $T$ is not connected:
+\::For each component $T_i$ of~$T$, choose the lightest edge $e_i$ from the cut
+   separating $T_i$ from the rest of~$T$.
+\::Add all $e_i$'s to~$T$.
  \algout Minimum spanning tree~$T$.
  \endalgo
  
+\lemma
+Bor\o{u}vka's algorithm returns the MST of the input graph.
+
  \proof
-During the course of the algorithm, $T$ is a forest with all edges blue. Adding~$e$ to~$T$
-in step~4 applies the Blue rule on the cut separating two components of~$T$ ($e$ is the lightest,
-because all other edges of the cut have not been considered yet). Dropping~$e$ in step~5 corresponds
-to the red rule on the cycle found ($e$~must be the heaviest, since all other edges of the
-cycle have been already processed). At the end of the algorithm, all edges have been colored,
-so~$T$ must be the~MST.
+In every iteration of the algorithm, $T$ is a blue subgraph,
+because every addition of some edge~$e_i$ to~$T$ is a straightforward
+application of the Blue rule. We stop when the blue subgraph is connected, so
+we do not need the Red rule to explicitly exclude edges.
+
+It remains to show that adding the edges simultaneously does not
+produce a cycle. Consider the first iteration of the algorithm where $T$ contains some cycle~$C$. Without
+loss of generality we can assume that $C=T_1[u_1v_1]\,v_1u_2\,T_2[u_2v_2]\,v_2u_3\,T_3[u_3v_3]\, \ldots \,T_k[u_kv_k]\,v_ku_1$.
+Each component $T_i$ has chosen its lightest incident edge~$e_i$ as either the edge $v_iu_{i+1}$
+or $v_{i-1}u_i$ (indexing cyclically). Assume that $e_1=v_1u_2$ (otherwise we reverse the orientation
+of the cycle). Then $e_2=v_2u_3$ and $w(e_2)<w(e_1)$ and so on, giving $w(e_1)>w(e_2)>\ldots>w(e_k)>w(e_1)$,
+which is a contradiction. (Note that distinctness of edge weights was crucial here.)
+\qed
+
+\lemma
+In each iteration of the algorithm, the number of trees in~$T$ drops at least twice.
+
+\proof
+Each tree gets merged with at least one neighboring trees, so each of the new trees
+consists of at least two original trees.
+\qed
+
+\cor
+The algorithm stops in $\O(\log n)$ iterations.
+
+\lemma
+Each iteration can be carried out in time $\O(m)$.
+
+\proof
+Following \cite{mm:mst},
+we assign a label to each tree and we keep a mapping from vertices to the
+labels of the trees they belong to. We scan all edges, map their endpoints
+to the particular trees and for each tree we maintain the lightest incident edge
+so far encountered. Instead of merging the trees one by one (which would be too
+slow), we build an auxilliary graph whose vertices are labels of the original
+trees and edges correspond to the chosen lightest inter-tree edges. We find connected
+components of this graph, these determine how the original labels are translated
+to the new labels.
+\qed
+
+\thm
+Bor\o{u}vka's algorithm finds the MST in time $\O(m\log n)$.
+
+\proof
+Follows from the previous lemmata.
  \qed
  
  \algn{Jarn\'\i{}k \cite{jarnik:ojistem}, Prim \cite{prim:mst}, Dijkstra \cite{dijkstra:mst}}
@@ -305,36 +350,158 @@ so~$T$ must be the~MST.
  \algout Minimum spanning tree~$T$.
  \endalgo
  
+\lemma
+Jarn\'\i{}k's algorithm returns the MST of the input graph.
+
  \proof
-In every step of the algorithm, $T$ is always a blue tree. Step~4 corresponds to applying
+During the course of the algorithm, $T$ is always a blue tree. Step~4 corresponds to applying
  the Blue rule to a cut between~$T$ and the rest of the given graph. We need not care about
  the remaining edges, since for a connected graph the algorithm always stops with the right
  number of blue edges.
  \qed
  
-\algn{Bor\accent23uvka \cite{boruvka:ojistem}, Choquet \cite{choquet:mst}, Sollin \cite{sollin:mst} and others}
+\impl
+The most important part of the algorithm is finding \em{neighboring edges,} i.e., edges
+going between $T$ and $V(G)\setminus T$. In the straightforward implementation,
+searching for the lightest neighboring edge takes $\Theta(m)$ time, so the whole
+algorithm runs in time $\Theta(mn)$.
+
+We can do much better by using a binary
+heap to hold all neighboring edges. In each iteration, we find and delete the
+minimum edge from the heap and once we expand the tree, we insert the newly discovered
+neighboring edges to the heap while deleting the neighboring edges which become
+internal to the new tree. Since there are always at most~$m$ edges in the heap,
+each heap operation takes $\O(\log m)=\O(\log n)$ time. For every edge, we perform
+at most one insertion and at most one deletion, so we spend $\O(m\log n)$ time in total.
+From this, we can conclude:
+
+\thm
+Jarn\'\i{}k's algorithm finds the MST of the graph in time $\O(m\log n)$.
+
+\rem
+We will show several faster implementations in section \secref{fibonacci}.
+
+\algn{Kruskal \cite{kruskal:mst}, the Greedy algorithm}
  \algo
  \algin A~graph~$G$ with an edge comparison oracle.
-\:$T\=$ a forest consisting of vertices of~$G$ and no edges.
-\:While $T$ is not connected:
-\::For each component $T_i$ of~$T$, choose the lightest edge $e_i$ from the cut
-   separating $T_i$ from the rest of~$T$.
-\::Add all $e_i$'s to~$T$.
+\:Sort edges of~$G$ by their increasing weight.
+\:$T\=\emptyset$. \cmt{an empty spanning subgraph}
+\:For all edges $e$ in their sorted order:
+\::If $T+e$ is acyclic, add~$e$ to~$T$.
+\::Otherwise drop~$e$.
  \algout Minimum spanning tree~$T$.
  \endalgo
  
+\lemma
+Kruskal's algorithm returns the MST of the input graph.
+
+\proof
+In every step, $T$ is a forest of blue trees. Adding~$e$ to~$T$
+in step~4 applies the Blue rule on the cut separating two components of~$T$ ($e$ is the lightest,
+because all other edges of the cut have not been considered yet). Dropping~$e$ in step~5 corresponds
+to the red rule on the cycle found ($e$~must be the heaviest, since all other edges of the
+cycle have been already processed). At the end of the algorithm, all edges have been colored,
+so~$T$ must be the~MST.
+\qed
+
+\impl
+Except for the initial sorting, which in general takes $\Theta(m\log n)$ time, the only
+other non-trivial operation is detection of cycles. What we need is a data structure
+for maintaining connected components, which supports queries and edge insertion.
+The following theorem shows that it can be done with a surprising efficiency.
+
+\thmn{Incremental connectivity}
+When only edge insertions and queries are allowed, connected components
+can be maintained in $\O(\alpha(n))$ time amortized per operation.
+
  \proof
-Again $T$ is a blue forest, because every addition of some edge~$e_i$ to~$T$ is a straightforward
-application of the Blue rule. It remains to show that adding the edges simultaneously does not
-produce a cycle.
-\FIXME{Finish}
+Proven by Tarjan and van Leeuwen in \cite{tarjan:setunion}.
  \qed
  
-\algn{Contracting version of Bor\accent23uvka's algorithm}
+\FIXME{Define Ackermann's function. Use $\alpha(m,n)$?}
+
+\rem
+The cost of the operations on components is of course dwarfed by the complexity
+of sorting, so a much simpler (at least in terms of its analysis) data
+structure would be sufficient, as long as it has $\O(\log n)$ amortized complexity
+per operation. For example, we can label vertices with identifiers of the
+corresponding components and always recolor the smaller of the two components.
+
+\thm
+Kruskal's algorithm finds the MST of a given graph in time $\O(m\log n)$
+or $\O(m\alpha(n))$ if the edges are already sorted by their weights.
+
+\proof
+Follows from the above analysis.
+\qed
+
+\section{Contractive algorithms}
+
+While the classical algorithms are based on growing suitable trees, they
+can be also reformulated in terms of edge contraction. Instead of keeping
+a forest of trees, we can keep each tree contracted to a single vertex.
+This replaces the relatively complex tree-edge incidencies by simple
+vertex-edge incidencies, potentially speeding up the calculation at the
+expense of having to perform the contractions.
+
+We will show a contractive version of the Bor\o{u}vka's algorithm
+in which these costs are carefully balanced, leading for example to
+a linear-time algorithm for MST in planar graphs.
+
+There are two definitions of edge contraction which differ when an edge of a
+triangle is contracted. Either we unify the other two edges to a single edge
+or we keep them as two parallel edges, leaving us with a multigraph. We will
+use the multigraph version and show that we can easily reduce the multigraph
+to a simple graph later. (See \thmref{contract} for the exact definitions.)
+
+We only need to be able to map edges of the contracted graph to the original
+edges, so each edge will carry a unique label $l(e)$ which will be preserved by
+contractions.
+
+\lemman{Flattening a multigraph}%
+Let $G$ be a multigraph and $G'$ its subgraph such that all loops have been
+removed and each bundle of parallel edges replaced by its lightest edge.
+Then $G'$~has the same MST as~$G$.
+
+\proof
+Loops can be never used in a spanning tree. If there is a spanning tree~$T$
+containing a removed edge~$e$ parallel to an edge~$e'\in G'$, exchaning $e'$
+for~$e$ in~$T$ makes it lighter. \qed
+
+\rem Removal of the heavier of a pair of parallel edges can be also viewed
+as an application of the Red rule on a two-edge cycle. And indeed it is, the
+Red-Blue procedure works on multigraphs as well as on simple graphs and all the
+classical algorithms also do. We only have to be more careful in the
+formulations and proofs, which we preferred to avoid. We also note that most of
+the algorithms can be run on disconnected graphs with little or no
+modifications.
+
+\algn{Contracting version of Bor\o{u}vka's algorithm}
+\algo
+\algin A~graph~$G$ with an edge comparison oracle.
+\:$T\=\emptyset$.
+\:$l(e)\=e$ for all edges~$e$. \cmt{Initialize the labels.}
+\:While $n(G)>1$:
+\::For each vertex $v_i$ of~$G$, let $e_i$ be the lightest edge incident to~$v_i$.
+\::$T\=T\cup \{ l(e_i) \}$. \cmt{Remember labels of all selected edges.}
+\::Contract $G$ along all edges $e_i$, inheriting labels and weights.
+\::Flatten $G$, removing parallel edges and loops.
+\algout Minimum spanning tree~$T$.
+\endalgo
+
+\FIXME{Show how to do contractions in linear time}
+
+\FIXME{Analyse performance on planar graphs}
+
+\section{Minor-closed graph classes}
+
+\section{Using Fibonacci heaps}
+\secid{fibonacci}
  
-% G has to be connected
+% G has to be connected, so m=O(n)
  % mention Steiner trees
  % mention matroids
  % sorted weights
+% \O(...) as a set?
  
  \endpart