opt.tex

   1 \ifx\endpart\undefined
   2 \input macros.tex
   3 \fi
   4
   5 \chapter{Approaching Optimality}\id{optchap}%
   6
   7 \section{Soft heaps}\id{shsect}%
   8
   9 A~vast majority of MST algorithms that we have encountered so far is based on
  10 the Tarjan's Blue rule (Lemma \ref{bluelemma}). The rule serves to identify
  11 edges that belong to the MST, while all other edges are left in the process. This
  12 unfortunately means that the later stages of computation spend most of
  13 their time on these edges that never enter the MSF. A~notable exception is the randomized
  14 algorithm of Karger, Klein and Tarjan. It adds an~important ingredient: it uses
  15 the Red rule (Lemma \ref{redlemma}) to filter out edges that are guaranteed to stay
  16 outside the MST, so that the graphs with which the algorithm works get smaller
  17 with time.
  18
  19 Recently, Chazelle \cite{chazelle:ackermann} and Pettie \cite{pettie:ackermann}
  20 have presented new deterministic algorithms for the MST which are also based
  21 on the combination of both rules. They have reached worst-case time complexity
  22 $\O(m\timesalpha(m,n))$ on the Pointer Machine. We will devote this chapter to their results
  23 and especially to another algorithm by Pettie and Ramachandran \cite{pettie:optimal}
  24 which is provably optimal.
  25
  26 At the very heart of all these algorithms lies the \df{soft heap} discovered by
  27 Chazelle \cite{chazelle:softheap}. It is a~meldable priority queue, roughly
  28 similar to the Vuillemin's binomial heaps \cite{vuillemin:binheap} or Fredman's
  29 and Tarjan's Fibonacci heaps \cite{ft:fibonacci}. The soft heaps run faster at
  30 the expense of \df{corrupting} a~fraction of the inserted elements by raising
  31 their values (the values are however never lowered). This allows for
  32 a~trade-off between accuracy and speed, controlled by a~parameter~$\varepsilon$.
  33 The heap operations take $\O(\log(1/\varepsilon))$ amortized time and at every
  34 moment at most~$\varepsilon n$ elements of the $n$~elements inserted can be
  35 corrupted.
  36
  37 \defnn{Soft heap interface}%
  38 The \df{soft heap} contains a~set of distinct items from a~totally ordered universe and it
  39 supports the following operations:
  40 \itemize\ibull
  41 \:$\<Create>(\varepsilon)$ --- Create an~empty soft heap with the given accuracy parameter~$\varepsilon$.
  42 \:$\<Insert>(H,x)$ --- Insert a~new item~$x$ into the heap~$H$.
  43 \:$\<Meld>(P,Q)$ --- Merge two heaps into one, more precisely move all items of a~heap~$Q$
  44   to the heap~$P$, destroying~$Q$ in the process. Both heaps must have the same~$\varepsilon$.
  45 \:$\<DeleteMin>(H)$ --- Delete the minimum item of the heap~$H$ and return its value
  46   (optionally signalling that the value has been corrupted).
  47 \:$\<Explode>(H)$ --- Destroy the heap and return a~list of all items contained in it
  48   (again optionally marking those corrupted).
  49 \endlist
  50
  51 \>For every item, we will distinguish between its \df{original} value at the time
  52 of insertion and its \df{current} value in the heap, which is possibly corrupted.
  53
  54 \examplen{Linear-time selection}
  55 We can use soft heaps to select the median (or generally the $k$-th smallest element)
  56 of a~sequence. We insert all $n$~elements to a~soft heap with error rate $\varepsilon=1/3$.
  57 Then we delete the minimum about $n/3$ times and we remember the current value~$x$
  58 of the last element deleted. This~$x$ is greater or equal than the current
  59 values of the previously deleted elements and thus also than their original values.
  60 On the other hand, the current values of the $2n/3$ elements remaining in the heap
  61 are greater than~$x$ and at most $n/3$ of them are corrupted. Therefore at least $n/3$
  62 original elements are greater than~$x$ and at least $n/3$ ones are smaller.
  63
  64 This allows us to use the~$x$ as a~pivot in the traditional divide-and-conquer algorithm
  65 for selection (cf.~Hoare's Quickselect algorithm \cite{hoare:qselect}): We split the
  66 input elements to three parts: $A$~contains those less than~$x$, $B$~those equal to~$x$
  67 and $C$~those greater than~$x$. If $k\le\vert A\vert$, the desired element lies in~$A$,
  68 so we can continue by finding the $k$-th smallest element of~$A$. If $\vert A\vert < k \le \vert
  69 A\vert + \vert B\vert$, the desired element is $x$~itself. Otherwise, we search for the
  70 $(k-\vert A\vert-\vert B\vert)$-th smallest element of~$C$. Either way, we have removed
  71 at least $n/3$ items, so the total time complexity is
  72 $\O(n+(2/3)\cdot n+(2/3)^2\cdot n+\ldots) = \O(n)$.
  73
  74 We have obtained a~nice alternative to the standard
  75 linear-time selection algorithm of Blum \cite{blum:selection}. The same trick can be used
  76 to select a~good pivot in Quicksort \cite{hoare:qsort}, leading to time complexity $\O(n\log n)$
  77 in the worst case.
  78
  79 \defnn{Soft queues}
  80 The soft heap is built from \df{soft queues} (we will usually omit the adjective soft
  81 in the rest of this section). Each queue has a~shape of a~binary tree.\foot{%
  82 Actually, Chazelle defines the queues as binomial trees, but he transforms them in ways that are
  83 somewhat counter-intuitive, albeit well-defined. We prefer describing the queues as binary
  84 trees with a~special distribution of values. In fact, the original C~code in the Chazelle's
  85 paper \cite{chazelle:softheap} uses this representation internally.}
  86 Each vertex~$v$ of the tree remembers a~doubly-linked list of items. The
  87 item list in every left son will be used only temporarily and it will be kept
  88 empty between operations. Only right sons and the root have their lists
  89 permanently occupied. The left sons will be called \df{white}, the right ones
  90 \df{black.} (See the picture.)
  91
  92 The first value in every list is called the \df{controlling key} of the vertex,
  93 denoted by $\<ckey>(v)$. If the list is empty, we keep the most recently used
  94 value or we set $\<ckey>(v)=+\infty$. The \<ckey>s obey the standard \df{heap order}
  95 --- a~\<ckey> of a~parent is always smaller than the \<ckey>s of its sons.
  96
  97 Each vertex is also assigned its \df{rank,} which is a~non-negative integer.
  98 The ranks of leaves are always zero, the rank of every internal vertex can be
  99 arbitrary, but it must be strictly greater than the ranks of its sons. Also,
 100 we will always maintain the ranks in such a~way that both sons will have the
 101 equal ranks. We define the rank of the whole queue to be equal to the rank of
 102 its root vertex and similarly for its \<ckey>.
 103
 104 A~queue is called \df{complete} if every two vertices joined by an~edge have rank
 105 difference exactly one. In other words, it is a~complete binary tree and the
 106 ranks correspond to vertex heights.
 107
 108 \figure{softheap1.eps}{\epsfxsize}{\multicap{A~complete and a~partial soft queue tree\\
 109 (black vertices contain items, numbers indicate ranks)}}
 110
 111 \obs
 112 The complete queue of rank~$k$ contains exactly~$2^{k+1}-1$ vertices, $2^k$~of which are
 113 black (by induction). Any other queue can be trivially embedded to the complete queue of the same
 114 rank, which we will call the \df{master tree} of the queue. This embedding preserves vertex
 115 ranks, colors and the ancestor relation.
 116
 117 The queues have a~nice recursive structure. We can construct a~queue of rank~$k$ by \df{joining}
 118 two queues of rank~$k-1$ under a~new root. The root will inherit the item list of one
 119 of the original roots and also its \<ckey>. To preserve the heap order, we will choose
 120 the son whose \<ckey> is smaller.
 121
 122 Sometimes, we will also need to split a~queue to smaller queues. We will call this operation
 123 \df{dismantling} the queue and it will happen only in cases when the item list in the root
 124 is empty. It suffices to remove the leftmost (all white) path going from the root. From
 125 a~queue of rank~$k$, we get queues of ranks $0,1,\ldots,k-1$, some of which may be missing
 126 if the original queue was not complete.
 127
 128 \figure{softheap2.eps}{\epsfxsize}{Joining and dismantling of soft queues}
 129
 130 We will now define the real soft heap and the operations on it.
 131
 132 \defn A~\df{soft heap} consists of:
 133 \itemize\ibull
 134 \:a~doubly linked list of soft queues of distinct ranks (in increasing order of ranks),
 135   we will call the first queue the \df{head} of the list, the last queue will be its \df{tail};
 136 \:\df{suffix minima:} each queue contains a~pointer to the queue with minimum \<ckey>
 137 of those following it in the list;
 138 \:a~global parameter~$r$: an~even integer to be set depending on~$\varepsilon$.
 139 \endlist
 140
 141 \>We will define the \df{rank} of a~heap as the highest of the ranks of its queues
 142 (that is, the rank of the heap's tail).
 143
 144 The heap always keeps the \df{Rank invariant:} When a~root of any tree has rank~$k$,
 145 its leftmost path contains at least~$k/2$ vertices.
 146
 147 \paran{Operations on soft heaps}
 148
 149 \em{Melding} of two soft heaps involves merging of their lists of queues. We disassemble
 150 the heap of the smaller rank and we insert its queues to the other heap.
 151 If two queues of the same rank~$k$ appear in both lists, we \em{join} them to
 152 a~single queue of rank~$k+1$ as already described and we propagate the new queue as
 153 a~\df{carry} to the next iteration. (This is similar to addition of binary numbers.)
 154 Finally, we have to update the suffix minima by walking the new list backwards
 155 from the last inserted item.
 156
 157 \em{Creation} of a~new soft heap is trivial, \em{insertion} is handled by creating
 158 a~single-element heap and melding it to the destination heap.
 159
 160 \algn{Creating a~new soft heap}
 161 \algo
 162 \algin The~parameter~$\varepsilon$ (the accuracy of the heap).
 163 \:Allocate memory for a~new heap structure~$H$.
 164 \:Initialize the list of queues in~$H$ to an~empty list.
 165 \:Set the parameter~$r$ to~$2\lceil\log(1/\varepsilon)\rceil+2$ (to be justified later).
 166 \algout A~newly minted soft heap~$H$.
 167 \endalgo
 168
 169 \algn{Melding of two soft heaps}
 170 \algo
 171 \algin Two soft heaps~$P$ and~$Q$.
 172 \:If $\<rank>(P) < \<rank>(Q)$, exchange the queue lists of~$P$ and~$Q$.
 173 \:$p\=\<head>(P)$.
 174 \brk\cmt{Whenever we run into an~end of a~list in this procedure, we assume that
 175 there is an~empty queue of infinite rank there.}
 176 \:While $Q$ still has some queues:
 177 \::$q\=\<head>(Q)$.
 178 \::If $\<rank>(p) < \<rank>(q)$, then $p\=$ the successor of~$p$,
 179 \::else if $\<rank>(p) > \<rank>(q)$, remove~$q$ from~$Q$ and insert it to~$P$ before~$p$,
 180 \::otherwise (the ranks are equal, we need to propagate the carry):
 181 \:::$\<carry>\=p$.
 182 \:::Remove~$p$ from~$P$ and set $p\=$ the original successor of~$p$.
 183 \:::While $\<rank>(q)=\<rank>(\<carry>)$:
 184 \::::Remove~$q$ from~$Q$.
 185 \::::$\<carry>\=\<join>(q, \<carry>)$.
 186 \::::$q\=\<head>(Q)$.
 187 \:::Insert~\<carry> before~$q$.
 188 \:Update the suffix minima: Walk with~$p$ backwards to the head of~$P$:
 189 \::$p'\=\<suffix\_min>$ of the successor of~$p$.
 190 \::If $\<ckey>(p) < \<ckey>(p')$, set $\<suffix\_min>(p)\=p$.
 191 \::Otherwise set $\<suffix\_min>(p)\=p'$.
 192 \:Destroy the heap~$Q$.
 193 \algout The merged heap~$P$.
 194 \endalgo
 195
 196 \algn{Insertion of an~element to a~soft heap}
 197 \algo
 198 \algin A~heap~$H$ and a~new element~$x$.
 199 \:Create a~new heap~$H'$ of the same parameters as~$H$. Let~$H'$ contain a~sole queue of rank~0,
 200   whose only vertex has the element~$x$ in its item list.
 201 \:$\<Meld>(H,H')$.
 202 \algout An~updated heap~$H$.
 203 \endalgo
 204
 205 \paran{Corruption}%
 206 So far, the mechanics of the soft heaps were almost identical to the binomial heaps
 207 and the reader could rightfully yawn. The things are going to get interesting
 208 now as we approach the parts where corruption of items takes place.
 209
 210 If all item lists contain at most one item equal to the \<ckey> of the particular
 211 vertex, no information is lost and the heap order guarantees that the minimum item
 212 of every queue stays in its root. We can however allow longer lists and let the items
 213 stored in a~single list travel together between the vertices of the tree, still
 214 represented by a~common \<ckey>. This data-structural analogue of car pooling will
 215 allow the items to travel at a~faster rate, but as only a~single item can be equal
 216 to the \<ckey>, all other items will be inevitably corrupted.
 217
 218 We of course have to be careful about the size of the lists, because we must avoid corrupting
 219 too many items. We will control the growth according to the vertex ranks. Vertices
 220 with rank at most~$r$ will always contain just a~single item. Above this level,
 221 the higher is the rank, the longer list will be allowed.
 222
 223 \para
 224 \em{Deletion of minimum} will be based on this principle. The minimum is easy to locate
 225 --- we follow the \<suffix\_min> of the head of the heap to the queue with the minimum
 226 \<ckey>. There we look inside the item list of the root of the queue. We remove the \em{last}
 227 item from the list (we do not want the \<ckey> to change) and we return it as the minimum.
 228 (It is not necessarily the real minimum of all items, but always the minimum of their
 229 current, possibly corrupted values.)
 230
 231 If the list becomes empty, we \em{refill} it with items from the lower levels of the
 232 same queue. This process can be best described recursively: We ask the left son to refill itself
 233 (remember that the left son is always white, so there are currently no items there). If the new
 234 \<ckey> of the left son is smaller than of the right son, we immediately move the left
 235 son's list to its parent. Otherwise, we exchange the sons and move the list from the
 236 new left son to the parent. This way we obey the heap order and at the same time we keep
 237 the white left son free of items.
 238 \looseness=1  %%HACK%%
 239
 240 Occasionally, we repeat this process once again and we concatenate the resulting lists
 241 (we append the latter list to the former, using the smaller of the two \<ckey>s). This
 242 makes the lists grow longer and we want to do that roughly on every other level of the
 243 tree. The exact condition will be that either the rank of the current vertex is odd,
 244 or the difference in ranks between this vertex and its right son is at least two.
 245
 246 If refilling of the left son fails because there are no more items in that subtree
 247 (we report this by setting the \<ckey> to $+\infty$), the current vertex is no longer
 248 needed --- the items would just pass through it unmodified. We therefore want to
 249 remove it. Instead of deleting it directly, we rather make it point to its former
 250 grandsons and we remove the (now orphaned) original son. This helps us to ensure
 251 that both sons always keep the same rank, which will be useful for the analysis.
 252
 253 When the refilling is over, we update the suffix minima by walking from the current
 254 queue to the head of the heap exactly as we did in the \<Meld> procedure.
 255
 256 Our only remaining worry is that the Rank invariant can be broken after the
 257 refilling. When the leftmost path of the tree becomes too short, we just
 258 \em{dismantle} the tree as already described and we meld the new trees back to
 259 the heap. This is easier to handle when the item list at the root vertex is
 260 empty. We will therefore move this check before the refilling of the root list.
 261 It will turn out that we have enough time to always walk the leftmost path
 262 completely, so no explicit counters are needed.
 263
 264 %%HACK%%
 265 \>Let us translate these ideas to real (pseudo)code:
 266
 267 \algn{Deleting the minimum item from a~soft heap}
 268 \algo
 269 \algin A~soft heap~$H$.
 270 \:Use \<suffix\_min> of the head queue of~$H$ to locate the queue~$q$ with the smallest \<ckey>.
 271 \:Remove the final element~$x$ of the item list in the root of~$q$.
 272 \:If the item list became empty:
 273 \::Count the vertices on the leftmost path of~$q$.
 274 \::If there are less than $\<rank>(q)$ of them:
 275 \:::Remove~$q$ from the list of queues.
 276 \:::Recalculate the suffix minima as in the \<Meld> procedure.
 277 \:::Dismantle~$q$ and create a~heap~$H'$ holding the resulting trees.
 278 \:::Meld them back: $\<Meld>(H,H')$.
 279 \::Otherwise:
 280 \:::Call \<Refill> on the root of~$q$.
 281 \:::If $\<ckey>(q)=+\infty$ (no items left), remove the tree~$q$ from~$H$.
 282 \:::Recalculate the suffix minima.
 283 \algout The deleted minimum item~$x$ (possibly corrupted).
 284 \endalgo
 285
 286 \algn{Refilling the item list of a~vertex}
 287 \algo
 288 \algin A~soft queue and its vertex~$v$ with an~empty item list.
 289 \:Handle trivial cases: If~$v$ has no children or both have $\<ckey>=+\infty$,
 290   set $\<ckey>(v)$ to~$+\infty$ and return.
 291 \:Let \<left> and~\<right> denote the respective sons of~$v$.
 292 \:Recurse: call $\<Refill>(\<left>)$.
 293 \:If $\<ckey>(\<left>) > \<ckey>(\<right>)$, swap the sons.
 294 \:Move the item list from \<left> to~$v$ (implying $\<ckey>(v)=\<ckey>(\<left>)$).
 295 \:If $\<rank>(v) > r$ and either $\<rank>(v)$ is odd or $\<rank>(v) > \<rank>(\<right>)+1$, recurse once more:
 296 \::Repeat steps 3--4.
 297 \::Append the item list from \<left> to the item list at~$v$.
 298 \:Clean up. If $\<ckey>(\<right>) = +\infty$:
 299 \::If $\<ckey>(\<left>) = +\infty$, unlink and discard both sons.
 300 \::Otherwise relink the sons of \<left> to~$v$ and discard \<left>.
 301 \algout A~modified soft queue.
 302 \endalgo
 303
 304 \para
 305 \<Explode> is trivial once we know how to recognize the corrupted items. It simply examines
 306 all queues in the heap, walks the trees and the item lists of all vertices. It records
 307 all items seen, the corrupted ones are those that different from their \<ckey>.
 308
 309 \paran{Analysis of accuracy}%
 310 The description of the operations is now complete, so let us analyse their behavior
 311 and verify that we have delivered what we promised --- first the accuracy of
 312 the structure, then the time complexity of operations. In the whole analysis,
 313 we will denote the total number of elements inserted during the history of the
 314 structure by~$n$. We will also frequently take advantage of knowing that the
 315 threshold~$r$ is even.
 316
 317 We start by bounding the sizes of the item lists.
 318
 319 \lemma
 320 For every vertex~$v$ of a~soft queue, the size $\ell(v)$ of its item list
 321 satisfies:
 322 $$\ell(v) \le \max(1, 2^{\lceil \<rank>(v)/2 \rceil - r/2}).$$
 323
 324 \proof
 325 Initially, all item lists contain at most one item, so the inequality trivially
 326 holds. Let us continue by induction. Melds can affect the inequality only in the favorable
 327 direction (they occasionally move an~item list to a~vertex of a~higher rank)
 328 and so do deletes (they only remove items from lists). The only potentially
 329 dangerous place is the \<Refill> procedure.
 330
 331 Refilling sometimes just moves items upwards, which is safe, and sometimes it
 332 joins two lists into one, which generally is not. When $\<rank>(v) \le r$,
 333 no joining takes place and~$\ell(v)$ is still~1. Otherwise we join when either
 334 $\<rank>(v)$ is odd or $\<rank>(w) < \<rank>(v)-1$ for any son~$w$ of~$v$ (remember
 335 that both sons have the same rank). In both cases, $\lceil\<rank>(w)/2\rceil \le
 336 \lceil\<rank>(v)/2\rceil - 1$. By the induction hypothesis, the size of each
 337 of the two lists being joined is at most $2^{\max(1,\lceil\<rank>(v)/2\rceil - 1 - r/2)}$,
 338 so the new list has at most $2^{\lceil\<rank>(v)/2\rceil - r/2}$ items. (The maximum
 339 has disappeared since $\<rank>(v)>r$ and therefore the desired bound is at least~2.)
 340 \qed
 341
 342 We will now sum the sizes of the lists over all vertices containing corrupted items.
 343
 344 \lemma\id{shcorrlemma}%
 345 At any given time, the heap contains at most~$n/2^{r-2}$ corrupted items.
 346
 347 \proof
 348 We first prove an~auxiliary claim: The master trees of all queues contain at most~$n$
 349 black vertices. This follows by induction: If no deletions have taken place,
 350 there are exactly~$n$ black vertices, because insertion adds one black vertex and
 351 melding preserves their number. A~deletion affects the master trees only when
 352 dismantling takes place and then it only removes a~black vertex.
 353
 354 An~obvious upper bound on the number of corrupted items is the total size of item
 355 lists in all vertices of rank greater than~$r$. We already know from the previous
 356 lemma that the list sizes are limited by a~function of the ranks. A~complete tree
 357 is obviously the worst case, so we will prove that this lemma holds for the master
 358 tree of every queue in the heap. The actual trees can be much sparser, but the
 359 above claim guarantees that the total size of the master trees is bounded by the
 360 number of insertions properly.
 361
 362 So let us consider a~complete tree of~rank~$k$. It has exactly $2^{k-i}$ vertices
 363 of rank~$i$ and each such vertex contains a~list of at most~$2^{\lceil i/2\rceil - r/2}$
 364 items by the previous lemma. Summing over all ranks greater than~$r$, we get that
 365 the total number of corrupted items in this tree is at most:
 366 $$
 367 \sum_{i=r+1}^k 2^{k-i}\cdot 2^{\lceil i/2\rceil - r/2}
 368 = 2^{k-r/2} \cdot \sum_{i=r+1}^k 2^{\lceil i/2\rceil - i}
 369 \le 2^{k-r/2+1/2} \cdot \sum_{i=r+1}^k 2^{-i/2}
 370 \le 2^{k-r} \cdot \sum_{i=0}^\infty 2^{-i/2}.
 371 $$
 372 The sum of a~geometric series with quotient $2^{-1/2}$ is less than four, so the
 373 last expression is less than $2^{k-r+2}$. Since the tree contains $n_k=2^k$ black vertices,
 374 this makes less than $n_k/2^{r-2}$ corrupted items as we asserted.
 375 \qed
 376
 377 \paran{Analysis of time complexity}%
 378 Now we will examine the amortized time complexity of the individual operations.
 379 We will show that if we charge $\O(r)$ time against every element inserted, it is enough
 380 to cover the cost of all other operations.
 381
 382 All heap operations use only pointer operations, so it will be easy to derive the time
 383 bound in the Pointer Machine model. The notable exception is however that the procedures
 384 often refer to the ranks, which are integers on the order of $\log n$, so they cannot
 385 fit in a~single memory cell. For the time being, we will assume that the ranks can
 386 be manipulated in constant time, postponing the proof for later.
 387
 388 We take a~look at the melds first.
 389
 390 \lemma\id{shmeld}%
 391 The amortized cost of a~meld is $\O(1)$, except for melds induced by dismantling
 392 which take $\O(\<rank>(q))$, where $q$~is the queue to be dismantled.
 393
 394 \proof
 395 The real cost of a~meld of heaps $P$ and~$Q$ is linear in the smaller of
 396 their ranks, plus the time spent on carry propagation. The latter is easy to
 397 dispose of: Every time there is a~carry, the total number of trees in all
 398 heaps decreases by one. So it suffices to charge $\O(1)$ against creation of
 399 a~tree. An~insert creates one tree, dismantling creates at most $\<rank>(q)$
 400 trees, and all other operations alter only the internal structure of trees.
 401
 402 As for the $\O(\min(\<rank>(P),\<rank>(Q)))$ part, let us assume for a~while that
 403 no dismantling ever takes place and consider the \df{meld forest.} It is a~forest
 404 whose leaves correspond to the $n$~single-element heaps constructed by \<Insert>
 405 and each internal vertex represents a~heap arisen from melding its sons. The left
 406 son will be the one with the greater or equal rank. We therefore want to bound
 407 the sum of ranks of all right sons.
 408
 409 For every right son, we will distribute the charge for its rank~$k$ among all leaves
 410 in its subtree. There are at least $2^k$ such leaves. No leaf ever receives the same
 411 rank twice, because the ranks of right sons on every path from the root of the
 412 tree to a~leaf are strictly decreasing. (This holds because melding of two heaps
 413 always produces a~heap of a~rank strictly greater than the smaller of the input ranks.) Hence at most~$n/2^k$
 414 right sons have rank~$k$ and the total time charged against the leaves is bounded by:
 415 $$
 416 \sum_{k=0}^{\rm max. rank}k\cdot {n\over 2^k} \le n\cdot\sum_{k=0}^\infty {k\over 2^k} = 2n.
 417 $$
 418
 419 Let us return dismantling to the game. When a~queue is dismantled, melding the parts
 420 back to the heap takes $\O(\<rank>(q))$ time. We can therefore let the dismantling pay for it
 421 and omit such induced melds from the meld forest. As the rank of a~heap is never increased
 422 by induced melds, the above calculation is still a~proper upper bound on the cost
 423 of the regular melds.
 424 \qed
 425
 426 Before we estimate the time spent on deletions, we analyse the refills.
 427
 428 \lemma
 429 Every invocation of \<Refill> takes time $\O(1)$ amortized.
 430
 431 \proof
 432 When \<Refill> is called from the \<DeleteMin> operation, it recurses on a~subtree of the
 433 queue. This subtree can be split to the ``lossless'' lower part (rank~$r$ and below)
 434 and the upper part where list concatenation and thus also corruption takes place. Whenever
 435 we visit the lower part during the recursion, we spend at worst $\O(r)$ time there.
 436 We will prove that the total time spent in the upper parts during the whole life of the
 437 data structure is $\O(n)$. Since each upper vertex can perform at most two calls to the lower
 438 part, the total time spent in the lower parts is $\O(rn)$. All this can be prepaid by the
 439 inserts.
 440
 441 Let us focus on the upper part. There are three possibilities of what can happen
 442 when we visit a~vertex:
 443
 444 \itemize\ibull
 445
 446 \:We delete it: Every vertex deleted has to have been created at some time in the past.
 447 New vertices are created only during inserts and melds (when joining two trees) and
 448 we have already shown that these operations have constant amortized complexity. Then the
 449 same must hold for deletions.
 450
 451 \:We recurse twice and concatenate the lists: The lists are disassembled only when
 452 they reach the root of the tree, otherwise they are only concatenated. We can easily
 453 model the situation by a~binary tree forest similar to the meld forest. There are~$n$
 454 leaves and every internal vertex has outdegree two, so the total number of concatenations
 455 is at most~$n$. Each of them can be performed in constant time as the list is doubly linked.
 456
 457 \:We recurse only once: This occurs only if the rank is even and the gap between the
 458 rank of this vertex and its sons is equal to~1. It therefore cannot happen twice in a~row,
 459 thus it is clearly dominated by the cost of the other possibilities.
 460
 461 \endlist
 462 \>The total cost of all steps in the upper part is therefore $\O(n)$.
 463 \qed
 464
 465 We now proceed with examining the \<DeleteMin> operation.
 466
 467 \lemma\id{shdelmin}%
 468 Every \<DeleteMin> takes $\O(1)$ time amortized.
 469
 470 \proof
 471 Aside from refilling, which is $\O(1)$ by the previous lemma, the \<DeleteMin>
 472 takes care of the Rank invariant. This happens by checking the length of the leftmost
 473 path and dismantling the tree if the length is too far from the tree's rank~$k$.
 474 When the invariant is satisfied, the leftmost path is visited by the subsequent
 475 call to \<Refill>, so we can account the check on the refilling.
 476
 477 When we are dismantling, we have to pay $\O(k)$ for the operation itself and
 478 another $\O(k)$ for melding the trees back to the heap. Since we have produced at most
 479 $k/2$ subtrees of distinct ranks, some subtree of rank $k/2$ or more must be missing.
 480 Its master tree contained at least $2^{k/2}$ vertices which are now permanently gone
 481 from the data structure, so we can charge the cost against them.
 482 A~single vertex can participate in the master trees of several dismantlings, but their
 483 ranks are always strictly increasing. By the same argument as in the proof of
 484 Lemma \ref{shmeld} (complexity of \<Meld>), each vertex pays $\O(1)$.
 485
 486 We must not forget that \<DeleteMin> also has to recalculate the suffix minima.
 487 In the worst case, it requires touching $k$~trees. Because of the Rank invariant,
 488 this is linear in the size of the
 489 leftmost path and therefore it can be also paid for by \<Refill>. (Incidentally,
 490 this was the only place where we needed the invariant.)
 491 \qed
 492
 493 \<Explode>s are easy not only to implement, but also to analyse:
 494
 495 \lemma
 496 Every \<Explode> takes $\O(1)$ time amortized.
 497
 498 \proof
 499 As all queues, vertices and items examined by \<Explode> are forever gone from the heap,
 500 we can charge the constant time spent on each of them against the operations
 501 that have created them.
 502 \qed
 503
 504 %%HACK%%
 505 \>It remains to take care of the calculation with ranks:
 506
 507 \lemma\id{shyards}%
 508 Every manipulation with ranks performed by the soft heap operations can be
 509 implemented on the Pointer Machine in constant amortized time.
 510
 511 \proof
 512 We will recycle the idea of ``yardsticks'' from Section \ref{bucketsort}.
 513 We create a~yardstick --- a~doubly linked list whose elements represent the possible
 514 values of a~rank. Every vertex of a~queue will store its rank as a~pointer to
 515 the corresponding ``tick'' of the yardstick. We will extend the list whenever necessary.
 516
 517 Comparison of two ranks for equality is then trivial, as is incrementing or decrementing
 518 the rank by~1. Testing whether a~rank is odd can be handled by storing an~odd/even
 519 flag in every tick. This covers all uses of ranks except for the comparisons for inequality
 520 when melding. In step~1 of \<Meld>, we just mark the ticks of the two ranks and walk
 521 the yardstick from the beginning until we come across a~mark. Thus we compare the ranks
 522 in time proportional to the smaller of them, which is the real cost of the meld anyway.
 523 The comparisons in steps 5 and~6 are trickier, but since the ranks of the elements put
 524 to~$P$ are strictly increasing, we can start walking the list at the rank of the previous
 525 element in~$P$. The cost is then the difference between the current and the previous rank
 526 and the sum of these differences telescopes, again to the real cost of the meld.
 527 \qed
 528
 529 We can put the bits together now and laurel our effort with the following theorem:
 530
 531 \thmn{Performance of soft heaps, Chazelle \cite{chazelle:softheap}}\id{softheap}%
 532 A~soft heap with error rate~$\varepsilon$ ($0<\varepsilon\le 1/2$) processes
 533 a~sequence of operations starting with an~empty heap and containing $n$~\<Insert>s
 534 in time $\O(n\log(1/\varepsilon))$ on the Pointer Machine. At every moment, the
 535 heap contains at most $\varepsilon n$ corrupted items.
 536
 537 \proof
 538 We set the parameter~$r$ to~$2+2\lceil\log (1/\varepsilon)\rceil$. The rest follows
 539 from the analysis above. By Lemma \ref{shcorrlemma}, there are always at most $n/2^{r-2}
 540 \le \varepsilon n$ corrupted items in the heap. By Lemma \ref{shmeld}--\ref{shyards},
 541 the time spent on all operations in the sequence can be paid for by charging $\O(r)$ time
 542 against each \<Insert>. This yields the time bound.
 543 \qed
 544
 545 \rem
 546 When we set $\varepsilon = 1/2n$, the soft heap is not allowed to corrupt any
 547 items, so it can be used like any traditional heap. By the standard lower bound on
 548 sorting it therefore requires $\Omega(\log n)$ time per operation, so the
 549 time complexity is optimal for this choice of~$\varepsilon$. Chazelle \cite{chazelle:softheap}
 550 proves that it is optimal for every choice of~$\varepsilon$.
 551
 552 The space consumed by the heap need not be linear in the \em{current} number
 553 of items, but if a~case where this matters ever occurred, we could fix it easily
 554 by rebuilding the whole data structure completely after $n/2$ deletes. This
 555 increases the number of potentially corrupted items, but at worst twice, so it
 556 suffices to decrease~$\varepsilon$ twice.
 557
 558 %--------------------------------------------------------------------------------
 559
 560 \section{Robust contractions}
 561
 562 Having the soft heaps at hand, we would like to use them in a~conventional MST
 563 algorithm in place of a~normal heap. The most efficient specimen of a~heap-based
 564 algorithm we have seen so far is the Iterated Jarn\'\i{}k's algorithm (\ref{itjar}).
 565 It is based on a~simple, yet powerful idea: Run the Jarn\'\i{}k's algorithm with
 566 limited heap size, so that it stops when the neighborhood of the tree becomes too
 567 large. Grow multiple such trees, always starting in a~vertex not visited yet. All
 568 these trees are contained in the MST, so by the Contraction lemma
 569 (\ref{contlemma}) we can contract each of them to a~single vertex and iterate
 570 the algorithm on the resulting graph.
 571
 572 We can try implanting the soft heap in this algorithm, preferably in the earlier
 573 version without active edges (\ref{jarnik}) as the soft heap lacks the \<Decrease>
 574 operation. This brave, but somewhat simple-minded attempt is however doomed to
 575 fail. The reason is of course the corruption of items inside the heap, which
 576 leads to increase of weights of some subset of edges. In presence of corrupted
 577 edges, most of the theory we have so carefully built breaks down. For example,
 578 the Blue lemma (\ref{bluelemma}) now holds only when we consider a~cut with no
 579 corrupted edges, with a~possible exception of the lightest edge of the cut.
 580 Similarly, the Red lemma (\ref{redlemma}) is true only if the heaviest edge on
 581 the cycle is not corrupted.
 582
 583 There is fortunately some light in this darkness. While the basic structural
 584 properties of MST's no longer hold, there is a~weaker form of the Contraction
 585 lemma that takes the corrupted edges into account. Before we prove this lemma,
 586 we expand our awareness of subgraphs which can be contracted.
 587
 588 \defn
 589 A~subgraph $C\subseteq G$ is \df{contractible} iff for every pair of edges $e,f\in\delta(C)$\foot{That is,
 590 of~$G$'s edges with exactly one endpoint in~$C$.} there exists a~path in~$C$ connecting the endpoints
 591 of the edges $e,f$ such that all edges on this path are lighter than either $e$ or~$f$.
 592
 593 \example\id{jarniscont}%
 594 Let us see that when we stop the Jarn\'\i{}k's algorithm at some moment and
 595 we take a~subgraph~$C$ induced by the constructed tree, this subgraph is contractible.
 596 Indeed, when we consider any two distinct edges $uv, xy$ having exactly one endpoint in~$C$
 597 (w.l.o.g.~$u,x\in C$ and $v,y\not\in C$),
 598 they enter the algorithm's heap at some time. Without loss of generality $uv$~enters it earlier.
 599 Before the algorithm reaches the vertex~$x$, it adds the whole path $ux$ to the tree.
 600 As the edge~$uv$ never leaves the heap, all edges on the path $ux$ must be lighter
 601 than this edge.
 602
 603 We can now easily reformulate the Contraction lemma (\ref{contlemma}) in the language
 604 of contractible subgraphs. We again assume that we are working with multigraphs
 605 and that they need not be connected.
 606 Furthermore, we slightly abuse the notation in the way that we omit the explicit bijection
 607 between $G-C$ and~$G/C$, so we can write $G=C \cup (G/C)$.
 608
 609 \lemman{Generalized contraction}
 610 When~$C\subseteq G$ is a~contractible subgraph, then $\msf(G)=\msf(C) \cup \msf(G/C)$.
 611
 612 \proof
 613 As both sides of the equality are forests spanning the same graph, it suffices
 614 to show that $\msf(G) \subseteq \msf(C)\cup\msf(G/C)$.
 615 Let us show that edges of~$G$ that do not belong to the right-hand side
 616 do not belong to the left-hand side either.
 617 We know that the edges that
 618 do not participate in the MSF of some graph are exactly those which are the heaviest
 619 on some cycle (this is the Cycle rule from Lemma \ref{redlemma}).
 620
 621 Whenever an~edge~$g$ lies in~$C$, but not in~$\msf(C)$, then $g$~is the heaviest edge
 622 on some cycle in~$C$. As this cycle is also contained in~$G$, the edge $g$~does not participate
 623 in $\msf(G)$ either.
 624
 625 Similarly for $g\in (G/C)\setminus\msf(G/C)$: when the cycle does not contain
 626 the vertex~$c$ to which we have contracted the subgraph~$C$, this cycle is present
 627 in~$G$, too. Otherwise we consider the edges $e,f$ incident with~$c$ on this cycle.
 628 Since $C$~is contractible, there must exist a~path~$P$ in~$C$ connecting the endpoints
 629 of~$e$ and~$f$ in~$G$, such that all edges of~$P$ are lighter than either $e$ or~$f$
 630 and hence also than~$g$. Expanding~$c$ in the cycle to the path~$P$ then produces
 631 a~cycle in~$G$ whose heaviest edge is~$g$.
 632 \qed
 633
 634 We are now ready to bring corruption back to the game and state a~``robust'' version
 635 of this lemma. A~notation for corrupted graphs will be handy:
 636
 637 \nota\id{corrnota}%
 638 When~$G$ is a~weighted graph and~$R$ a~subset of its edges, we will use $G\crpt
 639 R$ to denote an arbitrary graph obtained from~$G$ by increasing the weights of
 640 some of the edges in~$R$. As usually, we will assume that all edges of this graph
 641 have pairwise distinct weights. While this is technically not true for the corruption
 642 caused by soft heaps, we can easily make it so.
 643
 644 Whenever~$C$ is a~subgraph of~$G$, we will use $R^C$ to refer to the edges of~$R$ with
 645 exactly one endpoint in~$C$ (i.e., $R^C = R\cap \delta(C)$).
 646
 647 \lemman{Robust contraction, Chazelle \cite{chazelle:almostacker}}\id{robcont}%
 648 Let $G$ be a~weighted graph and $C$~its subgraph contractible in~$G\crpt R$
 649 for some set~$R$ of edges. Then $\msf(G) \subseteq \msf(C) \cup \msf((G/C) \setminus R^C) \cup R^C$.
 650
 651 \proof
 652 We will modify the proof of the previous lemma. We will again consider all possible types
 653 of edges that do not belong to the right-hand side and we will show that they are the
 654 heaviest edges of certain cycles. Every edge~$g$ of~$G$ lies either in~$C$, or in $H=(G/C)\setminus R^C$,
 655 or possibly in~$R^C$.
 656
 657 If $g\in C\setminus\msf(C)$, then the same argument as before applies.
 658
 659 If $g\in H\setminus\msf(H)$, we consider the cycle in~$H$ on which $g$~is the heaviest.
 660 When $c$ (the vertex to which we have contracted~$C$) is outside this cycle, we are done.
 661 Otherwise we observe that the edges $e,f$ adjacent to~$c$ on this cycle cannot be corrupted
 662 (they would be in~$R^C$ otherwise, which is impossible). By contractibility of~$C$ there exists
 663 a~path~$P$ in~$C\crpt (R\cap C)$ such that all edges of~$P$ are lighter than $e$~or~$f$ and hence
 664 also than~$g$. The weights of the edges of~$P$ in the original graph~$G$ cannot be higher than
 665 in~$G\crpt R$, so the path~$P$ is lighter than~$g$ even in~$G$ and we can again perform the
 666 trick with expanding the vertex~$c$ to~$P$ in the cycle~$C$. Hence $g\not\in\mst(G)$.
 667 \qed
 668
 669 \para
 670 We still intend to mimic the Iterated Jarn\'\i{}k's algorithm. We will partition the given graph to a~collection~$\C$
 671 of non-overlapping contractible subgraphs called \df{clusters} and we put aside all edges that got corrupted in the process.
 672 We recursively compute the MSF of those subgraphs and of the contracted graph. Then we take the
 673 union of these MSF's and add the corrupted edges. According to the previous lemma, this does not produce
 674 the MSF of~$G$, but a~sparser graph containing it, on which we can continue.
 675
 676 %%HACK%%
 677 \>We can formulate the exact partitioning algorithm and its properties as follows:
 678
 679 \algn{Partition a~graph to a~collection of contractible clusters}\id{partition}%
 680 \algo
 681 \algin A~graph~$G$ with an~edge comparison oracle, a~parameter~$t$ controlling the size of the clusters,
 682   and an~accuracy parameter~$\varepsilon$.
 683 \:Mark all vertices as ``live''.
 684 \:$\C\=\emptyset$, $R^\C\=\emptyset$. \cmt{Start with an~empty collection and no corrupted edges.}
 685 \:While there is a~live vertex~$v_0$:
 686 \::$T=\{v_0\}$. \cmt{the tree that we currently grow}
 687 \::$K=\emptyset$. \cmt{edges known to be corrupted in the current iteration}
 688 \::\<Create> a~soft heap with accuracy~$\varepsilon$ and \<Insert> the edges adjacent to~$v_0$ into it.
 689 \::While the heap is not empty and $\vert T\vert \le t$:
 690 \:::\<DeleteMin> an~edge $uv$ from the heap, assume $u\in T$.
 691 \:::If $uv$ was corrupted, add it to~$K$.
 692 \:::If $v\in T$, drop the edge and repeat the previous two steps.
 693 \:::$T\=T\cup\{v\}$.
 694 \:::If $v$ is dead, break out of the current loop.
 695 \:::Insert all edges incident with~$v$ to the heap.
 696 \::$\C\=\C \cup \{G[T]\}$. \cmt{Record the cluster induced by the tree.}
 697 \::\<Explode> the heap and add all remaining corrupted edges to~$K$.
 698 \::$R^\C\=R^\C \cup K^T$. \cmt{Record the ``interesting'' corrupted edges.}
 699 \::$G\=G\setminus K^T$. \cmt{Remove the corrupted edges from~$G$.}
 700 \::Mark all vertices of~$T$ as ``dead''.
 701 \algout The collection $\C$ of contractible clusters and the set~$R^\C$ of
 702 corrupted edges in the neighborhood of these clusters.
 703 \endalgo
 704
 705 \thmn{Partitioning to contractible clusters, Chazelle \cite{chazelle:almostacker}}\id{partthm}%
 706 Given a~weighted graph~$G$ and parameters $\varepsilon$ ($0<\varepsilon\le 1/2$)
 707 and~$t$, the Partition algorithm (\ref{partition}) constructs a~collection
 708 $\C=\{C_1,\ldots,C_k\}$ of clusters and a~set~$R^\C$ of edges such that:
 709
 710 \numlist\ndotted
 711 \:All the clusters and the set~$R^\C$ are mutually edge-disjoint.
 712 \:Each cluster contains at most~$t$ vertices.
 713 \:Each vertex of~$G$ is contained in at least one cluster.
 714 \:The connected components of the union of all clusters have at least~$t$ vertices each,
 715   except perhaps for those which are equal to a~connected component of $G\setminus R^\C$.
 716 \:$\vert R^\C\vert \le 2\varepsilon m$.
 717 \:$\msf(G) \subseteq \bigcup_i \msf(C_i) \cup \msf\bigl((G / \bigcup_i C_i) \setminus R^\C\bigr) \cup R^\C$.
 718 \:The algorithm runs in time $\O(n+m\log (1/\varepsilon))$.
 719 \endlist
 720
 721 \proof
 722 Claim~1: The Partition algorithm grows a~series of trees which induce the clusters~$C_i$ in~$G$.
 723 A~tree is built from edges adjacent to live vertices
 724 and once it is finished, all vertices of the tree die, so no edge is ever reused in another
 725 tree. The edges that enter~$R^\C$ are immediately deleted from the graph, so they never participate
 726 in any tree.
 727
 728 Claim~2: The algorithm stops when all vertices are dead, so each vertex must have
 729 entered some tree.
 730
 731 Claim~3: The trees have at most~$t$ vertices each, which limits the size of the
 732 $C_i$'s as well.
 733
 734 Claim~4: We can show that each connected component has $t$~or more vertices as we already did
 735 in the proof of Lemma \ref{ijsize}: How can a~new tree stop growing? Either it acquires
 736 $t$~vertices, or it joins one of the existing trees (this only increases the
 737 size of the component), or the heap becomes empty (which means that the tree spans
 738 a~full component of the current graph and hence also of the final~$G\setminus R^\C$).
 739
 740 Claim~5: The set~$R^\C$ contains a~subset of edges corrupted by the soft heaps over
 741 the course of the algorithm. As every edge is inserted to a~heap at most once per
 742 its endpoint, the heaps can corrupt at worst $2\varepsilon m$ edges altogether.
 743
 744 We will prove the remaining two claims as separate lemmata.
 745 \qed
 746
 747 \lemman{Correctness of Partition, Claim 6 of Theorem \ref{partthm}}
 748 $$\msf(G) \subseteq \bigcup_i \msf(C_i) \cup \msf\biggl( \bigl(G / \bigcup_i C_i \bigr) \setminus R^\C\biggr) \cup R^\C.$$
 749
 750 \proof
 751 By iterating the Robust contraction lemma (\ref{robcont}). Let $K_i$ be the set of edges
 752 corrupted when constructing the cluster~$C_i$ in the $i$-th phase of the algorithm,
 753 and similarly for the state of the other variables at that time.
 754 We will use induction on~$i$ to prove that the lemma holds at the end of every phase.
 755
 756 At the beginning, the statement is obviously true, even as an~equality.
 757 In the $i$-th phase we construct the cluster~$C_i$ by running the partial Jarn\'\i{}k's algorithm on the graph
 758 $G_i = G\setminus\bigcup_{j<i} K_{\smash j}^{C_j}$.
 759 If it were not for corruption, the cluster~$C_i$ would be contractible, as we already know from Example \ref{jarniscont}.
 760 When the edges in~$K_i$ get corrupted, the cluster is contractible in some corrupted graph
 761 $G_i \crpt K_i$. (We have to be careful as the edges are becoming corrupted gradually,
 762 but it is easy to check that it can only improve the situation.) Since $C_i$~shares no edges
 763 with~$C_j$ for any~$j<i$, we know that~$C_i$ also a~contractible subgraph of $(G_i / \bigcup_{j<i} C_j) \crpt K_i$.
 764 Now we can use the Robust contraction lemma to show that:
 765 $$
 766 \msf\biggl(\bigl(G / \bigcup_{j<i} C_j \bigr) \setminus \bigcup_{j<i} K_j^{C_j} \biggr) \subseteq
 767 \msf(C_i) \cup \msf\biggl(\bigl(G / \bigcup_{j\le i} C_j \bigr) \setminus \bigcup_{j\le i} K_j^{C_j} \biggr) \cup K_i^{C_i}.
 768 $$
 769 This completes the induction step, because $K_j^{C_j} = K_j^{\C_j}$ for all~$j$.
 770 \qed
 771
 772 \lemman{Efficiency of Partition, Claim 7 of Theorem \ref{partthm}}
 773 The Partition algorithm runs in time $\O(n+m\log(1/\varepsilon))$.
 774
 775 \proof
 776 The inner loop (steps 7--13) processes the edges of the current cluster~$C_i$ and also
 777 the edges in its neighborhood $\delta(C_i)$. Steps 6 and~13 insert every such edge to the heap
 778 at most once, so steps 8--12 visit each edge also at most once. For every edge, we spend
 779 $\O(\log(1/\varepsilon))$ time amortized on inserting it and $\O(1)$ on the other operations
 780 (by Theorem \ref{softheap} on performance of the soft heaps).
 781
 782 Each edge of~$G$ can participate in some $C_i\cup \delta(C_i)$ only twice before both its
 783 endpoints die. The inner loop therefore processes $\O(m)$ edges total, so it takes $\O(m\log(1/\varepsilon))$
 784 time. The remaining parts of the algorithm spend $\O(m)$ time on operations with clusters and corrupted edges
 785 and additionally $\O(n)$ on identifying the live vertices.
 786 \qed
 787
 788 %--------------------------------------------------------------------------------
 789
 790 \section{Decision trees}\id{dtsect}%
 791
 792 The Pettie's and Ramachandran's algorithm combines the idea of robust partitioning with optimal decision
 793 trees constructed by brute force for very small subgraphs. In this section, we will
 794 explain the basics of the decision trees and prove several lemmata which will
 795 turn out to be useful for the analysis of time complexity of the final algorithm.
 796
 797 Let us consider all computations of some comparison-based MST algorithm when we
 798 run it on a~fixed graph~$G$ with all possible permutations of edge weights.
 799 The computations can be described by a~binary tree. The root of the tree corresponds to the first
 800 comparison performed by the algorithm and depending to its result, the computation
 801 continues either in the left subtree or in the right subtree. There it encounters
 802 another comparison and so on, until it arrives to a~leaf of the tree where the
 803 spanning tree found by the algorithm is recorded.
 804
 805 Formally, the decision trees are defined as follows:
 806
 807 \defnn{Decision trees and their complexity}\id{decdef}%
 808 A~\df{MSF decision tree} for a~graph~$G$ is a~binary tree. Its internal vertices
 809 are labeled with pairs of $G$'s edges to be compared, each of the two outgoing tree edges
 810 corresponds to one possible result of the comparison.\foot{There are two possible
 811 outcomes since there is no reason to compare an~edge with itself and we, as usually,
 812 expect that the edge weights are distinct.}
 813 Leaves of the tree are labeled with spanning trees of the graph~$G$.
 814
 815 A~\df{computation} of the decision tree on a~specific permutation of edge weights
 816 in~$G$ is the path from the root to a~leaf such that the outcome of every comparison
 817 agrees with the edge weights. The result of the computation is the spanning tree
 818 assigned to its final leaf.
 819 A~decision tree is \df{correct} iff for every permutation the corresponding
 820 computation results in the real MSF of~$G$ with the particular weights.
 821
 822 The \df{time complexity} of a~decision tree is defined as its depth. It therefore
 823 bounds the number of comparisons spent on every path. (It need not be equal since
 824 some paths need not correspond to an~actual computation --- the sequence of outcomes
 825 on the path could be unsatisfiable.)
 826
 827 A~decision tree is called \df{optimal} if it is correct and its depth is minimum possible
 828 among the correct decision trees for the given graph.
 829 We will denote an~arbitrary optimal decision tree for~$G$ by~${\cal D}(G)$ and its
 830 complexity by~$D(G)$.
 831
 832 The \df{decision tree complexity} $D(m,n)$ of the MSF problem is the maximum of~$D(G)$
 833 over all graphs~$G$ with $n$~vertices and~$m$ edges.
 834
 835 \obs
 836 Decision trees are the most general deterministic comparison-based computation model possible.
 837 The only operations that count in its time complexity are comparisons. All
 838 other computation is free, including solving NP-complete problems or having
 839 access to an~unlimited source of non-uniform constants. The decision tree
 840 complexity is therefore an~obvious lower bound on the time complexity of the
 841 problem in all other comparison-based models.
 842
 843 The downside is that we do not know any explicit construction of the optimal
 844 decision trees, or at least a~non-constructive proof of their complexity.
 845 On the other hand, the complexity of any existing comparison-based algorithm
 846 can be used as an~upper bound on the decision tree complexity. For example:
 847
 848 \lemma
 849 $D(m,n) \le 4/3 \cdot n^2$.
 850
 851 \proof
 852 Let us count the comparisons performed by the Contractive Bor\o{u}vka's algorithm
 853 (\ref{contbor}), tightening up the constants in its previous analysis in Theorem
 854 \ref{contborthm}. In the first iteration, each edge participates in two comparisons
 855 (one per endpoint), so the algorithm performs at most $2m \le 2{n\choose 2} \le n^2$
 856 comparisons. Then the number of vertices drops at least by a~factor of two, so
 857 the subsequent iterations spend at most $(n/2)^2, (n/4)^2, \ldots$ comparisons, which sums
 858 to less than $n^2\cdot\sum_{i=0}^\infty (1/4)^i = 4/3 \cdot n^2$. Between the Bor\o{u}vka steps,
 859 we flatten the multigraph to a~simple graph, which also needs some comparisons,
 860 but for every such comparison we remove one of the participating edges, which saves
 861 at least one comparison in the subsequent steps.
 862 \qed
 863
 864 \para
 865 Of course we can get sharper bounds from the better algorithms, but we will first
 866 show how to find the optimal trees using brute force. The complexity of the search
 867 will be of course enormous, but as we already promised, we will need the optimal
 868 trees only for very small subgraphs.
 869
 870 \lemman{Construction of optimal decision trees}\id{odtconst}%
 871 An~optimal MST decision tree for a~graph~$G$ on~$n$ vertices can be constructed on
 872 the Pointer Machine in time $\O(2^{2^{4n^2}})$.
 873
 874 \proof
 875 We will try all possible decision trees of depth at most $2n^2$
 876 (we know from the previous lemma that the desired optimal tree is shallower). We can obtain
 877 any such tree by taking the complete binary tree of exactly this depth
 878 and labeling its $2\cdot 2^{2n^2}-1$ vertices with comparisons and spanning trees. Those labeled
 879 with comparisons become internal vertices of the decision tree, the others
 880 become leaves and the parts of the tree below them are removed. There are less
 881 than $n^4$ possible comparisons and less than $2^{n^2}$ spanning trees of~$G$,
 882 so the number of candidate decision trees is bounded by
 883 $(n^4+2^{n^2})^{2^{2n^2+1}} \le 2^{(n^2+1)\cdot 2^{2n^2+1}} \le 2^{2^{2n^2+2}} \le 2^{2^{3n^2}}$.
 884
 885 We enumerate the trees in an~arbitrary order, test each tree for correctness and
 886 find the shallowest tree among those correct. Testing can be accomplished by running
 887 through all possible permutations of edges, each time calculating the MSF using any
 888 of the known algorithms and comparing it with the result given by the decision tree.
 889 The number of permutations does not exceed $(n^2)! \le (n^2)^{n^2} \le n^{2n^2} \le 2^{n^3}$
 890 for sufficiently large~$n$ and each one can be checked in time $\O(\poly(n))$.
 891
 892 On the Pointer Machine, trees and permutations can be certainly enumerated in time
 893 $\O(\poly(n))$ per object. The time complexity of the whole algorithm is therefore
 894 $\O(2^{2^{3n^2}} \cdot 2^{n^3} \cdot \poly(n)) = \O(2^{2^{4n^2}})$.
 895 \qed
 896
 897 \paran{Basic properties of decision trees}%
 898 The following properties will be useful for analysis of algorithms based
 899 on precomputed decision trees. We will omit some technical details, referring
 900 the reader to section 5.1 of the Pettie's article \cite{pettie:optimal}.
 901
 902 \lemma\id{dtbasic}%
 903 The decision tree complexity $D(m,n)$ of the MSF satisfies:
 904 \numlist\ndotted
 905 \:$D(m,n) \ge m/2$ for $m,n > 2$.
 906 \:$D(m',n') \ge D(m,n)$ whenever $m'\ge m$ and $n'\ge n$.
 907 \endlist
 908
 909 \proof
 910 For every $m,n>2$ there is a~graph on $n$~vertices and $m$~edges such that
 911 every edge lies on a~cycle. Every correct MSF decision tree for this graph
 912 has to compare each edge at least once. Otherwise the decision tree cannot
 913 distinguish between the case when an~edge has the lowest of all weights (and
 914 thus it is forced to belong to the MSF) and when it has the highest weight (so
 915 it is forced out of the MSF).
 916
 917 Decision trees for graphs on $n'$~vertices can be used for graphs with $n$~vertices
 918 as well --- it suffices to add isolated vertices, which does not change the MSF.
 919 Similarly, we can increase $m$ to~$m'$ by adding edges parallel to an~existing
 920 edge and making them heavier than the rest of the graph, so that they can never
 921 belong to the MSF.
 922 \qed
 923
 924 \defn
 925 Subgraphs $C_1,\ldots,C_k$ of a~graph~$G$ are called the \df{compartments} of~$G$
 926 iff they are edge-disjoint, their union is the whole graph~$G$ and
 927 $\msf(G) = \bigcup_i \msf(C_i)$ for every permutation of edge weights.
 928
 929 \lemma\id{partiscomp}%
 930 The clusters $C_1,\ldots,C_k$ generated by the Partition procedure of the
 931 previous section (Algorithm \ref{partition}) are compartments of the graph
 932 $H=\bigcup_i C_i$.
 933
 934 \proof
 935 The first and second condition of the definition of compartments follow
 936 from the Partitioning theorem (\ref{partthm}), so it remains to show that $\msf(H)$
 937 is the union of the MSF's of the individual compartments. By the Cycle rule
 938 (Lemma \ref{redlemma}), an~edge $h\in H$ is not contained in $\msf(H)$ if and only if
 939 it is the heaviest edge on some cycle. It is therefore sufficient to prove that
 940 every cycle in~$H$ is contained within a~single~$C_i$.
 941
 942 Let us consider a~cycle $K\subseteq H$ and a~cluster~$C_i$ such that it contains
 943 an~edge~$e$ of~$K$ and all clusters constructed later by the procedure do not contain
 944 any. If $K$~is not fully contained in~$C_i$, we can extend the edge~$e$ to a~maximal
 945 path contained in both~$K$ and~$C_i$. Since $C_i$ shares at most one vertex with the
 946 earlier clusters, there can be at most one edge from~$K$ adjacent to the maximal path,
 947 which is impossible.
 948 \qed
 949
 950 \lemma
 951 Let $C_1,\ldots,C_k$ be compartments of a~graph~$G$. Then there exists an~optimal
 952 MSF decision tree for~$G$ that does not compare edges of distinct compartments.
 953
 954 \proofsketch
 955 Consider a~subset~$\cal P$ of edge weight permutations~$w$ that satisfy $w(e) < w(f)$
 956 whenever $e\in C_i, f\in C_j, i<j$. For such permutations, no decision tree can
 957 gain any information on relations between edge weights in a~single compartment by
 958 inter-compartment comparisons --- the results of all such comparisons are determined
 959 in advance.
 960
 961 Let us take an~arbitrary correct decision tree for~$G$ and restrict it to
 962 vertices reachable by computations on~$\cal P$. Whenever a~vertex contained
 963 an~inter-compartment comparison, it has lost one of its sons, so we can remove it
 964 by contracting its only outgoing edge. We observe that we get a~decision tree
 965 satisfying the desired condition and that this tree is correct.
 966
 967 As for the correctness, the MSF of a~single~$C_i$ is uniquely determined by
 968 comparisons of its weights and the set~$\cal P$ contains all combinations
 969 of orderings of weights inside individual compartments. Therefore every
 970 spanning tree of every~$C_i$ and thus also of~$H$ is properly recognized.
 971 \qed
 972
 973 \lemma\id{compartsum}%
 974 Let $C_1,\ldots,C_k$ be compartments of a~graph~$G$. Then $D(G) = \sum_i D(C_i)$.
 975
 976 \proofsketch
 977 A~collection of decision trees for the individual compartments can be ``glued together''
 978 to a~decision tree for~$G$. We take the decision tree for~$C_1$, replace every its leaf
 979 by a~copy of the tree for~$C_2$ and so on. Every leaf~$\ell$ of the compound tree will be
 980 labeled with the union of labels of the original leaves encountered on the path from
 981 the root to~$\ell$. This proves that $D(G) \le \sum_i D(C_i)$.
 982
 983 The other inequality requires more effort. We use the previous lemma to transform
 984 the optimal decision tree for~$G$ to another of the same depth, but without inter-compartment
 985 comparisons. Then we prove by induction on~$k$ and then on the depth of the tree
 986 that this tree can be re-arranged, so that every computation first compares edges
 987 from~$C_1$, then from~$C_2$ and so on. This means that the tree can be decomposed
 988 to decision trees for the $C_i$'s. Also, without loss of efficiency all trees for
 989 a~single~$C_i$ can be made isomorphic to~${\cal D}(C_i)$.
 990 \qed
 991
 992 \cor\id{dtpart}%
 993 If $C_1,\ldots,C_k$ are the clusters generated by the Partition procedure (Algorithm \ref{partition}),
 994 then $D(\bigcup_i C_i) = \sum_i D(C_i)$.
 995
 996 \proof
 997 Lemma \ref{partiscomp} tells us that $C_1,\ldots,C_k$ are compartments of the graph
 998 $\bigcup C_i$, so we can apply Lemma \ref{compartsum} on them.
 999 \qed
1000
1001 \cor\id{dttwice}%
1002 $2D(m,n) \le D(2m,2n)$ for every $m,n$.
1003
1004 \proof
1005 For an~arbitrary graph~$G$ with $m$~edges and $n$~vertices, we create a~graph~$G_2$
1006 consisting of two copies of~$G$ sharing a~single vertex. The copies of~$G$ are obviously
1007 compartments of~$G_2$, so by Lemma \ref{compartsum} it holds that $D(G_2) = 2D(G)$.
1008 Taking a~maximum over all choices of~$G$ yields $D(2m,2n) \ge \max_G D(G_2) = 2D(m,n)$.
1009 \qed
1010
1011 %--------------------------------------------------------------------------------
1012
1013 \section{An optimal algorithm}\id{optalgsect}%
1014
1015 Once we have developed the soft heaps, partitioning and MST decision trees,
1016 it is now simple to state the Pettie's and Ramachandran's MST algorithm
1017 and prove that it is asymptotically optimal among all MST algorithms in
1018 comparison-based models. Several standard MST algorithms from the previous
1019 chapters will also play their roles.
1020
1021 We will describe the algorithm as a~recursive procedure. When the procedure is
1022 called on a~graph~$G$, it sets the parameter~$t$ to roughly $\log^{(3)} n$ and
1023 it calls the \<Partition> procedure to split the graph into a~collection of
1024 clusters of size~$t$ and a~set of corrupted edges. Then it uses precomputed decision
1025 trees to find the MSF of the clusters. The graph obtained by contracting
1026 the clusters is on the other hand dense enough, so that the Iterated Jarn\'\i{}k's
1027 algorithm runs on it in linear time. Afterwards we combine the MSF's of the clusters
1028 and of the contracted graphs, we mix in the corrupted edges and run two iterations
1029 of the Contractive Bor\o{u}vka's algorithm. This guarantees reduction in the number of
1030 both vertices and edges by a~constant factor, so we can efficiently recurse on the
1031 resulting graph.
1032
1033 \algn{Optimal MST algorithm, Pettie and Ramachandran \cite{pettie:optimal}}\id{optimal}%
1034 \algo
1035 \algin A~connected graph~$G$ with an~edge comparison oracle.
1036 \:If $G$ has no edges, return an~empty tree.
1037 \:$t\=\lfloor\log^{(3)} n\rfloor$. \cmt{the size of clusters}
1038 \:Call \<Partition> (\ref{partition}) on $G$ and $t$ with $\varepsilon=1/8$. It returns
1039   a~collection~$\C=\{C_1,\ldots,C_k\}$ of clusters and a~set~$R^\C$ of corrupted edges.
1040 \:$F_i \= \mst(C_i)$ for all~$i$, obtained using optimal decision trees.
1041 \:$G_A \= (G / \bigcup_i C_i) \setminus R^\C$. \cmt{the contracted graph}
1042 \:$F_A \= \msf(G_A)$ calculated by the Iterated Jarn\'\i{}k's algorithm (\ref{itjar}).
1043 \:$G_B \= \bigcup_i F_i \cup F_A \cup R^\C$. \cmt{combine subtrees with corrupted edges}
1044 \:Run two Bor\o{u}vka steps (iterations of the Contractive Bor\o{u}vka's algorithm, \ref{contbor}) on~$G_B$,
1045   getting a~contracted graph~$G_C$ and a~set~$F_B$ of MST edges.
1046 \:$F_C \= \mst(G_C)$ obtained by a~recursive call to this algorithm.
1047 \:Return $F_B \cup F_C$.
1048 \algout The minimum spanning tree of~$G$.
1049 \endalgo
1050
1051 Correctness of this algorithm immediately follows from the Partitioning theorem (\ref{partthm})
1052 and from the proofs of the respective algorithms used as subroutines. Let us take a~look at
1053 the time complexity. We will be careful to use only the operations offered by the Pointer Machine.
1054
1055 \lemma\id{optlemma}%
1056 The time complexity $T(m,n)$ of the Optimal algorithm satisfies the following recurrence:
1057 $$
1058 T(m,n) \le \sum_i c_1 D(C_i) + T(m/2, n/4) + c_2 m,
1059 $$
1060 where~$c_1$ and~$c_2$ are some positive constants and $D$~is the decision tree complexity
1061 from the previous section.
1062
1063 \proof
1064 The first two steps of the algorithm are trivial as we have linear time at our
1065 disposal.
1066
1067 By the Partitioning theorem (\ref{partthm}), the call to \<Partition> with~$\varepsilon$
1068 set to a~constant takes $\O(m)$ time and it produces a~collection of clusters of size
1069 at most~$t$ and at most $m/4$ corrupted edges. It also guarantees that the
1070 connected components of the union of the $C_i$'s have at least~$t$ vertices
1071 (unless there is just a~single component).
1072
1073 To apply the decision trees, we will use the framework of topological computations developed
1074 in Section \ref{bucketsort}. We pad all clusters in~$\C$ with isolated vertices, so that they
1075 have exactly~$t$ vertices. We use a~computation that labels the graph with a~pointer to
1076 its optimal decision tree. Then we apply Theorem \ref{topothm} combined with the
1077 brute-force construction of optimal decision trees from Lemma \ref{odtconst}. Together they guarantee
1078 that we can assign the decision trees to the clusters in time:
1079 $$\O\Bigl(\Vert\C\Vert + t^{t(t+2)} \cdot \bigl(2^{2^{4t^2}} + t^2\bigr)\Bigr)
1080 = \O\Bigl(m + 2^{2^{2^t}}\Bigr)
1081 = \O(m).$$
1082 Execution of the decision tree on each cluster~$C_i$ then takes $\O(D(C_i))$ steps.
1083
1084 The contracted graph~$G_A$ has at most $n/t = \O(n / \log^{(3)}n)$ vertices and asymptotically
1085 the same number of edges as~$G$, so according to Corollary \ref{ijdens}, the Iterated Jarn\'\i{}k's
1086 algorithm runs on it in linear time.
1087
1088 The combined graph~$G_B$ has~$n$ vertices, but less than~$n$ edges from the
1089 individual spanning trees and at most~$m/4$ additional edges which were
1090 corrupted. The Bor\o{u}vka steps on~$G_B$ take $\O(m)$
1091 time by Lemma \ref{boruvkaiter} and they produce a~graph~$G_C$ with at most~$n/4$
1092 vertices and at most $n/4 + m/4 \le m/2$ edges. (The $n$~tree edges in~$G_B$ are guaranteed
1093 to be reduced by the Bor\o{u}vka's algorithm.) It is easy to verify that this
1094 graph is still connected, so we can recurse on it.
1095
1096 The remaining steps of the algorithm can be easily performed in linear time either directly
1097 or in case of the contractions by the bucket-sorting techniques of Section \ref{bucketsort}.
1098 \qed
1099
1100 \paran{Optimality}%
1101 The properties of decision tree complexity, which we have proven in the previous
1102 section, will help us show that the time complexity recurrence is satisfied by a~constant
1103 multiple of the decision tree complexity $D(m,n)$ itself. This way, we will prove
1104 the following theorem:
1105
1106 \thmn{Optimality of the Optimal algorithm}
1107 The time complexity of the Optimal MST algorithm \ref{optimal} is $\Theta(D(m,n))$.
1108
1109 \proof
1110 We will prove by induction that $T(m,n) \le cD(m,n)$ for some $c>0$. The base
1111 case is trivial, for the induction step we will expand on the previous lemma:
1112 \def\eqalign#1{\null\,\vcenter{\openup\jot
1113   \ialign{\strut\hfil$\displaystyle{##}$&$\displaystyle{{}##}$\hfil
1114       \crcr#1\crcr}}\,}
1115 $$\vcenter{\openup\jot\halign{\strut\hfil $\displaystyle{#}$&$\displaystyle{{}#}$\hfil&\quad#\hfil\cr
1116 T(m,n)
1117     &\le \sum_i c_1 D(C_i) + T(m/2, n/4) + c_2 m &(Lemma \ref{optlemma})\cr
1118     &\le c_1 D({\textstyle\bigcup}_i C_i) + T(m/2, n/4) + c_2 m &(Corollary \ref{dtpart})\cr
1119     &\le c_1 D(m,n) + T(m/2, n/4) + c_2m &(definition of $D(m,n)$)\cr
1120     &\le c_1 D(m,n) + cD(m/2, n/4) + c_2m &(induction hypothesis)\cr
1121     &\le c_1 D(m,n) + c/2\cdot D(m,n/2) + c_2m &(Corollary \ref{dttwice})\cr
1122     &\le c_1 D(m,n) + c/2\cdot D(m,n) + 2c_2 D(m,n) &(Lemma \ref{dtbasic})\cr
1123     &\le (c_1 + c/2 + 2c_2) \cdot D(m,n)&\cr
1124     &\le cD(m,n). &(by setting $c=2c_1+4c_2$)\cr
1125 }}$$
1126 The other inequality is obvious as $D(m,n)$ is an~asymptotic lower bound on
1127 the time complexity of every comparison-based algorithm.
1128 \qed
1129
1130 \paran{Complexity of MST}%
1131 As we have already noted, the exact decision tree complexity $D(m,n)$ of the MST problem
1132 is still open and so therefore is the time complexity of the optimal algorithm. However,
1133 every time we come up with another comparison-based algorithm, we can use its complexity
1134 (or more specifically the number of comparisons it performs, which can be even lower)
1135 as an~upper bound on the optimal algorithm.
1136
1137 The best explicit comparison-based algorithm known to date achieves complexity $\O(m\timesalpha(m,n))$.\foot{$\alpha(m,n)$
1138 is a~certain inverse of the Ackermann's function, $\lambda_k(n)$ is the row inverse of the same
1139 function. See \ref{ackerinv} for the exact definitions.}
1140 It has been discovered by Chazelle \cite{chazelle:ackermann} as an~improvement of his previous
1141 $\O(m\timesalpha(m,n)\cdot\log\alpha(m,n))$ algorithm \cite{chazelle:almostacker}.
1142 It is also based on the ideas of this chapter --- in particular the soft heaps and robust contractions.
1143 The algorithm is very complex and it involves a~lot of elaborate
1144 technical details, so we only refer to the original paper here. Another algorithm of the same
1145 complexity, using similar ideas, has been discovered independently by Pettie \cite{pettie:ackermann}, who,
1146 having the optimal algorithm at hand, does not take care about the low-level details and he only
1147 bounds the number of comparisons. Using any of these results, we can prove an~Ackermannian
1148 upper bound on the optimal algorithm:
1149
1150 \thmn{Upper bound on complexity of the Optimal algorithm}\id{optthm}%
1151 The time complexity of the Optimal MST algorithm is $\O(m\timesalpha(m,n))$.
1152
1153 \proof
1154 We bound $D(m,n)$ by the number of comparisons performed by the algorithm of Chazelle \cite{chazelle:ackermann}
1155 or Pettie \cite{pettie:ackermann}.
1156 \qed
1157
1158 Similarly to the Iterated Jarn\'\i{}k's algorithm, this bound is actually linear for classes of graphs
1159 that do not have density extremely close to constant:
1160
1161 \cor\id{lambdacor}%
1162 The Optimal MST algorithm runs in linear time whenever $m\ge n\cdot \lambda_k(n)$ for any fixed~$k$.
1163
1164 \proof
1165 Combine the previous theorem with Lemma \ref{alphaconst}.
1166 \qed
1167
1168 \rem
1169 It is also known from \cite{pettie:optimal} that when we run the Optimal algorithm on a~random
1170 graph drawn from either $G_{n,p}$ (random graphs on~$n$ vertices, each edge is included with probability~$p$
1171 independently on the other edges) or $G_{n,m}$ (we draw the graph uniformly at random from the
1172 set of all graphs with~$n$ vertices and $m$~edges), it runs in linear time with high probability,
1173 regardless of the edge weights.
1174
1175 \paran{Models of computation}%
1176 Another important consequence of the optimal algorithm is that when we aim for a~linear-time
1177 MST algorithm (or for proving that it does not exist), we do not need to care about computational
1178 models at all. The elaborate RAM data structures of Chapter \ref{ramchap}, which have helped us
1179 so much in the case of integer weights, cannot help if we are allowed to access the edge weights
1180 by performing comparisons only. We can even make use of non-uniform objects given by some sort
1181 of oracle. Indeed, whatever trick we employ to achieve linear time complexity, we can mimic it in the
1182 world of decision trees and thus we can use it to show that the algorithm we already knew is
1183 also linear.
1184
1185 This however applies to deterministic algorithms only --- we have shown that access to a~source
1186 of random bits allows us to compute the MST in expected linear time (the KKT algorithm, \ref{kkt}).
1187 There were attempts to derandomize the KKT algorithm, but so far the best result in this direction
1188 is the randomized algorithm also by Pettie \cite{pettie:minirand} which achieves expected linear time
1189 complexity with only $\O(\log^* n)$ random bits.
1190
1191 \endpart