Continuing with the intro to dynamic algorithms.

[saga.git] / rank.tex
diff --git a/rank.tex b/rank.tex

index 8df70caa27a13d477abe8f707f5703db723a4819..a2b5ef6e7f4943f4471af44e1d62fe933df52507 100644 (file)
--- a/rank.tex
+++ b/rank.tex
@@ -3,6 +3,7 @@
  \fi
  
  \chapter{Ranking Combinatorial Structures}
+\id{rankchap}
  
  \section{Ranking and unranking}
  
@@ -79,9 +80,10 @@ RAM data structures, yields a~linear-time algorithm for lexicographic
  \nota\id{brackets}%
  We will view permutations on a~finite set $A\subseteq {\bb N}$ as ordered $\vert A\vert$-tuples
  (in other words, arrays) containing every element of~$A$ exactly once. We will
-use square brackets to index these tuples: $\pi=(\pi[1],\ldots,\pi[\vert A\vert])$.
-The corresponding lexicographic ranking and unranking functions will be denoted by~$L(\pi,A)$
-and $L^{-1}(i,A)$ respectively.
+use square brackets to index these tuples: $\pi=(\pi[1],\ldots,\pi[\vert A\vert])$,
+and sub-tuples: $\pi[i\ldots j] = (\pi[i],\ldots,\pi[j])$.
+The lexicographic ranking and unranking functions for the permutations on~$A$
+will be denoted by~$L(\pi,A)$ and $L^{-1}(i,A)$ respectively.
  
  \obs\id{permrec}%
  Let us first observe that permutations have a simple recursive structure.
@@ -89,10 +91,10 @@ If we fix the first element $\pi[1]$ of a~permutation~$\pi$ on the set~$[n]$, th
  elements $\pi[2], \ldots, \pi[n]$ form a~permutation on $[n]-\{\pi[1]\} = \{1,\ldots,\pi[1]-1,\pi[1]+1,\ldots,n\}$.
  The lexicographic order of two permutations $\pi$ and~$\pi'$ on the original set is then determined
  by $\pi[1]$ and $\pi'[1]$ and only if these elements are equal, it is decided
-by the lexicographic comparison of permutations $(\pi[2],\ldots,\pi[n])$ and
-$(\pi'[2],\ldots,\pi'[n])$. Moreover, for fixed~$\pi[1]$ all permutations on
-the smaller set occur exactly once, so the rank of $\pi$ is $(\pi[1]-1)\cdot
-(n-1)!$ plus the rank of $(\pi[2],\ldots,\pi[n])$.
+by the lexicographic comparison of permutations $\pi[2\ldots n]$ and $\pi'[2\ldots n]$.
+Moreover, for fixed~$\pi[1]$ all permutations on the smaller set occur exactly
+once, so the rank of $\pi$ is $(\pi[1]-1)\cdot (n-1)!$ plus the rank of
+$\pi[2\ldots n]$.
  
  This gives us a~reduction from (un)ranking of permutations on $[n]$ to (un)ranking
  of permutations on a $(n-1)$-element set, which suggests a straightforward
@@ -121,7 +123,7 @@ translates to the following recursive algorithms for both ranking and unranking
  \>We can call $\<Rank>(\pi,1,n,[n])$ for ranking on~$[n]$, i.e., to calculate
  $L(\pi,[n])$.
  
-\alg $\<Unrank>(j,i,n,A)$: Return an~array~$\pi$ such that $\pi[i,\ldots,n]$ is the $j$-th permutation on~$A$.
+\alg $\<Unrank>(j,i,n,A)$: Return an~array~$\pi$ such that $\pi[i\ldots n]$ is the $j$-th permutation on~$A$.
  \id{unrankalg}
  \algo
  \:If $i>n$, return $(0,\ldots,0)$.
@@ -245,7 +247,7 @@ and $\log n^{\underline k} \ge (n/2)(\log n - 1) \ge (k/2)(\log n - 1)$.
  It remains to show how to translate the operations on~$A$ to operations on~$H$,
  again stored as a~sorted vector~${\bf h}$. Insertion to~$A$ correspond to
  deletion from~$H$ and vice versa. The rank of any~$x\in[n]$ in~$A$ is $x$ minus
-the number of holes which are smaller than~$x$, therefore $R_A(x)=x-R_H(x)$.
+the number of holes that are smaller than~$x$, therefore $R_A(x)=x-R_H(x)$.
  To calculate $R_H(x)$, we can again use the vector operation \<Rank> from Algorithm \ref{vecops},
  this time on the vector~$\bf h$.
  
@@ -290,9 +292,9 @@ constant time. The time bound follows. \qed
  
  %--------------------------------------------------------------------------------
  
-\section{Hatcheck lady and other derangements}
+\section{Restricted permutations}
  
-Another interesting class of combinatorial objects which can be counted and
+Another interesting class of combinatorial objects that can be counted and
  ranked are restricted permutations. An~archetypal member of this class are
  permutations without a~fixed point, i.e., permutations~$\pi$ such that $\pi(i)\ne i$
  for all~$i$. These are also called \df{derangements} or \df{hatcheck permutations.}\foot{%
@@ -496,7 +498,7 @@ with its $i$-th row and $j$-th column removed.
  \obs
  Let us consider a~permutation $\pi\in{\cal P}_A$ and $n=\vert A\vert$.
  When we fix the value of the element $\pi[1]$, the remaining elements form
-a~permutation $\pi'=(\pi[2],\ldots,\pi[n])$ on the set~$A'=A\setminus\{\pi[1]\}$.
+a~permutation $\pi'=\pi[2\ldots n]$ on the set~$A'=A\setminus\{\pi[1]\}$.
  The permutation~$\pi$ satisfies the restriction matrix~$M$ if and only if
  $M[1,a]=1$ for $a=R_A(\pi[1])$ and $\pi'$ satisfies a~restriction matrix~$M'=M^{1,a}$.
  This translates to the following counterparts of algorithms \ref{rankalg}
@@ -561,7 +563,28 @@ and $\O(n^2\cdot t(n))$ by the computations of the~$N_0$'s.
  \qed
  
  \rem
-This time bound is obviously very coarse, its main purpose was to demonstrate that
+In cases where the efficient evaluation of the permanent is out of our reach,
+we can consider using the fully-polynomial randomized approximation scheme
+for the permanent described by Jerrum, Sinclair and Vigoda in \cite{jerrum:permanent}.
+We then get an~approximation scheme for the ranks.
+
+\rem
+There are also deterministic algorithms for computing the number of perfect matchings
+in various special graph families (which imply polynomial-time ranking algorithms for
+the corresponding families of permutations). If the graph is planar, we can
+use the Kasteleyn's algorithm \cite{kasteleyn:crystals} based on Pfaffian
+orientations which runs in time $\O(n^3)$.
+It has been recently extended to arbitrary surfaces by Yuster and Zwick
+\cite{yuster:matching} and sped up to $\O(n^{2.19})$. The counting problem
+for arbitrary minor-closed classes (cf.~section \ref{minorclosed}) is still
+open.
+
+%--------------------------------------------------------------------------------
+
+\section{Hatcheck lady and other derangements}
+
+The time bound for ranking of general restricted permutations shown in the previous
+section is obviously very coarse. Its main purpose was to demonstrate that
  many special cases of the ranking problem can be indeed computed in polynomial time.
  For most families of restriction matrices, we can do much better. One of the possible improvements
  is to replace the matrix~$M$ by the corresponding restriction graph and instead of
@@ -571,11 +594,11 @@ matrices using information already computed for the larger matrices.
  
  These speedups are hard to state formally in general (they depend on the
  structure of the matrices), so we will concentrate on a~specific example
-instead. We will show that for derangements one can achieve linear time complexity.
+instead. We will show that for the derangements one can achieve linear time complexity.
  
-\examplen{Ranking of hatcheck permutations a.k.a.~derangements}\id{hatrank}%
+\nota\id{hatrank}%
  As we already know, the hatcheck permutations correspond to restriction
-matrices which contain zeroes only on the main diagonal and graphs which are
+matrices that contain zeroes only on the main diagonal and graphs that are
  complete bipartite with the matching $\{(i,i) : i\in[n]\}$ deleted. For
  a~given order~$n$, we will call this matrix~$D_n$ and the graph~$G_n$ and
  we will show that the submatrices of~$D_n$ share several nice properties:
@@ -598,6 +621,11 @@ same number of vertices and also the same number of missing edges. As the
  number of matchings is an~isomorphism invariant, the lemma follows.
  \qed
  
+\rem
+There is a~clear combinatorial intuition behind this lemma: if we are
+to count permutations with restrictions placed on~$z$ elements and these
+restrictions are independent, it does not matter how exactly they look like.
+
  \defn
  Let $n_0(z,d)$ be the permanent shared by all submatrices as described
  by the above lemma, which have $d\times d$ entries and exactly~$z$ zeroes.
@@ -644,11 +672,20 @@ We will count the permutations $\pi\in {\cal P}_d$ satisfying~$M_{z-1}$ in two w
  First, there are $n_0(z-1,d)$ such permutations. On the other hand, we can divide
  the them to two types depending on whether $\pi[1]=1$. Those having $\pi[1]\ne 1$
  are exactly the $n_0(z,d)$ permutations satisfying~$M_z$. The others correspond to
-permutations $(\pi[2],\ldots,\pi[d])$ on $\{2,\ldots,d\}$ which satisfy~$M_z^{1,1}$,
+permutations $(\pi[2],\ldots,\pi[d])$ on $\{2,\ldots,d\}$ that satisfy~$M_z^{1,1}$,
  so there are $n_0(z-1,d-1)$ of them.
  \qed
  
-\cor For every $0\le z<d$ we have $n_0(z,d) - n_0(z+1,d) \le n_0(z,d)/d$.
+\cor\id{nzeroprecalc}%
+For a~given~$n$, a~table of the values $n_0(z,d)$ for all $0\le z\le d\le n$
+can be precomputed in time~$\O(n^2)$.
+
+\proof
+Use either recurrence and induction on~$z+d$.
+\qed
+
+\cor\id{smalldiff}%
+For every $0\le z<d$ we have $n_0(z,d) - n_0(z+1,d) \le n_0(z,d)/d$.
  
  \proof
  According to the recurrence $(\maltese)$, the difference $n_0(z,d) - n_0(z+1,d)$ is
@@ -656,9 +693,14 @@ equal to $n_0(z,d-1)$. We can bound this by plugging the trivial inequality $n_0
  to~$(*)$, from which we obtain $n_0(z,d) \ge d\cdot n_0(z,d-1)$.
  \qed
  
-\para
-Instead of maintaining the matrix~$M$ over the course of the algorithm, it is sufficient to remember
-the number~$z$ of zeroes in this matrix and the set~$Z$ which contains the elements
+\para\id{rrankmod}%
+Let us show how to modify the ranking algorithm (\ref{rrankalg}) using the insight
+we have gained into the structure of derangements.
+
+The algorithm uses the matrix~$M$ only for computing~$N_0$ of its submatrices
+and we have shown that this value depends only on the order of the matrix and
+the number of zeroes in it. We will therefore replace maintenance of the matrix
+by remember the number~$z$ of its zeroes and the set~$Z$ that contains the elements
  $x\in A$ whose locations are restricted (there is a~zero anywhere in the $(R_A(x)+1)$-th
  column of~$M$). In other words, every $x\in Z$ can appear at all positions in the
  permutation except one (and these forbidden positions are different for different~$x$'s),
@@ -668,16 +710,36 @@ As we already observed (\ref{hatcheck}) that the number of derangements on~$[n]$
  we can again use word-sized vectors to represent the sets~$A$ and~$Z$ with insertion,
  deletion, ranking and unranking on them in constant time.
  
-When the algorithm selects a~submatrix $M'=M^{1,k}$ for an~element $x$ of~rank~$k-1$, it is described by either
-$z'=z-1$ and~$Z'=Z\setminus\{x\}$ (if $x\in Z$) or $z'=z$ and $Z'=Z$ (if $x\not\in Z$).
+When the algorithm selects a~submatrix $M'=M^{1,k}$ for an~element $x$ of~rank~$k-1$, this
+matrix it is described by either by the choice of $z'=z-1$ and~$Z'=Z\setminus\{x\}$ (if $x\in Z$)
+or $z'=z$ and $Z'=Z$ (if $x\not\in Z$).
  All computations of~$N_0$ in the algorithm can therefore be replaced by looking
-up the appropriate $n_0(z',\vert A\vert-1)$ in a~precomputed table. Moreover, we can
+up the appropriate $n_0(z',\vert A\vert-1)$ in the precomputed table. Moreover, we can
  calculate a~single~$C_a$ in constant time, because all summands are either $n_0(z,\vert A\vert-1)$
-or $n_0(z-1,\vert A\vert-1)$ depending on the set~$Z$. So we get:
+or $n_0(z-1,\vert A\vert-1)$ depending on the set~$Z$. We get:
  $$C_a = r\cdot n_0(z-1,\vert A\vert-1) + (a-r) \cdot n_0(z,\vert A\vert-1),$$
  where $r=R_Z(R^{-1}_A(a))$, that is the number of restricted elements among the $a$~smallest ones in~$A$.
  
-It remains to show how to precompute the table of the $n_0$'s efficiently.
+All operations at a~single level of the \<Rank> function now run in constant time,
+but \<Unrank> needs to search among the~$C_a$'s to find the first of them which
+exceeds the given rank. We could use binary search, but that would take $\Theta(\log n)$
+time. There is however a~clever trick: the value of~$C_a$ does not vary too much with
+the set~$Z$. Specifically, by Corollary~\ref{smalldiff} the difference between the values
+for $Z=\emptyset$ and $Z=A$ is at most $n_0(z-1,\vert A\vert -1)$. It is therefore
+sufficient to just divide the rank by $n_0(z-1,\vert A\vert-1)$ and we get either
+the correct value of~$a$ or one more. Both possibilities can be checked in constant time.
+
+We can therefore conclude this section by the following theorem:
+
+\thmn{Ranking of derangements}%
+For every~$n$, the derangements on the set~$[n]$ can be ranked and unranked according to the
+lexicographic order in time~$\O(n)$ after spending $\O(n^2)$ on initialization of auxiliary tables.
+
+\proof
+We modify the general algorithms for (un)ranking of restricted permutations (\ref{rrankalg} and \ref{runrankalg})
+as described above (\ref{rrankmod}). Each of the $n$~levels of recursion will then run in constant time. The values~$n_0$ will
+be looked up in a~table precalculated in quadratic time as shown in Corollary~\ref{nzeroprecalc}.
+\qed