Editorial for APIO '10 P2 - Patrol - DMOJ: Modern Online Judge

Editorial for APIO '10 P2 - Patrol

Remember to use this editorial only when stuck, and not to copy-paste code from it. Please be respectful to the problem author and editorialist.
Submitting an official solution before solving the problem yourself is a bannable offence.

The road network forms a tree $T$ ~T~. A tree with $N$ ~N~ nodes has $N-1$ ~N-1~ edges. In $T$ ~T~, the length of a tour that visits all edges is $2(N-1)$ ~2(N-1)~, because each edge is visited twice. Recall that adding edges into a tree creates cycles.

Simpler case

We consider a simpler case when $K = 1$ ~K = 1~. Suppose that we add edge $e$ ~e~ to $T$ ~T~. The resulting graph $T'$ ~T'~ contains exactly one cycle $C$ ~C~. The cheapest tour visiting all edges uses each edge in $C$ ~C~ once and all other edges twice. Denote $C-e$ ~C-e~ as path $P$ ~P~. The new length of the required tour is

$\displaystyle 2(N-1)-L+1$ $$\displaystyle 2(N-1)-L+1$$

where $L$ ~L~ is the length of $P$ ~P~. Thus, for $K = 1$ ~K = 1~, we need to find the maximum path length for paths in $T$ ~T~. This value is called the diameter of $T$ ~T~.

There are many ways to find the diameter. We shall use dynamic programming, which can be turned into a solution for the general case.

First, we root the tree at some node $r$ ~r~; the parent-child relation between adjacent nodes can be defined naturally. For each node $u$ ~u~, let $H[u]$ ~H[u]~ denote the length of the longest path from $u$ ~u~ to some of its descendants. We can compute $H[u]$ ~H[u]~ for each $u$ ~u~, in $\mathcal O(N)$ ~\mathcal O(N)~ time, using a simple dynamic programming.

Consider the longest path $P$ ~P~, let node $u$ ~u~ be the node on $P$ ~P~ closest to the root $r$ ~r~. By definition, $u$ ~u~ is unique. Given $u$ ~u~, the length of $P$ ~P~ must be either $H[u]$ ~H[u]~, if $u$ ~u~ has one child, or

$\displaystyle \max_{v,w : \text{different children of }u} (2+H[v]+H[w])$

when $u$ ~u~ has more than one child. The value above is important to the case where $K = 2$ ~K = 2~ as well, so let's define it as $L[u]$ ~L[u]~. Formally, $L[u]$ ~L[u]~ is the maximum length of paths containing $u$ ~u~ such that $u$ ~u~ is the closest node to root $r$ ~r~.

Thus by enumerating all nodes, one can find the length of the longest path; thus, one can compute the answer to the case where $K = 1$ ~K = 1~.

When $K = 2$ ~K = 2~

Let's call both edges $e_1$ ~e_1~ and $e_2$ ~e_2~. Let path $P_i$ ~P_i~ be a unique path that joins two endpoints of $e_i$ ~e_i~, also let's call a unique cycle induced by adding each edge $e_i$ ~e_i~ (separately) as $C_i$ ~C_i~. Note that $C_i$ ~C_i~ is a union of $P_i$ ~P_i~ and $e_i$ ~e_i~.

Figure 1: (a) Edges $e_1$ ~e_1~ and $e_2$ ~e_2~ are shown as dashed lines. Paths $P_1$ ~P_1~ and $P_2$ ~P_2~ intersect. The intersection is shown as a thick line. In the tour, these edges must still be traversed over twice. (b) The new edges $f_1$ ~f_1~ and $f_2$ ~f_2~ are shown as dashed lines. Note that the number of times each edge on the tree is traversed on is the same as before.

When $P_1$ ~P_1~ and $P_2$ ~P_2~ are disjoint, the length of the desired tour that traverses all edges is

$\displaystyle 2(N-1)-|L_1|-|L_2|+2$ $$\displaystyle 2(N-1)-|L_1|-|L_2|+2$$

where $L_i$ ~L_i~ is the length of $P_i$ ~P_i~.

It gets more complicated when $P_i$ ~P_i~'s intersect. However, since one must traverse on each $e_i$ ~e_i~ exactly once, it is not hard to prove the following claim.

Claim: If $P_1$ ~P_1~ and $P_2$ ~P_2~ intersect, there is another pair of edges $f_1$ ~f_1~ and $f_2$ ~f_2~ such that the paths joining each edge's endpoints are disjoint, and the length of the tour traverses all edges in $T+f_1+f_2$ ~T+f_1+f_2~ is the same as in $T+e_1+e_2$ ~T+e_1+e_2~.

The proof is left out, but Figure 1 illustrates the idea of the proof.

From the claim, to find how to add two edges to minimize the tour, we need to only consider finding a pair of disjoint paths whose sum of lengths is maximum. This, again, can be solved using dynamic programming in $\mathcal O(N)$ ~\mathcal O(N)~ time.

Besides $H[u]$ ~H[u]~, we need other variables. Let $T_u$ ~T_u~ be the subtree rooted at $u$ ~u~. We define:

$A[u]$ ~A[u]~ is the maximum length of paths inside $T_u$ ~T_u~.
$B[u]$ ~B[u]~ is the maximum sum of lengths of any pairs of edge-disjoint paths $P$ ~P~ and $Q$ ~Q~ in $T_u$ ~T_u~ such that one endpoint of $P$ ~P~ is $u$ ~u~.

Figure 2 shows examples of paths considered in $A[u]$ ~A[u]~ and $B[u]$ ~B[u]~.

Figure 2: (a) Paths considered in $A[u]$ ~A[u]~. (b) A pair of paths considered in $B[u]$ ~B[u]~.

Let $ch(u)$ ~ch(u)~ denote the number of children of $u$ ~u~ on the rooted tree $T$ ~T~. It takes $\mathcal O(ch(u))$ ~\mathcal O(ch(u))~ time to compute $A[u]$ ~A[u]~ from information from its children by taking the maximum of $A[v]$ ~A[v]~ for all children $v$ ~v~ of $u$ ~u~ and $L[u]$ ~L[u]~.

To compute $B[u]$ ~B[u]~, a straightforward implementation takes $\mathcal O(ch(u)^2)$ ~\mathcal O(ch(u)^2)~ time. A careful implementation only takes $\mathcal O(ch(u))$ ~\mathcal O(ch(u))~ time. (See discussion in the next section.)

With $A$ ~A~'s and $B$ ~B~'s of all child nodes of $u$ ~u~ at hand, one can find $D[u]$ ~D[u]~ the maximum sum of lengths of pairs of paths $P_1$ ~P_1~ and $P_2$ ~P_2~ such that

$P_1$ ~P_1~ and $P_2$ ~P_2~ are disjoint,
$P_1$ ~P_1~ contains $u$ ~u~, and
Among all nodes in $P_1$ ~P_1~ and $P_2$ ~P_2~, $u$ ~u~ is the closest to root $r$ ~r~ of $T$ ~T~.

Again, a careful implementation runs in $\mathcal O(ch(u))$ ~\mathcal O(ch(u))~ time. Easier implementations that run in $\mathcal O(ch(u)^2)$ ~\mathcal O(ch(u)^2)~ time and $\mathcal O(ch(u)^3)$ ~\mathcal O(ch(u)^3)~ time exist. We discuss the implementations later.

After computing all $D[u]$ ~D[u]~'s, the minimum length of the desired tour is

$\displaystyle 2(N-1)-\max_u D[u]+2$ $$\displaystyle 2(N-1)-\max_u D[u]+2$$

Computing $B[u]$ ~B[u]~ and $D[u]$ ~D[u]~

We first discuss how to compute $B[u]$ ~B[u]~. Let $CH(u)$ ~CH(u)~ denote $u$ ~u~'s children. Recall that $B[u]$ ~B[u]~ is the maximum sum of the length of a pair of edge-disjoint paths $P$ ~P~ and $Q$ ~Q~ such that $u$ ~u~ is one end of $P$ ~P~.

There are many cases to consider for $P$ ~P~ and $Q$ ~Q~:

Case 1: Both $P$ ~P~ and $Q$ ~Q~ contain $u$ ~u~. In this case, we can compute $B[u]$ ~B[u]~ by finding $3$ ~3~ children with largest height.
Case 2a: $P$ ~P~ contains edge $(u, v)$ ~(u, v)~ for some child $v$ ~v~ in $CH(u)$ ~CH(u)~, and $Q$ ~Q~ also lies entirely in $T_v$ ~T_v~. In this case, we have that $B[u] = 1+B[v]$ ~B[u] = 1+B[v]~.
Case 2b: $P$ ~P~ contains edge $(u, v)$ ~(u, v)~ for some child $v$ ~v~ in $CH(u)$ ~CH(u)~, but $Q$ ~Q~ lies entirely in $T_w$ ~T_w~ for some child $w$ ~w~ not equal $v$ ~v~. In this case, $B[u] = 1+H[v]+A[w]$ ~B[u] = 1+H[v]+A[w]~.

Case 1 and Case 2a can be considered in $\mathcal O(ch(u))$ ~\mathcal O(ch(u))~ time. By checking all pairs of children in $CH(u)$ ~CH(u)~, we can consider Case 2b in $\mathcal O(ch(u)^2)$ ~\mathcal O(ch(u)^2)~ time. The time can be reduced to linear by noticing that we can preprocess by finding a child $x$ ~x~ with maximum $A[x]$ ~A[x]~. With that, we can consider the value of $1+H[v]+A[x]$ ~1+H[v]+A[x]~ when $v$ ~v~ is not equal to $x$ ~x~, and $1+H[x]+\max_{w \ne x} A[w]$ ~1+H[x]+\max_{w \ne x} A[w]~ when $v = x$ ~v = x~. The total running time is $\mathcal O(ch(u))$ ~\mathcal O(ch(u))~.

The same idea can be applied to computing $D[u]$ ~D[u]~. In this case, we want to find two edge-disjoint paths $P$ ~P~ and $Q$ ~Q~ in $T_u$ ~T_u~. There are 3 cases to consider:

Both $P$ ~P~ and $Q$ ~Q~ contain $u$ ~u~.
Neither $P$ ~P~ nor $Q$ ~Q~ contains $u$ ~u~.
One contains $u$ ~u~.

The first two cases are easy to implement to run in time $\mathcal O(ch(u))$ ~\mathcal O(ch(u))~. The last one can be implemented to run in $\mathcal O(ch(u)^3)$ ~\mathcal O(ch(u)^3)~. The idea from the computation of $B[u]$ ~B[u]~ can be applied here to reduce the running time to $\mathcal O(ch(u)^2)$ ~\mathcal O(ch(u)^2)~ and $\mathcal O(ch(u))$ ~\mathcal O(ch(u))~.

Scoring

Since optimizing the computation of $B$ ~B~'s and $D$ ~D~'s are not the essential part of the task, solutions that use either $\mathcal O(ch(u)^3)$ ~\mathcal O(ch(u)^3)~ or $\mathcal O(ch(u)^2)$ ~\mathcal O(ch(u)^2)~ per node $u$ ~u~ should score the majority of the test cases.

Comments

There are no comments at the moment.