Editorial for Google Code Jam '22 Round 2 Problem D - I, O Bot

Remember to use this editorial only when stuck, and not to copy-paste code from it. Please be respectful to the problem author and editorialist.
Submitting an official solution before solving the problem yourself is a bannable offence.

There is no value in carrying balls across the origin without depositing them into the warehouse, therefore, collecting the balls with positive coordinates $X_i$ ~X_i~ and those with negative coordinates are two similar but independent tasks. Hence, in what follows, we assume that $X_i > 0$ ~X_i > 0~ for all $i$ ~i~. Moreover, let us assume that the balls are sorted in ascending order by $X_i$ ~X_i~.

A solution to the problem consists of a number of passes or round-trips from the origin and back with one or two balls collected in each pass. The time required to collect a single ball $i$ ~i~ in a pass is $2X_i$ ~2X_i~. The time required to collect two balls $i$ ~i~ and $j$ ~j~ is $2 \times \max(X_i, X_j)$ ~2 \times \max(X_i, X_j)~ if the balls are of different shapes and $2 \times \max(X_i, X_j)+C$ ~2 \times \max(X_i, X_j)+C~ otherwise. We say that two balls $i$ ~i~ and $j$ ~j~ are matched (and write $(i, j)$ ~(i, j)~) if they are collected in the same pass. Since the order of passes is not affecting the overall time for collecting all balls, we can equivalently think of the problem as one of finding an optimal matching of balls.

The following observation will be useful throughout the analysis.

Observation 1: Suppose we want to collect the first $i$ ~i~ balls ( $i \ge 2$ ~i \ge 2~) and $S_i \ne S_{i-1}$ ~S_i \ne S_{i-1}~. In an optimal matching, the $i$ ~i~-th ball is matched with the $(i-1)$ ~(i-1)~-th ball.

Proof: Consider any matching of balls, where $i$ ~i~-th ball is not matched with $(i-1)$ ~(i-1)~-th ball, and assume that $i$ ~i~-th ball is a $0$ ~0~.

If none of the two balls is matched, we can match the balls and save $2X_{i-1}$ ~2X_{i-1}~ seconds.
If there is a matching $(i-1,j)$ ~(i-1,j)~, $j < i-1$ ~j < i-1~, and $i$ ~i~-th ball is not matched, then we can match $(i-1)$ ~(i-1)~-th ball with $i$ ~i~-th ball instead and save at least $2 \times (X_{i-1}-X_j)$ ~2 \times (X_{i-1}-X_j)~ seconds ( $2 \times (X_{i-1}-X_j)+C$ ~2 \times (X_{i-1}-X_j)+C~, if $j$ ~j~-th ball is $1$ ~1~-shaped).
Similarly, if there is a matching $(i,j)$ ~(i,j)~, $j < i-1$ ~j < i-1~, and $(i-1)$ ~(i-1)~-th ball is not matched, we can match $i$ ~i~-th ball with $(i-1)$ ~(i-1)~-th ball instead and, again, save at least $2 \times (X_{i-1}-X_j)$ ~2 \times (X_{i-1}-X_j)~ seconds.
Lastly, if there are matchings $(i, j)$ ~(i, j)~ and $(i-1, k)$ ~(i-1, k)~, $j < i-1$ ~j < i-1~ and $k < i-1$ ~k < i-1~, then we can rearrange the matchings as $(i, i-1)$ ~(i, i-1)~ and $(j, k)$ ~(j, k)~ saving at least $2 \times (X_{i-1}-\max(X_j, X_k))$ ~2 \times (X_{i-1}-\max(X_j, X_k))~ seconds.

Test Set 1

Observation 1 helps us match the balls if the last two balls have different shapes. But what if they have the same shape, say a $0$ ~0~?

Observation 2: Suppose we want to collect the first $i$ ~i~ balls ( $i \ge 2$ ~i \ge 2~) and $S_i = S_{i-1} = 0$ ~S_i = S_{i-1} = 0~. There is an optimal matching of balls such that one of the following conditions holds:

The last two $0$ ~0~-shaped balls $i$ ~i~ and $i-1$ ~i-1~ are matched.
There is a matching $(i, j)$ ~(i, j)~ with $S_j = 1$ ~S_j = 1~ and, for all $k \in [j+1, i]$ ~k \in [j+1, i]~, $S_k = 0$ ~S_k = 0~. In other words, $i$ ~i~-th ball is matched with the nearest $1$ ~1~-shaped ball on its left.
There are no $1$ ~1~-shaped balls and $i$ ~i~-th ball remains unmatched.

Proof: The full proof is a lengthy case analysis, which we omit here. The idea is that matching $i$ ~i~-th ball with the rightmost ball of a particular shape is generally at least as good as matching with another ball of that shape. For example, suppose that $i$ ~i~-th ball is matched with a $1$ ~1~-shaped ball $l$ ~l~ such that there is another $1$ ~1~-shaped ball $j$ ~j~ with $l < j < i$ ~l < j < i~. If the ball $j$ ~j~ is unmatched, we can match the ball $i$ ~i~ with $j$ ~j~ instead and save $2 \times (X_j-X_l)$ ~2 \times (X_j-X_l)~ seconds. Otherwise, if the ball $j$ ~j~ is matched with some other ball $k$ ~k~, we can swap the roles of balls $l$ ~l~ and $j$ ~j~ and create the matchings $(i, j)$ ~(i, j)~ and $(k, l)$ ~(k, l)~ obtaining the same overall time (if $k > j$ ~k > j~) or better.

This means that we can try matching the last $0$ ~0~-shaped ball with the $0$ ~0~-shaped ball before or the rightmost $1$ ~1~-shaped ball (if any), and at least one of these moves will be optimal.

The image shows the last five of i balls. The last three are 0-shaped, and the remaining two are 1-shaped. The i-th ball is connected to (i-1)-th and (i-3)-th balls with lines.

The two observations lead to a dynamic programming solution. Let $dp[i][j]$ ~dp[i][j]~ be the optimum time to collect the first $i$ ~i~ $0$ ~0~-shaped balls and the first $j$ ~j~ $1$ ~1~-shaped balls. The base case is $dp[0][0] = 0$ ~dp[0][0] = 0~. For $i+j > 0$ ~i+j > 0~, suppose again that the rightmost of these $i+j$ ~i+j~ balls is $0$ ~0~-shaped and it has the coordinate $x$ ~x~. The case when the rightmost ball is $1$ ~1~-shaped is symmetric. To eliminate some other corner cases, $dp[1][0] = 2x$ ~dp[1][0] = 2x~, $dp[i][0] = \min(dp[i-1][0], dp[i-2][0]+C)+2x$ for $i \ge 2$ ~i \ge 2~, and $dp[1][j] = dp[0][j-1]+2x$ ~dp[1][j] = dp[0][j-1]+2x~ for $j \ge 1$ ~j \ge 1~. For the general case with $i \ge 2$ ~i \ge 2~ and $j \ge 1$ ~j \ge 1~, if the penultimate ball is $1$ ~1~-shaped, then $dp[i][j] = dp[i-1][j-1]+2x$ ~dp[i][j] = dp[i-1][j-1]+2x~ (Observation 1). Otherwise, we can choose to match the last $0$ ~0~-shaped ball with the previous $0$ ~0~-shaped ball or the rightmost $1$ ~1~-shaped ball (Observation 2), namely, $dp[i][j] = \min(dp[i-2][j]+C,dp[i-1][j-1])+2x$ .

The final answer is $dp[N_0][N_1]$ ~dp[N_0][N_1]~, where $N_0$ ~N_0~ and $N_1$ ~N_1~ denote the total number of $0$ ~0~-shaped and $1$ ~1~-shaped balls, respectively. The time complexity of this algorithm is $\mathcal O(N^2)$ ~\mathcal O(N^2)~.

Test Set 2

Using dynamic programming from a different angle, we can solve the problem in linear time, apart from the initial sorting. Let $dp[i]$ ~dp[i]~ be the optimum time to collect the first $i$ ~i~ balls. As the base cases, $dp[0] = 0$ ~dp[0] = 0~ and $dp[1] = 2X_1$ ~dp[1] = 2X_1~. To calculate $dp[i]$ ~dp[i]~ for $i \ge 2$ ~i \ge 2~, suppose once more that the $i$ ~i~-th ball is $0$ ~0~-shaped. If the $(i-1)$ ~(i-1)~-th ball is $1$ ~1~-shaped, we can match the last two balls and $dp[i] = dp[i-2]+2X_i$ ~dp[i] = dp[i-2]+2X_i~ (Observation 1). Otherwise, using Observation 2, we have the options to match the last two $0$ ~0~-shaped balls and collect all balls in $dp[i-2]+C+2X_i$ ~dp[i-2]+C+2X_i~ seconds, or to match $i$ ~i~-th ball with the rightmost $1$ ~1~-shaped ball $j$ ~j~. The dynamic programming recurrence is not obvious in the latter case, though, as we do not know the optimum matching for the first $i-1$ ~i-1~ balls except for ball $j$ ~j~. What happens to the $0$ ~0~-shaped balls in-between $j$ ~j~ and $i$ ~i~? We are missing another key observation here.

Observation 3: If there is an optimal matching of the first $i$ ~i~ balls such that the $0$ ~0~-shaped ball $i$ ~i~ is matched with the rightmost $1$ ~1~-shaped ball $j$ ~j~ and $i-1 \ne j$ ~i-1 \ne j~, then the $0$ ~0~-shaped ball $i-1$ ~i-1~ is not matched with another $0$ ~0~-shaped ball.

Proof: Assume on the contrary that we have two pairs of matched balls $(i, j)$ ~(i, j)~ and $(i-1, k)$ ~(i-1, k)~, $k < i-1$ ~k < i-1~, such that ball $k$ ~k~ is $0$ ~0~-shaped. These two matched pairs contribute $2X_i+2X_{i-1}+C$ ~2X_i+2X_{i-1}+C~ seconds to the overall matching cost. But then we can rearrange the matchings as $(i, i-1)$ ~(i, i-1)~ and $(j, k)$ ~(j, k)~ costing us only $2X_i+C+2 \times \max(X_j,X_k)$ ~2X_i+C+2 \times \max(X_j,X_k)~ seconds, which is $2(X_{i-1}-\max(X_j,X_k))$ ~2(X_{i-1}-\max(X_j,X_k))~ seconds less. This contradicts the optimality assumption of the given matching.

It follows from Observation 3 that the $0$ ~0~-shaped ball $i-1$ ~i-1~ must be matched with another $1$ ~1~-shaped ball, specifically the rightmost unmatched $1$ ~1~-shaped ball. And we can extend this argument and repeatedly match $0$ ~0~-shaped balls with $1$ ~1~-shaped balls sweeping leftward for as long as there is another $0$ ~0~-shaped ball to the right of a matched $1$ ~1~-shaped ball. This process is illustrated in the drawing below.

The image shows the last 12 of the first i balls with the following shapes: ??1111001000. ? stands for Undefined. The last 0-shaped ball is labeled i. The last 1-shaped ball is labeled j. The second ball with unspecified shape is labeled k. Lines between balls indicate matchings (i,i-3),(i-1,i-6),(i-2,i-7),(i-4,i-8), and (i-5,i-9).

Let $k$ ~k~ be the rightmost unmatched ball after the above $0$ ~0~- $1$ ~1~ matching process. There are no shape changes in the set of balls $\{k+1, k+2, \dots, i\}$ ~\{k+1, k+2, \dots, i\}~ and the cost of collecting those balls is twice the sum $X_\text{0-shaped}(k+1, i)$ ~X_\text{0-shaped}(k+1, i)~ of $x$ ~x~-coordinates of $0$ ~0~-shaped balls in $\{k+1, k+2, \dots, i\}$ ~\{k+1, k+2, \dots, i\}~. Therefore, the cost of collecting all $i$ ~i~ balls in this way is $dp[k]+2 \times X_\text{0-shaped}(k+1, i)$ .

$X_\text{0-shaped}(k+1, i)$ ~X_\text{0-shaped}(k+1, i)~ can be calculated in $\mathcal O(1)$ ~\mathcal O(1)~ time using prefix sums. But how do we get the index $k$ ~k~ efficiently without actually carrying out the matching process? Note that $k$ ~k~ is the largest index such that $k < i$ ~k < i~ and the set $\{k+1, k+2, \dots, i\}$ ~\{k+1, k+2, \dots, i\}~ contains equal number of $0$ ~0~-shaped and $1$ ~1~-shaped balls. Consider the balance $b_i$ ~b_i~ of $0$ ~0~/ $1$ ~1~ balls at each index $i$ ~i~, namely, $b_i = z_i-o_i$ ~b_i = z_i-o_i~, where $z_i$ ~z_i~ and $o_i$ ~o_i~ is the number of $0$ ~0~-shaped and $1$ ~1~-shaped balls in the set $\{1, 2, \dots, i\}$ ~\{1, 2, \dots, i\}~. The set $\{k+1, k+2, \dots, i\}$ ~\{k+1, k+2, \dots, i\}~ has equal number of $0$ ~0~-shaped and $1$ ~1~-shaped balls if and only if $b_k = b_i$ ~b_k = b_i~. The index $k$ ~k~ can be looked up in $\mathcal O(1)$ ~\mathcal O(1)~ time if we maintain a hash-table of indices, when each balance was last registered. If the current balance $b_i$ ~b_i~ is seen for the first time, it means that there are not enough $1$ ~1~-shaped balls to match all $0$ ~0~-shaped balls with, and we can choose $k = 0$ ~k = 0~.

We are performing a constant number of operations at each index in this approach, so the overall time complexity is dominated by the sorting, thus $\mathcal O(N \log N)$ ~\mathcal O(N \log N)~.

Comments

There are no comments at the moment.