The first three sections of this chapter cover the three most common techniques used in amortized analysis. Section 18.1 starts with the aggregate method, in which we determine an upper bound T ( n ) on the total cost of a sequence of n operations. The amortized cost per operation is then T ( n ) /n .
Section 18.2 covers the accounting method, in which we determine an amortized cost of each operation. When there is more than one type of operation, each type of operation may have a different amortized cost. The accounting method overcharges some operations early in the sequence, storing the overcharge as "prepaid credit" on specific objects in the data structure. The credit is used later in the sequence to pay for operations that are charged less than they actually cost.
Section 18.3 discusses the potential method, which is like the accounting method in that we determine the amortized cost of each operation and may overcharge operations early on to compensate for undercharges later. The potential method maintains the credit as the "potential energy" of the data structure instead of associating the credit with individual objects within the data structure.
We shall use two examples to examine these three models. One is a stack with the additional operation MULTIPOP , which pops several objects at once. The other is a binary counter that counts up from 0 by means of the single operation INCREMENT .
While reading this chapter, bear in mind that the charges assigned during an amortized analysis are for analysis purposes only. They should not appear in the code. If, for example, a credit is assigned to an object x when using the accounting method, there is no need to assign an appropriate amount to some attribute credit [ x ] in the code.
The insight into a particular data structure gained by performing an amortized analysis can help in optimizing the design. In Section 18.4, for example, we shall use the potential method to analyze a dynamically expanding and contracting table.
18.1 The aggregate methodPUSH ( S , x ) pushes object x onto stack S .
POP ( S ) pops the top of stack S and returns the popped object.
Since each of these operations runs in O (1) time, let us consider the cost of each to be 1. The total cost of a sequence of n PUSH and POP operations is therefore n , and the actual running time for n operations is therefore ( n ).
MULTIPOP(S,k)
1 while not STACK-EMPTY(S) and k 0
2 do POP(S)
3 k k - 1
Figure 18.1 shows an example of MULTIPOP .
What is the running time of MULTIPOP ( S, k ) on a stack of s objects? The actual running time is linear in the number of POP operations actually executed, and thus it suffices to analyze MULTIPOP in terms of the abstract costs of 1 each for PUSH and POP . The number of iterations of the while loop is the number min( s, k ) of objects popped off the stack. For each iteration of the loop, one call is made to P OP in line 2. Thus, the total cost of MULTIPOP is min( s, k ), and the actual running time is a linear function of this cost.
Let us analyze a sequence of n PUSH , POP , and MULTIPOP operations on an initially empty stack. The worst-case cost of a MULTIPOP operation in the sequence is O ( n ), since the stack size is at most n . The worst-case time of any stack operation is therefore O ( n ), and hence a sequence of n operations costs O ( n 2 ), since we may have O ( n ) MULTIPOP operations costing O ( n ) each. Although this analysis is correct, the O ( n 2 ) result, obtained by considering the worst-case cost of each operation individually, is not tight.
Using the aggregate method of amortized analysis, we can obtain a better upper bound that considers the entire sequence of n operations. In fact, although a single MULTIPOP operation can be expensive, any sequence of n PUSH , POP , and MULTIPOP operations on an initially empty stack can cost at most O ( n ). Why? Each object can be popped at most once for each time it is pushed. Therefore, the number of times that POP can be called on a nonempty stack, including calls within MULTIPOP , is at most the number of PUSH operations, which is at most n . For any value of n , any sequence of n PUSH , POP , and MULTIPOP operations takes a total of O ( n ) time. The amortized cost of an operation is the average: O ( n )/ n = O (1).
We emphasize again that although we have just shown that the average cost, and hence running time, of a stack operation is O (1), no probabilistic reasoning was involved. We actually showed a worst-case bound of O ( n ) on a sequence of n operations. Dividing this total cost by n yielded the average cost per operation, or the amortized cost.
INCREMENT(A)
1 i 0
2 while i length[A] and A[i] = 1
3 do A[i] 0
4 i i + 1
5 if i < length[A]
6 then A[i] 1
This algorithm is essentially the same one implemented in hardware by a ripple-carry counter (see Section 29.2.1). Figure 18.2 shows what happens to a binary counter as it is incremented 16 times, starting with the initial value 0 and ending with the value 16. At the start of each iteration of the while loop in lines 2-4, we wish to add a 1 into position i . If A [ i ] = 1, then adding 1 flips the bit to 0 in position i and yields a carry of 1, to be added into position i + 1 on the next iteration of the loop. Otherwise, the loop ends, and then, if i < k , we know that A [ i ] = 0, so that adding a 1 into position i , flipping the 0 to a 1, is taken care of in line 6. The cost of each INCREMENT operation is linear in the number of bits flipped.
As with the stack example, a cursory analysis yields a bound that is correct but not tight. A single execution of INCREMENT takes time ( k ) in the worst case, in which array A contains all 1's. Thus, a sequence of n INCREMENT operations on an initially zero counter takes time O ( nk ) in the worst case.
We can tighten our analysis to yield a worst-case cost of O ( n ) for a sequence of n I NCREMENT'S by observing that not all bits flip each time INCREMENT is called. As Figure 18.2 shows, A [0] does flip each time INCREMENT is called. The next-highest-order bit, A [1], flips only every other time: a sequence of n INCREMENT operations on an initially zero counter causes A [1] to flip n /2 times. Similarly, bit A [2] flips only every fourth time, or n/4 times in a sequence of n I NCREMENT'S . In general, for i = 0, 1, . . . , lg n , bit A [ i ] flips n /2 i times in a sequence of n INCREMENT operations on an initially zero counter. For i > lg n , bit A [ i ] never flips at all. The total number of flips in the sequence is thus
by equation (3.4). The worst-case time for a sequence of n I NCREMENT operations on an initially zero counter is therefore O ( n ), so the amortized cost of each operation is O ( n )/ n = O (1).
ExercisesA sequence of n operations is performed on a data structure. The i th operation costs i if i is an exact power of 2, and 1 otherwise. Use an aggregate method of analysis to determine the amortized cost per operation.
18.2 The accounting methodOne must choose the amortized costs of operations carefully. If we want analysis with amortized costs to show that in the worst case the average cost per operation is small, the total amortized cost of a sequence of operations must be an upper bound on the total actual cost of the sequence. Moreover, as in the aggregate method, this relationship must hold for all sequences of operations. Thus, the total credit associated with the data structure must be nonnegative at all times, since it represents the amount by which the total amortized costs incurred exceed the total actual costs incurred. If the total credit were ever allowed to become negative (the result of undercharging early operations with the promise of repaying the account later on), then the total amortized costs incurred at that time would be below the total actual costs incurred; for the sequence of operations up to that time, the total amortized cost would not be an upper bound on the total actual cost. Thus, we must take care that the total credit in the data structure never becomes negative.
PUSH 1 ,
POP 1 ,
MULTIPOP min(k,s) ,
where k is the argument supplied to MULTIPOP and s is the stack size when it is called. Let us assign the following amortized costs:
PUSH 2 ,
POP 0 ,
MULTIPOP 0 .
Note that the amortized cost of MULTIPOP is a constant (0), whereas the actual cost is variable. Here, all three amortized costs are O (l), although in general the amortized costs of the operations under consideration may differ asymptotically.
We shall now show that we can pay for any sequence of stack operations by charging the amortized costs. Suppose we use a dollar bill to represent each unit of cost. We start with an empty stack. Recall the analogy of Section 11.1 between the stack data structure and a stack of plates in a cafeteria. When we push a plate on the stack, we use 1 dollar to pay the actual cost of the push and are left with a credit of 1 dollar (out of the 2 dollars charged), which we put on top of the plate. At any point in time, every plate on the stack has a dollar of credit on it.
The dollar stored on the plate is prepayment for the cost of popping it from the stack. When we execute a POP operation, we charge the operation nothing and pay its actual cost using the credit stored in the stack. To pop a plate, we take the dollar of credit off the plate and use it to pay the actual cost of the operation. Thus, by charging the PUSH operation a little bit more, we needn't charge the POP operation anything.
Moreover, we needn't charge MULTIPOP operations anything either. To pop the first plate, we take the dollar of credit off the plate and use it to pay the actual cost of a POP operation. To pop a second plate, we again have a dollar of credit on the plate to pay for the POP operation, and so on. Thus, we have always charged at least enough up front to pay for MULTIPOP operations. In other words, since each plate on the stack has 1 dollar of credit on it, and the stack always has a nonnegative number of plates, we have ensured that the amount of credit is always nonnegative. Thus, for any sequence of n PUSH , POP , and MULTIPOP operations, the total amortized cost is an upper bound on the total actual cost. Since the total amortized cost is O ( n ), so is the total actual cost.
For the amortized analysis, let us charge an amortized cost of 2 dollars to set a bit to 1. When a bit is set, we use 1 dollar (out of the 2 dollars charged) to pay for the actual setting of the bit, and we place the other dollar on the bit as credit. At any point in time, every 1 in the counter has a dollar of credit on it, and thus we needn't charge anything to reset a bit to 0; we just pay for the reset with the dollar bill on the bit.
The amortized cost of INCREMENT can now be determined. The cost of resetting the bits within the while loop is paid for by the dollars on the bits that are reset. At most one bit is set, in line 6 of INCREMENT , and therefore the amortized cost of an INCREMENT operation is at most 2 dollars. The number of l's in the counter is never negative, and thus the amount of credit is always nonnegative. Thus, for n INCREMENT operations, the total amortized cost is O ( n ), which bounds the total actual cost.
ExercisesA sequence of stack operations is performed on a stack whose size never exceeds k . After every k operations, a copy of the entire stack is made for backup purposes. Show that the cost of n stack operations, including copying the stack, is O ( n ) by assigning suitable amortized costs to the various stack operations.
18.3 The potential methodThe amortized cost of each operation is therefore its actual cost plus the increase in potential due to the operation. By equation (18.1), the total amortized cost of the n operations is
The second equality follows from equation (3.7), since the ( D i ) telescope.
If we can define a potential function so that ( D n ) ( D 0 ), then the total amortized cost is an upper bound on the total actual cost. In practice, we do not always know how many operations might be performed. Therefore, if we require that ( D i ) ( D 0 ) for all i , then we guarantee, as in the accounting method, that we pay in advance. It is often convenient to define ( D 0 ) to be 0 and then to show that ( D i ) 0 for all i . (See Exercise 18.3-1 for an easy way to handle cases in which ( D 0 ) 0.)
Intuitively, if the potential difference ( D i ) - ( D i - 1 ) of the i th operation is positive, then the amortized cost represents an overcharge to the i th operation, and the potential of the data structure increases. If the potential difference is negative, then the amortized cost represents an undercharge to the i th operation, and the actual cost of the operation is paid by the decrease in the potential.
The amortized costs defined by equations (18.1) and (18.2) depend on the choice of the potential function . Different potential functions may yield different amortized costs yet still be upper bounds on the actual costs. There are often trade-offs that can be made in choosing a potential function; the best potential function to use depends on the desired time bounds.
(Di) 0
= (D0).
The total amortized cost of n operations with respect to therefore represents an upper bound on the actual cost.
Let us now compute the amortized costs of the various stack operations. If the i th operation on a stack containing s objects is a PUSH operation, then the potential difference is
(Di) - (Di - 1) = (s + 1 ) - s
By equation (18.1), the amortized cost of this PUSH operation is
Suppose that the i th operation on the stack is MULTIPOP ( S,k ) and that k ' = min( k,s ) objects are popped off the stack. The actual cost of the operation is k ' , and the potential difference is
(Di) - (Di-1) = -k'.
Thus, the amortized cost of the MULTIPOP operation is
Similarly, the amortized cost of an ordinary POP operation is 0.
The amortized cost of each of the three operations is O (1), and thus the total amortized cost of a sequence of n operations is O ( n ). Since we have already argued that ( D i ) ( D 0 ), the total amortized cost of n operations is an upper bound on the total actual cost. The worst-case cost of n operations is therefore O ( n ).
Let us compute the amortized cost of an INCREMENT operation. Suppose that the i th INCREMENT operation resets t i bits. The actual cost of the operation is therefore at most t i +1, since in addition to resetting t i bits, it sets at most one bit to a 1. The number of 1's in the counter after the i th operation is therefore b i b i-1 - t i + 1, and the potential difference is
(Di) - (Di-1) (bi-1 - ti + 1) - bi-1
= 1 - ti.
The amortized cost is therefore
If the counter starts at zero, then ( D 0 ) = 0. Since ( D i ) 0 for all i , the total amortized cost of a sequence of n INCREMENT operations is an upper bound on the total actual cost, and so the worst-case cost of n INCREMENT operations is O ( n ).
The potential method gives us an easy way to analyze the counter even when it does not start at zero. There are initially b 0 1's, and after n INCREMENT operations there are b n 1's, where 0 b 0 , b n k . We can rewrite equation (18.2) as
We have for all 1 i n . Since ( D 0 ) = b 0 and ( D n ) = b n , the total actual cost of n INCREMENT operations is
Note in particular that since b 0 k , if we execute at least n = ( k ) INCREMENT operations, the total actual cost is O ( n ), no matter what initial value the counter contains.
ExercisesSuppose we have a potential function such that ( D i ) ( D 0 ) for all i , but (D 0 ) 0. Show that there exists a potential function ' such that ' ( D 0 ) = 0, ' ( D i ) 0 for all i 1, and the amortized costs using ' are the same as the amortized costs using .
Redo Exercise 18.1-3 using a potential method of analysis.
What is the total cost of executing n of the stack operations PUSH , POP , and MULTIPOP , assuming that the stack begins with s 0 objects and finishes with s n objects?
Suppose that a counter begins at a number with b 1's in its binary representation, rather than at 0. Show that the cost of performing n INCREMENT operations is O ( n ) if n = ( b ). (Do not assume that b is constant.)
Show how to implement a queue with two ordinary stacks (Exercise 11.1-6) so that the amortized cost of each ENQUEUE and each DEQUEUE operation is O (1).
18.4 Dynamic tablesWe assume that the dynamic table supports the operations TABLE - INSERT and TABLE - DELETE . TABLE - INSERT inserts into the table an item that occupies a single slot , that is, a space for one item. Likewise, TABLE - DELETE can be thought of as removing an item from the table, thereby freeing a slot. The details of the data-structuring method used to organize the table are unimportant; we might use a stack (Section 11.1), a heap (Section 7.1), or a hash table (Chapter 12). We might also use an array or collection of arrays to implement object storage, as we did in Section 11.3.
We start by analyzing a dynamic table in which only insertions are performed. We then consider the more general case in which both insertions and deletions are allowed.
18.4.1 Table expansion1 In some situations, such as an open-address hash table, we may wish to consider a table to be full if its load factor equals some constant strictly less than 1. (See Exercise 18.4-2.)
A common heuristic is to allocate a new table that has twice as many slots as the old one. If only insertions are performed, the load factor of a table is always at least 1/2, and thus the amount of wasted space never exceeds half the total space in the table.
In the following pseudocode, we assume that T is an object representing the table. The field table [ T ] contains a pointer to the block of storage representing the table. The field num [ T ]contains the number of items in the table, and the field size [ T ] is the total number of slots in the table. Initially, the table is empty: num [ T ] = size [ T ] = 0.
TABLE-INSERT(T,x)
1 if size[T] = 0
2 then allocate table [T] with 1 slot
3 size[T] 1
4 if num[T] = size[T]
5 then allocate new-table with 2 size[T] slots
6 insert all items in table[T] into new-table
7 free table[T]
8 table[T] new-table
9 size[T] 2 size[T]
10 insert x into table[T]
11 num[T] num[T] + 1
Notice that we have two "insertion" procedures here: the TABLE - INSERT procedure itself and the elementary insertion into a table in lines 6 and 10. We can analyze the running time of TABLE - INSERT in terms of the number of elementary insertions by assigning a cost of 1 to each elementary insertion. We assume that the actual running time of TABLE - INSERT is linear in the time to insert individual items, so that the overhead for allocating an initial table in line 2 is constant and the overhead for allocating and freeing storage in lines 5 and 7 is dominated by the cost of transferring items in line 6. We call the event in which the then clause in lines 5-9 is executed an expansion .
Let us analyze a sequence of n TABLE - INSERT operations on an initially empty table. What is the cost c i of the i th operation? If there is room in the current table (or if this is the first operation), then c i = 1, since we need only perform the one elementary insertion in line 10. If the current table is full, however, and an expansion occurs, then c i = i : the cost is 1 for the elementary insertion in line 10 plus i - 1 for the items that must be copied from the old table to the new table in line 6. If n operations are performed, the worst-case cost of an operation is O ( n ), which leads to an upper bound of O ( n 2 ) on the total running time for n operations.
This bound is not tight, because the cost of expanding the table is not borne often in the course of n TABLE - INSERT operations. Specifically, the i th operation causes an expansion only when i - 1 is an exact power of 2. The amortized cost of an operation is in fact O (1), as we can show using the aggregate method. The cost of the i th operation is
The total cost of n TABLE - INSERT operations is therefore
since there are at most n operations that cost 1 and the costs of the remaining operations form a geometric series. Since the total cost of n TABLE - INSERT operations is 3 n , the amortized cost of a single operation is 3.
(T) = 2 num[T] - size[T]
is one possibility. Immediately after an expansion, we have num [ T ] = size [ T ]/2, and thus ( T ) = 0, as desired. Immediately before an expansion, we have num [ T ] = size [ T ], and thus ( T ) = num [ T ], as desired. The initial value of the potential is 0, and since the table is always at least half full, num [ T ] size [ T ]/2, which implies that ( T ) is always nonnegative. Thus, the sum of the amortized costs of n TABLE - INSERT operations is an upper bound on the sum of the actual costs.
To analyze the amortized cost of the i th TABLE - INSERT operation, we let num i denote the number of items stored in the table after the i th operation, size i denote the total size of the table after the i th operation, and i denote the potential after the i th operation. Initially, we have num 0 = 0, size 0 =0, and 0 = 0.
If the i th TABLE - INSERT operation does not trigger an expansion, then size i = size i - 1 and the amortized cost of the operation is
If the i th operation does trigger an expansion, then size i /2 = size i -1 = num i - 1, and the amortized cost of the operation is
Figure 18.3 plots the values of num i , size i , and i against i . Notice how the potential builds to pay for the expansion of the table.
18.4.2 Table expansion and contractionTo implement a TABLE - DELETE operation, it is simple enough to remove the specified item from the table. It is often desirable, however, to contract the table when the load factor of the table becomes too small, so that the wasted space is not exorbitant. Table contraction is analogous to table expansion: when the number of items in the table drops too low, we allocate a new, smaller table and then copy the items from the old table into the new one. The storage for the old table can then be freed by returning it to the memory-management system. Ideally, we would like to preserve two properties:
the load factor of the dynamic table is bounded below by a constant, and
the amortized cost of a table operation is bounded above by a constant.
We assume that cost can be measured in terms of elementary insertions and deletions.
A natural strategy for expansion and contraction is to double the table size when an item is inserted into a full table and halve the size when a deletion would cause the table to become less than half full. This strategy guarantees that the load factor of the table never drops below 1/2, but unfortunately, it can cause the amortized cost of an operation to be quite large. Consider the following scenario. We perform n operations on a table T , where n is an exact power of 2. The first n/2 operations are insertions, which by our previous analysis cost a total of ( n ). At the end of this sequence of insertions, num [ T ] = size [ T ] = n/2. For the second n/2 operations, we perform the following sequence:
I, D, D, I, I, D, D, I, I. ,
where I stands for an insertion and D stands for a deletion. The first insertion causes an expansion of the table to size n . The two following deletions cause a contraction of the table back to size n/2 . Two further insertions cause another expansion, and so forth. The cost of each expansion and contraction is ( n ), and there are ( n ) of them. Thus, the total cost of the n operations is ( n 2 ), and the amortized cost of an operation is ( n ).
We can improve upon this strategy by allowing the load factor of the table to drop below 1/2. Specifically, we continue to double the table size when an item is inserted into a full table, but we halve the table size when a deletion causes the table to become less than 1/4 full, rather than 1/2 full as before. The load factor of the table is therefore bounded below by the constant 1/4. The idea is that after an expansion, the load factor of the table is 1/2. Thus, half the items in the table must be deleted before a contraction can occur, since contraction does not occur unless the load factor would fall below 1/4. Likewise, after a contraction, the load factor of the table is also 1/2. Thus, the number of items in the table must be doubled by insertions before an expansion can occur, since expansion occurs only when the load factor would exceed 1.
Observe that the potential of an empty table is 0 and that the potential is never negative. Thus, the total amortized cost of a sequence of operations with respect to is an upper bound on their actual cost.
Before proceeding with a precise analysis, we pause to observe some properties of the potential function. Notice that when the load factor is 1/2, the potential is 0. When it is 1, we have size [ T ] = num [ T ] , which implies (T) = num [ T ] , and thus the potential can pay for an expansion if an item is inserted. When the load factor is 1/4, we have size [ T ] = 4 num [ T ] , which implies ( T ) = num [ T ] , and thus the potential can pay for a contraction if an item is deleted. Figure 18.4 illustrates how the potential behaves for a sequence of operations.
To analyze a sequence of n TABLE - INSERT and TABLE - DELETE operations, we let c i denote the actual cost of the i th operation, denote its amortized cost with respect to , num i denote the number of items stored in the table after the i th operation, size i denote the total size of the table after the i th operation, i denote the load factor of the table after the i th operation, and i denote the potential after the i th operation. Initially, num 0 = 0 , size 0 = 0, 0 = 1, and 0 = 0.
We start with the case in which the i th operation is TABLE - INSERT . If i- 1 1/2 , the analysis is identical to that for table expansion in Section 18.4.1. Whether the table expands or not, the amortized cost of the operation is at most 3. If i- 1 < 1/2, the table cannot expand as a result of the operation, since expansion occurs only when i- 1 = 1. If i < 1/2 as well, then the amortized cost of the i th operation is
If i-1 < 1/2 but i 1/2, then
Thus, the amortized cost of a TABLE - INSERT operation is at most 3.
When the i th operation is a TABLE - DELETE and i -1 1/2, the amortized cost is also bounded above by a constant. The analysis is left as Exercise 18.4-3.
In summary, since the amortized cost of each operation is bounded above by a constant, the actual time for any sequence of n operations on a dynamic table is O(n) .
ExercisesArgue intuitively that if i- 1 1/2 and i 1/2, then the amortized cost of a TABLE - INSERT operation is 0.
Show that if the i th operation on a dynamic table is TABLE - DELETE and i- 1 1/2, then the amortized cost of the operation with respect to the potential function (18.5) is bounded above by a constant.
Suppose that instead of contracting a table by halving its size when its load factor drops below 1/4, we contract it by multiplying its size by 2/3 when its load factor drops below 1/3. Using the potential function
(T) = |2 num[T] - size[T]| ,
show that the amortized cost of a TABLE - DELETE that uses this strategy is bounded above by a constant.
ProblemsWe can express each index a as a k -bit sequence a k- 1 , a k-2 , . . . , a 0 , where . We define
revk( ak-1, ak-2. a0 ) = ( a0, a1. ak-1 ;
For example, if n = 16 (or, equivalently, k = 4), then rev k (3) = 12, since the 4-bit representation of 3 is 0011, which when reversed gives 1100, the 4-bit representation of 12.
a. Given a function rev k that runs in ( k ) time, write an algorithm to perform the bit-reversal permutation on an array of length n = 2 k in O ( nk ) time.
0000, 1000, 0100, 1100, 0010, 1010. = 0, 8, 4, 12, 2, 10. .
b. Assume that the words in your computer store k -bit values and that in unit time, your computer can manipulate the binary values with operations such as shifting left or right by arbitrary amounts, bitwise-AND, bitwise-OR, etc. Describe an implementation of the BIT - REVERSED - INCREMENT procedure that allows the bit-reversal permutation on an n -element array to be performed in a total of O ( n ) time.
c. Suppose that you can shift a word left or right by only one bit in unit time. Is it still possible to implement an O ( n )-time bit-reversal permutation?
Specifically, suppose that we wish to support SEARCH and INSERT on a set of n elements. Let k = lg( n + 1) , and let the binary representation of n be n k-1 , n k-2 , . . . , n 0 . We have k sorted arrays A 0 , A 1 , . . . , A k -1 , where for i = 0, 1, . . . , k - 1, the length of array A i is 2 i . Each array is either full or empty, depending on whether n i = 1 or n i = 0, respectively. The total number of elements held in all k arrays is therefore . Although each individual array is sorted, there is no particular relationship between elements in different arrays.
a . Describe how to perform the SEARCH operation for this data structure. Analyze its worst-case running time.
b . Describe how to insert a new element into this data structure. Analyze its worst-case and amortized running times.
c . Discuss how to implement DELETE .
size[left[x]] . size[x]
size[right[x]] . size[x].
The tree as a whole is -balanced if every node in the tree is -balanced. The following amortized approach to maintaining weight-balanced trees was suggested by G. Varghese.
a . A 1/2-balanced tree is, in a sense, as balanced as it can be. Given a node x in an arbitrary binary search tree, show how to rebuild the subtree rooted at x so that it becomes 1/2-balanced. Your algorithm should run in time ( size [ x ], and it can use O ( size [ x ]) auxiliary storage.
b . Show that performing a search in an n -node -balanced binary search tree takes O (lg n ) worst-case time.
For the remainder of this problem, assume that the constant is strictly greater than 1/2. Suppose that INSERT and DELETE are implemented as usual for an n -node binary search tree, except that after every such operation, if any node in the tree is no longer -balanced, then the subtree rooted at the highest such node in the tree is "rebuilt" so that it becomes 1/2-balanced.
We shall analyze this rebuilding scheme using the potential method. For a node x in a binary search tree T , we define
(x) = |size[left[x]] - size[right[x]]| ,
and we define the potential of T as
where c is a sufficiently large constant that depends on .
c. Argue that any binary search tree has nonnegative potential and that a 1/2-balanced tree has potential 0.
d . Suppose that m units of potential can pay for rebuilding an m -node subtree. How large must c be in terms of in order for it to take O (1) amortized time to rebuild a subtree that is not -balanced?
e. Show that inserting a node into or deleting a node from an n -node -balanced tree costs O (lg n) amortized time.
The aggregate method of amortized' analysis was used by Aho, Hopcroft, and Ullman [4]. Tarjan [189] surveys the accounting and potential methods of amortized analysis and presents several applications. He attributes the accounting method to several authors, including M. R. Brown, R. E. Tarjan, S. Huddleston, and K. Mehlhorn. He attributes the potential method to D. D. Sleator. The term "amortized" is due to D. D. Sleator and R. E. Tarjan.