8.1 Straight Insertion and Shell's Method: Will Draw The Line, However, at The Inefficient
8.1 Straight Insertion and Shell's Method: Will Draw The Line, However, at The Inefficient
Chapter 8.
Sorting
For small N one does better to use an algorithm whose operation count goes as a higher, i.e., poorer, power of N , if the constant in front is small enough. For N < 20, roughly, the method of straight insertion (8.1) is concise and fast enough. We include it with some trepidation: It is an N 2 algorithm, whose potential for misuse (by using it for too large an N ) is great. The resultant waste of computer time is so awesome, that we were tempted not to include any N 2 routine at all. We will draw the line, however, at the inefcient N 2 algorithm, beloved of elementary computer science texts, called bubble sort. If you know what bubble sort is, wipe it from your mind; if you dont know, make a point of never nding out! For N < 50, roughly, Shells method (8.1), only slightly more complicated to program than straight insertion, is competitive with the more complicated Quicksort on many machines. This method goes as N 3/2 in the worst case, but is usually faster. See references [1,2] for further information on the subject of sorting, and for detailed references to the literature.
CITED REFERENCES AND FURTHER READING: Knuth, D.E. 1973, Sorting and Searching, vol. 3 of The Art of Computer Programming (Reading, MA: Addison-Wesley). [1] Sedgewick, R. 1988, Algorithms, 2nd ed. (Reading, MA: Addison-Wesley), Chapters 813. [2]
Sample page from NUMERICAL RECIPES IN C: THE ART OF SCIENTIFIC COMPUTING (ISBN 0-521-43108-5) Copyright (C) 1988-1992 by Cambridge University Press. Programs Copyright (C) 1988-1992 by Numerical Recipes Software. Permission is granted for internet users to make one paper copy for their own personal use. Further reproduction, or any copying of machinereadable files (including this one) to any server computer, is strictly prohibited. To order Numerical Recipes books or CDROMs, visit website http://www.nr.com or call 1-800-872-7423 (North America only), or send email to [email protected] (outside North America).
Insert it.
What if you also want to rearrange an array brr at the same time as you sort arr? Simply move an element of brr whenever you move an element of arr:
331
void piksr2(int n, float arr[], float brr[]) Sorts an array arr[1..n] into ascending numerical order, by straight insertion, while making the corresponding rearrangement of the array brr[1..n]. { int i,j; float a,b; for (j=2;j<=n;j++) { a=arr[j]; b=brr[j]; i=j-1; while (i > 0 && arr[i] > a) { arr[i+1]=arr[i]; brr[i+1]=brr[i]; i--; } arr[i+1]=a; brr[i+1]=b; } } Pick out each element in turn.
Sample page from NUMERICAL RECIPES IN C: THE ART OF SCIENTIFIC COMPUTING (ISBN 0-521-43108-5) Copyright (C) 1988-1992 by Cambridge University Press. Programs Copyright (C) 1988-1992 by Numerical Recipes Software. Permission is granted for internet users to make one paper copy for their own personal use. Further reproduction, or any copying of machinereadable files (including this one) to any server computer, is strictly prohibited. To order Numerical Recipes books or CDROMs, visit website http://www.nr.com or call 1-800-872-7423 (North America only), or send email to [email protected] (outside North America).
Insert it.
For the case of rearranging a larger number of arrays by sorting on one of them, see 8.4.
Shells Method
This is actually a variant on straight insertion, but a very powerful variant indeed. The rough idea, e.g., for the case of sorting 16 numbers n 1 . . . n16 , is this: First sort, by straight insertion, each of the 8 groups of 2 (n 1 , n9 ), (n2 , n10 ), . . . , (n8 , n16 ). Next, sort each of the 4 groups of 4 (n 1 , n5 , n9 , n13 ), . . . , (n4 , n8 , n12 , n16 ). Next sort the 2 groups of 8 records, beginning with (n 1 , n3 , n5 , n7 , n9 , n11 , n13 , n15 ). Finally, sort the whole list of 16 numbers. Of course, only the last sort is necessary for putting the numbers into order. So what is the purpose of the previous partial sorts? The answer is that the previous sorts allow numbers efciently to lter up or down to positions close to their nal resting places. Therefore, the straight insertion passes on the nal sort rarely have to go past more than a few elements before nding the right place. (Think of sorting a hand of cards that are already almost in order.) The spacings between the numbers sorted on each pass through the data (8,4,2,1 in the above example) are called the increments, and a Shell sort is sometimes called a diminishing increment sort. There has been a lot of research into how to choose a good set of increments, but the optimum choice is not known. The set . . . , 8, 4, 2, 1 is in fact not a good choice, especially for N a power of 2. A much better choice is the sequence (3k 1)/2, . . . , 40, 13, 4, 1 which can be generated by the recurrence i1 = 1, ik+1 = 3ik + 1, k = 1, 2, . . . (8.1.2) (8.1.1)
It can be shown (see [1]) that for this sequence of increments the number of operations required in all is of order N 3/2 for the worst possible ordering of the original data.
332
Chapter 8.
Sorting
For randomly ordered data, the operations count goes approximately as N 1.25 , at least for N < 60000. For N > 50, however, Quicksort is generally faster. The program follows:
void shell(unsigned long n, float a[]) Sorts an array a[] into ascending numerical order by Shells method (diminishing increment sort). a is replaced on output by its sorted rearrangement. Normally, the argument n should be set to the size of array a, but if n is smaller than this, then only the rst n elements of a are sorted. This feature is used in selip. { unsigned long i,j,inc; float v; inc=1; Determine the starting increment. do { inc *= 3; inc++; } while (inc <= n); do { Loop over the partial sorts. inc /= 3; for (i=inc+1;i<=n;i++) { Outer loop of straight insertion. v=a[i]; j=i; while (a[j-inc] > v) { Inner loop of straight insertion. a[j]=a[j-inc]; j -= inc; if (j <= inc) break; } a[j]=v; } } while (inc > 1); }
Sample page from NUMERICAL RECIPES IN C: THE ART OF SCIENTIFIC COMPUTING (ISBN 0-521-43108-5) Copyright (C) 1988-1992 by Cambridge University Press. Programs Copyright (C) 1988-1992 by Numerical Recipes Software. Permission is granted for internet users to make one paper copy for their own personal use. Further reproduction, or any copying of machinereadable files (including this one) to any server computer, is strictly prohibited. To order Numerical Recipes books or CDROMs, visit website http://www.nr.com or call 1-800-872-7423 (North America only), or send email to [email protected] (outside North America).
CITED REFERENCES AND FURTHER READING: Knuth, D.E. 1973, Sorting and Searching, vol. 3 of The Art of Computer Programming (Reading, MA: Addison-Wesley), 5.2.1. [1] Sedgewick, R. 1988, Algorithms, 2nd ed. (Reading, MA: Addison-Wesley), Chapter 8.
8.2 Quicksort
Quicksort is, on most machines, on average, for large N , the fastest known sorting algorithm. It is a partition-exchange sorting method: A partitioning element a is selected from the array. Then by pairwise exchanges of elements, the original array is partitioned into two subarrays. At the end of a round of partitioning, the element a is in its nal place in the array. All elements in the left subarray are a, while all elements in the right subarray are a. The process is then repeated on the left and right subarrays independently, and so on. The partitioning process is carried out by selecting some element, say the leftmost, as the partitioning element a. Scan a pointer up the array until you nd an element > a, and then scan another pointer down from the end of the array until you nd an element < a. These two elements are clearly out of place for the nal partitioned array, so exchange them. Continue this process until the pointers cross. This is the right place to insert a, and that round of partitioning is done. The