0% found this document useful (0 votes)

179 views5 pages

Computational Complexity: Section 1: Why Is It Important?

Computation complexity is a term used to describe the complexity of a program. It is important to understand the rationale behind these definitions. A program can be more complex if it has to deal with large amounts of data.

Uploaded by

Dalia Pal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

179 views5 pages

Computational Complexity: Section 1: Why Is It Important?

Uploaded by

Dalia Pal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Member Count: 280,840 - February 1, 2011 [Get Time] Login

Competitions
Home
Computational Complexity: Section 1 Archive
Printable view
The Tops
Discuss this article
Algorithm Write for TopCoder
By misof
Copilot Opportunities TopCoder Member
Conceptualization
Specification
In this article I'll try to introduce you to the area of computation complexity. The article will be
Architecture
a bit long before we get to the actual formal definitions because I feel that the rationale
Component Design behind these definitions needs to be explained as well - and that understanding the
Component rationale is even more important than the definitions alone.
Development
Assembly
Why is it important?

Test Scenarios Example 1. Suppose you were assigned to write a program to process some records your
company receives from time to time. You implemented two different algorithms and tested
Test Suites
them on several sets of test data. The processing times you obtained are in Table 1.
UI Prototype
RIA Build # of records 10 20 50 100 1000 5000
Content Creation
algorithm 1 0.00s 0.01s 0.05s 0.47s 23.92s 47min
Bug Race
algorithm 2 0.05s 0.05s 0.06s 0.11s 0.78s 14.22s
Marathon Match
High School
Table 1. Runtimes of two fictional algorithms.
Tournaments
The Digital Run In praxis, we probably could tell which of the two implementations is better for us (as we
usually can estimate the amount of data we will have to process). For the company this
Educational Content
solution may be fine. But from the programmer's point of view, it would be much better if he
Overview could estimate the values in Table 1 before writing the actual code - then he could only
Algorithm Tutorials implement the better algorithm.

Component Tutorials The same situation occurs during programming contests: The size of the input data is given
Marathon Tutorials in the problem statement. Suppose I found an algorithm. Questions I have to answer before I
start to type should be: Is my algorithm worth implementing? Will it solve the largest test
How to Get Paid cases in time? If I know more algorithms solving the problem, which of them shall I
implement?
Forums
UML Tool This leads us to the question: How to compare algorithms? Before we answer this question
TopCoder Wiki in general, let's return to our simple example. If we extrapolate the data in Table 1, we may
Event Calendar assume that if the number of processed records is larger than 1000, algorithm 2 will be
substantially faster. In other words, if we consider all possible inputs, algorithm 2 will be
Press Room
better for almost all of them.
Surveys
My TopCoder It turns out that this is almost always the case - given two algorithms, either one of them is
About TopCoder almost always better, or they are approximately the same. Thus, this will be our definition of
a better algorithm. Later, as we define everything formally, this will be the general idea
behind the definitions.

A neat trick

If you thing about Example 1 for a while, it shouldn't be too difficult to see that there is an
algorithm with runtimes similar to those in Table 2:
Member Search:
Handle: Go
# of records 10 20 50 100 1000 5000
Advanced Search
algorithm 3 0.00s 0.01s 0.05s 0.11s 0.78s 14.22s

Table 2. Runtimes of a new fictional algorithm.

The idea behind this algorithm: Check the number of records. If it is small enough, run
algorithm 1, otherwise run algorithm 2.

Similar ideas are often used in praxis. As an example consider most of the sort() functions
provided by various libraries. Often this function is an implementation of QuickSort with
various improvements, such as:

if the number of elements is too small, run InsertSort instead (as InsertSort is faster for
small inputs)
if the pivot choices lead to poor results, fall back to MergeSort
What is efficiency?

Example 2. Suppose you have a concrete implementation of some algorithm. (The example
code presented below is actually an implementation of MinSort - a slow but simple sorting
algorithm.)

for (int i=0; i<N; i++)

for (int j=i+1; j<N; j++)
if (A[i] > A[j])
swap( A[i], A[j] );

If we are given an input to this algorithm (in our case, the array A and its size N), we can
exactly compute the number of steps our algorithm does on this input. We could even count
the processor instructions if we wanted to. However, there are too many possible inputs for
this approach to be practical.

And we still need to answer one important question: What is it exactly we are interested in?
Most usually it is the behavior of our program in the worst possible case - we need to look
at the input data and to determine an upper bound on how long will it take if we run the
program.

But then, what is the worst possible case? Surely we can always make the program run
longer simply by giving it a larger input. Some of the more important questions are: What is
the worst input with 700 elements? How fast does the maximum runtime grow when we
increase the input size?

Formal notes on the input size

What exactly is this "input size" we started to talk about? In the formal definitions this is the
size of the input written in some fixed finite alphabet (with at least 2 "letters"). For our needs,
we may consider this alphabet to be the numbers 0..255. Then the "input size" turns out to be
exactly the size of the input file in bytes.

Usually a part of the input is a number (or several numbers) such that the size of the input is
proportional to the number.

E.g. in Example 2 we are given an int N and an array containing N ints. The size of the input
file will be roughly 5N (depending on the OS and architecture, but always linear in N).

In such cases, we may choose that this number will represent the size of the input. Thus
when talking about problems on arrays/strings, the input size is the length of the array/string,
when talking about graph problems, the input size depends both on the number of vertices
(N) and the number of edges (M), etc.

We will adopt this approach and use N as the input size in the following parts of the article.

There is one tricky special case you sometimes need to be aware of. To write a (possibly
large) number we need only logarithmic space. (E.g. to write 123456, we need only roughly
log10(123456) digits.) This is why the naive primality test does not run in polynomial time -
its runtime is polynomial in the size of the number, but not in its number of digits! If you
didn't understand the part about polynomial time, don't worry, we'll get there later.

How to measure efficiency?

We already mentioned that given an input we are able to count the number of steps an
algorithm makes simply by simulating it. Suppose we do this for all inputs of size at most N
and find the worst of these inputs (i.e. the one that causes the algorithm to do the most
steps). Let f (N) be this number of steps. We will call this function the time complexity, or
shortly the runtime of our algorithm.

In other words, if we have any input of size N, solving it will require at most f (N) steps.

Let's return to the algorithm from Example 2. What is the worst case of size N? In other
words, what array with N elements will cause the algorithm to make the most steps? If we
take a look at the algorithm, we can easily see that:

the first step is executed exactly N times

the second and third step are executed exactly N(N - 1)/2 times
the fourth step is executed at most N(N - 1)/2 times

Clearly, if the elements in A are in descending order at the beginning, the fourth step will
always be executed. Thus in this case the algorithm makes 3N(N - 1)/2 + N = 1.5N2 -
0.5N steps. Therefore our algorithm has f (N) = 1.5N2 - 0.5N.

As you can see, determining the exact function f for more complicated programs is painful.
Moreover, it isn't even necessary. In our case, clearly the -0.5N term can be neglected. It will
usually be much smaller than the 1.5N2 term and it won't affect the runtime significantly. The
result "f (N) is roughly equal to 1.5N2" gives us all the information we need. As we will show
now, if we want to compare this algorithm with some other algorithm solving the same
problem, even the constant 1.5 is not that important.

Consider two algorithms, one with the runtime N2, the other with the runtime 0.001N3. One
can easily see that for N greater than 1 000 the first algorithm is faster - and soon this
difference becomes apparent. While the first algorithm is able to solve inputs with N =
20 000 in a matter of seconds, the second one will already need several minutes on current
machines.

Clearly this will occur always when one of the runtime functions grows asymptotically
faster than the other (i.e. when N grows beyond all bounds the limit of their quotient is zero
or infinity). Regardless of the constant factors, an algorithm with runtime proportional to N2
will always be better than an algorithm with runtime proportional to N3 on almost all inputs.
And this observation is exactly what we base our formal definition on.

Finally, formal definitions

Let f, g be positive non-decreasing functions defined on positive integers. (Note that all
runtime functions satisfy these conditions.) We say that f (N) is O(g(N)) (read: f is big-oh of
g) if for some c and N0 the following condition holds:

N > N0 ; f (N) < c.g(N)

In human words, f (N) is O(g(N)) , if for some c almost the entire graph of the function f is
below the graph of the function c.g. Note that this means that f grows at most as fast as c.g
does.

Instead of "f (N) is O(g(N)) " we usually write f (N) = O(g(N)) . Note that this "equation" is
not symmetric - the notion " O(g(N)) = f (N) " has no sense and " g(N) = O(f (N)) " doesn't
have to be true (as we will see later). (If you are not comfortable with this notation, imagine
O(g(N)) to be a set of functions and imagine that there is a instead of =.)

What we defined above is known as the big-oh notation and is conveniently used to specify
upper bounds on function growth.

E.g. consider the function f (N) = 3N(N - 1)/2 + N = 1.5N2 - 0.5N from Example 2. We
may say that f (N) = O(N2) (one possibility for the constants is c = 2 and N0 = 0). This

means that f doesn't grow (asymptotically) faster than N2.

Note that even the exact runtime function f doesn't give an exact answer to the question
"How long will the program run on my machine?" But the important observation in the
example case is that the runtime function is quadratic. If we double the input size, the
runtime will increase approximately to four times the current runtime, no matter how fast our
computer is.

The f (N) = O(N2) upper bound gives us almost the same - it guarantees that the growth of
the runtime function is at most quadratic.

Thus, we will use the O-notation to describe the time (and sometimes also memory)
complexity of algorithms. For the algorithm from Example 2 we would say "The time
complexity of this algorithm is O(N2)" or shortly "This algorithm is O(N2)".

In a similar way we defined O we may define and .

We say that f (N) is (g(N)) if g(N) = O(f (N)), in other words if f grows at least as fast as
g.

We say that f (N) = (g(N)) if f (N) = O(g(N)) and g(N) = O(f (N)), in other words if both
functions have approximately the same rate of growth.

As it should be obvious, is used to specify lower bounds and is used to give a tight
asymptotic bound on a function. There are other similar bounds, but these are the ones you'll
encounter most of the time.

Some examples of using the notation

1.5N2 -0.5N = O(N2).

47N log N = O(N2).
N log N + 1 000 047N = (N log N).
All polynomials of order k are O(Nk).
The time complexity of the algorithm in Example 2 is (N2).
If an algorithm is O(N2), it is also O(N5).
Each comparision-based sorting algorithm is (N log N).
MergeSort run on an array with N elements does roughly N log N comparisions. Thus
the time complexity of MergeSort is (N log N). If we trust the previous statement, this
means that MergeSort is an asymptotically optimal general sorting algorithm.
The algorithm in Example 2 uses (N) bytes of memory.
The function giving my number of teeth in time is O(1).
A naive backtracking algorithm trying to solve chess is O(1) as the tre of positions it will
examine is finite. (But of course in this case the constant hidden behind the O(1) is
unbelievably large.)

The statement "Time complexity of this algorithm is at least O(N2)" is meaningless. (It
says: "Time complexity of this algorithm is at least at most roughly quadratic." The
speaker probably wanted to say: "Time complexity of this algorithm is (N2).")

When speaking about the time/memory complexity of an algorithm, instead of using the
formal (f (n))-notation we may simply state the class of functions f belongs to. E.g. if f (N)
= (N), we call the algorithm linear. More examples:

f (N) = (log N): logarithmic

f (N) = (N2): quadratic
f (N) = (N3): cubic
f (N) = O(Nk) for some k: polynomial
f (N) = (2N): exponential

For graph problems, the complexity (N + M) is known as "linear in the graph size".

Determining execution time from an asymptotic bound

For most algorithms you may encounter in praxis, the constant hidden behind the O (or )
is usually relatively small. If an algorithm is (N2), you may expect that the exact time
complexity is something like 10N2, not 107N2.

The same observation in other words: if the constant is large, it is usually somehow related
to some constant in the problem statement. In this case it is good practice to give this
constant a name and to include it in the asymptotic notation.

An example: The problem is to count occurences of each letter in a string of N letters. A

naive algorithm passes through the whole string once for each possible letter. The size of
alphabet is fixed (e.g. at most 255 in C), thus the algorithm is linear in N. Still, it is better to
write that its time complexity is (| S|.N), where S is the alphabet used. (Note that there is a
better algorithm solving this problem in (| S| + N).)

In a TopCoder contest, an algorithm doing 1 000 000 000 multiplications runs barely in time.
This fact together with the above observation and some experience with TopCoder problems
can help us fill the following table:

complexity maximum N

(N) 100 000 000

(N log N) 40 000 000

(N2) 10 000

(N3) 500

(N4) 90

(2N) 20

(N!) 11

Table 3. Approximate maximum problem size solvable in 8 seconds.

A note on algorithm analysis

Usually if we present an algorithm, the best way to present its time complexity is to give a -
bound. However, it is common practice to only give an O-bound - the other bound is usually
trivial, O is much easier to type and better known. Still, don't forget that O represents only an
upper bound. Usually we try to find an O-bound that's as good as possible.

Example 3. Given is a sorted array A . Determine whether it contains two elements with the
difference D. Consider the following code solving this problem:

int j=0;
for (int i=0; i<N; i++) {
while ( (j<N-1) && (A[i]-A[j] > D) )
j++;
if (A[i]-A[j] == D) return 1;
}

It is easy to give an O(N2) bound for the time complexity of this algorithm - the inner while-
cycle is called N times, each time we increase j at most N times. But a more careful analysis
shows that in fact we can give an O(N) bound on the time complexity of this algorithm - it is
sufficient to realize that during the whole execution of the algorithm the command "j++;" is
executed no more than N times.

If we said "this algorithm is O(N2)", we would have been right. But by saying "this algorithm
is O(N)" we give more information about the algorithm.

Conclusion

We have shown how to write bounds on the time complexity of algorithms. We have also
demonstrated why this way of characterizing algorithms is natural and (usually more-or-less)
sufficient.

The next logical step is to show how to estimate the time complexity of a given algorithm. As
we have already seen in Example 3, sometimes this can be messy. It gets really messy
when recursion is involved. We will address these issues in the second part of this article.

...continue to Section 2

Home | About TopCoder | Press Room | Contact Us | Careers | Privacy | Terms

Competitions | Cockpit

Computational Complexity: Section 1: Why Is It Important?
No ratings yet
Computational Complexity: Section 1: Why Is It Important?
6 pages
Lecture 05
No ratings yet
Lecture 05
51 pages
Algorithm
No ratings yet
Algorithm
133 pages
Advanced Programming Techniques
No ratings yet
Advanced Programming Techniques
25 pages
Algorithms Analysis and Design Lec1
No ratings yet
Algorithms Analysis and Design Lec1
9 pages
Topic 1 - Introduction To Algorithm and Analysis
No ratings yet
Topic 1 - Introduction To Algorithm and Analysis
101 pages
Time and Space Complexity Analysis
No ratings yet
Time and Space Complexity Analysis
107 pages
Unit 1 (B) - Mi0rWGjk0afTHzcMQLd
No ratings yet
Unit 1 (B) - Mi0rWGjk0afTHzcMQLd
30 pages
Unit I P1
No ratings yet
Unit I P1
36 pages
Lecture01 2
No ratings yet
Lecture01 2
46 pages
2 Algorithm Analysis and Time Complexity
No ratings yet
2 Algorithm Analysis and Time Complexity
23 pages
CSC2204 Analysis of Algorithms Lecture Note2
No ratings yet
CSC2204 Analysis of Algorithms Lecture Note2
22 pages
Lecture 2
No ratings yet
Lecture 2
41 pages
2 ComplexityAnalysis
No ratings yet
2 ComplexityAnalysis
101 pages
Algorithm Analysis and Efficiency Evaluation
No ratings yet
Algorithm Analysis and Efficiency Evaluation
42 pages
Algorithm Efficiency and Analysis
No ratings yet
Algorithm Efficiency and Analysis
26 pages
Design and Analysis of Algorithms Notes 1 - TutorialsDuniya
No ratings yet
Design and Analysis of Algorithms Notes 1 - TutorialsDuniya
82 pages
Algorithm Efficiency and Analysis
No ratings yet
Algorithm Efficiency and Analysis
52 pages
Time Complexity
No ratings yet
Time Complexity
28 pages
Ada Notes
No ratings yet
Ada Notes
127 pages
Lec-4 AymptoticNotations
No ratings yet
Lec-4 AymptoticNotations
16 pages
Unit 1
No ratings yet
Unit 1
144 pages
Algorithms Analysis and Design Lec1,2
No ratings yet
Algorithms Analysis and Design Lec1,2
22 pages
2 ComplexityAnalysis-eru
No ratings yet
2 ComplexityAnalysis-eru
57 pages
Complexity
No ratings yet
Complexity
51 pages
Design and Analysis of Algorithms Notes
No ratings yet
Design and Analysis of Algorithms Notes
80 pages
Time-Space Trade-Off in Algorithms
No ratings yet
Time-Space Trade-Off in Algorithms
51 pages
Algorithm Design for Students
No ratings yet
Algorithm Design for Students
46 pages
Lec16 Algorithm Analysis 09092024 090952pm
No ratings yet
Lec16 Algorithm Analysis 09092024 090952pm
30 pages
Topics
No ratings yet
Topics
4 pages
Algorithm Complexity Insights
100% (4)
Algorithm Complexity Insights
36 pages
Computer Algorithms: Muralikrishna S.N
No ratings yet
Computer Algorithms: Muralikrishna S.N
13 pages
Analyzing Algorithm Efficiency Basics
No ratings yet
Analyzing Algorithm Efficiency Basics
45 pages
Algorithm Analysis and Efficiency
No ratings yet
Algorithm Analysis and Efficiency
43 pages
Module1 (Autosaved)
No ratings yet
Module1 (Autosaved)
66 pages
Lec-2 Algorithms Efficiency & Complexity Updated
No ratings yet
Lec-2 Algorithms Efficiency & Complexity Updated
28 pages
Chapter 09 PDF
No ratings yet
Chapter 09 PDF
16 pages
Algorithm Design & Analysis Course
No ratings yet
Algorithm Design & Analysis Course
50 pages
Module1 Notes
No ratings yet
Module1 Notes
22 pages
Analyse Data Structures and Algorithms
No ratings yet
Analyse Data Structures and Algorithms
51 pages
Algorithms Analysis
No ratings yet
Algorithms Analysis
40 pages
Importance
No ratings yet
Importance
5 pages
Data Structure Lecture 6
No ratings yet
Data Structure Lecture 6
49 pages
Introduction 1
No ratings yet
Introduction 1
6 pages
02 Algorithm Analysis
No ratings yet
02 Algorithm Analysis
64 pages
Analyzing Algorithms: Complexity Principles
No ratings yet
Analyzing Algorithms: Complexity Principles
453 pages
Algorithm Design Essentials
No ratings yet
Algorithm Design Essentials
20 pages
Introduction to Algorithms: Key Concepts
No ratings yet
Introduction to Algorithms: Key Concepts
77 pages
2 Complexity
No ratings yet
2 Complexity
18 pages
33 تحليل
No ratings yet
33 تحليل
33 pages
Understanding Algorithms and Efficiency
No ratings yet
Understanding Algorithms and Efficiency
9 pages
L1 AlgoAnalysis
No ratings yet
L1 AlgoAnalysis
57 pages
Analyzing Algorithm Efficiency
No ratings yet
Analyzing Algorithm Efficiency
42 pages
Module 1
No ratings yet
Module 1
24 pages
Algorithm Efficiency & Analysis Guide
No ratings yet
Algorithm Efficiency & Analysis Guide
23 pages
6 Introduction To Algorithms-3
No ratings yet
6 Introduction To Algorithms-3
19 pages
DSA Lab Manual (MST)
No ratings yet
DSA Lab Manual (MST)
4 pages
Numerical Decision Tree
No ratings yet
Numerical Decision Tree
12 pages
Module 1 Quiz
No ratings yet
Module 1 Quiz
7 pages
Unit 5
No ratings yet
Unit 5
27 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
23 pages
Lagrange Multipliers in Optimization
No ratings yet
Lagrange Multipliers in Optimization
19 pages
Digital Image Processing and Analysis Human and Computer Vision Applications With CVIPtools Second Edition Umbaugh PDF Download
100% (3)
Digital Image Processing and Analysis Human and Computer Vision Applications With CVIPtools Second Edition Umbaugh PDF Download
52 pages
Assignment
No ratings yet
Assignment
2 pages
Data Mining Algorithms in R - Clustering - Fuzzy Clustering - Fuzzy C-Means - Wikibooks, Open Books For An Open World
No ratings yet
Data Mining Algorithms in R - Clustering - Fuzzy Clustering - Fuzzy C-Means - Wikibooks, Open Books For An Open World
8 pages
1
No ratings yet
1
2 pages
Structural Stiffness Analysis Guide
No ratings yet
Structural Stiffness Analysis Guide
3 pages
Fundamentals of The Analysis of Algorithm Efficiency
No ratings yet
Fundamentals of The Analysis of Algorithm Efficiency
37 pages
CSC 212: Data Structures Assignment
No ratings yet
CSC 212: Data Structures Assignment
2 pages
Ads Complexity
No ratings yet
Ads Complexity
42 pages
9 - Correlation, Energy Spectral Density, and Power Spectral Density
No ratings yet
9 - Correlation, Energy Spectral Density, and Power Spectral Density
34 pages
Bi 8
No ratings yet
Bi 8
3 pages
Computer Vision and Its Applications
No ratings yet
Computer Vision and Its Applications
3 pages
Linear Programming: Simplex Method
No ratings yet
Linear Programming: Simplex Method
49 pages
Machine Learning: Gradient Descent & Confusion Matrix
No ratings yet
Machine Learning: Gradient Descent & Confusion Matrix
5 pages
III Cse Cs3301 Ds QB Unit 5
No ratings yet
III Cse Cs3301 Ds QB Unit 5
7 pages
Data Structure MCQs with Answers
No ratings yet
Data Structure MCQs with Answers
40 pages
19 Range For Loop
No ratings yet
19 Range For Loop
13 pages
Print APUNTES TEMA 7
No ratings yet
Print APUNTES TEMA 7
13 pages
Neural Networks Basics Course
No ratings yet
Neural Networks Basics Course
36 pages
Anoosh Murad Fidai Practical 5
No ratings yet
Anoosh Murad Fidai Practical 5
5 pages
Empirical Analysis of Sorting Algorithms
No ratings yet
Empirical Analysis of Sorting Algorithms
7 pages
Digital Speech Coding Techniques
No ratings yet
Digital Speech Coding Techniques
18 pages
0029 2A Mernoki Optimalas en
No ratings yet
0029 2A Mernoki Optimalas en
225 pages
MATLAB Gaussian Elimination Guide
No ratings yet
MATLAB Gaussian Elimination Guide
2 pages
Chap 11. Message Authentication and Hash Functions
No ratings yet
Chap 11. Message Authentication and Hash Functions
26 pages

Computational Complexity: Section 1: Why Is It Important?

Uploaded by

Computational Complexity: Section 1: Why Is It Important?

Uploaded by

Member Count: 280,840 - February 1, 2011 [Get Time] Login

Table 2. Runtimes of a new fictional algorithm.

for (int i=0; i<N; i++)

Formal notes on the input size

How to measure efficiency?

the first step is executed exactly N times

Finally, formal definitions

N > N0 ; f (N) < c.g(N)

means that f doesn't grow (asymptotically) faster than N2.

In a similar way we defined O we may define and .

Some examples of using the notation

1.5N2 -0.5N = O(N2).

f (N) = (log N): logarithmic

Determining execution time from an asymptotic bound

An example: The problem is to count occurences of each letter in a string of N letters. A

(N) 100 000 000

(N log N) 40 000 000

Table 3. Approximate maximum problem size solvable in 8 seconds.

A note on algorithm analysis

Home | About TopCoder | Press Room | Contact Us | Careers | Privacy | Terms

Copyright © 2001-2011, TopCoder, Inc. All rights reserved.

You might also like