0% found this document useful (0 votes)

4 views42 pages

Lecture 8 - Data Structure and Algorithm

The document introduces data structures and algorithms, emphasizing their importance in computer science and modern technology. It discusses principles of algorithm analysis, including measuring running time and space usage, and introduces concepts like big-Oh notation for characterizing algorithm efficiency. Additionally, it covers recursion and provides examples such as binary search and the factorial function to illustrate these concepts.

Uploaded by

darren boesono

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views42 pages

Lecture 8 - Data Structure and Algorithm

Uploaded by

darren boesono

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

Introduction to Computer Science:

Programming Methodology

Lecture 8
Data Structure and Algorithm - Intro
Prof. Jiangfan Yu
School of Science and Engineering
Data structure and algorithm
• A data structure is a systematic way of organizing and
accessing data

• An algorithm is a step-by-step procedure for performing

some task in a finite amount of time.
Why study data structure and algorithm?
• Important for all other branches of computer science

• Plays a key role in modern technological innovation

• Moore’s law predicts that the density of transistors in integrated

circuits would continue to double every 1 to 2 years

• However, in many areas, performance gains due to the improvements

in algorithms have greatly exceeded even the dramatic performance
gains due to increased processor speed
Why study data structure and algorithm?
• Provide novel “lens” on processes outside of computer
science and technology, such as quantum mechanics,
economic markets, evolution

• Challenging (good for your brain!!) and funny

Example: Integer Multiplication
• Inputs: two n-digits number x and y

• Output: the product of x and y

• Primitive operations: add or multiply 2 single digit numbers

The algorithm designer’s mantra

• “Perhaps the most important principle for the good

algorithm designer is to refuse to be content”

Aho, Hopcroft, and Ullman, The Design and Analysis of Computer

Algorithms, 1974
How do we define a“good” algorithm?

• The primary analysis of algorithms involves characterizing the running

times and space usage of algorithms and data structure operations

• Running time is a natural measure of “goodness,” since time is a

precious resource—computer solutions should run as fast as possible

• Space usage is another major issue to consider when we design an

algorithm, since we only have limited storage spaces
Measuring the running time experimentally
Visualize the running time
• Running time and space
usage are dependent on
the size of the input

• Perform independent
experiments on many
different test inputs of
various sizes

• Visualize the results by

plotting the
performance of each
run of the algorithm as a
point
Challenges of experimental analysis

• Experimental running times of two algorithms are difficult to directly

compare unless the experiments are performed in the same hardware
and software environments

• Experiments can be done only on a limited set of test inputs; hence,

they leave out the running times of inputs not included in the
experiment (and these inputs may be important)

• An algorithm must be fully implemented in order to execute it to

study its running time experimentally
Principle of algorithm analysis 1: Counting
primitive operations
• To analyse the running time of an algorithm without performing
experiments, we perform an analysis directly on a high-level description of
the algorithm

• We define a set of primitive operations such as the following:

ü Assigning an identifier to an object
ü Determining the object associated with an identifier
ü Performing an arithmetic operation (for example, adding two numbers)
ü Comparing two numbers
ü Accessing a single element of a Python list by index
ü Calling a function (excluding operations executed within the function)
ü Returning from a function.
Principle of algorithm analysis 2: Measuring
Operations as a Function of Input Size

• To capture the order of growth of an algorithm’s running

time, we will associate, with each algorithm, a function f (n)
that characterizes the number of primitive operations that
are performed as a function of the input size n
Principle of algorithm analysis 3: Focusing
on the Worst-Case Input
• An algorithm may run faster on some inputs than it does on others of
the same size. Thus, we may wish to express the running time of an
algorithm as the function of the input size obtained by taking the
average over all possible inputs of the same size

• Unfortunately, such an average-case analysis is typically quite

challenging. It requires us to define a probability distribution on the
set of inputs, which is often a difficult task

• We will characterize running times in terms of the worst case, as a

function of the input size, n, of the algorithm
The 7 functions used in algorithm analysis
• We may use the following 7 functions to measure the time complexity
of an algorithm: constant, logarithm, linear, N-log-N, quadratic, cubic
and other polynomials, exponential
Asymptotic Analysis
• In algorithm analysis, we focus on the growth rate of the running time as a
function of the input size n, taking a “big-picture” approach

• Vocabulary for the analysis and design of algorithms

• “Sweet spot” for high-level reasoning about algorithms

• Coarse enough to supress unnecessary details, e.g.

architecture/language/compiler…

• Sharp enough to make meaningful comparisons between algorithms

The big Oh notation
• Let f(n) and g(n) be functions mapping positive integers to
positive real numbers.
• We say that f(n) is O(g(n)) if there is a real constant c > 0 and
an integer constant n0 ≥ 1 such that
f(n) ≤ cg(n), for n ≥ n0

• This definition is often referred to as the “big-Oh” notation

• Example: The function 8n+5 is O(n).
The big Oh notation
• The big-Oh notation allows us to say that a function f(n) is
“less than or equal to” another function g(n) up to a constant
factor and in the asymptotic sense as n grows toward infinity

• The big-Oh notation is used widely to characterize running

times and space bounds in terms of some parameter n,
which varies from problem to problem, but is always defined
as a chosen measure of the “size” of the problem
Some Properties of the Big-Oh
Notation
• The big-Oh notation allows us to ignore constant factors and lower-order
terms and focus on the main components of a function that affect its
growth

• Example: 5𝑛! + 3𝑛" + 2𝑛# + 4𝑛 + 1 is O(𝑛! )

• Example: 2$%# is O(2$ )
• Example: 2𝑛 + 100𝑙𝑜𝑔𝑛 is O(𝑛)

• In general, we should use the big-Oh notation to characterize a function as

closely as possible
Comparative analysis
Question: Suppose two algorithms solving the same problem
are available: an algorithm A, which has a running time of
O(n), and an algorithm B, which has a running time of O(𝑛! ).
Which algorithm is better?

Answer: Algorithm A is asymptotically better than algorithm B

Comparative analysis
• We can use the big-Oh notation to order classes of functions by
asymptotic growth rate
• Our seven functions are ordered by increasing growth rate in the
following sequence
Why AlghaGo is a remarkable achievement?
• If we use brutal-force
to search the best
move in Go, the time
complexity is at the
order of 𝑂(10" )

• The search space is

even larger than the
number of atoms in
the universe!!!
The line of tractability
• To differentiate efficient and inefficient algorithms, the
general line is between polynomial time algorithms and
exponential time algorithms

• The distinction between polynomial-time and exponential-

time algorithms is considered a robust measure of
tractability
Example: Finding the smallest number in a list

• What is the time complexity of this algorithm?

Recursion
• Recursion is a technique by which a function makes one or
more calls to itself during execution

• Recursion provides an elegant and powerful alternative for

performing repetitive tasks

• Recursion is an important technique in the study of data

structures and algorithms
Inception
Example: The factorial function
• The factorial of a positive integer n, denoted n!, is defined as follows:

• The factorial function is important because it is known to equal the

number of ways in which n distinct items can be arranged into a
sequence, that is, the number of permutations of n items
The recursive definition

• First, a recursive definition contains one or more base cases,

which are defined non-recursively in terms of fixed
quantities

• Second, it also contains one or more recursive cases, which

are defined by appealing to the definition of the function
being defined
The recursive definition of
factorial function
• The factorial function can be naturally defined in a recursive way, for
example, 5! = 5 ·(4 · 3 · 2 · 1) = 5 · 4!

• More generally, for a positive integer n, we can define n! to be n ·

(n−1)!

• Therefore, the recursive definition of factorial function is:

Solution
How Python implements recursion
• In Python, each time a function (recursive or otherwise) is called, a
structure known as an activation record or frame is created to store
information about the progress of that invocation of the function

• This activation record stores the function call’s parameters and local
variables

• When the execution of a function leads to a nested function call, the

execution of the former call is suspended and its activation record
stores the place in the source code at which the flow of control
should continue upon return of the nested call
The recursive trace
Example: Drawing an English ruler
• We denote the length of the tick
designating a whole inch as the
major tick length.

• Between the marks for whole

inches, the ruler contains a series
of minor ticks, placed at intervals
of 1/2 inch, 1/4 inch, and so on.

• As the size of the interval

decreases by half, the tick length
decreases by one
Recursive implementation of English
ruler

•An interval with a central tick length L ≥ 1 is

composed of:
ü An interval with a central tick length L−1
ü A single tick of length L
ü An interval with a central tick length L−1
Solution
The recursive
trace for
English ruler
Example: Binary search
• A classic and very useful recursive algorithm, binary search, can be
used to efficiently locate a target value within a sorted sequence of n
elements

• When the sequence is unsorted, the standard approach to search for

a target value is to use a loop to examine every element, until either
finding the target or exhausting the data set; This is known as the
sequential search algorithm
Binary search
• When the sequence is sorted and indexable, binary search is a much
more efficient algorithm

• For any index j, we know that all the values stored at indices 0, . . . ,
j−1 are less than or equal to the value at index j, and all the values
stored at indices j+1, . . . ,n−1 are greater than or equal to that at
index j
The strategy of binary search
• We call an element of the sequence a candidate if, at the current
stage of the search, we cannot rule out that this item matches the
target

• The algorithm maintains two parameters, low and high, such that all
the candidate entries have index at least low and at most high

• Initially, low = 0 and high = n−1. We then compare the target value to
the median candidate, that is, the item data[mid] with index
mid = (low+high)/2
The strategy of binary search
• If the target equals data[mid], then we have found the item we
are looking for, and the search terminates successfully

• If target < data[mid], then we recur on the first half of the

sequence, that is, on the interval of indices from low to mid−1

• If target > data[mid], then we recur on the second half of the

sequence, that is, on the interval of indices from mid+1 to high
Solution
Time complexity of binary search

Proposition: The binary search algorithm runs in

O(logn) time for a sorted sequence with n elements
Proof

Collaborative Cyber Threat Intelligence Detecting and Responding To Advanced Cyber Attacks at The National Level
100% (1)
Collaborative Cyber Threat Intelligence Detecting and Responding To Advanced Cyber Attacks at The National Level
566 pages
Master Thesis E-Commerce Adoption and Implementation Strategy - Svetlana Golubovaaster
100% (4)
Master Thesis E-Commerce Adoption and Implementation Strategy - Svetlana Golubovaaster
78 pages
Design and Analysis of Algorithms (DAA) Notes
No ratings yet
Design and Analysis of Algorithms (DAA) Notes
112 pages
CIT 0106 Basic Maths For Information Technology
No ratings yet
CIT 0106 Basic Maths For Information Technology
3 pages
Cse-IV-Design and Analysis of Algorithms (10cs43) - Notes
100% (1)
Cse-IV-Design and Analysis of Algorithms (10cs43) - Notes
88 pages
Chemistry IA FINAL PDF
No ratings yet
Chemistry IA FINAL PDF
13 pages
CS3401 Algorithm
No ratings yet
CS3401 Algorithm
137 pages
Mechanical Design Project Cost Estimation
No ratings yet
Mechanical Design Project Cost Estimation
9 pages
Time Complexity
No ratings yet
Time Complexity
46 pages
ADA Unit 2 Notes
100% (1)
ADA Unit 2 Notes
110 pages
Algorithm Analysis
No ratings yet
Algorithm Analysis
36 pages
Satellite - Imagery - Product - Guide 2018 PDF
No ratings yet
Satellite - Imagery - Product - Guide 2018 PDF
57 pages
Daa Complete
No ratings yet
Daa Complete
343 pages
TM-T88V TRG en Revf
No ratings yet
TM-T88V TRG en Revf
112 pages
Algorithms Intro
No ratings yet
Algorithms Intro
86 pages
MCS 208
No ratings yet
MCS 208
191 pages
67c7d84dd7ffd Sumit Notes
No ratings yet
67c7d84dd7ffd Sumit Notes
88 pages
Data Structure Dessalew
No ratings yet
Data Structure Dessalew
94 pages
Daa Unit 1
No ratings yet
Daa Unit 1
28 pages
Fundamentals of The Analysis of Algorithm Efficiency
No ratings yet
Fundamentals of The Analysis of Algorithm Efficiency
37 pages
Unit I
No ratings yet
Unit I
37 pages
Chapter 3
No ratings yet
Chapter 3
49 pages
WSM 2.3 Admin Guide
100% (2)
WSM 2.3 Admin Guide
139 pages
300-430-ENWLSI Implementing Cisco Enterprise Wireless Networks PDF
100% (1)
300-430-ENWLSI Implementing Cisco Enterprise Wireless Networks PDF
3 pages
Analysis and Design of Algorithms
No ratings yet
Analysis and Design of Algorithms
47 pages
Time and Space Complexity
No ratings yet
Time and Space Complexity
22 pages
33 تحليل
No ratings yet
33 تحليل
33 pages
Block 1
No ratings yet
Block 1
50 pages
Analysis and Design of Algoritms WK 1-4
No ratings yet
Analysis and Design of Algoritms WK 1-4
47 pages
1 Algorithm
No ratings yet
1 Algorithm
46 pages
2 ComplexityAnalysis-eru
No ratings yet
2 ComplexityAnalysis-eru
57 pages
A Daa 1-2-3-4
No ratings yet
A Daa 1-2-3-4
30 pages
Chapter 1 - Analysis of Algorithms 2
No ratings yet
Chapter 1 - Analysis of Algorithms 2
44 pages
CS-IT341 Lecture 3
No ratings yet
CS-IT341 Lecture 3
34 pages
Laporan IP Server (Responses)
No ratings yet
Laporan IP Server (Responses)
3 pages
RISC-V Assembler - Load Store - Project F
No ratings yet
RISC-V Assembler - Load Store - Project F
9 pages
Daa Module 2
No ratings yet
Daa Module 2
20 pages
Week8 Week9 Algorithm Analysis2
No ratings yet
Week8 Week9 Algorithm Analysis2
14 pages
Introduction Algorithm
No ratings yet
Introduction Algorithm
53 pages
Module 1 AAD
No ratings yet
Module 1 AAD
9 pages
ADSA Unit-I
No ratings yet
ADSA Unit-I
21 pages
Cam4ug PDF
No ratings yet
Cam4ug PDF
466 pages
2) Karatsuba Algorithm
No ratings yet
2) Karatsuba Algorithm
8 pages
4220 2 (Bigdata)
No ratings yet
4220 2 (Bigdata)
19 pages
Artificial Intelligence (AI)
No ratings yet
Artificial Intelligence (AI)
6 pages
UNIT-3 JSP Intro
No ratings yet
UNIT-3 JSP Intro
12 pages
Chapter 2 Fundamentals of The Analysis of Algorithm Efficiency Student
No ratings yet
Chapter 2 Fundamentals of The Analysis of Algorithm Efficiency Student
20 pages
Unit 1 - Daa - Notes - 2023
No ratings yet
Unit 1 - Daa - Notes - 2023
30 pages
Algorithm Up To 7 Lectures
No ratings yet
Algorithm Up To 7 Lectures
13 pages
4220 5 (Python)
No ratings yet
4220 5 (Python)
12 pages
Unit-I DAA
No ratings yet
Unit-I DAA
25 pages
Asymptotic Analysis
No ratings yet
Asymptotic Analysis
10 pages
DSA Unit-1
No ratings yet
DSA Unit-1
25 pages
Algorithem Chapter One
No ratings yet
Algorithem Chapter One
43 pages
1.1 - Analysis of Algorithms
No ratings yet
1.1 - Analysis of Algorithms
37 pages
ADA Lect2
No ratings yet
ADA Lect2
11 pages
Unit 11
No ratings yet
Unit 11
17 pages
Design and Analysis
No ratings yet
Design and Analysis
15 pages
DAA - Ch. 1 (Lecture Notes)
No ratings yet
DAA - Ch. 1 (Lecture Notes)
24 pages
Process Control Narratives
No ratings yet
Process Control Narratives
27 pages
Adsa Unit - 1
No ratings yet
Adsa Unit - 1
19 pages
Mcs 021 PDF
No ratings yet
Mcs 021 PDF
171 pages
D35 Data Structure2012
No ratings yet
D35 Data Structure2012
182 pages
DSC I Unit PDF
No ratings yet
DSC I Unit PDF
85 pages
DAA Notes-4-15
No ratings yet
DAA Notes-4-15
12 pages
Economics IA Sample PDF
No ratings yet
Economics IA Sample PDF
7 pages
4220 6 (DataFormat)
No ratings yet
4220 6 (DataFormat)
15 pages
Adsa Unit - 1
No ratings yet
Adsa Unit - 1
9 pages
20MCA203 Design & Analysis of Algorithms
No ratings yet
20MCA203 Design & Analysis of Algorithms
59 pages
Cisco Flex Link
No ratings yet
Cisco Flex Link
10 pages
ADSA - MECSE-unit-1
No ratings yet
ADSA - MECSE-unit-1
8 pages
Alg PDF
No ratings yet
Alg PDF
6 pages
0580 m18 Ms 42
No ratings yet
0580 m18 Ms 42
7 pages
Unit 1 Analysis of Algorithms: Structure Page Nos
No ratings yet
Unit 1 Analysis of Algorithms: Structure Page Nos
16 pages
Basic Cartography
No ratings yet
Basic Cartography
38 pages
Efficient Coding - LEX
No ratings yet
Efficient Coding - LEX
9 pages
DAA Unit - 1
No ratings yet
DAA Unit - 1
68 pages
Algorithm Analysis
No ratings yet
Algorithm Analysis
82 pages
Hidden Technical Debt in Machine Learning Systems
No ratings yet
Hidden Technical Debt in Machine Learning Systems
9 pages
A Simple Image Model
No ratings yet
A Simple Image Model
32 pages
Assembly Code For N'TH Fibonacci Number
0% (1)
Assembly Code For N'TH Fibonacci Number
7 pages
02 Solutiondiscrete
No ratings yet
02 Solutiondiscrete
17 pages
Enterprise and Global Management of Information Technology: Chapter Overview
No ratings yet
Enterprise and Global Management of Information Technology: Chapter Overview
4 pages
Stability
No ratings yet
Stability
66 pages
Advanced Spatial Analysis Methods - Assignment Questions - Asian Institute of Technology
No ratings yet
Advanced Spatial Analysis Methods - Assignment Questions - Asian Institute of Technology
4 pages
Darren Boesono 12IBC-2
No ratings yet
Darren Boesono 12IBC-2
1 page
Ups Eaton Pw9130
No ratings yet
Ups Eaton Pw9130
4 pages
Goldengate On ASM Using DBLOGREADER
No ratings yet
Goldengate On ASM Using DBLOGREADER
5 pages
Introduction To Algorithms: Unit 1
No ratings yet
Introduction To Algorithms: Unit 1
42 pages
Investigating The Effect of PH On Amylase Activity Ss 34
No ratings yet
Investigating The Effect of PH On Amylase Activity Ss 34
4 pages
Physics IA
No ratings yet
Physics IA
3 pages
Cinema4d in One Day Part1
No ratings yet
Cinema4d in One Day Part1
1 page
Math IA
No ratings yet
Math IA
16 pages
Details of Mera Gaon Mera Gaurav Scheme
No ratings yet
Details of Mera Gaon Mera Gaurav Scheme
3 pages
Cot410 Work Sheet 1-1
No ratings yet
Cot410 Work Sheet 1-1
11 pages
EM 4, Sector V, Salt Lake, Kolkata-700091, West Bengal, India
No ratings yet
EM 4, Sector V, Salt Lake, Kolkata-700091, West Bengal, India
1 page
Algorithm
No ratings yet
Algorithm
12 pages
Linear Programming: An Introduction to Finite Improvement Algorithms: Second Edition
From Everand
Linear Programming: An Introduction to Finite Improvement Algorithms: Second Edition
Daniel Solow
5/5 (2)
Beginners Guide to TI-84 Plus CE Python Programming Calculator
From Everand
Beginners Guide to TI-84 Plus CE Python Programming Calculator
Obakoma G. Martins
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet

Lecture 8 - Data Structure and Algorithm

Uploaded by

Lecture 8 - Data Structure and Algorithm

Uploaded by

Introduction to Computer Science:

• An algorithm is a step-by-step procedure for performing

• Plays a key role in modern technological innovation

• Moore’s law predicts that the density of transistors in integrated

• However, in many areas, performance gains due to the improvements

• Challenging (good for your brain!!) and funny

• Output: the product of x and y

• Primitive operations: add or multiply 2 single digit numbers

• “Perhaps the most important principle for the good

Aho, Hopcroft, and Ullman, The Design and Analysis of Computer

• The primary analysis of algorithms involves characterizing the running

• Running time is a natural measure of “goodness,” since time is a

• Space usage is another major issue to consider when we design an

• Visualize the results by

• Experimental running times of two algorithms are difficult to directly

• Experiments can be done only on a limited set of test inputs; hence,

• An algorithm must be fully implemented in order to execute it to

• We define a set of primitive operations such as the following:

• To capture the order of growth of an algorithm’s running

• Unfortunately, such an average-case analysis is typically quite

• We will characterize running times in terms of the worst case, as a

• Vocabulary for the analysis and design of algorithms

• “Sweet spot” for high-level reasoning about algorithms

• Coarse enough to supress unnecessary details, e.g.

• Sharp enough to make meaningful comparisons between algorithms

• This definition is often referred to as the “big-Oh” notation

• The big-Oh notation is used widely to characterize running

• Example: 5𝑛! + 3𝑛" + 2𝑛# + 4𝑛 + 1 is O(𝑛! )

• In general, we should use the big-Oh notation to characterize a function as

Answer: Algorithm A is asymptotically better than algorithm B

• The search space is

• The distinction between polynomial-time and exponential-

• What is the time complexity of this algorithm?

• Recursion provides an elegant and powerful alternative for

• Recursion is an important technique in the study of data

• The factorial function is important because it is known to equal the

• First, a recursive definition contains one or more base cases,

• Second, it also contains one or more recursive cases, which

• More generally, for a positive integer n, we can define n! to be n ·

• Therefore, the recursive definition of factorial function is:

• When the execution of a function leads to a nested function call, the

• Between the marks for whole

• As the size of the interval

•An interval with a central tick length L ≥ 1 is

• When the sequence is unsorted, the standard approach to search for

• If target < data[mid], then we recur on the first half of the

• If target > data[mid], then we recur on the second half of the

Proposition: The binary search algorithm runs in

You might also like