0% found this document useful (0 votes)

26 views41 pages

Principles of Algorithm Analysis: Biostatistics 615

This document discusses principles for analyzing algorithms. It introduces common relationships between the input size N and an algorithm's running time, such as O(N), O(log N), and O(N2). It compares the sequential search and binary search algorithms, showing that binary search has better worst-case performance of O(log N) compared to sequential search's O(N). Empirical testing of the algorithms on various input sizes is also presented.

Uploaded by

ZeedArt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views41 pages

Principles of Algorithm Analysis: Biostatistics 615

Uploaded by

ZeedArt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

Principles of Algorithm

Analysis

Biostatistics 615
Lecture 3
Problem Set…
z Questions?

z FAQ:
• How to compile C code?
• 10 minute introduction for the adventurous…
C Compilers
z Popular commercial compilers are:
• Borland C++ Builder
• Microsoft Visual C++
• Metrowerks Codewarrior

z Several free compilers also available

• Borland C++ (older version, no graphics)
• GCC
GCC
z GNU C Compiler
• Available on most UNIX systems
• Also on newer Macintosh computers

z For Windows, download from

• www.mingw.org
• www.cygwin.com
Running GCC
z Command line application
z Basic usage is:

gcc –o program_name source.c

• Use extension “.c” or “.cpp” for source code

z Reads in source(s), creates executable

program
A simple program …
/* This is a comment */
#include “stdio.h”
#include “stdlib.h”

int main()
{
int lucky; // Variable declaration

srand(123456); // Initialize random numbers

lucky = rand() % 49; // Generate a random number

printf(“Hello! My lucky number is %d\n”, lucky);

return 0;
}
Good editors for C programs…
z Commercial compilers provide very
fancy editors

z A good free alternative is nedit

• On Windows, available when you install

cygwin.
Today
z Strategies for comparing algorithms

z Common relationships between

algorithm complexity and input data

z Compare two simple search algorithms

Objectives
z Framework for

• empirical testing
• approximate analysis

z Highlight performance characteristics of

algorithms
Specific Questions
z Compare two algorithms for one task

z Predict performance in a new environment

• If we had a computer that was 10x faster and could
handle 10x more data, how would approach perform?

z Set values of algorithm parameters

Two Common Mistakes
z Ignore performance of algorithm
• Shun faster algorithms to avoid complexity in program
• Instead, wait for simple N2 algorithms, when N log N
alternatives exist of modest complexity available

z Too much weight on performance of algorithm

• Improving a very fast program is not worth it
• Spending too much time tinkering with code is rarely
good use of time
Empirical analysis
z Given two algorithms … which is better?

z Run both
• Say, algorithm A takes 3 seconds
• Say, algorithm B takes 30 seconds

z Empirical studies may not always be practical

• Some algorithms may take too long to run!
Choices of Input Data
z Actual data
• Measures performance in use
z Random data
• Generic approach, may not be representative
z Perverse data
• Attempt worst case analysis
Limitations of Empirical
Analysis
z Quality of implementation
• Is our favored implementation coded more
carefully than another?

z Extraneous factors
• Compiler
• Machine
• Computer system
Limitations of Empirical
Analysis
z Requires a working program

z Theoretical analysis is an alternative

• Estimate potential gains
z Predict effectiveness relative to new
algorithms or computers (that may not
yet exist)
Theoretical Analysis
z Predict performance of algorithm based
on theoretical properties

z “Independent” of actual implementation

z Several constructs occur frequently in

algorithm analysis
Limitations of Theoretical
Analysis

z Efficiency can depend on compiler

z Efficiency may fluctuate with input data

z Some algorithms are not well understood

The idea…
z Given a code fragment

#Find parent of node i

i = a[i];

z Consider how many times it is executed

z But not how long each execution takes
Two typical analyses
z Average-case for random input

z Worst-case

z Are these representative of real world

problems?
• Check with empirical predictions…
The Primary Parameter N
z Examples
• Degree of polynomial
• Number of characters in a string
• Size of file to be sorted
• Number of input data items
• Some other abstract measure of problem size

z With multiple parameters, we can often hold

one of them constant
Running time as a function of N
Running time when N
f(N) Description
doubles…
1 constant -
log N logarithmic constant increase
N linear doubles
N log N log-linear more than doubles
N2 quadratic increases fourfold
N3 cubic increases eightfold
2N exponential running time squares
Running time as a function of N
z Multiple terms may be involved
• e.g. N + N log N

z Typically, we ignore
• Smaller terms
• Constant coefficient
• Focus on inner loop

z In rare cases, smaller terms and constant

coefficient will be important
Time to Solve Large Problem

Problem Size N = 1,000,000

operations
per second
N N log N N2

106 seconds minutes months

109 instant instant hours

1012 instant instant seconds

Time to Solve Huge Problem

Problem Size N = 1,000,000,000

operations
per second
N N log N N2

106 hours days never

109 seconds minutes centuries

1012 instant instant months

Big-Oh Notation
z Algorithm is O(N) or O(N log N)
• Common statement
• What does it mean?
z Summarizes performance for large N

z Focuses on leading terms of expression

describing running time
Big-Oh Notation
z Consider function g(N)

z It is said to be O(f(N))

z If there exist c0 and N0 such that:

• N > N0 implies c0f(N) > g(N)

From N to Running Time…
z Common relationships
• N2
• log N
• N log N
•N

z Describe examples of how these arise

z Cost of running program is CN
O(N2)
z Loop through input successively, eliminate
one item at a time
C N = C N −1 + N for N ≥ 2, C 1= 1
= C N − 2 + ( N − 1) + N
...
= 1 + 2 + ... + ( N − 1) + N
N ( N + 1)
=
2
O(log N)
z Recursive program, halves input in one step
C 2 n = C 2 n−1 + 1 for N ≥ 2, C 1= 1
= C 2 n−2 + 1 + 1
= C 2 n −3 + 3
...
= C 20 + n
= n +1
N = 2n
O(N log N)
z Recursive program, processes each item,
splits input into two halves, examines each
C N = 2C N / 2 + N for N ≥ 2, C 1= 0
one…
C2n = 2C2n−1 + 2 n

C2 n 2C2n−1 + 2 n
n
=
2 2n
C2n−1
= n −1
+1
2
C 2 n−2
= n−2
+1+1
2
...
=n
O(2N)

z Halves input, must examine each item…

CN = CN /2 + N for N ≥ 2, C 1= 1
N N N
= N + + + + ...
2 4 8
≈ 2N
Application
z Analysis of two search algorithms

z Consider a set of items

• Evaluate functions to decide whether a

particular item is present…
Sequential Search
int search(int a[], int value, int start, int stop)
{
// Variable declarations
int i;

// Search through each item

for (i = start; i <= stop; i++)
if (value == a[i])
return i;

// Search failed
return -1;
}
Sequential Search Properties
z Algorithm:
• Look through array sequentially, until we find a match

z Average cost
• If match found: N/2
• If match not found: N

z Actual cost depends on fraction of successful

searches
Better Sequential Search
z If items are sorted…

z Stop unsuccessful search early, when

we reach item with higher value
• Cost for unsuccessful searches is now N/2

z Overall, algorithm is still O(N)

Binary Search
int search(int a[], int value, int start, int stop)
{
while (stop >= start)
{
// Find midpoint
int mid = (start + stop) / 2;

// Compare midpoint to value

if (value == a[mid]) return mid;

// Reduce input in half!!!

if (value > a[mid])
{ start = mid + 1; }
else
{ stop = mid - 1; }
}

// Search failed
return -1;
}
Binary Search Properties
z Algorithm:
• Halve number of items to consider with each
comparison

z Worst-case cost
• Maximum cost is never greater than log2 N

z Much better than sequential search, but even

better methods exist!
Sequential vs. Binary Search
M = 1,000 M = 10,000 M = 100,000
N S B S B S B
125 1 1 13 2 130 20
250 3 0 25 2 251 22
500 5 0 49 3 492 23
1250 13 0 128 3 1276 25
2500 26 1 267 3 * 28
Timings in seconds, for M searches in table of N elements
Summary
z Outline principles for analysis of
algorithms

z Introduced some common relationships

between N and running time

z Described two simple search algorithms

Further Reading
z Read chapter 2 of Sedgewick
Tip of the Day:
Defensive Programming
z Document code
• Indicate intended purpose
• Specify required inputs
• Always indicate author

z Check for error conditions

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6471)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (650)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (1005)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1859)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (651)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4104)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1278)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1022)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (582)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5181)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (945)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibin
3.5/5 (2141)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (464)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2016)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (280)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4372)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2886)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2814)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4135)
3200A Busduct Busbar Calculation PDF
89% (9)
3200A Busduct Busbar Calculation PDF
4 pages
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (929)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (841)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2547)
Presentation On HDFC Mutual Fund
No ratings yet
Presentation On HDFC Mutual Fund
19 pages
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
Aerodynamics Design Assignment
100% (1)
Aerodynamics Design Assignment
11 pages
Soal Um Bahasa Inggris Min 1 SMG
No ratings yet
Soal Um Bahasa Inggris Min 1 SMG
3 pages
Christina Bilia CV
No ratings yet
Christina Bilia CV
3 pages
Ahmed Foods
No ratings yet
Ahmed Foods
3 pages
Test Cases
No ratings yet
Test Cases
4 pages
Report On Modern Canada
No ratings yet
Report On Modern Canada
2 pages
Marketing Research An Applied Orientation 7 Ed Malhotra
No ratings yet
Marketing Research An Applied Orientation 7 Ed Malhotra
317 pages
Detecting Multicollinearity in Regression Analysis: Keywords
No ratings yet
Detecting Multicollinearity in Regression Analysis: Keywords
4 pages
105 - Pdfsam - MANUAL FUSION ZXK-60000 - S-A (206 PG)
No ratings yet
105 - Pdfsam - MANUAL FUSION ZXK-60000 - S-A (206 PG)
52 pages
SMC Sink Catalouge Price List
No ratings yet
SMC Sink Catalouge Price List
5 pages
Sample 6
No ratings yet
Sample 6
10 pages
Udsm - Call For Application For Admission Into Certificate of Law
No ratings yet
Udsm - Call For Application For Admission Into Certificate of Law
2 pages
List of Companies in Hydrebad
No ratings yet
List of Companies in Hydrebad
10 pages
Taylor Swift - Tekst You're Losing Me - PL
No ratings yet
Taylor Swift - Tekst You're Losing Me - PL
1 page
Signature Yalid: Superintending Engineer (Ra&C)
No ratings yet
Signature Yalid: Superintending Engineer (Ra&C)
2 pages
Plan and Profile NH45C
No ratings yet
Plan and Profile NH45C
54 pages
Env Variables
No ratings yet
Env Variables
18 pages
Nutrition Education For Public
No ratings yet
Nutrition Education For Public
64 pages
AI Lab-2
No ratings yet
AI Lab-2
44 pages
BC Creative Hubs Report 2016
No ratings yet
BC Creative Hubs Report 2016
97 pages
Apollo N1N3Service Guide V1.20
No ratings yet
Apollo N1N3Service Guide V1.20
84 pages
6 - Communication in Client Server System
No ratings yet
6 - Communication in Client Server System
19 pages
Case Study On Quota System
No ratings yet
Case Study On Quota System
2 pages
STAAD Stiffners and EC3 Shear
No ratings yet
STAAD Stiffners and EC3 Shear
3 pages
In The Supreme Court of Pakistan
0% (1)
In The Supreme Court of Pakistan
4 pages
Brace Forces in Steel Box Girders With Single Diagonal Lateral Bracing Systems
No ratings yet
Brace Forces in Steel Box Girders With Single Diagonal Lateral Bracing Systems
12 pages
Complete Bundle Doing Your Research Project Guide For FirstTime Researchers 7th Edition Stephen Waters HQ File
No ratings yet
Complete Bundle Doing Your Research Project Guide For FirstTime Researchers 7th Edition Stephen Waters HQ File
408 pages
Cswip 3 1-300 - Multiple Choice Question
No ratings yet
Cswip 3 1-300 - Multiple Choice Question
43 pages

Principles of Algorithm Analysis: Biostatistics 615

Uploaded by

Principles of Algorithm Analysis: Biostatistics 615

Uploaded by

Principles of Algorithm

z Several free compilers also available

z For Windows, download from

gcc –o program_name source.c

• Use extension “.c” or “.cpp” for source code

z Reads in source(s), creates executable

srand(123456); // Initialize random numbers

lucky = rand() % 49; // Generate a random number

printf(“Hello! My lucky number is %d\n”, lucky);

z A good free alternative is nedit

• On Windows, available when you install

z Common relationships between

z Compare two simple search algorithms

z Highlight performance characteristics of

z Predict performance in a new environment

z Set values of algorithm parameters

z Too much weight on performance of algorithm

z Empirical studies may not always be practical

z Theoretical analysis is an alternative

z “Independent” of actual implementation

z Several constructs occur frequently in

z Efficiency can depend on compiler

z Efficiency may fluctuate with input data

z Some algorithms are not well understood

#Find parent of node i

z Consider how many times it is executed

z Are these representative of real world

z With multiple parameters, we can often hold

z In rare cases, smaller terms and constant

Problem Size N = 1,000,000

106 seconds minutes months

109 instant instant hours

1012 instant instant seconds

Problem Size N = 1,000,000,000

106 hours days never

109 seconds minutes centuries

1012 instant instant months

z Focuses on leading terms of expression

z If there exist c0 and N0 such that:

• N > N0 implies c0f(N) > g(N)

z Describe examples of how these arise

z Halves input, must examine each item…

z Consider a set of items

• Evaluate functions to decide whether a

// Search through each item

z Actual cost depends on fraction of successful

z Stop unsuccessful search early, when

z Overall, algorithm is still O(N)

// Compare midpoint to value

// Reduce input in half!!!

z Much better than sequential search, but even

z Introduced some common relationships

z Described two simple search algorithms

z Check for error conditions

You might also like