Short Notes On Knuth

Uploaded by

Janhavi Bhati

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views2 pages

Short Notes On Knuth

Uploaded by

Janhavi Bhati

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Short Notes on Knuth-Morris-Pratt (KMP) Algorithm

1. Purpose:
The KMP algorithm is a pattern-matching algorithm used to find the occurrence of a pattern PPP of length
mmm in a text TTT of length nnn. It avoids redundant comparisons, achieving a time complexity of
O(n+m)O(n + m)O(n+m).
2. Key Idea:
Instead of starting over after a mismatch, the algorithm uses information from the pattern itself to skip
unnecessary comparisons. This is done using a failure function.
3. Failure Function:
o The failure function f(j)f(j)f(j) for a pattern PPP represents the length of the longest prefix of PPP that
is also a suffix of P[1..j]P[1..j]P[1..j].
o Example for P="abacab"P = "abacab"P="abacab": j:012345P[j]:abacabf(j):001012j: 0 \quad 1 \quad
2 \quad 3 \quad 4 \quad 5 P[j]: a \quad b \quad a \quad c \quad a \quad b f(j): 0 \quad 0 \quad 1 \
quad 0 \quad 1 \quad 2 j:012345P[j]:abacabf(j):001012
4. Algorithm Steps:
o Preprocessing: Compute the failure function fff for PPP in O(m)O(m)O(m).
o Matching: Use fff to determine how far to shift the pattern after a mismatch, reducing unnecessary
comparisons.
5. Performance:
o Worst-case time complexity: O(n+m)O(n + m)O(n+m).
o This is optimal since every character in both TTT and PPP is processed at most once.
6. Advantages:
o Efficient for large TTT and PPP.
o Reduces the need to recheck previously matched characters.
7. C++ Implementation:
The algorithm can be implemented in C++ using two functions: one for matching (KMPMatch) and another
for computing the failure function (computeFailFunction).

Descriptive Questions and Answers

1. Q1: Explain the key concept of the KMP algorithm. Why is it more efficient than the brute-force approach?
Answer:
The KMP algorithm avoids redundant comparisons by using the failure function. When a mismatch occurs,
the failure function provides the next index to continue the search, skipping unnecessary characters. In
contrast, the brute-force approach restarts the comparison from the next character in the text, leading to
redundant checks. The KMP algorithm thus achieves O(n+m)O(n + m)O(n+m) time complexity, whereas brute
force can take O(n⋅m)O(n \cdot m)O(n⋅m) in the worst case.

Q2: Define the failure function f(j)f(j)f(j) and explain its significance in the KMP algorithm.
Answer:
The failure function f(j)f(j)f(j) is defined as the length of the longest prefix of PPP that is also a suffix of
P[1..j]P[1..j]P[1..j]. It helps the KMP algorithm efficiently shift the pattern PPP in the text TTT after a mismatch,
ensuring no redundant comparisons are made. It encodes information about repeated substrings within the pattern.

Q5: Analyze the time complexity of the KMP algorithm.

Answer:
 The failure function is computed in O(m)O(m)O(m).
 The matching phase processes nnn characters of TTT and uses fff to skip unnecessary comparisons. Each
iteration either increments iii or reduces jjj, ensuring at most 2n2n2n iterations.
 Total time complexity: O(m+n)O(m + n)O(m+n).

Let's compute the failure function f(j)f(j) for the pattern P="ababaca"P = "ababaca" step by step. The failure function
f(j)f(j) represents the length of the longest prefix of PP that is also a suffix of P[1..j]P[1..j].
Pattern PP:
P="a b a b a c a"P = "a \ b \ a \ b \ a \ c \ a"
Steps for f(j)f(j):
1. Initialization:
o f(0)=0f(0) = 0 (by definition).
o Start with i=1i = 1 (current position) and j=0j = 0 (length of longest prefix).

2. Step-by-Step Calculation:
o i=1i = 1:
P[1]=b≠P[0]=aP[1] = b \neq P[0] = a, so f(1)=0f(1) = 0.
No prefix matches the suffix for P[1..1]="ab"P[1..1] = "ab".
o i=2i = 2:
P[2]=a=P[0]P[2] = a = P[0], so f(2)=1f(2) = 1.
Prefix "a""a" matches suffix for P[1..2]="aba"P[1..2] = "aba".
o i=3i = 3:
P[3]=b=P[1]P[3] = b = P[1], so f(3)=2f(3) = 2.
Prefix "ab""ab" matches suffix for P[1..3]="abab"P[1..3] = "abab".
o i=4i = 4:
P[4]=a=P[2]P[4] = a = P[2], so f(4)=3f(4) = 3.
Prefix "aba""aba" matches suffix for P[1..4]="ababa"P[1..4] = "ababa".
o i=5i = 5:
P[5]=c≠P[3]=bP[5] = c \neq P[3] = b, so we use f(3)=2f(3) = 2.
P[5]=c≠P[2]=aP[5] = c \neq P[2] = a, so f(5)=0f(5) = 0.
No prefix matches the suffix for P[1..5]="ababac"P[1..5] = "ababac".
o i=6i = 6:
P[6]=a=P[0]P[6] = a = P[0], so f(6)=1f(6) = 1.
Prefix "a""a" matches suffix for P[1..6]="ababaca"P[1..6] = "ababaca".

Final Failure Function:

j:0123456P[j]:ababacaf(j):0012301j: \quad 0 \quad 1 \quad 2 \quad 3 \quad 4 \quad 5 \quad 6 P[j]: \quad a \quad
b \quad a \quad b \quad a \quad c \quad a f(j): \quad 0 \quad 0 \quad 1 \quad 2 \quad 3 \quad 0 \quad 1

Explanation:
 f(j)f(j) gives us the information to skip unnecessary comparisons in the Knuth-Morris-Pratt algorithm when
there’s a mismatch during pattern matching.

54.string Inotes
No ratings yet
54.string Inotes
20 pages
Knuth Morris Pratt Algorithm
No ratings yet
Knuth Morris Pratt Algorithm
4 pages
Design & Analysis of Algorithm - 6
No ratings yet
Design & Analysis of Algorithm - 6
32 pages
DS Unit-5 Topic
No ratings yet
DS Unit-5 Topic
26 pages
M269 - Lec8 Fall 1819
No ratings yet
M269 - Lec8 Fall 1819
24 pages
Kumboji Pattern Matching Alg
No ratings yet
Kumboji Pattern Matching Alg
4 pages
KMP Algorithm: Engineerpro - K01
No ratings yet
KMP Algorithm: Engineerpro - K01
16 pages
Ch-5 Numerical Daa
No ratings yet
Ch-5 Numerical Daa
11 pages
KMP Algorithm
No ratings yet
KMP Algorithm
21 pages
KMP Algorithm
No ratings yet
KMP Algorithm
19 pages
Module III Problem Solving
No ratings yet
Module III Problem Solving
16 pages
Knuth Morris Pratt Algorithms - Notes
No ratings yet
Knuth Morris Pratt Algorithms - Notes
6 pages
String Matching Chapter 12 Goodrich Nep
No ratings yet
String Matching Chapter 12 Goodrich Nep
43 pages
BNP Unit-5 Lecture 20 KMP 5.2
No ratings yet
BNP Unit-5 Lecture 20 KMP 5.2
14 pages
Unit 3
No ratings yet
Unit 3
34 pages
20BCS5977 - DAA LAB WORKSHEET 3.3pdf
No ratings yet
20BCS5977 - DAA LAB WORKSHEET 3.3pdf
5 pages
Week14 Chap7 String Algorithms
No ratings yet
Week14 Chap7 String Algorithms
13 pages
Lec 7
No ratings yet
Lec 7
24 pages
CSE 205 Lab Manual 12 KMP
No ratings yet
CSE 205 Lab Manual 12 KMP
6 pages
Lecture 34, 35 36 - String Matching Algorithms
No ratings yet
Lecture 34, 35 36 - String Matching Algorithms
42 pages
KMP Algo
No ratings yet
KMP Algo
16 pages
12 StringMatching
No ratings yet
12 StringMatching
23 pages
Week 9 String Algorithms, Approximation
No ratings yet
Week 9 String Algorithms, Approximation
22 pages
Week4 PPT SM
No ratings yet
Week4 PPT SM
35 pages
AAD-String Matching
No ratings yet
AAD-String Matching
15 pages
Cse 217
No ratings yet
Cse 217
10 pages
AAD Lec11
No ratings yet
AAD Lec11
5 pages
Draft 1
No ratings yet
Draft 1
6 pages
DAA DA Output
No ratings yet
DAA DA Output
9 pages
W 9 Presentation
No ratings yet
W 9 Presentation
20 pages
Daa Da
No ratings yet
Daa Da
9 pages
Patternmatching
No ratings yet
Patternmatching
29 pages
BCS304 DS Module 1 KMP Algorithm
No ratings yet
BCS304 DS Module 1 KMP Algorithm
6 pages
Lecture 39 Knutt Morris Pratt
No ratings yet
Lecture 39 Knutt Morris Pratt
15 pages
String Matching - RYS - Lect - 1 - 2 - 3 - Update
No ratings yet
String Matching - RYS - Lect - 1 - 2 - 3 - Update
61 pages
Knuth-Morris-Pratt Algorithm
No ratings yet
Knuth-Morris-Pratt Algorithm
4 pages
The Knuth Morris Pratt Algorithm
No ratings yet
The Knuth Morris Pratt Algorithm
7 pages
Advanced String Lecture
No ratings yet
Advanced String Lecture
50 pages
Corporate Training
No ratings yet
Corporate Training
11 pages
資料工程 Data Engineering: Pattern Matching 張賢宗
No ratings yet
資料工程 Data Engineering: Pattern Matching 張賢宗
38 pages
AOA Module 6 - String of Algorithms - Aeraxia - in
No ratings yet
AOA Module 6 - String of Algorithms - Aeraxia - in
26 pages
String Matching
No ratings yet
String Matching
27 pages
Today's Lecture: String Matching Algorithm Naïve / Brute Force RK
No ratings yet
Today's Lecture: String Matching Algorithm Naïve / Brute Force RK
20 pages
Lecture 56string Matching
No ratings yet
Lecture 56string Matching
43 pages
KMP 2
No ratings yet
KMP 2
7 pages
KMP Algorithm
No ratings yet
KMP Algorithm
20 pages
Toyota Engineering Standard
100% (2)
Toyota Engineering Standard
10 pages
DAA Unit 5
No ratings yet
DAA Unit 5
22 pages
Cse2012 Design and Analysis of Algorithms Lab Digital Assignment 2
No ratings yet
Cse2012 Design and Analysis of Algorithms Lab Digital Assignment 2
18 pages
Ada Notes Unit 4
No ratings yet
Ada Notes Unit 4
28 pages
Knuth Moris 2797348
No ratings yet
Knuth Moris 2797348
21 pages
Caterpillar Model
100% (1)
Caterpillar Model
109 pages
CS 240 Tutorial 11 Notes: C A A B A
No ratings yet
CS 240 Tutorial 11 Notes: C A A B A
2 pages
Cse2012 Design and Analysis of Algorithms Lab Digital Assignment 2
No ratings yet
Cse2012 Design and Analysis of Algorithms Lab Digital Assignment 2
18 pages
Algorithms in Bioinformatics
No ratings yet
Algorithms in Bioinformatics
7 pages
SCM Module1 Questions and Answers 1
No ratings yet
SCM Module1 Questions and Answers 1
11 pages
How A Search Engine Works
No ratings yet
How A Search Engine Works
28 pages
Espan140 Solution 54860159 8697
No ratings yet
Espan140 Solution 54860159 8697
39 pages
A357460420 - 22393 - 2 - 2018 - String Matching
No ratings yet
A357460420 - 22393 - 2 - 2018 - String Matching
27 pages
4 Word Processor
No ratings yet
4 Word Processor
22 pages
Modern Indian History Vision
No ratings yet
Modern Indian History Vision
15 pages
14th June - CSLIVE - Express Analysis
No ratings yet
14th June - CSLIVE - Express Analysis
22 pages
String Matching Problem
No ratings yet
String Matching Problem
16 pages
Hephaestus 7100 - Quick Reference Guide
No ratings yet
Hephaestus 7100 - Quick Reference Guide
4 pages
Midjourney Cheat Sheet PROMPT
89% (9)
Midjourney Cheat Sheet PROMPT
126 pages
W9 Presentation
No ratings yet
W9 Presentation
20 pages
Auction of Dead Stock - Auction Notice of CT
No ratings yet
Auction of Dead Stock - Auction Notice of CT
1 page
Globe Telecom Accounting Case Study
No ratings yet
Globe Telecom Accounting Case Study
20 pages
Lesson Plan On Algebra
No ratings yet
Lesson Plan On Algebra
5 pages
Instruction Manual: Programmable Automatic Shift System
No ratings yet
Instruction Manual: Programmable Automatic Shift System
25 pages
LJ CG Unit 2
No ratings yet
LJ CG Unit 2
2 pages
248HSL
No ratings yet
248HSL
8 pages
Physics Investigatory Project
No ratings yet
Physics Investigatory Project
17 pages
Pavani Profile (Salesforce Developer)
No ratings yet
Pavani Profile (Salesforce Developer)
3 pages
IVth Year Orientation
No ratings yet
IVth Year Orientation
12 pages
Kareem Shagar Formation An Oil Field Located in Ras Gharib Development
No ratings yet
Kareem Shagar Formation An Oil Field Located in Ras Gharib Development
53 pages
Simple Packer-In C Gunther
No ratings yet
Simple Packer-In C Gunther
10 pages
Jtac Notes
No ratings yet
Jtac Notes
18 pages
MS 02 230
No ratings yet
MS 02 230
58 pages
Icmlp 1501
No ratings yet
Icmlp 1501
2 pages
Tut - 03 - 020843
No ratings yet
Tut - 03 - 020843
25 pages
Lowongan Pekerjaan - Employee Referral Program (10022021)
No ratings yet
Lowongan Pekerjaan - Employee Referral Program (10022021)
5 pages
Pathfinder Solution Overview
No ratings yet
Pathfinder Solution Overview
2 pages
NLP Extc Sem8 Final Exam IMPs
No ratings yet
NLP Extc Sem8 Final Exam IMPs
3 pages
Geu Admit Card Back
No ratings yet
Geu Admit Card Back
1 page
LT08
No ratings yet
LT08
5 pages
JDS Call For Papers Air Power and India - 250123
No ratings yet
JDS Call For Papers Air Power and India - 250123
3 pages
Cyber Insurance Policy
No ratings yet
Cyber Insurance Policy
4 pages
Abstract
No ratings yet
Abstract
12 pages
DS BSC Hons Guidelines
No ratings yet
DS BSC Hons Guidelines
2 pages
FPFF RF PDF
No ratings yet
FPFF RF PDF
1 page
5G TF, 5G-NR and DSS (Dynamic Spectrum Sharing)
No ratings yet
5G TF, 5G-NR and DSS (Dynamic Spectrum Sharing)
1 page
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Programming with MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
Programming with MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
4.5/5 (3)
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet

Short Notes On Knuth

Uploaded by

Short Notes On Knuth

Uploaded by

Short Notes on Knuth-Morris-Pratt (KMP) Algorithm

Descriptive Questions and Answers

Q5: Analyze the time complexity of the KMP algorithm.

Final Failure Function:

You might also like