KMP Algorithm

The Knuth-Morris-Pratt algorithm improves on the Morris-Pratt algorithm for string matching by allowing longer shifts of the pattern when a mismatch occurs. It introduces a kmpNext table that is precomputed from the pattern to store the length of the longest prefix that is also a suffix. This allows starting the comparison after a mismatch at kmpNext[i] instead of at the beginning, avoiding re-checking characters. The algorithm runs in O(m+n) time and performs at most 2n-1 character comparisons.

Uploaded by

muffi840

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

625 views3 pages

KMP Algorithm

Uploaded by

muffi840

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 3

Knuth-Morris-Pratt algorithm

Description
The design of the Knuth-Morris-Pratt algorithm follows a tight analysis of the
Morris and Pratt algorithm. Let us look more closely at the Morris-Pratt
algorithm. It is possible to improve the length of the shifts.

Consider an attempt at a left position j, that is when the the window is

positioned on the text factor y[j .. j+m-1]. Assume that the first mismatch
occurs between x[i] and y[i+j] with 0 < i < m. Then, x[0 .. i-1] = y[j .. i+j-1] =u
and a = x[i] y[i+j]=b.

When shifting, it is reasonable to expect that a prefix v of the pattern matches

some suffix of the portion u of the text. Moreover, if we want to avoid another
immediate mismatch, the character following the prefix v in the pattern must be
different from a. The longest such prefix v is called the tagged border of u (it
occurs at both ends of u followed by different characters in x).

This introduces the notation: let kmpNext[i] be the length of the longest border
of x[0 .. i-1] followed by a character c different from x[i] and -1 if no such
tagged border exits, for 0 < i m. Then, after a shift, the comparisons can
resume between characters x[kmpNext[i]] and y[i+j] without missing any
occurrence of x in y, and avoiding a backtrack on the text (see figure 7.1). The
value of kmpNext[0] is set to -1.

Figure 7.1: Shift in the Knuth-Morris-Pratt algorithm (v border of u and c b).

The table kmpNext can be computed in O(m) space and time before the
searching phase, applying the same searching algorithm to the pattern itself,
as if x=y.
The searching phase can be performed in O(m+n) time. The Knuth-Morris-
Pratt algorithm performs at most 2n-1 text character comparisons during the
searching phase. The delay (maximal number of comparisons for a single text

character) is bounded by log (m) where is the golden ratio ( ).

The C code
void preKmp(char *x, int m, int kmpNext[]) {
int i, j;

i = 0;
j = kmpNext[0] = -1;
while (i < m) {
while (j > -1 && x[i] != x[j])
j = kmpNext[j];
i++;
j++;
if (x[i] == x[j])
kmpNext[i] = kmpNext[j];
else
kmpNext[i] = j;
}
}

void KMP(char x, int m, char y, int n) {

int i, j, kmpNext[XSIZE];

/* Preprocessing */
preKmp(x, m, kmpNext);
/* Searching */
i = j = 0;
while (j < n) {
while (i > -1 && x[i] != y[j])
i = kmpNext[i];
i++;
j++;
if (i >= m) {
OUTPUT(j - i);
i = kmpNext[i];
}
}
}

The example

Preprocessing phase

The kmpNext table

Searching phase

Oil Circuit Diagrams - 700R4 - MD8
100% (3)
Oil Circuit Diagrams - 700R4 - MD8
11 pages
Connect Plus Test Centre Administration Guide
80% (5)
Connect Plus Test Centre Administration Guide
45 pages
Brute Force
No ratings yet
Brute Force
5 pages
Abstract
No ratings yet
Abstract
12 pages
Knuth-Morris-Pratt Algorithm KENT
No ratings yet
Knuth-Morris-Pratt Algorithm KENT
4 pages
Week 9 String Algorithms, Approximation
No ratings yet
Week 9 String Algorithms, Approximation
22 pages
KMP 2
No ratings yet
KMP 2
7 pages
Algorithms in Bioinformatics
No ratings yet
Algorithms in Bioinformatics
7 pages
String Matching
No ratings yet
String Matching
35 pages
String Searching Algorithm
No ratings yet
String Searching Algorithm
22 pages
AOA Module 6 - String of Algorithms - Aeraxia - in
No ratings yet
AOA Module 6 - String of Algorithms - Aeraxia - in
26 pages
A357460420 - 22393 - 2 - 2018 - String Matching
No ratings yet
A357460420 - 22393 - 2 - 2018 - String Matching
27 pages
A Two Way Pattern Matching Algorithm Using Sliding Patterns
No ratings yet
A Two Way Pattern Matching Algorithm Using Sliding Patterns
5 pages
Nuth-Orris - Ratt Algorithm KMP: Ahmadreza Nazemi 8432310225 University of Kashan 2006
No ratings yet
Nuth-Orris - Ratt Algorithm KMP: Ahmadreza Nazemi 8432310225 University of Kashan 2006
11 pages
KMP Algorithm
No ratings yet
KMP Algorithm
21 pages
Module III Problem Solving
No ratings yet
Module III Problem Solving
16 pages
Daa Da
No ratings yet
Daa Da
9 pages
String Search: 1 2 I I+1 I+m-1 N
No ratings yet
String Search: 1 2 I I+1 I+m-1 N
8 pages
Lecture 39 Knutt Morris Pratt
No ratings yet
Lecture 39 Knutt Morris Pratt
15 pages
Ads Unit5
No ratings yet
Ads Unit5
26 pages
Lecture 34, 35 36 - String Matching Algorithms
No ratings yet
Lecture 34, 35 36 - String Matching Algorithms
42 pages
KMP Algo
No ratings yet
KMP Algo
16 pages
String Matching Chapter 12 Goodrich Nep
No ratings yet
String Matching Chapter 12 Goodrich Nep
43 pages
String Matching Kmprabin Karp and Naive
No ratings yet
String Matching Kmprabin Karp and Naive
41 pages
String Matching
No ratings yet
String Matching
34 pages
BCS304 DS Module 1 KMP Algorithm
No ratings yet
BCS304 DS Module 1 KMP Algorithm
6 pages
Cse2012 Design and Analysis of Algorithms Lab Digital Assignment 2
No ratings yet
Cse2012 Design and Analysis of Algorithms Lab Digital Assignment 2
18 pages
Cse2012 Design and Analysis of Algorithms Lab Digital Assignment 2
No ratings yet
Cse2012 Design and Analysis of Algorithms Lab Digital Assignment 2
18 pages
Unit 5
No ratings yet
Unit 5
14 pages
String Matching Problem
No ratings yet
String Matching Problem
16 pages
M269 - Lec8 Fall 1819
No ratings yet
M269 - Lec8 Fall 1819
24 pages
32.4 The Knuth-Morris-Pratt Algorithm: Either
No ratings yet
32.4 The Knuth-Morris-Pratt Algorithm: Either
10 pages
AAD Lec11
No ratings yet
AAD Lec11
5 pages
5CS4-AOA-Unit-3 @zammers
No ratings yet
5CS4-AOA-Unit-3 @zammers
7 pages
KMP Algorithm 1
No ratings yet
KMP Algorithm 1
22 pages
Lecture 18 - String Matching-KMP
No ratings yet
Lecture 18 - String Matching-KMP
40 pages
CS 240 Tutorial 11 Notes: C A A B A
No ratings yet
CS 240 Tutorial 11 Notes: C A A B A
2 pages
Knuth Moris 2797348
No ratings yet
Knuth Moris 2797348
21 pages
4string Matching Kmprabin Karp and Naive
No ratings yet
4string Matching Kmprabin Karp and Naive
57 pages
W 9 Presentation
No ratings yet
W 9 Presentation
20 pages
W9 Presentation
No ratings yet
W9 Presentation
20 pages
AoA Exp10
No ratings yet
AoA Exp10
8 pages
How A Search Engine Works
No ratings yet
How A Search Engine Works
28 pages
String Matching
No ratings yet
String Matching
27 pages
Internetalgo
No ratings yet
Internetalgo
13 pages
Naïve Method. Code:: Naive, Rabin-Karp, and Knuth-Morris-Pratt Algorithms For String Matching
No ratings yet
Naïve Method. Code:: Naive, Rabin-Karp, and Knuth-Morris-Pratt Algorithms For String Matching
5 pages
Algo Lecture 7
No ratings yet
Algo Lecture 7
52 pages
KMP Algorithm
No ratings yet
KMP Algorithm
19 pages
The Knuth Morris Pratt Algorithm
No ratings yet
The Knuth Morris Pratt Algorithm
7 pages
Ada Notes Unit 4
No ratings yet
Ada Notes Unit 4
28 pages
CH 8
No ratings yet
CH 8
26 pages
Sandeep Singh (Iii B.Tech I.T)
No ratings yet
Sandeep Singh (Iii B.Tech I.T)
179 pages
Patternmatching
No ratings yet
Patternmatching
29 pages
Data Structures Using C: Example 4.13
No ratings yet
Data Structures Using C: Example 4.13
5 pages
Lecture 04
No ratings yet
Lecture 04
18 pages
DAA Unit 5
No ratings yet
DAA Unit 5
22 pages
Mathematical Model For String Pattern Matching Algorithm (Boyer-Moore's Algorithm)
No ratings yet
Mathematical Model For String Pattern Matching Algorithm (Boyer-Moore's Algorithm)
5 pages
DAA DA Output
No ratings yet
DAA DA Output
9 pages
KMP Algorithm
No ratings yet
KMP Algorithm
26 pages
String Matching Algorithms
No ratings yet
String Matching Algorithms
25 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Fifth Dimension: The Light to See
From Everand
Fifth Dimension: The Light to See
Marc E. King
No ratings yet
Centos Gui
No ratings yet
Centos Gui
2 pages
Intermediate Documents (Idocs) : What Is An Idoc
No ratings yet
Intermediate Documents (Idocs) : What Is An Idoc
21 pages
Principles of Flight PDF
No ratings yet
Principles of Flight PDF
34 pages
Summer Internship Report
No ratings yet
Summer Internship Report
13 pages
Iat-I QP
No ratings yet
Iat-I QP
2 pages
PCL Barcode Manual
No ratings yet
PCL Barcode Manual
59 pages
Irrig Operating Procedure
No ratings yet
Irrig Operating Procedure
3 pages
Wait Event
No ratings yet
Wait Event
8 pages
Ka Ku Analysis
No ratings yet
Ka Ku Analysis
12 pages
Barlington, Spectramag-6 Six-Channel Spectrum Analyser PDF
No ratings yet
Barlington, Spectramag-6 Six-Channel Spectrum Analyser PDF
68 pages
Lec 1 Introduction Frequencty Response
No ratings yet
Lec 1 Introduction Frequencty Response
61 pages
Re Liquefaction System EcoRel - Cryostar Magazine 10
100% (1)
Re Liquefaction System EcoRel - Cryostar Magazine 10
12 pages
HTML & XML For Beginners
100% (1)
HTML & XML For Beginners
417 pages
Iheartmedia: Brand Guidelines
No ratings yet
Iheartmedia: Brand Guidelines
20 pages
WCU NURS 492 - Nursing Capstone - 2014fall - I - 8-4-14
No ratings yet
WCU NURS 492 - Nursing Capstone - 2014fall - I - 8-4-14
14 pages
Storage & Content Delivery: Amazon Simple Storage Service AWS Import/Export
100% (1)
Storage & Content Delivery: Amazon Simple Storage Service AWS Import/Export
20 pages
Manual Ad330
No ratings yet
Manual Ad330
32 pages
Structural Health Monitoring of Aluminum Plate Using Acoustic Sensors
No ratings yet
Structural Health Monitoring of Aluminum Plate Using Acoustic Sensors
10 pages
Subsurface Textile Irrigation
50% (2)
Subsurface Textile Irrigation
2 pages
Spring: May Is National Electrical Safety Month!
No ratings yet
Spring: May Is National Electrical Safety Month!
4 pages
Mgmtbooks PDF
No ratings yet
Mgmtbooks PDF
60 pages
Osaka Steel PLC: Mechanical Department)
No ratings yet
Osaka Steel PLC: Mechanical Department)
11 pages
VIP5662W Wireless IPTV Receiver: Installation Guide
No ratings yet
VIP5662W Wireless IPTV Receiver: Installation Guide
24 pages
Vipul CV
No ratings yet
Vipul CV
4 pages
Louis Ballington 2nd Place Winchester GT 2023 - Space Wolves Allies
No ratings yet
Louis Ballington 2nd Place Winchester GT 2023 - Space Wolves Allies
3 pages
ProVaC Catalogue V2
No ratings yet
ProVaC Catalogue V2
4 pages
BJT - MKWI4201 - Bahasa Inggris - ROBENTUS LACAN - TMK2
No ratings yet
BJT - MKWI4201 - Bahasa Inggris - ROBENTUS LACAN - TMK2
3 pages
Illlffl F
No ratings yet
Illlffl F
26 pages

KMP Algorithm

Uploaded by

KMP Algorithm

Uploaded by

Knuth-Morris-Pratt algorithm

Consider an attempt at a left position j, that is when the the window is

When shifting, it is reasonable to expect that a prefix v of the pattern matches

Figure 7.1: Shift in the Knuth-Morris-Pratt algorithm (v border of u and c b).

character) is bounded by log (m) where is the golden ratio ( ).

void KMP(char *x, int m, char *y, int n) {

The kmpNext table

You might also like

void KMP(char x, int m, char y, int n) {