0% found this document useful (0 votes)

7 views4 pages

Module V

Uploaded by

nayankonar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views4 pages

Module V

Uploaded by

nayankonar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

**PGCSE104: Advanced Algorithms

Module V - (4L)
Set and String Problems**

In this module, we explore important problems related to sets and strings, focusing on
optimization techniques and algorithms.

1. Set Cover Problem

The Set Cover Problem is a classical optimization problem where we aim to cover all elements of a
universal set with the minimum number of subsets from a given collection.

Problem Statement

Given a universe U and a collection S = {S1 , S2 , … , Sm } of subsets of U , find the minimum

number of subsets from S whose union equals U .

Approach
The problem is NP-hard, but a greedy algorithm provides an approximate solution. The greedy
approach selects the subset that covers the most uncovered elements of U at each step.

Greedy Algorithm
Initialize the set of covered elements as empty.
While there are uncovered elements, select the subset that covers the maximum number of
uncovered elements.
Repeat until all elements are covered.

Code Example (Python)

python Copy code

def set_cover(universe, subsets): covered = set() selected_subsets = [] while covered

!= universe: # Choose the subset that covers the most uncovered elements subset =
max(subsets, key=lambda s: len(s - covered)) selected_subsets.append(subset) covered
|= subset return selected_subsets # Example usage universe = {1, 2, 3, 4, 5} subsets
= [{1, 2, 3}, {2, 4}, {3, 4, 5}, {5}] solution = set_cover(universe, subsets)
print("Selected subsets:", solution)
2. String Matching
String Matching refers to the problem of finding one or more occurrences of a pattern string
within a larger text string.

Naive String Matching Algorithm

The simplest way to solve this problem is the naive algorithm, which slides the pattern over the
text one character at a time and checks for a match.

Code Example (Python)

python Copy code

def naive_string_matching(text, pattern): n = len(text) m = len(pattern) occurrences

= [] for i in range(n - m + 1): if text[i:i+m] == pattern: occurrences.append(i)
return occurrences # Example usage text = "abracadabra" pattern = "abra"
print("Pattern found at positions:", naive_string_matching(text, pattern))

KMP Algorithm (Knuth-Morris-Pratt)

The KMP algorithm is an efficient string matching algorithm that preprocesses the pattern to
avoid unnecessary comparisons. It uses a partial match table (also called the "lps" array) to skip
sections of the text.

3. Approximate String Matching

Approximate String Matching (also known as fuzzy string matching) is the problem of finding
substrings that match a pattern approximately, allowing for some mismatches or errors (insertions,
deletions, or substitutions).

Dynamic Programming Approach

The most common way to solve this problem is to use dynamic programming to compute
the edit distance (Levenshtein distance), which is the minimum number of operations (insertions,
deletions, or substitutions) required to convert one string into another.

Code Example (Python)

python Copy code

def edit_distance(s1, s2): n = len(s1) m = len(s2) dp = [[0] * (m + 1) for _ in

range(n + 1)] for i in range(n + 1): for j in range(m + 1): if i == 0: dp[i][j] = j
elif j == 0: dp[i][j] = i elif s1[i-1] == s2[j-1]: dp[i][j] = dp[i-1][j-1] else:
dp[i][j] = 1 + min(dp[i-1][j], dp[i][j-1], dp[i-1][j-1]) return dp[n][m] # Example
usage s1 = "kitten" s2 = "sitting" print("Edit distance:", edit_distance(s1, s2))
This algorithm runs in O(n × m), where n and m are the lengths of the two strings.

4. Longest Common Subsequence (LCS)

The Longest Common Subsequence (LCS) problem is a classic dynamic programming problem
where we seek to find the longest subsequence common to two sequences. Unlike substrings,
subsequences are not required to occupy consecutive positions.

Problem Statement

Given two sequences X and Y , find the longest subsequence that appears in both sequences in
the same order (but not necessarily consecutively).

Dynamic Programming Approach

Let dp[i][j] represent the length of the LCS of the first i characters of X and the first j characters
of Y . The recurrence relation is:

dp[i − 1][j − 1] + 1 if X[i − 1] == Y [j − 1]

dp[i][j] = {
max(dp[i − 1][j], dp[i][j − 1]) if X[i − 1] =
 Y [j − 1]

Code Example (Python)

python Copy code

def lcs(X, Y): m = len(X) n = len(Y) dp = [[0] * (n + 1) for _ in range(m + 1)] for i
in range(1, m + 1): for j in range(1, n + 1): if X[i-1] == Y[j-1]: dp[i][j] = dp[i-1]
[j-1] + 1 else: dp[i][j] = max(dp[i-1][j], dp[i][j-1]) return dp[m][n] # Example
usage X = "AGGTAB" Y = "GXTXAYB" print("Length of LCS:", lcs(X, Y))

Time Complexity

The time complexity of this algorithm is O(m × n), where m and n are the lengths of the two
sequences.

Summary
In this module, we covered several key problems related to sets and strings:
Set Cover: An NP-hard optimization problem, approximated using a greedy approach.
String Matching: Finding exact occurrences of a pattern in a text using naive and efficient
algorithms like KMP.
Approximate String Matching: Finding close matches between strings using dynamic
programming to compute edit distances.
Longest Common Subsequence: A dynamic programming problem that finds the longest
subsequence common to two sequences.

These problems have wide-ranging applications in optimization, data analysis, and computational
biology.

Cambridge Lower Secondary Computing 7 (Ben Barnes, Tristan Kirkpatrick Etc.) (Z-Library)
86% (14)
Cambridge Lower Secondary Computing 7 (Ben Barnes, Tristan Kirkpatrick Etc.) (Z-Library)
239 pages
hw10 Solution PDF
No ratings yet
hw10 Solution PDF
5 pages
String Matching Algorithms
No ratings yet
String Matching Algorithms
13 pages
Semester Final Project Report
No ratings yet
Semester Final Project Report
11 pages
Abdul Rauf (021!21!0019) Assignment2
No ratings yet
Abdul Rauf (021!21!0019) Assignment2
3 pages
String Matching
No ratings yet
String Matching
5 pages
CPS Final Project
No ratings yet
CPS Final Project
4 pages
Introduction To String Matching
No ratings yet
Introduction To String Matching
28 pages
Lecture 18 - String Matching-KMP
No ratings yet
Lecture 18 - String Matching-KMP
40 pages
DAA Summarized Unit 5
No ratings yet
DAA Summarized Unit 5
21 pages
Strings and Pattern Searching
100% (1)
Strings and Pattern Searching
80 pages
String Matching Algorithms
No ratings yet
String Matching Algorithms
25 pages
Imp Question
No ratings yet
Imp Question
5 pages
Approximate Matching
No ratings yet
Approximate Matching
16 pages
16 String Matching - Naive String Algorithm
100% (1)
16 String Matching - Naive String Algorithm
9 pages
Arrays and Strings
No ratings yet
Arrays and Strings
8 pages
Aoa Assignment
No ratings yet
Aoa Assignment
5 pages
Unit 2 Daa PDF
No ratings yet
Unit 2 Daa PDF
99 pages
KMP 2
No ratings yet
KMP 2
7 pages
Internetalgo
No ratings yet
Internetalgo
13 pages
DP Problem Algortithms
No ratings yet
DP Problem Algortithms
16 pages
Unit 3
No ratings yet
Unit 3
34 pages
Algorithms in Bioinformatics
No ratings yet
Algorithms in Bioinformatics
7 pages
String Matching Algorithms
No ratings yet
String Matching Algorithms
46 pages
Lectures 5-8
No ratings yet
Lectures 5-8
11 pages
Design and Analysis of Algorithms Lab - 3
No ratings yet
Design and Analysis of Algorithms Lab - 3
15 pages
Disjoint Set and Next
No ratings yet
Disjoint Set and Next
6 pages
Foundations of Sequence Analysis
No ratings yet
Foundations of Sequence Analysis
161 pages
Module III Problem Solving
No ratings yet
Module III Problem Solving
16 pages
Ch-5 Numerical Daa
No ratings yet
Ch-5 Numerical Daa
11 pages
54.string Inotes
No ratings yet
54.string Inotes
20 pages
W9 Presentation
No ratings yet
W9 Presentation
20 pages
W 9 Presentation
No ratings yet
W 9 Presentation
20 pages
Exercise 1
No ratings yet
Exercise 1
17 pages
Unit 4
No ratings yet
Unit 4
66 pages
DAA Unit5 Theory 50q
No ratings yet
DAA Unit5 Theory 50q
35 pages
Project Explanation
No ratings yet
Project Explanation
50 pages
AAD-String Matching
No ratings yet
AAD-String Matching
15 pages
Abstract
No ratings yet
Abstract
12 pages
11339AoA - EX-7
No ratings yet
11339AoA - EX-7
7 pages
Daa
No ratings yet
Daa
10 pages
Python Program For Array Rotation
No ratings yet
Python Program For Array Rotation
3 pages
Sandeep Singh (Iii B.Tech I.T)
No ratings yet
Sandeep Singh (Iii B.Tech I.T)
179 pages
M269 - Lec8 Fall 1819
No ratings yet
M269 - Lec8 Fall 1819
24 pages
8 LCS 19 01 2024
No ratings yet
8 LCS 19 01 2024
17 pages
1 Obtaining A Sum From A Subsequence of Digits: COMP9021, Session 1, 2016
No ratings yet
1 Obtaining A Sum From A Subsequence of Digits: COMP9021, Session 1, 2016
5 pages
Unit 7
No ratings yet
Unit 7
60 pages
String Matching
No ratings yet
String Matching
35 pages
ADSA IA2 Solution
No ratings yet
ADSA IA2 Solution
14 pages
String Matching
No ratings yet
String Matching
63 pages
Experiment No.09: Part A
No ratings yet
Experiment No.09: Part A
7 pages
4 Module Algorithms
No ratings yet
4 Module Algorithms
28 pages
Lecture Notes On Pattern Matching Algorithms
No ratings yet
Lecture Notes On Pattern Matching Algorithms
16 pages
Lecture Notes On Pattern Matching Algorithms
No ratings yet
Lecture Notes On Pattern Matching Algorithms
16 pages
Pattren Matching
No ratings yet
Pattren Matching
3 pages
CS 240 Tutorial 11 Notes: C A A B A
No ratings yet
CS 240 Tutorial 11 Notes: C A A B A
2 pages
Day 11
No ratings yet
Day 11
7 pages
Unit II
No ratings yet
Unit II
94 pages
Github Jakehoare Leetcode 4splits 100scale With Difficulty
No ratings yet
Github Jakehoare Leetcode 4splits 100scale With Difficulty
221 pages
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Digital Logic Design Assignment 01 Converting Binary Floating Point Number To Decimal
No ratings yet
Digital Logic Design Assignment 01 Converting Binary Floating Point Number To Decimal
8 pages
Ece221 Notes Chapter 3 Part 1
No ratings yet
Ece221 Notes Chapter 3 Part 1
14 pages
PHP Module 2
No ratings yet
PHP Module 2
64 pages
P1 Ab Initio Basic Components
100% (1)
P1 Ab Initio Basic Components
25 pages
Computer Science Green Book 1 3
No ratings yet
Computer Science Green Book 1 3
944 pages
Harish Data Structures Q
No ratings yet
Harish Data Structures Q
3 pages
Arrays - Assignment
No ratings yet
Arrays - Assignment
4 pages
DMC Mid Ii Bit Bank
No ratings yet
DMC Mid Ii Bit Bank
23 pages
Digital Principles and Computer Organization - CS3351 - Important Questions With Answer - Unit 2 - Synchronous Sequential Logic
No ratings yet
Digital Principles and Computer Organization - CS3351 - Important Questions With Answer - Unit 2 - Synchronous Sequential Logic
9 pages
Sem4 - Important Id
No ratings yet
Sem4 - Important Id
6 pages
Python Practical 2
No ratings yet
Python Practical 2
10 pages
Rs Y7 C3: 32 Marks From 32 Questions
No ratings yet
Rs Y7 C3: 32 Marks From 32 Questions
6 pages
MD050 - GL - GENERATE GL Voucher Num - v1.0
No ratings yet
MD050 - GL - GENERATE GL Voucher Num - v1.0
16 pages
Sequential Logic Quiz
No ratings yet
Sequential Logic Quiz
5 pages
OOPS - Summer - Carry-Over Paper
No ratings yet
OOPS - Summer - Carry-Over Paper
3 pages
A Mathematical Model For Assessing Cryptographic Agility in Security Systems - Internship Sherbrooke
No ratings yet
A Mathematical Model For Assessing Cryptographic Agility in Security Systems - Internship Sherbrooke
30 pages
B.SC., IT
No ratings yet
B.SC., IT
28 pages
Viva Questions
No ratings yet
Viva Questions
2 pages
MCS 012
No ratings yet
MCS 012
118 pages
Java 3
No ratings yet
Java 3
1 page
MCS 011
No ratings yet
MCS 011
4 pages
Vision Transformers (ViT) in Image Recognition - Full Guide - Viso - Ai
No ratings yet
Vision Transformers (ViT) in Image Recognition - Full Guide - Viso - Ai
11 pages
7 - Exponent Laws - Student Notes
No ratings yet
7 - Exponent Laws - Student Notes
2 pages
Dumpssheet Uipath Ardv1 Uipath
No ratings yet
Dumpssheet Uipath Ardv1 Uipath
9 pages
Bitwise Operators
No ratings yet
Bitwise Operators
5 pages
J Session 1
No ratings yet
J Session 1
41 pages
Practice Q 01
No ratings yet
Practice Q 01
2 pages
Log - 2022 01 29
No ratings yet
Log - 2022 01 29
35 pages
Project 4 2595095609675856
No ratings yet
Project 4 2595095609675856
3 pages

Module V

Uploaded by

Module V

Uploaded by

**PGCSE104: Advanced Algorithms

1. Set Cover Problem

Given a universe U and a collection S = {S1 , S2 , … , Sm } of subsets of U , find the minimum

number of subsets from S whose union equals U .

Code Example (Python)

python Copy code

def set_cover(universe, subsets): covered = set() selected_subsets = [] while covered

Naive String Matching Algorithm

Code Example (Python)

python Copy code

def naive_string_matching(text, pattern): n = len(text) m = len(pattern) occurrences

KMP Algorithm (Knuth-Morris-Pratt)

3. Approximate String Matching

Dynamic Programming Approach

Code Example (Python)

python Copy code

def edit_distance(s1, s2): n = len(s1) m = len(s2) dp = [[0] * (m + 1) for _ in

4. Longest Common Subsequence (LCS)

Dynamic Programming Approach

dp[i − 1][j − 1] + 1 if X[i − 1] == Y [j − 1]

Code Example (Python)

python Copy code

You might also like