A. Yet Another Problem With Strings: ACM ICPC Practice Contest, 8 November, 2015

This document provides an explanation of an algorithm to solve a problem involving performing queries on a set of strings S. The queries include contains queries to check if a string is a prefix of any string in S, and append queries to append a character to a string in S. The algorithm uses two data structures: an augmented trie to represent the strings in S, with counters to track the number of strings that are prefixes of each node, and a hash map of the strings in S. Contains queries are solved by finding the longest prefix of the query string that is in the trie and has a positive counter, while append queries update the trie and hash map lazily. The overall time complexity is linear for append queries and O(

Uploaded by

Vikram Saurabh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views2 pages

A. Yet Another Problem With Strings: ACM ICPC Practice Contest, 8 November, 2015

Uploaded by

Vikram Saurabh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

ACM ICPC Practice Contest, 8th November, 2015

A. Yet Another Problem with Strings

Editorial

NOTE: In this problem, the purpose of using LAST YES indices to generate queries, is to force participants to
implement an online solution. From now, we assume that all queries represent input queries already transformed
using LAST YES variable as described in the statement.

You are given an array S[1..N ] of N strings consisting only of lowercase English letters. Your task is to perform Q
queries on S. There are two type of queries to handle:
contains(t) := returns YES if at least one string from S is a prefix of string t. Otherwise returns NO.
append(i, c) := appends character c at the end of Si .

In the below explanation, we denote a hash of a string as its rolling hash, sometimes also called a polynomial hash.
It is defined as follows. For a string t[1..k] its hash value h(t) := t1 X k1 + t2 X k2 + . . . + tk , where ti is the integer
representing the ith letter of t. The hash is defined for some integer X and computed modulo some prime number.

In order to solve the problem, we are going use two data structures:
augmented trie
set of hashes of strings currently in S.

Augmented trie

The idea is to represent S as a trie with counters. In more details, initially, we insert all strings from S to the trie.
Then, for each node of the trie, we want to know how many strings from S are prefixes of the string corresponding
to this node. For the initial S, it is very easy to do: first, we assign 1 to counters of nodes corresponding to strings
from S, and then we propagate all these ones down to the trie using for example a DFS.
The trie augmented in such a way, will be used to answer contains(t) queries. The idea is that for a given suffix
tj of t, we are going to find its longest prefix which is in the trie and its counter is positive. Notice that a positive
counter means that there is a prefix of the string corresponding to the node which is currently in S, so this prefix is
a prefix of tj , thus its a substring of t.
Sounds good, but do not forget that we have to handle append queries also, and they have an impact on the trie and
its counters. The good thing is that for any append(i, c) query, we can use a lazy propagation in order to update
counters in the trie. In more details, first, we find the node v corresponding to Si . Then we decrement its counter,
since it now represents one string less than it used to be. Next, if there is no node corresponding to Si with appended
c, we create it and set up its counter. The last thing is to decrement counters for the subtrees of the other children
of v. We have to do this, because each such node has now one less prefix which is in S than before the update.
Notice that it is not a good idea to update counters in each subtree every time, because it may lead to updating a
very large amount of nodes for a single query. The crucial observation is that in order to handle contains query, we
only need the information if a counter for a particular node is positive or not, we do not need it exact value. Based
on that fact, we can use lazy propagation concept and update immediately only counters which become zero - notice
that counters at any path from the root forms a non-decreasing sequence. For any other node, it is sufficient to store
the information that counters in all nodes in its subtree has to be decreased by some amount in the future, and as
soon as this amount becomes zero, we update it and propagate the update down the subtree. This technique allows
us to achieve an amortized linear time for all append queries. Doing this, we have a trie with up to date information
if a given node has a prefix which is currently in S.

Set of hashes

In addition to the trie, we will maintain a set of hashes of all strings currenctly in S. It is a good idea have it
implemented as a map in order to store not only the information if a given string is in S, but also a pointer to its
node. Notice that this map can be easily updated while performing append queries. We will use the map to check if
a particular string is in S and the pointer to check out if its node has a positive counter or not.

Handling queries

Handling append queries is explained in the trie section above. Now, we will focus on contains(t) queries. For a
given t, the first thing we do is to compute hashes for all its prefixes. We will use them in order to be able to compute
a hash for any substring of t in a constant time.
Next, we iterate over all suffixes of t. For a particular suffix tj = t[j, . . . , |t|], we are going to find its longest prefix
which is in S and has a positive counter. In order to do that, we can use a binary search. In more details, we are
going to find the greatest index i, for which t[j, . . . i] is in S and its counter is positive. To do so, we can use hashes
computed for t to compute hash of every prefix of tj , and check fast using our map of hashes, if this prefix is in S.
If it is, we can easily get its counter using a pointer. Notice that if and only if this counter is positive, then there
is at least one prefix of t which is currently in S. The reason that we are looking for the longest prefix of t is that
counters on any path from the root of the trie form a non decreasing sequence, so its sufficient to find the longest
prefix with a positive counter. Since we are doing it for all suffixes of t, we are checking all possible starting points
of substrings of t, so all cases are covered.
The time complexity of a single contains(t) query is O(|t| log |t| f ), where f is the cost of checking if a given string
is in S using the map. This is true, because there are |t| suffixes of t, and for each one, we are finding its longest
prefix which is in S using binary search and our map implementation. You can use an expected constant time access
map implementation, but is sufficient to use an implementation of the map as a balanced binary tree, to achieve
(O|t| log|t| log N ) per single contains(t) query.

Tries Data Structures (Trie) PPT
100% (1)
Tries Data Structures (Trie) PPT
11 pages
Business Transformation Enablement Program
No ratings yet
Business Transformation Enablement Program
48 pages
GRC Training - Terminology
0% (1)
GRC Training - Terminology
13 pages
Empowerment Technologies Unit 1 Lesson 1 Introduction To Information and Communication Technologies
No ratings yet
Empowerment Technologies Unit 1 Lesson 1 Introduction To Information and Communication Technologies
17 pages
Vdgs Technical Descriptionpdf PDF
No ratings yet
Vdgs Technical Descriptionpdf PDF
20 pages
Final Documentation (G-9)
100% (1)
Final Documentation (G-9)
81 pages
IEEE 610-5-1990 - w2000 Glossary of Data Management Terminology
No ratings yet
IEEE 610-5-1990 - w2000 Glossary of Data Management Terminology
76 pages
Several Problems of The Polish Physics Olympiad: Waldemar Gorzkowski
No ratings yet
Several Problems of The Polish Physics Olympiad: Waldemar Gorzkowski
4 pages
Codeforces Tutorial
No ratings yet
Codeforces Tutorial
72 pages
Radix Search Tree
100% (1)
Radix Search Tree
18 pages
Crack Maang Companies - DSA Questions (C++) - 1
No ratings yet
Crack Maang Companies - DSA Questions (C++) - 1
257 pages
5.4. ADS - Tries - Standard Tries
No ratings yet
5.4. ADS - Tries - Standard Tries
34 pages
Notes 06 Text Indexing PDF
No ratings yet
Notes 06 Text Indexing PDF
162 pages
Suffix Array Tutorial
No ratings yet
Suffix Array Tutorial
17 pages
A2SV - Trie Lecture (No Code)
No ratings yet
A2SV - Trie Lecture (No Code)
39 pages
Advance Data Structures
No ratings yet
Advance Data Structures
184 pages
Iota Bits Mid-II
No ratings yet
Iota Bits Mid-II
18 pages
Rules of Netiquette 1.1
No ratings yet
Rules of Netiquette 1.1
78 pages
Suffix Trees, Suffix Arrays, and Their Applications
No ratings yet
Suffix Trees, Suffix Arrays, and Their Applications
29 pages
Lecture4 - Indexing and Searching I
No ratings yet
Lecture4 - Indexing and Searching I
56 pages
Module 06. String Algorithms Lecture 3-6
No ratings yet
Module 06. String Algorithms Lecture 3-6
48 pages
Trie Tree
No ratings yet
Trie Tree
21 pages
An Efficient Implementation of Trie Structures
No ratings yet
An Efficient Implementation of Trie Structures
27 pages
Suffixtrees
No ratings yet
Suffixtrees
50 pages
Project Phase 1
No ratings yet
Project Phase 1
29 pages
Unit5 Trie
No ratings yet
Unit5 Trie
23 pages
Suffix Trees and Suffix Arrays
No ratings yet
Suffix Trees and Suffix Arrays
33 pages
Pydub
No ratings yet
Pydub
26 pages
Chapter 09 Advanced Data Structures
No ratings yet
Chapter 09 Advanced Data Structures
9 pages
Tries and Radix Tree1
No ratings yet
Tries and Radix Tree1
27 pages
Tries and Suffix Tries
No ratings yet
Tries and Suffix Tries
29 pages
Triesearches PDF
No ratings yet
Triesearches PDF
22 pages
1.advanced Tree Structures
No ratings yet
1.advanced Tree Structures
29 pages
Trie
No ratings yet
Trie
6 pages
X-Fast and Y-Fast Tries
No ratings yet
X-Fast and Y-Fast Tries
66 pages
Types of Tries
No ratings yet
Types of Tries
20 pages
Lecture 04 Inaryseachtree
No ratings yet
Lecture 04 Inaryseachtree
20 pages
Obi Observations
No ratings yet
Obi Observations
16 pages
Trie Insertion
No ratings yet
Trie Insertion
31 pages
Oracle-Db Essentials Imp
No ratings yet
Oracle-Db Essentials Imp
21 pages
Applications of Suffix Trees
No ratings yet
Applications of Suffix Trees
40 pages
Current Challenges in Textual Databases: Gonzalo Navarro
No ratings yet
Current Challenges in Textual Databases: Gonzalo Navarro
44 pages
Unipower Company 2018pptx
No ratings yet
Unipower Company 2018pptx
22 pages
55 TriesNOTES
No ratings yet
55 TriesNOTES
18 pages
Unit 3 Tries
No ratings yet
Unit 3 Tries
16 pages
Assignment 4
No ratings yet
Assignment 4
21 pages
9 Suffix Trees: Tttta
No ratings yet
9 Suffix Trees: Tttta
9 pages
Tries 1427
No ratings yet
Tries 1427
19 pages
Ads 2 Part 4
No ratings yet
Ads 2 Part 4
18 pages
Radio
No ratings yet
Radio
21 pages
Chapter 3 Part 2
No ratings yet
Chapter 3 Part 2
22 pages
Tries
No ratings yet
Tries
17 pages
Advance Data Structures: Tries
No ratings yet
Advance Data Structures: Tries
26 pages
Search:: A Really Simple Database
No ratings yet
Search:: A Really Simple Database
30 pages
Representation:: Insertion and Search in Trie Data Structure
No ratings yet
Representation:: Insertion and Search in Trie Data Structure
25 pages
Seaman Resume 1
No ratings yet
Seaman Resume 1
1 page
10 String Algorithms
No ratings yet
10 String Algorithms
36 pages
Cci & DF Lab Ex - No 1a - 1b
No ratings yet
Cci & DF Lab Ex - No 1a - 1b
10 pages
CSC10004: Data Structures and Algorithms
No ratings yet
CSC10004: Data Structures and Algorithms
20 pages
Tries: Symbol Table Review
No ratings yet
Tries: Symbol Table Review
8 pages
Tries and Huffman Encoding
No ratings yet
Tries and Huffman Encoding
16 pages
Software Development Process - Object-Oriented Approach - Encapsulation
No ratings yet
Software Development Process - Object-Oriented Approach - Encapsulation
11 pages
Jnaanendra Saurabh: Academic Details
No ratings yet
Jnaanendra Saurabh: Academic Details
2 pages
Indexed Search Tree (Trie) : Nelson Padua-Perez Chau-Wen Tseng
No ratings yet
Indexed Search Tree (Trie) : Nelson Padua-Perez Chau-Wen Tseng
21 pages
Topic - Q Implement Trie (Prefix Tree) - Information O..
No ratings yet
Topic - Q Implement Trie (Prefix Tree) - Information O..
3 pages
SolidWorks Tutorial 3D Trusses
No ratings yet
SolidWorks Tutorial 3D Trusses
15 pages
Advantages Relative To Other Search Algorithms
No ratings yet
Advantages Relative To Other Search Algorithms
7 pages
Daa Tut 6 Sudhanshu Raut: Pseudo Code For KMP Algorithm
No ratings yet
Daa Tut 6 Sudhanshu Raut: Pseudo Code For KMP Algorithm
11 pages
Trie Vs BST Vs HashTable
No ratings yet
Trie Vs BST Vs HashTable
2 pages
Trie - Wikipedia
No ratings yet
Trie - Wikipedia
10 pages
Can AI Language Models Replace Human Participants
No ratings yet
Can AI Language Models Replace Human Participants
4 pages
Suffix Tree
No ratings yet
Suffix Tree
6 pages
Tries
No ratings yet
Tries
5 pages
Tries
No ratings yet
Tries
4 pages
Lecture Notes On Tries
No ratings yet
Lecture Notes On Tries
10 pages
SONTEK FlowTracker2 Acoustic Doppler Velocimeter
No ratings yet
SONTEK FlowTracker2 Acoustic Doppler Velocimeter
4 pages
Toc
No ratings yet
Toc
6 pages
5991 3719en
No ratings yet
5991 3719en
8 pages
Winter Petrozavodsk Camp Andrew Stankevich Contest 1 en
No ratings yet
Winter Petrozavodsk Camp Andrew Stankevich Contest 1 en
8 pages
Pass Res B1plus UT 4A
No ratings yet
Pass Res B1plus UT 4A
3 pages
SAP Simple Finance Training Course Content
No ratings yet
SAP Simple Finance Training Course Content
5 pages
ECONOMIA
No ratings yet
ECONOMIA
9 pages
jssor-API - UI Definitio
No ratings yet
jssor-API - UI Definitio
5 pages
Algorithm Design and Synthesis For Wireless Sensor Networks
No ratings yet
Algorithm Design and Synthesis For Wireless Sensor Networks
8 pages
Convexity: 1 Warm-Up
No ratings yet
Convexity: 1 Warm-Up
7 pages
Suffix Arrays: Justin Zhang 24 May 2017
No ratings yet
Suffix Arrays: Justin Zhang 24 May 2017
5 pages
Programming Reviewer
No ratings yet
Programming Reviewer
3 pages
Java Assignment 2024
No ratings yet
Java Assignment 2024
2 pages
Top 10 Developer Articles For 2020
No ratings yet
Top 10 Developer Articles For 2020
3 pages
6.851 Advanced Data Structures (Spring'12) Prof. Erik Demaine Problem 9 Sample Solution
No ratings yet
6.851 Advanced Data Structures (Spring'12) Prof. Erik Demaine Problem 9 Sample Solution
2 pages
Easy Recursion Questions
No ratings yet
Easy Recursion Questions
2 pages
1 Matrix en
No ratings yet
1 Matrix en
1 page
Zonal Informatics Olympiad, 2015: Solutions
No ratings yet
Zonal Informatics Olympiad, 2015: Solutions
1 page
1 Purify en
No ratings yet
1 Purify en
1 page
Bank's Seal: Visakhapatnam Steel Plant
No ratings yet
Bank's Seal: Visakhapatnam Steel Plant
1 page

A. Yet Another Problem With Strings: ACM ICPC Practice Contest, 8 November, 2015

Uploaded by

A. Yet Another Problem With Strings: ACM ICPC Practice Contest, 8 November, 2015

Uploaded by

ACM ICPC Practice Contest, 8th November, 2015

A. Yet Another Problem with Strings

You might also like