0% found this document useful (0 votes)

2 views8 pages

Chapter 24

This chapter covers character strings and regular expressions in C#. It explains the StringBuilder class for mutable strings, the immutability of strings, and the basics of using regular expressions for pattern matching. Key concepts include string manipulation methods, the efficiency of StringBuilder for frequent changes, and the use of Regex for advanced string searching and replacing.

Uploaded by

brightmohlala06

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views8 pages

Chapter 24

Uploaded by

brightmohlala06

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

1

Chapter 24
Strings and Regular expressions

1. Introduction
In this chapter we focus on character strings. We build further on the content of the section on strings
in Chapter 10 and you must be fully familiar with that content before we can proceed. We discuss the
StringBuilder class as container for mutable strings and then touch on the basics of regular
expressions.

2. Strings
A character string is a data structure that contains characters. The most important thing to remember
about character strings is that a string is an array of characters. This means that we can index and
access individual characters in a string in the same way that we can access elements in any other array
such as int[]. You should be fully conversant with the content of the section on strings in Chapter
10. Make sure that you understand how to use the Compare method and that you are familiar with the
methods listed on page 180. Note also how the static Join method can be used to create a comma-
separated string of an array or list of string objects:

List<string> words = new List<string>(new string[] { "John", "Mike", "Susan", "Anna" } );

string listOfWords = string.Join(", ", words);

The example on csharp.pl3.co.za contains several examples of the usage of character strings. Study
and make sure that you understand everything. Make sure that you understand how to use
IntelliSense to determine the parameter and return types of the available built-in methods of the
string class.

2.1 Character strings

String is a reference type and should be treated the same as other reference types with the
exception that we do not have to use new to instantiate it.

There is a fundamental difference between a null and empty string. A null is an absence of a
value and an empty string is a value that is empty.

string s; // s = nothing, null, no value

string s = ""; // s is empty

We can test with s.IsNullOrEmpty.

• Nesteruk (2019, p 250)

Strings are immutable. That means that we cannot make a change to the initial value of a string.
A statement such as

string s = "a";
s += "b";

Copyright: PJ Blignaut, 2020

make a copy of s and then overwrites the original memory cell. You should use StringBuilder
if you want a mutable string. If you foresee many changes to a string, StringBuilder will be
much more efficient and faster.

Unlike other reference types, when we pass a string parameter, the reference is passed by value.
This means that we make a second copy of the pointer which in effect means that we have two
different memory cells that behave independently of each other.

See also
• https://stackoverflow.com/questions/1096449/c-sharp-string-reference-type

2.2 The StringBuilder class

Strings in the string class are immutable. This means that the contents of a memory cell is fixed and
cannot be changed. A statement such as

string s = "abc";
s += "d";

effectively means that the variable name s points to a new memory cell with the contents "abcd".
The old s is left in memory until the garbage collector cleans it up. Appending or changing string
instances is slow and time consuming.

The StringBuilder class, on the other hand, is capable of amending the contents of a memory cell
in-place. This means that

StringBuilder sb = new StringBuilder("abc");

sb.Append("d");

effectively changes the contents of sb without creating a new memory cell. Note that although the
Append method returns an instance of StringBuilder, it is not necessary to do this:

sb = sb.Append("d");

A StringBuilder object maintains a buffer to accommodate expansions to the string. New data is
appended to the buffer if room is available; otherwise, a new, larger buffer is allocated, data from the
original buffer is copied to the new buffer, and the new data is then appended to the new buffer.

For small or infrequent changes to a character string, the overheads that are involved with
StringBuilder does not make it worthwhile. However, for cases where frequent changes to the same
character string is expected, the usage of a StringBuilder object will imply a huge saving of CPU
time.

Study the examples on csharp.pl3.co.za to become familiar with the StringBuilder versions of
Length, Insert, Remove, Replace, etc. Also look at the method Timing to see the difference in time
between the append procedure of the two classes.

Also read this and pay specific attention to the Capacity property:
https://docs.microsoft.com/en-us/dotnet/api/system.text.stringbuilder

Copyright: PJ Blignaut, 2020

3. Regular expressions
Mostly, we can get away with the methods and properties that the string and StringBuilder classes
provide, but there are cases when we need more power. It is, for example, easy to find the domain
name in an email address with the IndexOf and Substring methods of the string class, but it is not
so easy to determine if a given email address is valid.

The Regex class in C# provides an interface for the Regular Expression (RE) language for character
strings. Regex is huge and we are just going to touch on some of the basic elements to make you
aware of the existence and power of regular expressions. The examples below are also available in
Listing 24.3.

3.1 Find a substring in a larger string

There is nothing here that cannot be done with IndexOf in the string class, but just to get the ball
rolling, consider the following code fragment. We instantiate a Regex object and then use its Match
method to return a Match object. This object has properties to indicate whether a match was found
(Success), its Index and Value. Use IntelliSense to discover the other possible methods and
properties. The same can be achieved through static methods of the Regex class.

string str1 = "the quick brown fox jumped over the lazy dog";

Regex reg = new Regex("the");

Match match = reg.Match(str1);
if (match.Success)
{
int matchPos = match.Index;
Console.WriteLine("\tFound '" + match.Value + "' at position : " + matchPos);
}

//Alternatively with static methods

if (Regex.IsMatch(str1, "brown"))
{
int matchPos = Regex.Match(str1, "brown").Index;
Console.WriteLine("\tFound 'brown' at position : " + matchPos);
}

Listing 24.3.1 Using an RE determine if a substring is present in a larger string

Copyright: PJ Blignaut, 2020

3.2 Find multiple occurrences of a substring in a larger string

It is not undoable, but it is quite tricky with existing string methods and properties. With Regex, it is
easy to find all occurrences of the word "the" in a given string.

Regex reg = new Regex("the");

string str1 = "the quick brown fox jumped over the lazy dog";
MatchCollection matches = reg.Matches(str1);
if (matches.Count > 0)
foreach (Match aMatch in matches)
Console.WriteLine("\tFound '" + aMatch.Value + "' at: " + aMatch.Index);

Listing 24.3.2 Using an RE to find all occurrences of a substring in a larger string

We can use the pipe symbol "|", to list alternatives, e.g.

Regex reg = new Regex("the|dog|lazy");

will find all occurrences of "the", "dog" and "lazy".

3.3 Replace all occurrences of a substring with another

This is easy with the existing Replace method in the string class:
string s = "the quick brown fox jumped over the brown dog";
s = s.Replace("brown", "black"):

With Regex, it can be done like this:

s = Regex.Replace(s, "brown", "black");

3.4 Wildcards and quantifiers

All the examples below are contained in the Quantifiers method in Listing 24.3. I advise you
strongly to follow the discussion below while running the example. Comment out all lines of code
except for the one that you are inspecting at that moment and make sure that you understand why you
get the specific output.

Consider the following array of words that will serve as basis for the examples:

string[] words = new string[] {"abdomen", "bad", "baad", "baaad", "life", "lobby",
"boy", "bear", "bend", "bobby", "lend", "death"};

In Regex terminology, we refer to a substring in a larger string as a pattern. If we want to find all
words in the above list that contain the pattern "bd", we can do this:

foreach (string word in words)

if (Regex.IsMatch(word, "bd")) //abdomen
Console.Write(word + ", ");

If we want to find all words with the same pattern, but with any character between the "b" and the
"d", we can write "b.d" and do this

foreach (string word in words)

if (Regex.IsMatch(word, "b.d")) //bad
Console.Write(word + ", ");

Copyright: PJ Blignaut, 2020

We refer to the period (".") as a wildcard that stands in the place of anything else. If the pattern is
"b..d", we will find "baad" and "bend". If the pattern is "b." (nothing following the period), we
will find "abdomen", "bad", "baad", "baaad", "lobby", "boy", "bear", "bend" and
"bobby".

Instead of writing "bb", we can write "b{2}". The number between curly braces is a quantifier. This
will find "lobby" and "bobby". What will we get if the pattern is "b.{2}d"? If we write "ba{1,2}d",
we find "bad" and "baad", but not "baaad". In general, {m,n} refers to the minimum and maximum
number of occurrences of the previous character.

We can use the + symbol to reflect on the character preceding it and find words containing one or
more of those characters. The "*" symbol will find matches that contain zero or more occurrences of
the preceding character. For example, "b+d" wil find "abdomen", while "b*d" will find all words
containing "d". The "?" symbol will find occurrences of zero or one of the preceding character.

3.5 Greedy and lazy

Consider the html string below:

string words = "Part of this string is bold";

If the pattern is "<.>", it means that we want to find "<" followed by any single character followed
by ">". This will then find "" in the above string.

The search pattern "<.+>" means that we want to find "<" followed by any character one or more
times followed by ">". There are two possibilities, namely "" and "string". If we
define a greedy search, the regular expression will find as many characters as possible, thus
"string". A lazy search will stop at the first fulfilment of the pattern, thus "". A lazy
search are defined by the symbol "?" and thus the pattern must be "<.+?>".

If we write the code as below, there is a loop that will find all matches of a lazy search, resulting in
both "" and "".

string pattern = "<.+?>"; // '<' followed by any character one or more times lazy
MatchCollection matches = Regex.Matches(words, pattern);
for (int i = 0; i < matches.Count; i++)
Console.Write(matches[i].Value + ", ");

In Section 4.3 above, replace

s = Regex.Replace(s, "brown", "black");

with one of the following and examine the output:

s = Regex.Replace(s, "b.n", "black");

s = Regex.Replace(s, "b...n", "black");
s = Regex.Replace(s, "b.+n", "black");
s = Regex.Replace(s, "b.+?n", "black");

3.6 Special characters

You would have noticed that we have special characters in the expressions, namely ".", "+", "?",
"(", ")", "{", "}", "[", "]". If we want to include those characters in the search string, we have to
precede them with "\". If we want to include "\" in the pattern, we have to write "\\". For example,
Copyright: PJ Blignaut, 2020
6

if we want to find a period in the expression, we have to write "\.". Since a "\" has a special meaning
in a C# string, we have to write @"\.". So, to find periods, we can do this:

string sentences = "First sentence. Second sentence.";

string pattern = @"\.";
MatchCollection matches = Regex.Matches(sentences, pattern);
for (int i = 0; i < matches.Count; i++)
Console.Write(matches[i].Value + ", ");

3.7 Character classes

In the examples above, we had to specify specific characters in the search pattern. If we want to find
a specific category (or class) of characters, we need a character class. Character classes are written
between [ ]. Don't confuse the meaning of the word "class" in this context with its normal meaning
in object oriented programming.

As an example, consider the pattern "[\w]" which will find all letters, digits and underscores.
Characters between "[" and "]" are treated as special characters. "\" precedes a character class and
should not be confused with the meaning of "\" outside of [ ] (cf Section 4.6 above).

Consider the following string on which the subsequent examples are based. For easy reference to
indexes within the string, two indexing strings are also provided:

" 1 2 3 4 5 6 7 8 9 "
"012345678901234567890123456789012345678901234567890123456789012345678901234567890123456789012"
"Note: The -8- years old quick brown_fox, with 4 toes, jumped over the (12 year) old lazy dog."

The table below lists some character classes along with typical patterns and the matches in the above
example string. You should run the CharacterClasses method in Listing 24.3 in conjunction with
the table entries. Comment out all the re variables except for the one that you want to inspect.

For the sake of readability, the @ and quotes are omitted in the table, but they should always be there,
for example [\w] must be written as @"[\w]".

Class Explanation Pattern Matches Indexes

Word characters Any character, including letters, digits and [\w] 'N' 0
underscore. Excluding spaces and punctuation. 'o' 1
Equivalent: [A-Za-z0-9_] 't' 2
'e' 3
'T' 6
… …
Non-word characters Spaces and punctuation [\W] ':' 4
' ' 5
' ' 9
'-' 10
… …
Digits All digits, 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 [\d] '8' 11
Equivalent: [0-9] '4' 46
'1' 71
'2' 72
Non-digits Everything but a digit [\D] 'N' 0
'o' 1
't' 2
'e' 3
':' 4
' ' 5
… …

Class Explanation Pattern Matches Indexes

Capital letters All capital letters, 'A' through 'Z' [A-Z] 'N' 0
'T' 6
Lower case letters All lower case letters, 'a' through 'z' [a-z] 'o' 1
't' 2
'e' 3
'h' 7
… …
Vowels All vowels [aeiou] 'o' 1
'e' 3
'e' 8
… …
Not E.g. not vowels [^aeiou] 'N' 0
't' 2
':' 4
' ' 5
… …
Spaces All spaces [\s] ' ' 5
' ' 9
' ' 13
' ' 19
… …
Non-spaces All but spaces [\S] 'N' 0
'o' 1
't' 2
'e' 3
':' 4
'T' 6
… …
Combinations E.g. all vowels and digits [aeiou0-9] 'o' 1
'e' 3
'e' 8
'8' 11
… …

3.8 Anchors

Regular expressions can be modified with special characters to mark word or sentence boundaries.

The table below shows the matches of some example patterns in the following string.

" 1 2 3 4 5 6 7 8"
"012345678901234567890123456789012345678901234567890123456789012345678901234567890"
"The 8 years old quick brown fox with 4 toes jumped over the 12 year old lazy dog."

Assertion Explanation Example Index(es)

pattern of matches
^ Marks the beginning of a sentence "^[Tt]" 0
$ Marks the end of the sentence @"\.$" 80
\b Word boundary @"d\b" 14, 49, 70
@"\bd" 77
@"\byear\b" 63
@"\b[Tt]he\b" 0, 56

3.9 Cheat sheets

The wildcards, quantifiers, character classes and anchors that we referred to above are by far not all
that the regular expression language has to offer. See the following websites for many more
possibilities.

http://www.mikesdotnetting.com/Article/46/CSharp-Regular-Expressions-Cheat-Sheet
http://msdn.microsoft.com/en-us/library/az24scfc(v=vs.110).aspx

3.10 Test the validity of strings

We can use a regular expression to test the validity of email addresses, postal codes, telephone
numbers, dates and many more. The following regular expression can be used to test the validity of
an email address. See if you can analyse it into its parts:

string regExp = @"^((([\w]+\.[\w]+)+)|([\w]+))@(([\w]+\.)+)([A-Za-z]{1,3})$";

It can then be used to prompt a user to enter an email address until it is valid:

string sInput = "";

bool isValid = false;
do
{
Console.Write("E-mail address: ");
sInput = Console.ReadLine();

string re = @"^((([\w]+\.[\w]+)+)|([\w]+))@(([\w]+\.)+)([A-Za-z]{1,3})$";
isValid = Regex.Match(sInput, re).Success;
} while (!isValid);

You can rest assured that we will not expect you to develop a complex regular expression as above.
You can, however, be expected to use a given RE in an application as above.

4. Summary
We discussed the following key concepts in this chapter:

• The String class

• StringBuilder class
• Regular expressions
- Find and replace occurrences of a substring
- Wildcards and quantifiers
- Greedy and lazy searches
- Special characters
- Character classes
- Anchors
- Cheat sheets
- Test the validity of email addresses and other strings

1998-2001 25L Ford Ranger PCM Pin Out Chart
71% (7)
1998-2001 25L Ford Ranger PCM Pin Out Chart
4 pages
TestBank IntroToIS 8e TechGuide4
No ratings yet
TestBank IntroToIS 8e TechGuide4
17 pages
C# Cheat Sheet PDF For Your Quick Reference
No ratings yet
C# Cheat Sheet PDF For Your Quick Reference
25 pages
Lect None On Chapter 2 - Part II - String and StringBuilder
No ratings yet
Lect None On Chapter 2 - Part II - String and StringBuilder
49 pages
Chapter 2 StringBuilder Regex
No ratings yet
Chapter 2 StringBuilder Regex
59 pages
String Operation in C#
No ratings yet
String Operation in C#
12 pages
C# - Strings: Creating A String Object
No ratings yet
C# - Strings: Creating A String Object
6 pages
Chapter 2 Part 2
No ratings yet
Chapter 2 Part 2
62 pages
Code Listings 4
No ratings yet
Code Listings 4
4 pages
String Functions in C#
No ratings yet
String Functions in C#
36 pages
C Sharp (C#) : Benadir University
No ratings yet
C Sharp (C#) : Benadir University
33 pages
Module 8 Strings Chars
No ratings yet
Module 8 Strings Chars
9 pages
STR Processing
No ratings yet
STR Processing
22 pages
C# Strings: String Vs String
No ratings yet
C# Strings: String Vs String
58 pages
Strings
No ratings yet
Strings
19 pages
C# Strings: Shubhangi Shinde
No ratings yet
C# Strings: Shubhangi Shinde
11 pages
String Manipulation
No ratings yet
String Manipulation
51 pages
IP 07 Handout 1
No ratings yet
IP 07 Handout 1
4 pages
07 Handout 1
No ratings yet
07 Handout 1
4 pages
Strings (C# Programming Guide)
No ratings yet
Strings (C# Programming Guide)
10 pages
Strings in C#: Bibek Kumar CSE Deptt. DITU Bibek - Kumar@dituniversity - Edu.in
No ratings yet
Strings in C#: Bibek Kumar CSE Deptt. DITU Bibek - Kumar@dituniversity - Edu.in
19 pages
November 13, 2003 Week 2
No ratings yet
November 13, 2003 Week 2
146 pages
C# String
No ratings yet
C# String
26 pages
Final String Project
No ratings yet
Final String Project
15 pages
Core CSharp and NET Quick Reference
100% (5)
Core CSharp and NET Quick Reference
2 pages
Trucos C#
100% (1)
Trucos C#
7 pages
Core CSharp and NET Quick Reference
100% (11)
Core CSharp and NET Quick Reference
2 pages
6 String
No ratings yet
6 String
4 pages
More About Processing Data
No ratings yet
More About Processing Data
31 pages
C# Unit 4
No ratings yet
C# Unit 4
50 pages
Vbhtp2e 16 Beta
No ratings yet
Vbhtp2e 16 Beta
76 pages
Lab 04 - Classes and Object C#
No ratings yet
Lab 04 - Classes and Object C#
7 pages
Chapter 3 - Data Structures in C#
No ratings yet
Chapter 3 - Data Structures in C#
55 pages
OOPS Programming in C#
No ratings yet
OOPS Programming in C#
28 pages
Lec7 Strings
No ratings yet
Lec7 Strings
14 pages
Programming Strings Using C#: Mahesh Chand
No ratings yet
Programming Strings Using C#: Mahesh Chand
20 pages
Complete-Reference-Vb Net 11
No ratings yet
Complete-Reference-Vb Net 11
1 page
Re Expression
No ratings yet
Re Expression
23 pages
cs321 Winter 2023 Lecture 4 Strings
No ratings yet
cs321 Winter 2023 Lecture 4 Strings
62 pages
C Training Day 4.1
No ratings yet
C Training Day 4.1
20 pages
Chapter 2
No ratings yet
Chapter 2
28 pages
Cs321 Winter 2023 Lecture 4 Strings
No ratings yet
Cs321 Winter 2023 Lecture 4 Strings
62 pages
Final Exam (C#)
No ratings yet
Final Exam (C#)
25 pages
Com Pro
No ratings yet
Com Pro
12 pages
Gaddis Python 4e Chapter 08
0% (1)
Gaddis Python 4e Chapter 08
22 pages
String Methods of C#
No ratings yet
String Methods of C#
3 pages
DotNet W
No ratings yet
DotNet W
9 pages
String Handling and Regular Expressions
No ratings yet
String Handling and Regular Expressions
19 pages
9 C# .NET String Manipulation
No ratings yet
9 C# .NET String Manipulation
18 pages
Tutorial 3: Ans: S Hello S1 He S2 Hel S3 Llo S4 LL
No ratings yet
Tutorial 3: Ans: S Hello S1 He S2 Hel S3 Llo S4 LL
6 pages
String & Math Functions
No ratings yet
String & Math Functions
38 pages
Lecture 3 Chapter 3
No ratings yet
Lecture 3 Chapter 3
23 pages
C# Substring Programs: Getting First Part
No ratings yet
C# Substring Programs: Getting First Part
22 pages
Gaddis Python 3e Chapter 08
No ratings yet
Gaddis Python 3e Chapter 08
22 pages
Starting Out With Python - Chapter3 - More About Strings
No ratings yet
Starting Out With Python - Chapter3 - More About Strings
22 pages
Python String Functions
No ratings yet
Python String Functions
3 pages
Inbuilt Functions
No ratings yet
Inbuilt Functions
25 pages
Unit 1 Array and String
No ratings yet
Unit 1 Array and String
9 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Learn Programming Using C#
From Everand
Learn Programming Using C#
Taurius Litvinavicius
No ratings yet
Ian Talks Regex A-Z
From Everand
Ian Talks Regex A-Z
Ian Eress
No ratings yet
Chapter 21
No ratings yet
Chapter 21
25 pages
Chapter 27
No ratings yet
Chapter 27
11 pages
Chapter 19
No ratings yet
Chapter 19
14 pages
Chapter 18
No ratings yet
Chapter 18
8 pages
Chapter 16
No ratings yet
Chapter 16
10 pages
Manual Siwarex Wp521 Wp522 en
No ratings yet
Manual Siwarex Wp521 Wp522 en
176 pages
Technical - Manual - Midea - Aqua Thermal - MC - SUxxx - RN8L - B
No ratings yet
Technical - Manual - Midea - Aqua Thermal - MC - SUxxx - RN8L - B
62 pages
Chavan Motors Solapur
No ratings yet
Chavan Motors Solapur
2 pages
H3C S5560X-EI Series Converged Gigabit Switches: Product Overview
No ratings yet
H3C S5560X-EI Series Converged Gigabit Switches: Product Overview
16 pages
Introduction To Cellular Mobile Radio Systems
No ratings yet
Introduction To Cellular Mobile Radio Systems
83 pages
271352751-HVAC-Training-ppt (Compatibility Mode) (Repaired)
No ratings yet
271352751-HVAC-Training-ppt (Compatibility Mode) (Repaired)
29 pages
BMW Case
No ratings yet
BMW Case
2 pages
ReCAT Doc1
No ratings yet
ReCAT Doc1
2 pages
Канада
No ratings yet
Канада
9 pages
Jenkins End To End
No ratings yet
Jenkins End To End
6 pages
Poltical Science Worksheet-Globalization: Section-B
No ratings yet
Poltical Science Worksheet-Globalization: Section-B
2 pages
Dxa9ka 1
No ratings yet
Dxa9ka 1
1 page
Auditing Project Report
No ratings yet
Auditing Project Report
41 pages
COPE WAME Best Practices
No ratings yet
COPE WAME Best Practices
3 pages
Web Services - 252 Course Outline
No ratings yet
Web Services - 252 Course Outline
6 pages
Project Report of RICHA
No ratings yet
Project Report of RICHA
31 pages
STATISTICS FINAL EXAM (MPM) Answer Sheet
No ratings yet
STATISTICS FINAL EXAM (MPM) Answer Sheet
15 pages
cDAQ Hands On
No ratings yet
cDAQ Hands On
124 pages
Hioki Im3570 Handbuch en A981 08
No ratings yet
Hioki Im3570 Handbuch en A981 08
458 pages
Are QSM Manual Rev 08
No ratings yet
Are QSM Manual Rev 08
43 pages
LINKWELL
No ratings yet
LINKWELL
3 pages
Components of CPU
No ratings yet
Components of CPU
8 pages
Conditional Formatting
No ratings yet
Conditional Formatting
32 pages
Minbooklist 136254
No ratings yet
Minbooklist 136254
156 pages
SANS Cheatsheet Google-Workspace
No ratings yet
SANS Cheatsheet Google-Workspace
1 page
Presentation On Impact of DG On Power Quality
No ratings yet
Presentation On Impact of DG On Power Quality
19 pages
P6 File Corruption
No ratings yet
P6 File Corruption
20 pages
LTE Training-Celcite
No ratings yet
LTE Training-Celcite
72 pages

Chapter 24

Uploaded by

Chapter 24

Uploaded by

1

List<string> words = new List<string>(new string[] { "John", "Mike", "Susan", "Anna" } );

2.1 Character strings

string s; // s = nothing, null, no value

We can test with s.IsNullOrEmpty.

• Nesteruk (2019, p 250)

Copyright: PJ Blignaut, 2020

2.2 The StringBuilder class

StringBuilder sb = new StringBuilder("abc");

Copyright: PJ Blignaut, 2020

3.1 Find a substring in a larger string

Regex reg = new Regex("the");

//Alternatively with static methods

Listing 24.3.1 Using an RE determine if a substring is present in a larger string

Copyright: PJ Blignaut, 2020

3.2 Find multiple occurrences of a substring in a larger string

Regex reg = new Regex("the");

Listing 24.3.2 Using an RE to find all occurrences of a substring in a larger string

We can use the pipe symbol "|", to list alternatives, e.g.

Regex reg = new Regex("the|dog|lazy");

will find all occurrences of "the", "dog" and "lazy".

3.3 Replace all occurrences of a substring with another

With Regex, it can be done like this:

s = Regex.Replace(s, "brown", "black");

3.4 Wildcards and quantifiers

foreach (string word in words)

foreach (string word in words)

Copyright: PJ Blignaut, 2020

3.5 Greedy and lazy

Consider the html string below:

In Section 4.3 above, replace

s = Regex.Replace(s, "brown", "black");

with one of the following and examine the output:

s = Regex.Replace(s, "b.n", "black");

3.6 Special characters

string sentences = "First sentence. Second sentence.";

3.7 Character classes

Class Explanation Pattern Matches Indexes

Copyright: PJ Blignaut, 2020

Class Explanation Pattern Matches Indexes

Assertion Explanation Example Index(es)

Copyright: PJ Blignaut, 2020

3.9 Cheat sheets

3.10 Test the validity of strings

string regExp = @"^((([\w]+\.[\w]+)+)|([\w]+))@(([\w]+\.)+)([A-Za-z]{1,3})$";

string sInput = "";

• The String class

Copyright: PJ Blignaut, 2020

You might also like