0% found this document useful (0 votes)

54 views30 pages

Unit - IV

Filters like grep, sed, and awk are useful for searching, filtering, and processing text. Grep searches for patterns in files. Sed is a stream editor that performs text transformations on each line. Awk splits input into fields and performs actions on lines that match patterns, allowing for selection, processing, and rearrangement of columnar data. These filters are often combined in pipelines to efficiently extract and manipulate information from text files.

Uploaded by

Siva Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views30 pages

Unit - IV

Uploaded by

Siva Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 30

Unit -IV

Filters
Contents
 The Grep Family
 Other Filters
 The stream editor sed
 The awk pattern scanning and processing
language
 Good Files and Good Filters
The Grep Family

• searches the named pipes or the standard input and prints each
line that contains an instance of the pattern
• Patterns are a slightly restricted form of the string specifiers called regular
expressions

• The option –n prints line numbers,-v inverts the sense of the test and –y
makes lower case letters in the pattern match letters of either case in the file
• The meta characters ^ and $ anchor the pattern to the beginning(^) and
end($) of the line.
• For example prints lines that begin with from, which are more
likely to be message header lines.
• grep supports character classes. so [a-z] matches any lower case letter.[^0-9]
matches any non-digit.
• A period ‘.’ matches any character.

• The closure operator ‘*’ matches applies to the previous character or metacharacters
in the expression, and collectively they match any number of successive matches of
the character or metacharacter.
– For example x* matches a sequence of x’s as long as possible
– [a-zA-Z]* matches an alphabetic string
– .* matches anything up to a newline
– .*x matches anything upto and including the last x on the line
• Closure applies to only one character
• No grep expression matches a newlne
• This command searches for users without passwords
• fgrep searches for many string literals simultaneously
• egrep interprets true regular expression but with an
or operator and parenthesis to group expressions
• Both egrep and fgrep accept a –f option to specify a
file from which to read the pattern
• There are two other closure operators in egrep- +
and ?.
– The pattern x+ matches one or more x’s
– The pattern x? Matches zero or one x
• egrep is excellent at word games that involves searching the
directory for words with special properties
– To find all words of six or more letters that have the letters in alphabetical
order

• Why are three grep patterns?

– fgrep interprets no metacharacters, but can look efficiently thousands of
words in parallel and thus is used primarily for bibliographic searches.
– egrep interprets more general expressions and runs signinficantly faster.
Other Filters

• Sort

• Given a list of words, one per line, the command prints

the unique words
• Uniq –d prints only those lines that are duplicated.
• Uniq –u prints only those that are unique i.e. Not
duplicated
• Uniq –c counts the number of occurrences of each line
• comm command is a file comparison program
• print only those lines that are in both files.
• print lines that are in the first file but not in second file
• This is useful for comparing directories and for comparing a word list with a
dictionary
• tr command transliterates the characters in its input.By far the most
common use of tr is case conversion

• dd command will do the case conversion from ASCII to EBCDIC and

viceversa

• dd command is intended primarily for processing tape data from other

systems
• What can be accomplished by combining
filters?

which prints the 10 most frequent words in its

input
The stream editor sed
• The basic idea of sed is simple

reads lines one at a time from the input files, it

applies the commands from the list, in order, to each
line and writes its edited form on standard output.
For example , you can change UNIX to UNIX(TM)
,everywhere it occurs in a set of files with

• sed does not alter the contents of the input files

• du prints the size and the filename
• The substitution deletes all characters (.*) up
to and including the rightmost tab.
• In a similar way, you could select usernames
and login times from the output of who
• The s command replaces a blank and everything
that follows it up to another blank by a single
blank.
• The same sed command can be used to make a
program getname that will return your user name

• It is also possible to put sed commands in a file

and execute them from there,with
• prints its input up to and including the
first line matching pattern, and
• deletes every line that contains the
pattern.
• sed provides the ability to write on multiple
output lines.

writes lines matching pat on file1 and lines not

matching pat on file2
The awk pattern scanning and processing language

• The idea in awk is much the same as in sed,but the details are based more
on the C programming language than on a text editor
• Usage is just like sed

but the program is different

• awk reads the input in the filenames one line at a time.

• Each line is compared with each pattern in order;for each pattern that
matches the line , corresponding action is performed.
• Like sed, awk doesn't alter its input files
• prints every line that matches the regular
expression
• IF the action is omitted,the default action is to print matched lines
• If the pattern is omitted, then the action part is
done for every input line.
• does what cat does
• It is possible to present the program to awk from
a file
• awk splits each line automatically into fields, that
is strings of non-blank characters separated by
blanks or tabs.
• By definition, the output of who has five fields
• awk call the fields $1,$2,....$NF, Where NF is a variable
whose value is set to the number of fields.(Here NF=5)
• To print the names of people logged in and the time of
login, one per line

• To print name and time of login sorted by time

• To print the usernames,which comes from the first field,

• The built-in variable NR is the number of
current input “record” or line
• So to add line numbers to an input stream,use
this
• To print line numbers in a field 4 digits wide,

• Suppose you want to look in /etc/password for

people who have no passwords
• You can write this pattern in a variety of ways

• One common use of patterns in awk is for

simple data validation tasks
• For example, the following pattern ensures
that every input record has an even number of
fields
• Printing a warning and part of the too-long
line using another built-in function substr
• Selecting the hour and minute from output of date

• awk provides two special pattterns BEGIN and END.

• BEGIN actions are performed before the first input line has been read

• END actions are performed after the last line of input has been
processed

• print the number of lines of input

• To illustrate the use of variables in awk,
– Add up all the numbers in first column

– Print both sum and average

– Count the input lines(count the lines,words and

characters)
• The if statement is just like that in C

If condition is true, statement 1 is executed; if

it is false and if there is an else part, statement
2 is executed; else part is optional;
• The for statement is a loop like the one in C

• for is identical to following while statement

• For example, runs the loop with i set in turn
to 2,3,.. upto the Number of fields NF.
• The break statement causes an immediate exit from
enclosing while or for
• The continue statement causes the next iteration to begin
• The next statement causes the next input line to be read
and the pattern matching to resume at the beginning of
the awk program.
• The next statement causes an immediate transfer to END
pattern
• awk provides arrays

– For example , this awk program collects each line of

input in a separate array element, indexed by line
number , then prints them out in reverse order.
• splits the string s into fields that
are stored in elements 1 through n of the array
arr. If the separator character is provided, it is
used; otherwise the current value of FS is used
• awk provides associative arrays
– is the complete program for
adding up and print the sums for the name value pairs.
• Syntactically, this is a variant of the for statement

• In awk, there is no explicit string concatenation operator

• As in C, the assignment statement can be used as an
expression, so the construction,
assigns the length of the input line
to n before testing the value. Notice the parenthesis
• A program field n that will print the nth field from
each line of input
– For example , to print only the login names.
– One implementation is
– Another approach uses double quotes
• A second example is addup n, which adds up
the numbers in nth field

• A third example, separate sums of each of n

columns plus a grand total
Good Files and Good Filters
• Many uses of awk are simple one –or –two line programs to do
some filtering as part of a larger pipeline
• Programs like wc or grep can count interesting items or search for
them by name.When more information is present for each object,
the file is still line-by-line but columnated into fields separated by
blanks or tabs, as in output of ls –l
• Given data divided into fields, programs like awk can easily select,
process or rearrange the information
• The arguments of filters specify input never output, so the output
of command can always be fed to a pipeline. Optional arguments
precede any file names. Finally error messages are written on the
standard error, so they will not vanish down a pipeline

Intro To Plastic Injection Molding Ebook
78% (9)
Intro To Plastic Injection Molding Ebook
43 pages
Authentic Listening 2019
100% (1)
Authentic Listening 2019
129 pages
U CMR March 2023
80% (5)
U CMR March 2023
2 pages
Background of The Study: Manual System in Generating Reports of Inventory and Check-Up
No ratings yet
Background of The Study: Manual System in Generating Reports of Inventory and Check-Up
5 pages
Unit 3. Information Search Process
No ratings yet
Unit 3. Information Search Process
34 pages
Fund Based Activities
100% (3)
Fund Based Activities
35 pages
Unit - II: The File System
No ratings yet
Unit - II: The File System
24 pages
Unit - III: Using The Unix
No ratings yet
Unit - III: Using The Unix
32 pages
A Series of Enthalpy-Entropy Charts For Natural Gases: f:1H, f:1M f:1MZ
No ratings yet
A Series of Enthalpy-Entropy Charts For Natural Gases: f:1H, f:1M f:1MZ
12 pages
Case - Study Vietjet
No ratings yet
Case - Study Vietjet
26 pages
Internship at Troikaa Pharmaceuticals
No ratings yet
Internship at Troikaa Pharmaceuticals
7 pages
Unit-Vi: The Process
No ratings yet
Unit-Vi: The Process
13 pages
Unix Programming Iii B.Tech I Sem R16: Unit - I Introduction To Unix by K. Siva Kumar
No ratings yet
Unix Programming Iii B.Tech I Sem R16: Unit - I Introduction To Unix by K. Siva Kumar
10 pages
BMW PDF
No ratings yet
BMW PDF
38 pages
Unit-V: Shell Programming
No ratings yet
Unit-V: Shell Programming
24 pages
I Jcs It 2016070619
No ratings yet
I Jcs It 2016070619
3 pages
UNIX Shells by Example (PDFDrive)
No ratings yet
UNIX Shells by Example (PDFDrive)
1,194 pages
Oferta de Compraventa Bilingüe
No ratings yet
Oferta de Compraventa Bilingüe
6 pages
VTX PDF
No ratings yet
VTX PDF
6 pages
EC 504 End Semester QP
No ratings yet
EC 504 End Semester QP
3 pages
Electric Actuator Commissioning
No ratings yet
Electric Actuator Commissioning
4 pages
Basic Filters & Pipes
No ratings yet
Basic Filters & Pipes
33 pages
Unix Filterss
No ratings yet
Unix Filterss
37 pages
Amjathfinal
No ratings yet
Amjathfinal
113 pages
Risk
No ratings yet
Risk
27 pages
Advanced Scripting in Unix: SED, AWK, Makefile & GDB
No ratings yet
Advanced Scripting in Unix: SED, AWK, Makefile & GDB
35 pages
Global Maritime Distress and Safety System (GMDSS) : Companies Can Opt For Block Booking
100% (1)
Global Maritime Distress and Safety System (GMDSS) : Companies Can Opt For Block Booking
1 page
Grep, Awk and Sed - Three VERY Useful Command-Line Utilities
No ratings yet
Grep, Awk and Sed - Three VERY Useful Command-Line Utilities
9 pages
Obj To Report of No Distribution (Original As Filed)
No ratings yet
Obj To Report of No Distribution (Original As Filed)
10 pages
Chapter 4 - Regular Expression
No ratings yet
Chapter 4 - Regular Expression
6 pages
Project Name: Automated Room Controlling: Author: Siva Kumar Kotamraju
No ratings yet
Project Name: Automated Room Controlling: Author: Siva Kumar Kotamraju
2 pages
Linux CMD AWK
No ratings yet
Linux CMD AWK
32 pages
OS Lab Manual
No ratings yet
OS Lab Manual
37 pages
IIM KZ EPGP Combine Brochure Batch 17 32c718e31a
No ratings yet
IIM KZ EPGP Combine Brochure Batch 17 32c718e31a
20 pages
IA For Condonation of Delay
83% (6)
IA For Condonation of Delay
4 pages
UNIX II:grep, Awk, Sed: October 30, 2017
No ratings yet
UNIX II:grep, Awk, Sed: October 30, 2017
26 pages
Unit1 - 26 05 23
No ratings yet
Unit1 - 26 05 23
56 pages
Project Name: Automated Room Controlling
No ratings yet
Project Name: Automated Room Controlling
1 page
Project: Automated Room Controlling: Author: Siva Kumar Kotamraju
No ratings yet
Project: Automated Room Controlling: Author: Siva Kumar Kotamraju
1 page
Shell Script Lec 1
No ratings yet
Shell Script Lec 1
33 pages
Perceived Guest House Brand Value The Influence of Web Interactivity On Brand Image and Brand Awareness
No ratings yet
Perceived Guest House Brand Value The Influence of Web Interactivity On Brand Image and Brand Awareness
29 pages
UNIX Filters
No ratings yet
UNIX Filters
18 pages
Residual vs. Zero Sequence: Welcome Posts About Electrical Training Arc Flash Studies Safety Compliance
No ratings yet
Residual vs. Zero Sequence: Welcome Posts About Electrical Training Arc Flash Studies Safety Compliance
2 pages
2025 Reqwhiterun
No ratings yet
2025 Reqwhiterun
6 pages
Structural Regular Expressions: Rob Pike
No ratings yet
Structural Regular Expressions: Rob Pike
7 pages
Learning Awk and Sed
No ratings yet
Learning Awk and Sed
14 pages
Unix Scripting: A Tutorial For Computational Linguistics (CSE 506/606)
No ratings yet
Unix Scripting: A Tutorial For Computational Linguistics (CSE 506/606)
12 pages
Unix Utilities: Grep, Sed, and Awk
100% (1)
Unix Utilities: Grep, Sed, and Awk
81 pages
Basic Commands
No ratings yet
Basic Commands
17 pages
Ledesma vs. CA Notes
No ratings yet
Ledesma vs. CA Notes
4 pages
Regex, Grep, Awk, Sed
No ratings yet
Regex, Grep, Awk, Sed
19 pages
OSP1
No ratings yet
OSP1
17 pages
11.2 Command-Line Syntax
No ratings yet
11.2 Command-Line Syntax
15 pages
AwkUsageIn Bash Scripting
No ratings yet
AwkUsageIn Bash Scripting
67 pages
Lokesh (3054) Kushal (3053)
100% (1)
Lokesh (3054) Kushal (3053)
25 pages
Volume Profile 部分20
No ratings yet
Volume Profile 部分20
5 pages
Pipingfile
No ratings yet
Pipingfile
11 pages
Unit 2
No ratings yet
Unit 2
26 pages
20.10 Filters-Text Processing Commands
No ratings yet
20.10 Filters-Text Processing Commands
14 pages
Unix Talk #2: AWK Overview Patterns and Actions Records and Fields Print vs. Printf
No ratings yet
Unix Talk #2: AWK Overview Patterns and Actions Records and Fields Print vs. Printf
31 pages
Unix Interview Questions On Awk Command
No ratings yet
Unix Interview Questions On Awk Command
19 pages
Unix Bible
No ratings yet
Unix Bible
8 pages
Sedawknew
No ratings yet
Sedawknew
18 pages
Awk
100% (1)
Awk
9 pages
Week 7&8
No ratings yet
Week 7&8
8 pages
Mishra Rath 2021 A Comparative Study of Non Performing Assets Using Non Parametric Test Indian Scheduled Commercial
No ratings yet
Mishra Rath 2021 A Comparative Study of Non Performing Assets Using Non Parametric Test Indian Scheduled Commercial
23 pages
8 - Awk Programming
No ratings yet
8 - Awk Programming
7 pages
Grep Awk Sed
No ratings yet
Grep Awk Sed
9 pages
The Structure of An Awk Program: Pattern (Action) Pattern (Action)
No ratings yet
The Structure of An Awk Program: Pattern (Action) Pattern (Action)
12 pages
Kolkata Faculty List DG Upload Jan 2023
No ratings yet
Kolkata Faculty List DG Upload Jan 2023
3 pages
Lecture14 Unix Advanced Commands
No ratings yet
Lecture14 Unix Advanced Commands
13 pages
2024 Emerging Space Brief Satellite Servicing
No ratings yet
2024 Emerging Space Brief Satellite Servicing
6 pages
Lec 05
No ratings yet
Lec 05
39 pages
Scripting Languages: Gluing Together Other Programs, ..
No ratings yet
Scripting Languages: Gluing Together Other Programs, ..
27 pages
NLC Accomplishment Report 2024-2025
No ratings yet
NLC Accomplishment Report 2024-2025
5 pages
OS & LINUX Labmanual R20..
No ratings yet
OS & LINUX Labmanual R20..
85 pages
M71-WL Manual v1.0
No ratings yet
M71-WL Manual v1.0
6 pages
Description of An Awk Program: Pattern Action
No ratings yet
Description of An Awk Program: Pattern Action
8 pages
Perl Scripts For Eda Tools
No ratings yet
Perl Scripts For Eda Tools
6 pages
Awk - A Pattern Scanning and Processing Language (Second Edition)
No ratings yet
Awk - A Pattern Scanning and Processing Language (Second Edition)
8 pages
Linux Commands 1717440746
No ratings yet
Linux Commands 1717440746
18 pages
AWK and Sed
No ratings yet
AWK and Sed
14 pages
Awk Patterns: 'Awk' Patterns May Be One of The Following
No ratings yet
Awk Patterns: 'Awk' Patterns May Be One of The Following
3 pages
Assignment (2) Linux
No ratings yet
Assignment (2) Linux
6 pages
Maglev Train Market SAMPLE
No ratings yet
Maglev Train Market SAMPLE
158 pages
OSA3
No ratings yet
OSA3
7 pages
Linux
No ratings yet
Linux
7 pages
Module 5
No ratings yet
Module 5
14 pages
Advanced Unix Commands-Tmp
No ratings yet
Advanced Unix Commands-Tmp
30 pages
Module 5
No ratings yet
Module 5
13 pages
02 Advanced Unix Commands Notes - px4D2Ov
No ratings yet
02 Advanced Unix Commands Notes - px4D2Ov
8 pages
Linux Commands Awk
No ratings yet
Linux Commands Awk
8 pages
Ian Talks Regex A-Z
From Everand
Ian Talks Regex A-Z
Ian Eress
No ratings yet
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Programming with MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
Programming with MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
4.5/5 (3)
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Python Programming Concepts
From Everand
Python Programming Concepts
MRB
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet

Unit - IV

Uploaded by

Unit - IV

Uploaded by

Unit -IV

• Why are three grep patterns?

• Given a list of words, one per line, the command prints

• dd command will do the case conversion from ASCII to EBCDIC and

• dd command is intended primarily for processing tape data from other

which prints the 10 most frequent words in its

reads lines one at a time from the input files, it

• sed does not alter the contents of the input files

• It is also possible to put sed commands in a file

writes lines matching pat on file1 and lines not

but the program is different

• awk reads the input in the filenames one line at a time.

• To print name and time of login sorted by time

• To print the usernames,which comes from the first field,

• Suppose you want to look in /etc/password for

• One common use of patterns in awk is for

• awk provides two special pattterns BEGIN and END.

• print the number of lines of input

– Print both sum and average

– Count the input lines(count the lines,words and

If condition is true, statement 1 is executed; if

• for is identical to following while statement

– For example , this awk program collects each line of

• In awk, there is no explicit string concatenation operator

• A third example, separate sums of each of n

You might also like