SW LAB 10 Filter

The document discusses simple Linux filters such as head, tail, cut, paste, sort, uniq, grep, and sed which can be used to view, extract, modify, and search text in files. It provides the syntax and common options for each filter, with examples of how to use them to display parts of files, sort lines, find patterns, and more. Advanced regular expressions are also covered that allow matching multiple patterns with a single expression.

Uploaded by

vaidikkumar2508

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views45 pages

SW LAB 10 Filter

Uploaded by

vaidikkumar2508

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

Simple Filters

By: Prof. Brijesha Rao

Assistant Professor,
IT Department,
DDU Nadiad
Filters:

 head - Displaying the beginning of a file

 tail - Displaying the end of a file
 cut - Slitting a file vertically
 paste - Pasting files
 sort - Ordering a file
 uniq - Locate repeated & nonrepeated lines
 grep - scans its input for a pattern and display lines
containing that pattern.
 sed - stream editor and it can perform lots of functions
on file like searching, find and replace, insertion or
deletion.
 awk - simple command line filtering tool
head - Displaying the beginning of a file:
It displays the top of the file.
When used without an option, it displays the first
10 lines of the specified file.
Syntax : $ head [options] filename

Options:
 -n –display first n lines
 -c –display first n bytes of the file
Ex:- $ head data_list
 $ head –n 3 data_list
 $ head -3 data_list
 $ head –c 50 data_list
 $ vi `ls –t –l | head –n 1`
tail - Displaying the end of a file:
It just reverse of the head.
 Syntax : $ tail [options] filename

Options:
 -n –display last n lines
 -c –extracts byte instead of lines
Ex:- $ tail data_list
 $ tail –n 3 data_list
 $ tail -3 data_list
 $ tail –c -50 data_list
 $ tail –c +50 data_list
 $ tail –c 50 data_list
cut - Slitting a file vertically:
We are now able to cut and paste particular
characters or fields from the files, vertically not
horizontally.
Syntax : $ cut [options] filename

Options:
 -c –to extract particular columns by characters
 -b –to extract particular columns by bytes
 -f –cutting fields
 -d –use DELIM instead of TAB for field
delimiter
Ex:- $ cut –c 3-5, 15-18 data_list
 $ cut –d \| -f 2,3 data_list
 $ who | cut –d “ ” –f 1, 2
 $ cat data_list | cut -d “|” -f 1,3
paste - Pasting files:
Whatever we have cut, we can paste it –but
vertically rather than horizontally.
Syntax : $ paste [options] file1 file2

Also there are so many options for this

command.

 Syntax : $ sort [options] filename

Options Description
-t char Uses delimiter character to identify fields.
-k n Sort on nth field.
-k m,n Starts sort on mth field & ends sort on nth
field.
-k m.n Starts sort on nth column of mth field.
-u Removes repeated lines
-n Sorts numerically
-r Reverses sort order.
-f Case –insensitive sort
-c Checks if file is sorted
-o f_name Places output in file f_name
Options:
 -k – sort on specified field.
Ex:- $ sort -t “|” –k 2 data_list
 -r – reversed sort order
Ex:- $ sort -t “|” -r –k 2 data_list
 -k m,n – sort start from mth field & ends at nth
field.
Ex:- $ sort -t “|” –k 3,3 –k 2,2 data_list
 -k m.n – sort on nth column of mth field.
Ex:- $ sort -t “|” –k 4.7,4.8 data_list
 -n – sort on numbers
Ex:- $ sort -n data_list
-u – removing repeated lines
Ex:- $ cut –d “|” –f 3 data_list | sort –u
 -o f_name – stores the output in f_name.
Ex:- $ sort –o abc –t “|” –k 3 data_list
 -c – to check that file is sorted or not.
Ex:- $ sort -c data_list
Ex:- $ sort –t “|” –c –k 2 data_list
uniq – Locate repeated & nonrepeated lines:
When you merge files, you‟ll face the problem of
duplicate entries.
But we are having a command „uniq‟ which just
display the unique lines in the „sorted‟ files.
Syntax: $ uniq [options] filename.

Options:
 -u –selecting the nonrepeated lines
 -d –selecting the duplicate lines
 -c –counting frequency of occurrence
Ex:- $ cut –d “|” –f 3 data_list | sort | uniq –u
Ex:- $ cut –d “|” –f 3 data_list | sort | uniq –d
Ex:- $ cut –d “|” –f 3 data_list | sort | uniq –c
Advance Filters: grep & sed
 grep:
 grep scans its input for a pattern and display
lines containing that pattern.
 When used with different options, it can also
display line numbers or filenames containing
the required pattern.

 Syntax:
 grep options pattern filename(s)
 Ex: grep “abc” std_db
 As it is a filter, it can also work with the standard
input and search for the desired pattern from the
standard input.
 It can also save the standard output in a file.
 Note: We could write the pattern without the
quotes, but it is safe to use either double or single
quotes while writing the pattern.
 Ex: grep bbb patel std_db
 Ex: grep “bbb patel” std_db
 When grep doesn't match the pattern, it would
silently return the prompt.
 When grep is used with multiple filenames, it
would display the respective filenames along
with the output.
 grep Options:
 Ignoring case (-i) :
 When you are not sure of the case of the
required pattern, you could use the –i option.
 Ex: grep -i “agarwal” std_db
 Deleting Lines (-v)
 To inverse the role of grep, i.e. to select all the
lines except those containing the pattern, you
can use the -v option.
 Ex: grep -v “agarwal” std_db
 Displaying Line Numbers (-n)
 When you want to display the line numbers
containing the pattern, you can use the –n
option.
 Ex: grep -n “agarwal” std_db
 If you want to extract only the line numbers
containing the pattern, you can use cut along
with this.
 Counting Lines containing Pattern (-c):
 If you want to know the total lines which are
containing the pattern, you can use the –c
option.
 Note: This count is different from the number
of occurrence of that pattern.
 Example:
 grep -c “professor” *.txt
 cat *.txt | grep –c “professor”
 Displaying Filenames (-l):
 The -l (list) option displays only the names of
the files containing the pattern.

 Example:
 grep -l “professor” *.txt
 Matching Multiple Patterns (-e):
 If you want to match multiple patterns, like
agarwal, aggarwal, Agrawal, etc., then you need
to use -e option.

 Example:
 grep -e “agarwal” -e “Agrawal” f1
 Taking Patterns from a file (-f) :
 If you want to explicitly mention each pattern,
you have the option to store them in a file and
use that filename instead of the patterns.

 Example:
 File – patternfile (one pattern per line)
 agarwal
 Agrawal
 grep -f patternfile f1
 Basic Regular Expression (BRE)
 Like the Shell's Wild-Card Characters, grep uses
an expression of a different type to match a
group of similar patterns.
 However, unlike Wild-Cards, this expression is a
feature of the command that uses it and has
nothing to do with the shell.
 If an expression uses any of the below listed
characters, it is termed as a Regular Expression.
 Regular Expressions belong to two categories:
 (i) Basic Regular Expressions
 (ii) Extended Regular Expressions.
 grep supports Basic Regular Expressions (BRE)
by default and Extended Regular Expressions
(ERE) with the -E option.
 sed supports only the BRE set.
• BRE Character Set:
 * : Zero or more occurrences of the
previous characters
 a* : Nothing or a or aa or aaa, etc.
 . : A single Character
 .* : Nothing or any no. of Characters
 [ijk] : A single Character either i, j or k
 [x-z] : Any single character between x & z
 [^x-z] : Any single character not between x &
z
 ^abc : Pattern abc at beginning of the line
 abc$ : Pattern abc at end of the line
 ^abc$ : abc as the only word in the line
 ^$ : Line contains nothing
 Examples:
 If you want to match Agarwal and agrawal both,
you could use the below expression:
 [aA]g[ar][ar]wal
 grep “[aA]g[ar][ar]wal” f1

 Note here that the expression [ar][ar] here

matches four pattenrs – ar, aa, ra & rr but only
two patterns are of importance to us.
 Examples :
 If you want to match aggarwal in addition to
Agarwal and agrawal, you could use the asterisk
in your expression:
 grep “[aA]gg*[ar][ar]wal” f1

 As * means either zero or any no. of

occurrences of the previous character, it works
fine here. But it would not work if used as Wild-
Cards.
 Example:
 While the shell uses ' ? ' to match a single
character, BRE set has ' . ' (dot) to match a single
character.
 Ex: grep “emp*.c” f1
 emp1.c
 emp2.c
 And so on....
 Ex: grep “ a.* aggrwal” f1
 Examples:
 If you want all the lines beginning with Hello,
you could use –
 grep “Hello” f1
 But would it be correct? - No
 Because Hello could occur anywhere in the line.
So you need to use –
 grep “^Hello” f1
 Similarly use $ for the end of line matching.
 Examples:
 If you want to reverse your search and search
for all the lines not containing H in the
beginning, then the expression would be:
 grep “^[^H]” f1
 Hence, the caret (^) has three roles to play.
 1) [^abc] – Not a, b or c
 2) ^abc – abc as the beginning of the line
 3) a^b – Here it matches literally
 Examples:
 $ ls -l | grep “^d”

 $ grep “5...$” f1
 Examples:
 The ‘ – ’ loses its meaning when not used
properly or when used outside the char class.
 The ‘ . ’ and ‘ * ’ loses their special meaning
when placed inside the character class.
 If ‘ * ’ is the first character of expression, it is
matched literally.
 Extended Regular Expression (ERE)
 a+ : Matches one or more occurrences of a
 a? : Matches zero or one occurrence of a
 Exp1 | Exp2 : Matches either exp1 or exp2
Expression
 (x1|x2)x3 : Matches either x1x3 or x2x3
 Examples :
 The characters + and ? restrict the scope of
match as compared to the * For matching
Agarwal and Aggarwal, we can use the
expression –
 Agg*arwal
 But this would also match Aggggggarwal.
 To restrict this, we could use the expression –
 Agg?arwal
 Usage: grep -E “Agg?arwal” f1
 Examples :
 For matching two strings – foolish or girlish, we
could use two expressions with pipe:
 1) foolish | girlish
 2) (foo | gir) lish
• sed- stream editor and it can perform lots of
functions on file like searching, find and replace,
insertion or deletion.

• works well with character-based processing.

• Example1- sed -n „/hello/p‟ file1.
• This command will display all the lines which
contains hello.
• Example2- sed „s/hello/HELLO/‟ file1.
• This command will substitute hello with HELLO
everywhere in the file.
• Example3- sed „/hello/,+2d‟ file1.
• This command will delete the two lines starting with
the first match of „hello‟
• awk- it is a simple command line filtering tool.
• awk is mostly used for pattern scanning and
processing. It searches one or more files to see
if they contain lines that matches with the
specified patterns and then perform the
associated actions.
• Syntax:
• awk 'script' filename
• Where 'script' is a set of commands that are
understood by awk and are execute on file,
filename.
• $ awk '/manager/ {print}' employee.txt
• $ awk '{print $1,$4}' employee.txt

Linux Filters
100% (1)
Linux Filters
18 pages
Grep, Awk and Sed - Three VERY Useful Command-Line Utilities
No ratings yet
Grep, Awk and Sed - Three VERY Useful Command-Line Utilities
9 pages
Unit-3 Usp
No ratings yet
Unit-3 Usp
82 pages
Unix Commands
No ratings yet
Unix Commands
76 pages
Advanced Unix Commands-Tmp
No ratings yet
Advanced Unix Commands-Tmp
30 pages
L5 - Reg Exp
No ratings yet
L5 - Reg Exp
38 pages
LPIC-1 Exam 101-500 Final Study Guide With Command Combinations
No ratings yet
LPIC-1 Exam 101-500 Final Study Guide With Command Combinations
18 pages
OS Filters 2
No ratings yet
OS Filters 2
19 pages
UNIX For BI-ClassBook-Lesson03
No ratings yet
UNIX For BI-ClassBook-Lesson03
22 pages
Unix Filterss
No ratings yet
Unix Filterss
37 pages
Grep
No ratings yet
Grep
15 pages
L3 - Grep ND Egrep
No ratings yet
L3 - Grep ND Egrep
26 pages
l6 Latest
No ratings yet
l6 Latest
16 pages
Module 5
No ratings yet
Module 5
13 pages
Module 5
No ratings yet
Module 5
14 pages
Linux Essinsial Tools Commends
No ratings yet
Linux Essinsial Tools Commends
11 pages
GNU Grep: Print Lines Matching A Pattern: Alain Magloire Et Al
No ratings yet
GNU Grep: Print Lines Matching A Pattern: Alain Magloire Et Al
39 pages
CVC Story Pyramids
No ratings yet
CVC Story Pyramids
9 pages
Unix Unit 2 Part 3
No ratings yet
Unix Unit 2 Part 3
11 pages
Grep
100% (2)
Grep
20 pages
Linux CLInotes
No ratings yet
Linux CLInotes
15 pages
Complete Unix and AWK Guide
No ratings yet
Complete Unix and AWK Guide
19 pages
Grep Operator in Linux
No ratings yet
Grep Operator in Linux
12 pages
File Attributes, Permissions & Shell Programming and Interpretive Cycle
No ratings yet
File Attributes, Permissions & Shell Programming and Interpretive Cycle
35 pages
Sodapdf
No ratings yet
Sodapdf
13 pages
Filter Commands
No ratings yet
Filter Commands
7 pages
Session10 Advanced Filters
No ratings yet
Session10 Advanced Filters
10 pages
Linux Commands
No ratings yet
Linux Commands
33 pages
Lab03.Processing Text Streams
No ratings yet
Lab03.Processing Text Streams
12 pages
Chapter 03 - UNIX For Power Users
No ratings yet
Chapter 03 - UNIX For Power Users
32 pages
02 Advanced Unix Commands Notes - px4D2Ov
No ratings yet
02 Advanced Unix Commands Notes - px4D2Ov
8 pages
Grep
No ratings yet
Grep
10 pages
Perfected Unix and AWK Guide
No ratings yet
Perfected Unix and AWK Guide
21 pages
Linux Grep Command
No ratings yet
Linux Grep Command
7 pages
A Beginner's Guide To Grep - Basics and Regular Expressions - Manish Rane
No ratings yet
A Beginner's Guide To Grep - Basics and Regular Expressions - Manish Rane
9 pages
Grep, Egrep, Fgrep-Difference
No ratings yet
Grep, Egrep, Fgrep-Difference
4 pages
DevOps AccessingFilesAndRegex
No ratings yet
DevOps AccessingFilesAndRegex
7 pages
Q.NO.2 and 3
No ratings yet
Q.NO.2 and 3
4 pages
Sed Grep Cmds 2
No ratings yet
Sed Grep Cmds 2
5 pages
Basic Filters & Pipes
No ratings yet
Basic Filters & Pipes
33 pages
Pipingfile
No ratings yet
Pipingfile
11 pages
PR 6
No ratings yet
PR 6
4 pages
Reg Expressions
No ratings yet
Reg Expressions
5 pages
Filer Command
No ratings yet
Filer Command
38 pages
Grep Command
No ratings yet
Grep Command
14 pages
Week 7&8
No ratings yet
Week 7&8
8 pages
Grep Command in Linux
No ratings yet
Grep Command in Linux
7 pages
The Grep Command: SR - No. Option & Description
No ratings yet
The Grep Command: SR - No. Option & Description
2 pages
Search For The Given String in A Single File
No ratings yet
Search For The Given String in A Single File
7 pages
Grep Command Examples
No ratings yet
Grep Command Examples
3 pages
Unix - Commands
No ratings yet
Unix - Commands
24 pages
Linux and Unix Grep Command
No ratings yet
Linux and Unix Grep Command
13 pages
Narasimham Committee
No ratings yet
Narasimham Committee
17 pages
18bcs6516, Vipin, OS Worksheet Final
No ratings yet
18bcs6516, Vipin, OS Worksheet Final
2 pages
Filter 1
No ratings yet
Filter 1
9 pages
Callimachus Aetia Iambi Hecale and Other Fragments Musaeus Hero and Leander Reprintnbsped 9780674994638 0674994639
No ratings yet
Callimachus Aetia Iambi Hecale and Other Fragments Musaeus Hero and Leander Reprintnbsped 9780674994638 0674994639
221 pages
Linear Thinking Vs Lateral Thinking
No ratings yet
Linear Thinking Vs Lateral Thinking
15 pages
Procedure: Technical Bid Evaluation For Catodic Protection
100% (1)
Procedure: Technical Bid Evaluation For Catodic Protection
6 pages
TUV SUD Certificate 1500 V Single Glass New Standard F1a45fd378
100% (1)
TUV SUD Certificate 1500 V Single Glass New Standard F1a45fd378
3 pages
Methods of Production of Induced Emf: Explanation 1: Using Faraday's Law
No ratings yet
Methods of Production of Induced Emf: Explanation 1: Using Faraday's Law
3 pages
Search For The Given String in A Single File
No ratings yet
Search For The Given String in A Single File
7 pages
Grep Awk Sed
No ratings yet
Grep Awk Sed
9 pages
Making Inferences 2
No ratings yet
Making Inferences 2
38 pages
Co-Creation of Quality Service and Service Failures and Service Recovery
No ratings yet
Co-Creation of Quality Service and Service Failures and Service Recovery
31 pages
Sample of Moving Companies
No ratings yet
Sample of Moving Companies
53 pages
Condenser Tube: Categories of Failure
No ratings yet
Condenser Tube: Categories of Failure
19 pages
Đề HSG10
No ratings yet
Đề HSG10
11 pages
Thesis Final PDF
100% (1)
Thesis Final PDF
51 pages
PR Material
No ratings yet
PR Material
46 pages
Ado2021bn Microfinance Social Development
No ratings yet
Ado2021bn Microfinance Social Development
24 pages
Standard Fireworks Price List: Language: English
No ratings yet
Standard Fireworks Price List: Language: English
10 pages
The Legal Context
No ratings yet
The Legal Context
13 pages
Helical Foundations and Anchors
No ratings yet
Helical Foundations and Anchors
61 pages
NAT10904002 Assessment Task 1
No ratings yet
NAT10904002 Assessment Task 1
15 pages
History of Magnetism
No ratings yet
History of Magnetism
12 pages
Impact of Lecturer-Student Relationship On Student Academic Performance in Federal University, Oye-Ekiti
No ratings yet
Impact of Lecturer-Student Relationship On Student Academic Performance in Federal University, Oye-Ekiti
7 pages
ECF1100 Practice Exam
No ratings yet
ECF1100 Practice Exam
7 pages
Substitution and Income Effect
No ratings yet
Substitution and Income Effect
4 pages
Driving Licence Application (Duplicate)
No ratings yet
Driving Licence Application (Duplicate)
5 pages
Karissa Teichman Resume 1 1
No ratings yet
Karissa Teichman Resume 1 1
1 page
Ptu Promotion Test Form
No ratings yet
Ptu Promotion Test Form
1 page
Wisdom International School
No ratings yet
Wisdom International School
3 pages
Nfa Policy On Non-Grains Kadiwa Accountabilities
No ratings yet
Nfa Policy On Non-Grains Kadiwa Accountabilities
3 pages
A Summary of Classical Lamination Theory
No ratings yet
A Summary of Classical Lamination Theory
4 pages
Perl One-Liners: 130 Programs That Get Things Done
From Everand
Perl One-Liners: 130 Programs That Get Things Done
Peteris Krumins
4/5 (3)
Ian Talks Regex A-Z
From Everand
Ian Talks Regex A-Z
Ian Eress
No ratings yet
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet

SW LAB 10 Filter

Uploaded by

SW LAB 10 Filter

Uploaded by

Simple Filters

By: Prof. Brijesha Rao

 head - Displaying the beginning of a file

Also there are so many options for this

 Syntax : $ sort [options] filename

 Note here that the expression [ar][ar] here

 As * means either zero or any no. of

• works well with character-based processing.

You might also like