Exercise 1

The document describes an exercise to practice manipulating next generation sequencing (NGS) data using Unix commands. It provides 12 tasks to download test NGS data, examine the data files, extract information from the files, modify headers, and filter reads based on barcode sequences. Hints suggesting useful Unix commands for each task are also provided.

Uploaded by

mustafa gökçen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views11 pages

Exercise 1

Uploaded by

mustafa gökçen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Exercise

I: Basic Unix for manipulating NGS data

C. Hahn, July 2014

The purpose of this exercise is to practice manipulating NGS (Next Generation
Sequencing) data using simple but incredibly powerful Unix commands.

Try to solve the below tasks using your Unix skills. Do not hesitate to consult
Google for help!! Hints for every task can be found on page 3 below. Possible
solutions are suggested on page 4.

1. Download the testdata from
https://www.dropbox.com/s/wcanmej6z03yfmt/testdata_1.tar?dl=0. Create a
directory (exercise_1) in your home directory and copy the archive testdata.tar
from ~/Downloads/ to this directory.
2. Decompress the archive.
3. Examine the content of the data files. For basic information on fastq format
please visit: http://en.wikipedia.org/wiki/FASTQ_format. With this information
try to interpret the content of your data files. Do you know what all the lines
represent?
a) Which quality encoding (which offset) do these files use?
b) What is the header of the third read in the file? Is this a single-‐end (forward)
or a paired-‐end (reverse) read?
c) What is the header of the last read in the file? Is this a forward or reverse
read?
4. How many lines does the file have?
5. How many reads does the file contain?
6. How many single-‐end (“se” also referred to as forward reads) reads and how
many paired-‐end (“pe” also reverse reads) reads does the file contain?
7. Extract all se/pe reads and write them to separate files testdata_1.fastq and
testdata_2.fastq.
8. a) Count the number of reads that contain the sequence TGCACTAC in
testdata_1.fastq.
b) Count the number of reads that start with TGCACTAC (referred to as in-‐line
barcode) in testdata_1.fastq.
9. Modify all headers in the file testdata_1.fastq. Replace the part of the header,
which identifies the read as se by “/1” and write the data to
testdata_1_newheader.fastq.
10. Extract the first 1000 reads from testdata_1_newheader.fastq and save them to a
file called testdata_1_sub1000.fastq. Gzip this file.
11. Perform the tasks of 10. (except change to “/2”) and 11. above in one single
command using pipes for testdata_2.fastq and write the data into a compressed
file called testdata_2_sub1000.fastq.gz.
12. Identify all reads with the in-‐line barcode TGCACTAC from the file
testdata_1_sub1000.fastq.gz and write them to the file
sample_TGCACTAC_sub.1.fastq.

1
Advanced:

13. Which are the 24 most common barcodes (of length 8bp) in the file
testdata_1.fastq and how often do they occur?
14. Extract the pe reads corresponding to the reads in
sample_TGCACTAC_sub.1.fastq from testdata_2_sub1000.fastq.gz and write them
to sample_TGCACTAC_sub.2.fastq.
15. The file “barcodes” contains a list of the in-‐line barcodes used during the
preparation of the current library. Count the number of reads for every barcode
in testdata_1.fastq.

2
Hints: The following hints suggest commands that might be useful to solve the
above problems. Not all commands will be needed in all cases, but different
combinations and applications of different subsets of commands can be applied.

1. cd; mkdir; cp;
2. tar;
3. gunzip; cat; zcat; less; more; head; tail;
b) Is the offset of the quality encoding Phred+33 or Phred+64? visit
http://en.wikipedia.org/wiki/FASTQ_format for help.
4. gunzip; cat; wc; zcat;
5. cat; zcat; grep; wc; |;
6. cat; zcat; grep; wc; |;
7. cat; zcat; grep (-‐A, -‐v); >; |;
8. cat; zcat; grep; wc; |;
For many applications it might be useful/applicable to sequence DNA from
several DNA extracts (different individuals, species, etc) during the same run of a
NGS instrument (often termed multiplexing). Specific short DNA sequences
(barcodes) are ligated to the DNA fragments of different samples during NGS
library preparation. After sequencing reads can be assigned back to individual
DNA samples based on this barcode. In our case individuals are identified by an
8bp in-‐line barcode, i.e. the first 8 bp of the se reads.
9. cat; sed; >; |;
(example would be: @DHK1:324:C2:4:23:19:41 1:N:0:TGCAACTGG -‐>
@DHK1:324:C2:4:23:19:41/1;)
10. head; >; gzip;
11. head; |; gzip; >;
12. cat; zcat; grep (-‐A, -‐B); >;

Advanced:

13. cat; zcat; sed -‐n; cut -‐c; sort; uniq; head; |;
14. for loop; cat; grep; |; >;
15. for loop; cat; sed; grep; cut; uniq; sort; |;

3
Solutions: Note that there are usually several ways to solve the problems and the
ones stated below represent just examples. If you found another way to do it –
Congratulations! Lines starting with “$” represent commands to be executed in
the terminal window. Italicized text after the “#” gives some extra info on what
the command is doing.

1. $ cd ~/your_directory
$ mkdir exercise_1
$ cd exercise_1
$ cp ~/Downloads/testdata_1.tar .

2. $ tar xvf testdata_1.tar #this will produce the directory “testdata_1”, which
contains the gzipped file testdata_interleaved.fastq.gz
$ ls –hlrt testdata_1/ #look whats in the directory and have the content
listed with some information on filesize in human readable format

3. $ cd testdata_1
$ gunzip testdata_interleaved.fastq.gz #decompresses the “gzipped” file.
Note that per default a new decompressed file is created, while the gzipped
version disappears. This behavior can be modified in various ways. See
manual.
$ less testdata_interleaved.fastq #less is a useful program to look at large
text files. It does not have to read the entire text file before starting so it is
much faster to open large files than your standard text editor. Navigate
with up/down key. Many, many functions including pattern search. Look at
manual for details. Quit with “q”.
$ more testdata_interleaved.fastq #similar to less but slightly different
functionality
$ head testdata_interleaved.fastq #writes the first 15 lines of the file to
your screen (usually called standard output or STDOUT. Number of lines to
be written out can be controlled (see man)
$ tail testdata_interleaved.fastq #writes the last 15 lines of the file to
STDOUT.
$ cat testdata_interleaved.fastq #writes the entire content of the file to
STDOUT); not very helpful at first glance (stop the process by pressing
“CTRL-‐c”), but you can use “pipe” (“|”) to forward the content of the
STDOUT directly to another program without displaying -‐> very powerful!!
See below..

You may perform all these actions also directly on compressed files.

$ gzip testdata_interleaved.fastq
$ gunzip –c testdata_interleaved.fastq.gz #gunzippes the file, but instead of
creating a new file this command writes the content of the file to the
STDOUT.
$ zcat testdata_interleaved.fastq.gz #writes the content of a compressed file
to STDOUT.

4
$ zcat testdata_interleaved.fastq.gz | head #head can not directly display
the content of a gzipped file in human readable form (try it). It can however
be combined with other commands using pipe.
$ gunzip –c testdata_interleaved.fastq.gz | tail

a) The quality scores in the file are encoded in Phred+33 (Sanger) format.
b) The header of the third read (forward read) is:
@DHKW5DQ1:324:C2G0EACXX:4:2308:19447:41921 1:N:0:TGAACTGG
c) The header of the last read (reverse read) is:
@DHKW5DQ1:324:C2G0EACXX:4:2316:21327:100822 2:N:0:TGAACTGG

Note: This data file contains both se and pe reads in what is sometimes
referred to as interleaved format. That means that se and pe reads from a
given DNA fragment occur in the file in consecutive order. Quite often se
and pe reads are provided in separate files. In this case the first read in
the se (often named something_1.fastq) file corresponds to the first read
in the pe file (e.g. something_2.fastq) and so on.

4. $ wc –l testdata_interleaved.fastq
$ cat testdata_interleaved.fastq | wc -‐l
$ zcat testdata_interleaved.fastq.gz | wc –l

The file contains 6705040 lines.

5. You already know that the standard fastq format per definition has 4 lines
per read, so the number of reads in the file is simply the number of
lines/4, i.e. 6705040/4 = 1676260. But you could also simply count all
lines containing headers in the file. Can you find a pattern you could
search for that is true for every header and does not occur in any other
line?
Per definition the header of a fastq file starts with “@” so if you would
count all lines in the file that start with a “@”.

$ grep “^@” testdata_interleaved.fastq | wc –l #grep is a very powerful
command for pattern search. The pattern in this case is “@”. The “^” in the
command is a special character and defines the start of the line. So by “^@”
you search for lines starting with @.

The result of the linecount is: 2159438, i.e. not the expected 1676260.
What’s wrong? By chance some of the quality lines might also start with
@. Have a look:

$ grep “^@” testdata_interleaved.fastq | less

5
So, our pattern has to be more specific. We know that a fastq header
usually starts with an id specific to the machine, which was used to
generate the data. In our case “@DHKW5DQ1” should thus be a pattern
that occurs only in headers.

$ grep “^@DHKW5DQ1” testdata_interleaved.fastq | wc –l
or
$ grep –c “^@DHKW5DQ1” testdata_interleaved.fastq

Result: 1676260

You can try to find other patterns that are specific to headers.

6. Se/pe reads can be identified immediately by looking at their header. In
our case a typical se read would be identified by a header containing “ 1”,
while a pe read would contain “ 2”. There are different conventions.
Sometimes se/pe reads are identified by headers that end in “/1” and
“/2”, respectively. Use grep to count the number of se/pe reads, by e.g.:

$ grep –c “ 1:” testdata_interleaved.fastq
$ zcat testdata_interleaved.fastq.gz | grep –c “ 1:”
$ grep “ 2:” testdata_interleaved.fastq |wc –l

The file contains 838130 se/pe reads respectively.

7. Grep enables to you to search for a pattern and then output the line
containing the pattern plus a given number of lines after the pattern using
the –A flag. While building a complex command you may want to limit
your data to only a subset of the actual data file, and only run the full
command once you are sure it is correct, e.g.:

$ head –n 20 testdata_interleaved.fastq | grep “ 1:” –A 3 #displays just the
first 20 lines of your result

Examining the output of the above command you will see that the –A
option separates every hit by lines containing “-‐-‐“. Another pattern that
you can look for. However in this case you will want to invert the pattern
search and only display lines that do not contain the pattern.

$ head –n 20 testdata_interleaved.fastq | grep “ 1:” –A 3 | grep –v “^-‐-‐$“
#Note the use of the special characters “^” for “the beginning of the line”
and “$” for “the end of the line”.

Does the result look ok? If yes you can simply replace head by cat to
output the result for the full data file. Instead of writing the result to the
STDOUT you may save it directly to a file (using “>”).

6
$ cat testdata_interleaved.fastq | grep “ 1:” –A 3 | grep –v “^-‐-‐$“ >
testdata_1.fastq

for the pe reads:
$ grep “ 2:” –A 3 | grep –v “^-‐-‐$“ > testdata_2.fastq

directly from a compressed data file:
$ zcat testdata_interleaved.fastq.gz | grep “ 1:” –A 3 | grep –v “^-‐-‐$“ >
testdata_1.fastq

8. A simple pattern search as before should do here:

a)
$ grep –c “TGCACTAC” testdata_1.fastq
$ grep “TGCACTAC” testdata_1.fastq | wc -‐l

This identifies 11565 reads.

b)
$ grep –c “^TGCACTAC” testdata_1.fastq
$ grep “^TGCACTAC” testdata_1.fastq | wc -‐l

This identifies 9582 reads.

9. A very useful program to find and replace patterns in text files is sed. Sed
is a very powerful tool. A basic usage for find and replace would be:
$ sed ‘s/pattern/replace/g’ file. This would output the content of the file
to STDOUT. Each occurrence of “pattern” in the file would be replaced by
“replace”. Sed is case sensitive, so for example you may want to replace
every “C” in your file with a “c”. Try it:

$ sed ‘s/C/c/g’ testdata_1.fastq | less

Note that in Unix some characters have a special meaning. You are already
familiar with “^” for “the beginning of a line” and “$” for “the end of a
line”. Other examples: “.” is interpreted by sed as “any character”, while
“.*” is interpreted as “any character and everything that might follow after
this character until the end of the line”. These special characters are very
useful in the context of regular expressions (google “Unix regular
expressions” to find out more). However, you can tell sed (and also grep)
to interpret also these characters in their literal meaning by prepending
“/” to the character. For example if you want to replace every “.” in a file,
instead of replacing “every character” in the file:
$ sed ‘s/\./REPLACE/g’ testdata_1.fastq |less

A solution to our problem could thus be:

7
$ sed ‘s/ 1:.*/\/1/g’ testdata_1.fastq > testdata_1_newheader.fastq #Note
the use of “.*” to specify “everything that follows after “ 1:” and the use of “\”
before “/1” to force the literal interpretation of “/”, instead of specifying the
end of the pattern as part of sed.

10. $ head –n 4000 testdata_1_newheader.fastq > testdata_1_sub1000.fastq
$ gzip testdata_1_sub1000.fastq

11. $ sed ‘s/ 2:.*/\/2/g’ testdata_2.fastq | head –n 4000 | gzip >
testdata_2_sub1000.fastq.gz

12. $ zcat testdata_1_sub1000.fastq.gz | grep “^TGCACTAC” –A 2 –B 1 | grep
“^-‐-‐$“ –v > sample_TGCACTAC_sub.1.fastq #in addition to using –A for
“lines after pattern” you can also specify the number of lines before the
pattern that you will want with “-‐B”

The resulting file should contain 11 sequences.

13. The current problem will require a complex command, which we will try
to build step by step. The only relevant lines for this task are those
containing the nucleotide sequence. Per definition fastq format contains
the nucleotide sequence in the second line. Every 4th line after that will be
another nucleotide sequence.
A very handy function of sed is to print only a specified line followed by
every nth line after that. In our case we will want to print every 4th line
starting from the second line of the file.

$ sed –n ‘2~4p’ testdata_1.fastq | less

The command cut can be used to only print selected characters of a line.
In our case we will want to reduce our file to only the first 8 (i.e.
characters 1-‐8) of the line, because we know that the in-‐line barcode is
specified there. Build on the previous command:

$ sed –n ‘2~4p’ testdata_1.fastq | cut –c 1-‐8 | less

The command sort can be used to sort given data in particular ways, for
example numerically.

$ sed –n ‘2~4p’ testdata_1.fastq | cut –c 1-‐8 | sort –n | less

The command uniq can be used to collapse consecutive identical lines and
thus reduce redundancy in files.

$ sed –n ‘2~4p’ testdata_1.fastq | cut –c 1-‐8 | sort –n | uniq | less

8
With a little extra flag uniq can at the same time count the number of
occurrences for each line.

$ sed –n ‘2~4p’ testdata_1.fastq | cut –c 1-‐8 | sort –n | uniq -‐c| less

The next step will be to sort the result numerically and at the same time
to reverse the sorting order, so that the highest number appears on top.

$ sed –n ‘2~4p’ testdata_1.fastq | cut –c 1-‐8 | sort –n | uniq -‐c| sort –nr |
less

Finally, we simply limit our output to the first 24 lines, which in our case
corresponds to the 24 most abundant barcodes.

$ sed –n ‘2~4p’ testdata_1.fastq | cut –c 1-‐8 | sort –n | uniq -‐c| sort –nr |
head –n 24

Extra question:
From our sequencing provider we know that the sequence “CCGTCTAC” is
not a barcode that has been used during library preparation, yet it comes
up among the 24 most abundant sequences. What could be an explanation
for its high abundance?

14. You know that the headers of corresponding se and pe reads are largely
identical. They only differ in the part that identifies the read as se/pe. So,
one can use the ids of the se reads in a pattern search to identify the
corresponding pe reads. This could be done manually, by copying the
header of the first read in sample_TGCACTAC_sub.1.fastq, remove the
se/pe specific part and use it in a pattern search:

$ zcat testdata_2_sub1000.fastq.gz | grep
“^@DHKW5DQ1:324:C2G0EACXX:4:2308:2072:42111” –A 3 >
sample_TGCACTAC_sub.2.fastq

Then do the same for the second header in sample_TGCACTAC_sub.1.fastq
and append to result to sample_TGCACTAC_sub.2.fastq with the “>>”
command.

$ zcat testdata_2_sub1000.fastq.gz | grep
“^@DHKW5DQ1:324:C2G0EACXX:4:2308:8096:42248” –A 3 >>
sample_TGCACTAC_sub.2.fastq

.. and so on for the remaining 9 sequences. A much more efficient way
would be to assign the header ID to a variable and to use this variable as a
dynamically changing search pattern.

9
You can assign the search pattern as a value (in this case a string) to a
variable, e.g. named header ..

$ header=”@DHKW5DQ1:324:C2G0EACXX:4:2308:2072:42111”

.. and subsequently use the variable in the search in place of the text. To
tell Unix to search for the string stored in the variable instead of the literal
“header” you identify header as a variable by prepending “$”, like
“$header”. Try the following:

$ echo header #this will write header to your screen

$ echo $header #this identifies header as a variable and the string assigned
to the variable will be written to STDOUT

You can now assign a different value to the variable, e.g.:

$ header=”I am learning Unix”

and display the new content of the variable using the same command:

$ echo $header

In the context of our search you can try:

$ header=”@DHKW5DQ1:324:C2G0EACXX:4:2308:2072:42111”
$ zcat testdata_2_sub1000.fastq.gz | grep “^$header” –A 3 >
sample_TGCACTAC_sub.2.using_variable.fastq

$ header=“@DHKW5DQ1:324:C2G0EACXX:4:2308:8096:42248”
$ zcat testdata_2_sub1000.fastq.gz | grep “^$header” –A 3 >>
sample_TGCACTAC_sub.2.using_variable.fastq

Instead of manually assigning new search strings to the variable you can
do this automatically.
First produce a simple command that lists the IDs of the 11 sequences in
sample_TGCACTAC_sub.1.fastq and remove the se specific part of the
header, e.g. replace it with an empty string.

$ grep “/1$” sample_TGCACTAC_sub.1.fastq | sed ‘s/\1//g’

Write the resulting list to a file:

$ grep “/1$” sample_TGCACTAC_sub.1.fastq | sed ‘s/\1//g’ > list

Now read this file line by line and assign the content of each line in turn to
the variable $header. The content of the variable is then used as search

10
pattern as before. This can be done via a “for loop” (google “Unix for loop”
for many useful examples).

$ for header in $(cat list); do zcat testdata_2_sub1000.fastq.gz | grep
"^$header" -‐A 3 | grep "^-‐-‐$" -‐v; done > sample_TGCACTAC_sub.2.fastq

You could even do it all in one go:

$ for header in $(grep “/1$” sample_TGCACTAC_sub.1.fastq | sed
‘s/\1//g’); do zcat testdata_2_sub1000.fastq.gz | grep "^$header" -‐A 3 |
grep "^-‐-‐$" -‐v; done > sample_TGCACTAC_sub.2.fastq

16. Using principally the same approach as in the previous exercise we can
loop line by line through the barcodes file and in turn use each line as
search pattern. For every search pattern we simply count the occurrences.
Finally, we sort the barcodes in descending order, just for convenience.

$ for line in $(cat barcodes); do sed -‐n '2~4p' testdata_1.fastq | grep
"^$line" | cut -‐c 1-‐8 | uniq -‐c; done | sort -‐rn

KGMC Alumni Association Directory 2011
100% (1)
KGMC Alumni Association Directory 2011
29 pages
Wah Industry Limited. Internship Report
100% (4)
Wah Industry Limited. Internship Report
52 pages
Sixteen Saviours or One?, John Perry. 1879
100% (3)
Sixteen Saviours or One?, John Perry. 1879
160 pages
Linux: Cut & Paste More Linux Commands
100% (1)
Linux: Cut & Paste More Linux Commands
16 pages
OPPE 1 Set 1 and Set 2 Q+sol - 2
No ratings yet
OPPE 1 Set 1 and Set 2 Q+sol - 2
11 pages
Introduction To The Command Line For Genomics
No ratings yet
Introduction To The Command Line For Genomics
10 pages
Mumbai Pune Expressway
100% (3)
Mumbai Pune Expressway
12 pages
Chapter02-Accessing The Command Line
No ratings yet
Chapter02-Accessing The Command Line
4 pages
Voolenvine FavoriteSocks 2020 Final PDF
No ratings yet
Voolenvine FavoriteSocks 2020 Final PDF
6 pages
Cognitive Assignment
No ratings yet
Cognitive Assignment
19 pages
$ Python - Version: Hint: You Can Exit An Interactive Python Environment by Entering
No ratings yet
$ Python - Version: Hint: You Can Exit An Interactive Python Environment by Entering
16 pages
Linux Examples Exercises
No ratings yet
Linux Examples Exercises
7 pages
CSCD 240 Lab 1: Command Xyz" Means To Capture The Xyz Command AND Its Resulting Output
No ratings yet
CSCD 240 Lab 1: Command Xyz" Means To Capture The Xyz Command AND Its Resulting Output
2 pages
Ansar - F18605005 Inlab + Post Lab No 04 Operating System Dated 20 April, 2021
No ratings yet
Ansar - F18605005 Inlab + Post Lab No 04 Operating System Dated 20 April, 2021
7 pages
Week 05 Tutorial Sample Answers
No ratings yet
Week 05 Tutorial Sample Answers
11 pages
Linux Tutorial
No ratings yet
Linux Tutorial
3 pages
Linux Command Line Exercises - Linux+CSC Quick Reference
No ratings yet
Linux Command Line Exercises - Linux+CSC Quick Reference
11 pages
Unix Beyond Basics
No ratings yet
Unix Beyond Basics
20 pages
Module I - More About Linux
No ratings yet
Module I - More About Linux
44 pages
Titrimetric Methods of Analysis
No ratings yet
Titrimetric Methods of Analysis
82 pages
2021 s1 Practicals With Answers
No ratings yet
2021 s1 Practicals With Answers
92 pages
Threadsafe Man: Command Description
No ratings yet
Threadsafe Man: Command Description
14 pages
Solutions
No ratings yet
Solutions
45 pages
Compress and Archieve Files.
No ratings yet
Compress and Archieve Files.
2 pages
Examen Capitulo 7 Respuestas
No ratings yet
Examen Capitulo 7 Respuestas
4 pages
ModuleLinux - Session3 - Prac - 1
No ratings yet
ModuleLinux - Session3 - Prac - 1
3 pages
Unit1 ProLUG Lab Essentials
No ratings yet
Unit1 ProLUG Lab Essentials
10 pages
M Hassan Bin Adeel
No ratings yet
M Hassan Bin Adeel
5 pages
Linux Bootcamp Exercises
No ratings yet
Linux Bootcamp Exercises
9 pages
Ansar - F18605005 Inlab + Post Lab No 04 Operating System Dated 24 April, 2021
No ratings yet
Ansar - F18605005 Inlab + Post Lab No 04 Operating System Dated 24 April, 2021
6 pages
2017 Modelling and Transient Simulation of Water Flow in Pipelines Using WANDA Transient Software
No ratings yet
2017 Modelling and Transient Simulation of Water Flow in Pipelines Using WANDA Transient Software
10 pages
Arraygen Linux Manual
No ratings yet
Arraygen Linux Manual
8 pages
Sorting Data: Implement Grep and Tar
No ratings yet
Sorting Data: Implement Grep and Tar
3 pages
Ques
No ratings yet
Ques
3 pages
Basic Linux Introduction
No ratings yet
Basic Linux Introduction
8 pages
Exam1 Fall2023
No ratings yet
Exam1 Fall2023
5 pages
Bca Unix Lab
No ratings yet
Bca Unix Lab
10 pages
Arhqh 32 Po 9 Lknan 2
No ratings yet
Arhqh 32 Po 9 Lknan 2
6 pages
Afpjawprwa'tj 3
No ratings yet
Afpjawprwa'tj 3
6 pages
Chapter 3 Inspection of Sequence Quality PDF
No ratings yet
Chapter 3 Inspection of Sequence Quality PDF
18 pages
Working of Comparison and Searching Commands
No ratings yet
Working of Comparison and Searching Commands
8 pages
Ancient Indian History (Quick Revision)
No ratings yet
Ancient Indian History (Quick Revision)
20 pages
DF Date Du Cal Who Whoami WC Head Tail BC: LAB: 2 Explore The Commands The Given Below Basic Commands
No ratings yet
DF Date Du Cal Who Whoami WC Head Tail BC: LAB: 2 Explore The Commands The Given Below Basic Commands
9 pages
Unix Lab QUESTION SET
No ratings yet
Unix Lab QUESTION SET
11 pages
Ceaser and Cleopatra
No ratings yet
Ceaser and Cleopatra
9 pages
NGS ToolsFormats r1 BDG
No ratings yet
NGS ToolsFormats r1 BDG
32 pages
Introduction To Shell Scripting: © J. Banfelder, L. Skrabanek, Weill Cornell Medical College, 2013
No ratings yet
Introduction To Shell Scripting: © J. Banfelder, L. Skrabanek, Weill Cornell Medical College, 2013
6 pages
Poster PPT Portrait
No ratings yet
Poster PPT Portrait
1 page
Linux Essentials Chapter 7 Exam Answer
No ratings yet
Linux Essentials Chapter 7 Exam Answer
18 pages
Intro To RNA-seq Concepts
No ratings yet
Intro To RNA-seq Concepts
85 pages
Threadsafe Man: Command Description
No ratings yet
Threadsafe Man: Command Description
12 pages
Practical-Notebook Mubeen Ashraf (5707)
No ratings yet
Practical-Notebook Mubeen Ashraf (5707)
48 pages
Clay Pot Refrigerator
No ratings yet
Clay Pot Refrigerator
494 pages
Linux Essinsial Tools Lab Final
No ratings yet
Linux Essinsial Tools Lab Final
16 pages
Pityriasis Versicolor
No ratings yet
Pityriasis Versicolor
21 pages
5616 Uzair Haider
No ratings yet
5616 Uzair Haider
20 pages
Andrew D. Miall
No ratings yet
Andrew D. Miall
48 pages
PSIC - Final of Domestic Electrical Appliances
No ratings yet
PSIC - Final of Domestic Electrical Appliances
82 pages
Materials Compatibility Milling Units Chart
No ratings yet
Materials Compatibility Milling Units Chart
1 page
A Trek Through Time - The History of Trek Bicycles
No ratings yet
A Trek Through Time - The History of Trek Bicycles
5 pages
Chapter 1: Introduction: Department of Information and Communication Engineering (ICE)
No ratings yet
Chapter 1: Introduction: Department of Information and Communication Engineering (ICE)
20 pages
Summary Notes - Topic 3 Movement in and Out of Cells - CAIE Biology IGCSE
No ratings yet
Summary Notes - Topic 3 Movement in and Out of Cells - CAIE Biology IGCSE
4 pages
Linux CLI Cheat Sheet
No ratings yet
Linux CLI Cheat Sheet
6 pages
Lab p2
No ratings yet
Lab p2
9 pages
Prinsip Dasar Teknik DNA Rekombinan
No ratings yet
Prinsip Dasar Teknik DNA Rekombinan
39 pages
TOEFL Reading - Practice Exam - Revisión Del Intento (Página 1 de 5)
No ratings yet
TOEFL Reading - Practice Exam - Revisión Del Intento (Página 1 de 5)
5 pages
Project On Drug Addiction
No ratings yet
Project On Drug Addiction
17 pages
Hippo S192021PreliminaryRound
No ratings yet
Hippo S192021PreliminaryRound
21 pages
Combined Stresses Singer
No ratings yet
Combined Stresses Singer
8 pages
Scale Properties in The Solar System - 20241018 - 072845 - 0000
No ratings yet
Scale Properties in The Solar System - 20241018 - 072845 - 0000
13 pages
Scrubber
No ratings yet
Scrubber
15 pages
Redemption - Batch - 3 11 24 To 3 15 24
No ratings yet
Redemption - Batch - 3 11 24 To 3 15 24
4 pages
Distokia Pada Sapi
No ratings yet
Distokia Pada Sapi
3 pages
Dimensional Analysis
No ratings yet
Dimensional Analysis
3 pages
Just A Pretty Face
No ratings yet
Just A Pretty Face
2 pages
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
Windows Batch File Programming
From Everand
Windows Batch File Programming
Michael Elliott
2/5 (2)
The Mac Terminal Reference and Scripting Primer
From Everand
The Mac Terminal Reference and Scripting Primer
Jay Docherty
4.5/5 (3)
The Project Gutenberg RST Manual
From Everand
The Project Gutenberg RST Manual
Marcello Perathoner
No ratings yet
Basic Information About C language PDF
From Everand
Basic Information About C language PDF
Suraj Das
No ratings yet
LPIC-1 Primer
From Everand
LPIC-1 Primer
John Greene
4.5/5 (3)
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
From Everand
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Joerg Christian Seubert
No ratings yet
Node.js, JavaScript, API: Interview Questions and Answers
From Everand
Node.js, JavaScript, API: Interview Questions and Answers
John Edward Cooper Berg
5/5 (1)
Evaluation of Some Intrusion Detection and Vulnerability Assessment Tools
From Everand
Evaluation of Some Intrusion Detection and Vulnerability Assessment Tools
Dr. Hedaya Mahmood Alasooly
No ratings yet
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
Evaluation of Some Windows and Linux Intrusion Detection Tools
From Everand
Evaluation of Some Windows and Linux Intrusion Detection Tools
Dr. Hidaia Mahmood Alassouli
No ratings yet
Evaluation of Some Windows and Linux Intrusion Detection Tools
From Everand
Evaluation of Some Windows and Linux Intrusion Detection Tools
Dr. Hedaya Alasooly
No ratings yet
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Overview of Some Windows and Linux Intrusion Detection Tools
From Everand
Overview of Some Windows and Linux Intrusion Detection Tools
Dr. Hidaia Mahmood Alassouli
No ratings yet

Exercise 1

Uploaded by

Exercise 1

Uploaded by

Exercise

I: Basic Unix for manipulating NGS data

You might also like