0% found this document useful (0 votes)

13 views38 pages

IBT DNA Seq Analysis

The document outlines the learning objectives and outcomes of an online course on Bioinformatics, focusing on DNA sequence analysis. It covers key topics such as extracting DNA sequences, identifying sequence features, primer design, and gene prediction. The course also discusses the use of biological databases, sequence formats, and various tools for data manipulation and analysis.

Uploaded by

Edilita

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views38 pages

IBT DNA Seq Analysis

Uploaded by

Edilita

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Introduction to Bioinformatics online course: IBT

Bioinformatics resources and databases:

Lecture 3: DNA sequence analysis
Nicola Mulder

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Learning Objectives

• Objective: Basic DNA sequence analysis – finding

sequence features
• Sub objectives:
– Understand how to extract a DNA sequence from
the database
– Use online or local tools for simple DNA sequence
analysis -finding features on the sequence and
their applications

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Learning Outcomes

• Understand how to find a DNA sequence and

save it in the correct format
• Identify features on the sequence such as
coding regions, restriction enzyme sites, etc.
• Design primers for amplification of a DNA
sequence
• Interpret sequence analysis results and
understand the biological impact of functional
regions
Introduction to Bioinformatics online course: IBT
Bioinformatics Resources & Databases: N Mulder
Two major components to Bioinformatics
• Storing and retrieving data:
– Biological databases
– Querying these to retrieve data
• Manipulating the data –tools e.g:
– Finding features on sequences
– Sequence similarity searches
– Protein families and function prediction
– Comparing sequences –phylogenetics
– Etc.

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Aspects of sequence analysis

Regulatory region Promoter

Protein coding (CDS) DNA sequence

Transcription Stop codon

Gene and promoter start
RNA sequence
prediction
Protein sequence
RNA secondary structure,
gene expression
Protein sequence
analysis
Restriction mapping
for cloning, primer
design for PCR

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Overview
• Assume sequence is retrieved from the database
• General text/format manipulation and accession
numbers
• DNA sequences
– Restriction analysis
– Primer design
– Finding features –coding and non-coding
– Gene prediction
• RNA sequence analysis
– Summary of kinds of analyses possible

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Sequence formats: Fasta

> [title]
[sequence]

>seq1
GGAAAATTAGATGCATGGGAAAAAATTA
GGAAAATTAGACAAATGGGAAAAAATTA
>seq2
AAGTCCCTGGATTTACCCAATGCAGTCGA
CATCGCATTT

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Sequence formats: GenBank
LOCUS 525-42 1588 bp
DEFINITION 525-42 1588 bp
TITLE 525-42
FEATURES Location/Qualifiers
exon 39..70
/note="exon1 is believed to have an alternative splice donor site"
ORIGIN

1 ATGTT AAGAG GGGGA AAATT AGATG CATGG GAAAA AATTA GGTTA AGGCC
51 AGGGG GAAAG AAATG CTATA NGATA AAACA CCTAG TATGG GCAAG CAGGG
101 AGCTG GAAAG ATTTG CACTT AACCC TGGCC TTTTA GAGAC ATCAG ANGGC
151 TGTAA ACAAA TAATG NAACA GATAC AACCA GCTCT TCAGA CAGGA ACAGA

Converting between sequence formats (save options)

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
DNA sequence composition

• Nucleotide composition (% GC vs AT content)

• GC bonds are stronger than AT bonds
• Applications:
– Horizontal gene transfer analysis
– Gene prediction
– Primer design

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Accession numbers

• GenBank/EMBL/DDBJ: 1 letter & digits, e.g.:

U12345 or 2 letters & 6 digits, e.g.: AY123456
• GenPept Sequence Records -3 letters & 5 digits,
e.g.: AAA12345
• UniProt -All 6 characters: [A,B,O,P,Q] [0-9] [A-Z,0-
9] [A-Z,0-9] [A-Z,0-9] [0-9], e.g.:
P12345 and Q9JJS7

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Cross-referencing identifiers

• So many different IDs for same thing, e.g.

Ensembl, EMBL, HGNC, UniGene, UniProt, Affy ID,
etc.
• Need mapping files to move between them to
avoid having to parse every entry
• UniProt website mapper (www.uniprot.org)
• PICR (http://www.ebi.ac.uk/Tools/picr/) enables
mapping between IDs

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Example conversion

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
DNA sequence analysis

• Restriction analysis e.g. for cloning –looks for

recognition sites
• Primer design
• Finding features on a sequence
• Gene prediction:
– Translation
– Promoter prediction

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Bioinformatics and cloning

• Retrieving sequence of interest

• Identifying restriction enzyme sites
• Matching these to RE sites in cloning vector

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Restriction enzyme analysis

• Restriction enzymes recognize specific or

defined 4 to 8 base pair sequences on DNA and
cut

Microorganism Enzyme Sequences Notes 5’ 3’

……..GG CC….….
Haemophilus HaeIII 5’…GG CC..3’ Blunt end
………CC GG...….
aegitius 3’…CC GG..5’
Haemophilus HhaI 5’…GC G C..3’ 3’ single ……..GCG C….
haemolytica 3’…CG C G..5’ strand ………C GCG...….
Escherichia coli EcoRI 5’…G AATT C..3’ 5’ single …G AATTC.….
3’…C TTAA G..5’ strand …CTTAA G.….

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Restriction map

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Removing vector sequence

• Vector contamination can be identified by

searching your sequence against a database of
vector sequences (UniVec) e.g.
http://www.ncbi.nlm.nih.gov/VecScreen/VecS
creen.html –uses BLASTN
• Need to hope vector is only at extremities and
not in insert (contamination!)

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
PCR and primer design

• Can engineer restriction

sites
• Primers should be similar
length and Tm
• Should amplify only
required piece from
genome

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Example with Primer BLAST

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Example with Primer BLAST

Bioinformatics Resources & Databases: N Mulder

Gene Prediction
Wikipedia: A gene is a locatable region of genomic sequence, corresponding to a
unit of inheritance, which is associated with regulatory regions, transcribed regions
and/or other functional sequence regions

• Look for gene structures

• Move along sequence looking for coding regions and
intergenic regions
• Check reading frame -translate
• Look for promoters and poly-adenylation signals
• In eukaryotes look for introns and exons
• Use EST or BLAST support (reduce pseudogenes)

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Translation

• Can choose frame if you know it

• Otherwise 6-frame translation:
– Choose start codon ATG
– Otherwise lists all codons between stop codons
• Results –usually the longest ORF starting with
Met and ending in stop, & no stop codons
inside
• Can confirm this with promoter prediction
• Should use appropriate codon usage table

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Open reading frame

• String of in-frame combinations/triplets of

bases that specify an amino acid
• Starts with ATG (Meth) or Val
• Ends with stop codon
• One base insertion or deletion –out of
frame/frameshift

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Genetic code

• Each amino
acid is specified
by a triplet of 3
bases
• 4 bases:
A,C,G,T = 64
possible
codons.
Actually 61
codons + 3
stop codons

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Translating sequences

• 6 possible reading frames, 3 in each direction

Ser Arg Leu

AGTCGGCTGACTGCGTTTACGAATGCGATTACTCCCTT
+1

Reverse complement

AAGGGAGTAATCGCATTCGTAAACGCAGTCAGCCGACT

-1

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Translating sequences

• 6 possible reading frames, 3 in each direction

Val Gly Stop

AGTCGGCTGACTGCGTTTACGAATGCGATTACTCCCTT
+2

AAGGGAGTAATCGCATTCGTAAACGCAGTCAGCCGACT

-2

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Translating sequences

• 6 possible reading frames, 3 in each direction

Ser Ala Asp

AGTCGGCTGACTGCGTTTACGAATGCGATTACTCCCTT

AAGGGAGTAATCGCATTCGTAAACGCAGTCAGCCGACT

-3

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Translating sequences

• 6 possible reading frames, 3 in each direction

Arg Leu Thr

AGTCGGCTGACTGCGTTTACGAATGCGATTACTCCCTT
+1

Reverse complement

AAGGGAGTAATCGCATTCGTAAACGCAGTCAGCCGACT

-1

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Getting the final protein

• Six-frame translation
• Find longest ORF with initiation site, start
codon and ending with stop codon

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Gene Prediction -bacteria

Promoter

Start codon

CDS

Stop codon

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Complex Eukaryotic systems
Promoter region –many
TFBS -find with pattern
matching Splice junction

Exon 1 Intron 1 Exon 2 Intron 2 Exon 3

Alternative splicing

Exon 1 Exon 2 Exon 3

Exon 2 Exon 3 Exon 1

Exon 2 Exon 3

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Human introns and exons

Introns are much larger

than exons, introns could
represent up to 95% of
gene

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Gene prediction in eukaryotes
• Identifying features (sometimes by PSSMs):
– splice sites
– start and stop sites
• Predict exons based on these signals
• Score exons based on signals and exon characteristics
(coding sequences may have compositional biases)
• Use composition and homology information
• Assemble components into predicted gene structure
• Some methods use HMMs -features are states
• Use EST info

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Using EST data: mRNA against genomic sequence
exon
CONTIG --------------------------------------------------------------------------------------CGANGGCCTATCAACAATGAAAGGTCGAAACCTG
Genomic AGCTACAAACAGATCCTTGATAATTGTCGTTGATTTTACTTTATCCTAAATTTATCTCAAAAATGTTGAAATTCAGATTCGTCAAGCGAGGGCCTATCAACAATG-AAGGTCGAAACCTG

exon * ******** * **************

CONTIG CGTTTACTCCGGATACAAGATCCACCCAGGACACGGNAAAGAGACTTGTCCGTACTGACGGAAAG-------------------------------------------------------
Genomic CGTTTACTCCGGATACAAGATCCACCCAGGACACGG-AAAGAGACTTGTCCGTACTGACGGAAAGGTGAGTTCAGTTTCTCTTTGAAAGGCGTTAGCATGCTGTTAGAGCTCGTAAGGTA

intron
************************************ ****************************

CONTIG ------------------------------------------------------------------------------------------------------------------------
Genomic TATTGTAATTTTACGAGTGTTGAAGTATTGCAAAAGTAAAGCATAATCACCTTATGTATGTGTTGGTGCTATATCTTCTAGTTTTTAGAAGTTATACCATCGTTAAGCATGCCACGTGTT

CONTIG ----------------------------------------------GTCCAAATCTTCCTCAGTGGAAAGGCACTCAAGGGAGCCAAGCTTCGCCGTAACCCACGTGACATCAGATGGAC
Genomic GAGTGCGACAAACTACCGTTTCATGATTTATTTATTCAAATTTCAGGTCCAAATCTTCCTCAGTGGAAAGGCACTCAAGGGAGCCAAGCTTCGCCGTAACCCACGTGACATCAGATGGAC
exon **************************************************************************
intron

exon
CONTIG TGTCCTCTACAGAATCAAGAACAAGAAG---------------------------------------------GGAACCCACGGACAAGAGCAAGTCACCAGAAAGAAGACCAAGAAGTC
Genomic TGTCCTCTACAGAATCAAGAACAAGAAGGTACTTGAGATCCTTAAACGCAGTTGAAAATTGGTAATTTTACAGGGAACCCACGGACAAGAGCAAGTCACCAGAAAGAAGACCAAGAAGTC
**************************** ***********************************************

CONTIG CGTCCAGGTTGTTAACCGCGCCGTCGCTGGACTTTCCCTTGATGCTATCCTTGCCAAGAGAAACCAGACCGAAGACTTCCGTCGCCAACAGCGTGAACAAGCCGCTAAGATCGCCAAGGA
Genomic CGTCCAGGTTGTTAACCGCGCCGTCGCTGGACTTTCCCTTGATGCTATCCTTGCCAAGAGAAACCAGACCGAAGACTTCCGTCGCCAACAGCGTGAACAAGCCGCTAAGATCGCCAAGGA
************************************************************************************************************************

CONTIG TGCCAACAAGGCTGTCCGTGCCGCCAAGGCTGCTNCCAACAAG-----------------------------------------------------------------------------
Genomic TGCCAACAAGGCTGTCCGTGCCGCCAAGGCTGCTGCCAACAAGGTAAACTTTCTACAATATTTATTATAAACTTTAGCATGCTGTTAGAGCTTGTAAGGTATATGTGATTTTACGAGTGT
********************************** ********

CONTIG
intron
-------------------------------------------------------------------------------------------------------------------GNAAA
Genomic GTTATTTGAAGCTGTAATATCAATAAGCATGTCTCGTGTGAAGTCCGACAATTTACCATATGCATGAAATTTAAAAACAAGTTAATTTTGTCAATTCTTTATCATTGGTTTTCAGGAAAA
exon * ***

CONTIG GAAGGCCTCTCAGCCAAAGACCCAGCAAAAGACCGCCAAGAATNTNAAGACTGCTGCTCCNCGTGTCGGNGGAAANCGA TAAACGTTCTCGGNCCCGTTATTGTAATAAATTTTGTTGAC

Genomic GAAGGCCTCTCAGCCAAAGACCCAGCAAAAGACCGCCAAGAATGTGAAGACTGCTGCTCCACGTGTCGGAGGAAAGCGA TAAACGTTCTCGGTCCCGTTATTGTAATAAATTTTGTTGAC
******************************************* * ************** ******** ***** **** * *********** ***************************

CONTIG C-----------------------------------------------------------------------------------------------------------------------
Genomic CGTTAAAGTTTTAATGCAAGACATCCAACAAGAAAAGTATTCTCAAATTATTATTTTAACAGAACTATCCGAATCTGTTCATTTGAGTTTGTTTAGAATGAGGACTCTTCGAATAGCCCA
*

Bioinformatics Resources & Databases: N Mulder

Gene Prediction software

• GeneMark –gene prediction for prokaryotes, eukaryotes

and viruses: http://opal.biology.gatech.edu/GeneMark/
• GENSCAN –for vertebrate, maize and Arabidopsis
sequences: http://genes.mit.edu/GENSCAN.html
• Microbial Gene Prediction System
http://compbio.ornl.gov/generation/
• Glimmer –bacteria, archae and viruses
http://www.tigr.org/software/glimmer/
• GRAIL –for eukaryotes, includes splice info, homology, etc.
http://compbio.ornl.gov/grailexp/

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Other translators and promoter
prediction
• NCBI ORF Finder:
(http://www.ncbi.nlm.nih.gov/gorf/gorf.htm)
• Promoter 2.0 Prediction Server
(http://www.cbs.dtu.dk/services/Promoter/)
• MCPromoter MM:II
(http://genes.mit.edu/McPromoter.html)
• BPROM -prediction of bacterial promoters,
etc.

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
RNA sequence analysis

• Many different types of RNA e.g. tRNA, rRNA,

mRNA etc.
• Some have activities e.g. ribozymes
• Many new programs for identification of non-
coding RNA, miRNAs etc and their targets
• Secondary structure of RNA is NB for stability and
often function
• RNA levels are NB for final protein levels, they
measure gene expression –ESTs, microarrays

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder
Summary and conclusions

• Basic sequence analysis is finding features on

a sequence
• This could be small features
– Restriction sites -> cloning
– Primer sites -> PCR
• Or combinations of features:
– Gene signals -> gene prediction
• Features found by nature of their
“conservation” or pattern matching

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder

Lecture 2
No ratings yet
Lecture 2
24 pages
Instant Notes in Bioinformatics, Richard M Tywman
100% (2)
Instant Notes in Bioinformatics, Richard M Tywman
257 pages
Bioinformatics Lecture 1
No ratings yet
Bioinformatics Lecture 1
48 pages
Exploring Bioinformatic - A Proyect Based Approach PDF
0% (2)
Exploring Bioinformatic - A Proyect Based Approach PDF
255 pages
PU Syllabus M.Tech - Computational Biology CY
No ratings yet
PU Syllabus M.Tech - Computational Biology CY
66 pages
BIOINFORMATICS Chapter 1 3rd Sem
100% (1)
BIOINFORMATICS Chapter 1 3rd Sem
44 pages
Lecture Bioinfo Databases
No ratings yet
Lecture Bioinfo Databases
27 pages
Lecture1 BIOF242 Shuvadeep
No ratings yet
Lecture1 BIOF242 Shuvadeep
38 pages
Bioinform-Tica-Pdf-May-6-2010-12-38-Pm-3-5-Meg
No ratings yet
Bioinform-Tica-Pdf-May-6-2010-12-38-Pm-3-5-Meg
105 pages
4 Bioinformaticsdatabases
No ratings yet
4 Bioinformaticsdatabases
71 pages
Bioinformatics and Functional Genomics - Ebook PDF
No ratings yet
Bioinformatics and Functional Genomics - Ebook PDF
51 pages
Elizabeth A. Bates, Mark H. Johnson (Eds,) - Rethinking Innateness. A Connectionist Perspective
No ratings yet
Elizabeth A. Bates, Mark H. Johnson (Eds,) - Rethinking Innateness. A Connectionist Perspective
471 pages
Module 1 - Session 3 - Part 1
No ratings yet
Module 1 - Session 3 - Part 1
17 pages
Biological Database 1
No ratings yet
Biological Database 1
50 pages
Bioinformatics 1
No ratings yet
Bioinformatics 1
37 pages
Bioin
No ratings yet
Bioin
34 pages
Bioninformaticas Lecture - 1
No ratings yet
Bioninformaticas Lecture - 1
33 pages
2a.BioinfoServerDatabase (Proteomics)
No ratings yet
2a.BioinfoServerDatabase (Proteomics)
50 pages
Databases
No ratings yet
Databases
34 pages
Unit-5 Bioinformatics
No ratings yet
Unit-5 Bioinformatics
13 pages
Bioinformatics Pratical File
No ratings yet
Bioinformatics Pratical File
63 pages
Bioinformatics: Nadiya Akmal Binti Baharum (PHD)
100% (2)
Bioinformatics: Nadiya Akmal Binti Baharum (PHD)
54 pages
MSC - Bioinformatics - Year1 Detailing by Bioinformatics Centre SPPU - 03082023
No ratings yet
MSC - Bioinformatics - Year1 Detailing by Bioinformatics Centre SPPU - 03082023
33 pages
Sequence Analysis Primer, 1st Edition Full Download
100% (8)
Sequence Analysis Primer, 1st Edition Full Download
17 pages
Bioinformatics Final
No ratings yet
Bioinformatics Final
18 pages
Module 2 (Bioinformatics)
No ratings yet
Module 2 (Bioinformatics)
81 pages
Unit 1
No ratings yet
Unit 1
24 pages
Bioinformatics Database and Applications
100% (3)
Bioinformatics Database and Applications
82 pages
Bioinfo Course Notes M1 2020 DR Mbulli
No ratings yet
Bioinfo Course Notes M1 2020 DR Mbulli
56 pages
Sequence Analysis Primer 1st Edition ISBN 0195098749, 9780195098747 Full Text Download
No ratings yet
Sequence Analysis Primer 1st Edition ISBN 0195098749, 9780195098747 Full Text Download
16 pages
Sec1 Introduction To Bioinformatics
No ratings yet
Sec1 Introduction To Bioinformatics
20 pages
Lecture 5 - DataBase
No ratings yet
Lecture 5 - DataBase
18 pages
The Flow of Genetic Information: DNA RNA Protein
No ratings yet
The Flow of Genetic Information: DNA RNA Protein
134 pages
Bioinformatics
No ratings yet
Bioinformatics
22 pages
Bio Informatics
No ratings yet
Bio Informatics
46 pages
PM703 Practical Biotechnology (2019) PM703 Practical Biotechnology (2019)
No ratings yet
PM703 Practical Biotechnology (2019) PM703 Practical Biotechnology (2019)
20 pages
Lecture 01
No ratings yet
Lecture 01
20 pages
2006 09 01 - Lect01 - ch1 2 PDF
No ratings yet
2006 09 01 - Lect01 - ch1 2 PDF
104 pages
Introduction To Bioinformatics
No ratings yet
Introduction To Bioinformatics
14 pages
Bioinformatics: Farhan Haq, PHD Department of Biosciences Cui
No ratings yet
Bioinformatics: Farhan Haq, PHD Department of Biosciences Cui
24 pages
Combine PDF
No ratings yet
Combine PDF
106 pages
2nd Lec Student Copy - 2
No ratings yet
2nd Lec Student Copy - 2
19 pages
Bioinformatics Session1
No ratings yet
Bioinformatics Session1
35 pages
Introduction To Bioinformatics
No ratings yet
Introduction To Bioinformatics
33 pages
Chapter 17
75% (4)
Chapter 17
16 pages
Transcription in Prokaryotes and Eukaryotes
No ratings yet
Transcription in Prokaryotes and Eukaryotes
31 pages
Integration of Ayurveda and Genomics
No ratings yet
Integration of Ayurveda and Genomics
11 pages
Bio PPT
No ratings yet
Bio PPT
35 pages
Get Lipoproteins and Cardiovascular Disease Methods and Protocols 1st Edition Lita A. Freeman (Auth.) Free All Chapters
100% (8)
Get Lipoproteins and Cardiovascular Disease Methods and Protocols 1st Edition Lita A. Freeman (Auth.) Free All Chapters
84 pages
Bioinformatics Class Notes
No ratings yet
Bioinformatics Class Notes
12 pages
Lecture2 - Molecular Biology of The Genome
No ratings yet
Lecture2 - Molecular Biology of The Genome
60 pages
Genome Organization and Control
100% (1)
Genome Organization and Control
32 pages
Z Bioinformatics
No ratings yet
Z Bioinformatics
14 pages
Fat Noews
No ratings yet
Fat Noews
27 pages
BTH 403-BTG407 Lecture 1
No ratings yet
BTH 403-BTG407 Lecture 1
6 pages
Algorithms in Bioinformatics: A Practical Introduction: Introduction To Molecular Biology
No ratings yet
Algorithms in Bioinformatics: A Practical Introduction: Introduction To Molecular Biology
78 pages
Computational Biology B.Tech - Biotech (Vith Semester)
No ratings yet
Computational Biology B.Tech - Biotech (Vith Semester)
34 pages
D 1.2 - Protein Synthesis
No ratings yet
D 1.2 - Protein Synthesis
19 pages
RNA Synthesis and Processing
No ratings yet
RNA Synthesis and Processing
17 pages
Supplementary Table 1 Test Content Design Considerations
No ratings yet
Supplementary Table 1 Test Content Design Considerations
16 pages
Cambridge International AS & A Level: Biology 9700/22 May/June 2022
No ratings yet
Cambridge International AS & A Level: Biology 9700/22 May/June 2022
21 pages
Molecular Evolution: Genetic Change and Innovations: Alberts (2015) Molecular Biology Edition, Figure 1-19
No ratings yet
Molecular Evolution: Genetic Change and Innovations: Alberts (2015) Molecular Biology Edition, Figure 1-19
38 pages
Bi Workbook
No ratings yet
Bi Workbook
13 pages
Application in Establishing Epidemiology and Variability: Genome & Protein " Sequence Analysis Programs"
100% (3)
Application in Establishing Epidemiology and Variability: Genome & Protein " Sequence Analysis Programs"
23 pages
Avina-Padilla 2021
No ratings yet
Avina-Padilla 2021
19 pages
Bioinformatics: ABE 2007 Kent Koster Group 3
No ratings yet
Bioinformatics: ABE 2007 Kent Koster Group 3
43 pages
Udit LncRNAs Review V2
No ratings yet
Udit LncRNAs Review V2
28 pages
Exploring Database and Analyzing Protein Sequence
No ratings yet
Exploring Database and Analyzing Protein Sequence
70 pages
Scan
No ratings yet
Scan
32 pages
8024 Bio Info
No ratings yet
8024 Bio Info
28 pages
Lesson 2 Genetic Code KES
No ratings yet
Lesson 2 Genetic Code KES
19 pages
Alternative Splicing As A Source of Phenotypic Diversity
No ratings yet
Alternative Splicing As A Source of Phenotypic Diversity
14 pages
Unit 6 - Bioinformatics
No ratings yet
Unit 6 - Bioinformatics
41 pages
Bioinformatics: Intended Learning Outcomes
No ratings yet
Bioinformatics: Intended Learning Outcomes
9 pages
Biology 171L - General Biology Lab I Lab 12: Introduction To Bioinformatics
No ratings yet
Biology 171L - General Biology Lab I Lab 12: Introduction To Bioinformatics
6 pages
Gene Regulation
67% (3)
Gene Regulation
30 pages
What Is Bioinformatics
No ratings yet
What Is Bioinformatics
3 pages
Bio Tics
No ratings yet
Bio Tics
7 pages
Result: Uncertain: Next Steps
No ratings yet
Result: Uncertain: Next Steps
8 pages
Basic Exercises Gen
No ratings yet
Basic Exercises Gen
8 pages
DNA Splicing
No ratings yet
DNA Splicing
3 pages
Module in Tics
No ratings yet
Module in Tics
20 pages
Bioinformatics Tools: Stuart M. Brown, PH.D Dept of Cell Biology NYU School of Medicine
No ratings yet
Bioinformatics Tools: Stuart M. Brown, PH.D Dept of Cell Biology NYU School of Medicine
50 pages
The Medtechs Eme
No ratings yet
The Medtechs Eme
3 pages
Sequencing Depth and Coverage: Key Considerations in Genomic Analyses
No ratings yet
Sequencing Depth and Coverage: Key Considerations in Genomic Analyses
12 pages
Rna Processing: M.Prasad Naidu MSC Medical Biochemistry, PH.D
No ratings yet
Rna Processing: M.Prasad Naidu MSC Medical Biochemistry, PH.D
33 pages
Manual de Ejercicios de Python
No ratings yet
Manual de Ejercicios de Python
1 page
Bioinformatics: Tina Elizabeth Varghese
No ratings yet
Bioinformatics: Tina Elizabeth Varghese
9 pages
Bioinformatics in Aquaculture: Principles and Methods
From Everand
Bioinformatics in Aquaculture: Principles and Methods
Zhanjiang (John) Liu
No ratings yet
Protocols used in Molecular Biology
From Everand
Protocols used in Molecular Biology
Sandeep Singh
No ratings yet

IBT DNA Seq Analysis

Uploaded by

IBT DNA Seq Analysis

Uploaded by

Introduction to Bioinformatics online course: IBT

Bioinformatics resources and databases:

Introduction to Bioinformatics online course: IBT

• Objective: Basic DNA sequence analysis – finding

Introduction to Bioinformatics online course: IBT

• Understand how to find a DNA sequence and

Introduction to Bioinformatics online course: IBT

Regulatory region Promoter

Transcription Stop codon

Introduction to Bioinformatics online course: IBT

Introduction to Bioinformatics online course: IBT

Introduction to Bioinformatics online course: IBT

Converting between sequence formats (save options)

Introduction to Bioinformatics online course: IBT

• Nucleotide composition (% GC vs AT content)

Introduction to Bioinformatics online course: IBT

• GenBank/EMBL/DDBJ: 1 letter & digits, e.g.:

Introduction to Bioinformatics online course: IBT

• So many different IDs for same thing, e.g.

Introduction to Bioinformatics online course: IBT

Introduction to Bioinformatics online course: IBT

• Restriction analysis e.g. for cloning –looks for

Introduction to Bioinformatics online course: IBT

• Retrieving sequence of interest

Introduction to Bioinformatics online course: IBT

• Restriction enzymes recognize specific or

Microorganism Enzyme Sequences Notes 5’ 3’

Introduction to Bioinformatics online course: IBT

Introduction to Bioinformatics online course: IBT

• Vector contamination can be identified by

Introduction to Bioinformatics online course: IBT

• Can engineer restriction

Introduction to Bioinformatics online course: IBT

Introduction to Bioinformatics online course: IBT

Bioinformatics Resources & Databases: N Mulder

• Look for gene structures

Introduction to Bioinformatics online course: IBT

• Can choose frame if you know it

Introduction to Bioinformatics online course: IBT

• String of in-frame combinations/triplets of

Introduction to Bioinformatics online course: IBT

Introduction to Bioinformatics online course: IBT

• 6 possible reading frames, 3 in each direction

Ser Arg Leu

Introduction to Bioinformatics online course: IBT

• 6 possible reading frames, 3 in each direction

Val Gly Stop

Introduction to Bioinformatics online course: IBT

• 6 possible reading frames, 3 in each direction

Ser Ala Asp

Introduction to Bioinformatics online course: IBT

• 6 possible reading frames, 3 in each direction

Arg Leu Thr

Introduction to Bioinformatics online course: IBT

Introduction to Bioinformatics online course: IBT

Introduction to Bioinformatics online course: IBT

Exon 1 Intron 1 Exon 2 Intron 2 Exon 3

Exon 1 Exon 2 Exon 3

Exon 2 Exon 3 Exon 1

Introduction to Bioinformatics online course: IBT

Introns are much larger

Introduction to Bioinformatics online course: IBT

Introduction to Bioinformatics online course: IBT

exon *** ************ ** * **************

CONTIG GAAGGCCTCTCAGCCAAAGACCCAGCAAAAGACCGCCAAGAATNTNAAGACTGCTGCTCCNCGTGTCGGNGGAAANCGA TAAACGTTCTCGGNCCCGTTATTGTAATAAATTTTGTTGAC

Bioinformatics Resources & Databases: N Mulder

• GeneMark –gene prediction for prokaryotes, eukaryotes

Introduction to Bioinformatics online course: IBT

Introduction to Bioinformatics online course: IBT

• Many different types of RNA e.g. tRNA, rRNA,

Introduction to Bioinformatics online course: IBT

• Basic sequence analysis is finding features on

Introduction to Bioinformatics online course: IBT

You might also like

exon * ******** * **************