0% found this document useful (0 votes)

40 views10 pages

What Is Bioinformatics?

This document provides information about bioinformatics, DNA, proteins, amino acids, and databases. It defines bioinformatics as an interdisciplinary field that combines biology, computer science, and other fields to analyze biological data through computational methods and software tools. It describes DNA as the genetic material containing four bases that pair up in a double helix structure. Proteins are made of amino acids linked by peptide bonds that fold into complex 3D shapes dictated by their sequence. Amino acids are the building blocks of proteins. The document also discusses various biological databases used for storing and analyzing genetic sequence data.

Uploaded by

Perla Universal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views10 pages

What Is Bioinformatics?

Uploaded by

Perla Universal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

What is bioinformatics?

Interdisciplinary field of science that combines biology, computer science, statistics, physics,
chemistry, mathematics and engineering by developing methods and software tools for
understanding and interpreting biological data in genetics and genomics
What is DNA, meaning, groups facts?

DNA, deoxyribonucleic acid-hereditary material in humans and almost all other organisms,
biological instructions that make each species unique

• DNA was discovered in 1869-Frederich Miescher-Watson and Crick model (1950)

• Information in DNA is stored as a code made up of four chemical bases: adenine (A), guanine
(G), cytosine (C), and thymine (T)

• DNA bases pair up with each other, A with T and C with G, to form units called base pairs

• Each base is also attached to a sugar molecule and a phosphate molecule

• Base + Sugar + Phosphate = Nucleotide

• Nucleotides are arranged in two long strands to form a spiral called double helix

All DNA follow Chargaff’s rule- “The total number of purines in a DNA molecule is equal to the total
number of pyrimidines”

• Structure of double helix resembles a ladder, with the base pairs forming the ladder’s rungs and
the sugar and phosphate molecules forming the vertical sidepieces of the ladder

• DNA can replicate or make copies of itself

• Each strand of DNA in double helix can function as a pattern for duplicating sequence of bases

• During cells division, each new cell needs to

have an exact copy of DNA present in old cell

Purines are bases that have double ring and triple bound

Pyrimidines are bases that have single ring and double

bound
What are proteins and everything about them
Proteins were huge molecules (macromolecules) made up of large
numbers of amino acids (typically from 100 to 500), picked out from a
selection of 20 “flavors” with names such as alanine, glycine,
tyrosine, glutamine, and so on….
Proteins with similar sequences would fold into similar shapes
 Proteins with similar structures would be encoded by similar
sequences of amino acids
 Function of a protein turned out to be a direct consequence of its
3-D structure
-structural bioinformatics- Branch of bioinformatics which is related
to the analysis and prediction of the three-dimensional structure of
biological macromolecules such as proteins, RNA, and DNA.

Final 3-D shape of protein molecule is

uniquely dictated by its sequence
because some amino-acid types (for
instance, hydrophobic residues L, V,
I) have no desire whatsoever to be at
the surface interacting with the
surrounding water — while others (for instance,
hydrophilic residues D, S, K) are actively looking for
such an opportunity
 Protein chain is also affected by other influences,
such as the electric charges carried by some of the
amino acids, or their capacity to fit with their immediate neighbors
 First 3-D structure of a protein was determined in 1958 by Drs. Kendrew and Perutz, using the
complicated technique of X-ray crystallography

HEMOGLOBIN

Hemoglobin is the protein that makes blood red

 Made up of four protein chains (polypeptide chains), two alpha
chains (141 amino acid residues each) and two beta chains (146 amino
acid residues each), each with a ring-like heme group containing an
iron atom
 There are four binding sites for oxygen on the hemoglobin molecule,
because each chain contains one heme group
 Alpha and beta chains have different sequences of amino acids, but
fold up to form similar three-dimensional structures
 Four chains are held together by noncovalent interactions

Oxygen binds reversibly to these
iron atoms and is transported
through blood

A protein is a polymer of
amino acids linked together by
peptide bonds- Primary structure
is the sequence of amino acids
in the chain
 Backbone of the protein will fold to form a regular repeating pattern called secondary structure
 Protein folds upon itself when regions of secondary structure are interrupted by irregularly
folded loops and turns. It helps to visualize the helices as pink and the turns as white. This pattern
repeats for the entire length of whole protein chain. The irregular folding of the whole protein into a
compact globular structure is called tertiary structure
 Some proteins are actually a collection of smaller proteins called subunits. Hemoglobin is made
of four subunits. The arrangement of subunits in a protein is its quaternary structure. It helps to
visualize the subunits as different colors

AMINO ACIDS

Amino acids are linked together as a chain — and that the true identity of a protein is derived not
only from its composition, but also from the precise order of its constituent amino acids
• First amino-acid sequence of protein insulin-determined in 1951.
• Actual recipe for human insulin, from which all its biological properties derive, is the following
chain of 110 residues
insulin=MALWMRLLPLLALLALWGPDPAAAFVNQHLCGSHLVEALYLVCGERGFFYTP
KTRREAEDLQVGQVELGGGPGAGSLQPLALEGSLQKRGIVEQCCTSICSLYQLENYCN
NH2 and COOH groups of atoms are used to form peptidic bonds between successive residues in
the sequence
 Protein molecule is made when a free NH2 group links chemically with a COOH group, forming
the peptide bond CO-NH
 As a result of this chaining process, your protein molecule is going to be left with an unused
NH2 at one end and an unused COOH at the other end known as N-terminus and C-terminus of
protein chain
 books, databases, and so on defines the sequence of a protein or of a protein fragment as the
succession of its constituent amino acids, listed in order from the N-terminus to the C-terminus.

MAVLD= Met-Ala-Val-Leu-Asp = Methionine–

Alanine-Valine–Leucine- Aspartic

Databases
Primary Database
•Original submission by experimentalists who have researched
•Content controlled by submitters
•Example: GENEBANK, SNP, GEO...

Secondary Database
•Built up from primary data which is retrieved by
primary database
•Content controlled by third party NCBI
•Example: RefSeq, RefSNP, NCBI, Structure, Protein

Shortcuts :

European Nucleotide Archive-ENA

Protein Information Resource (PIR)
European Molecular Biology Laboratory (EMBL)
UniProtKB/Swiss-Prot Protein Knowledgebase
ExPASy (Expert Protein Analysis System
Swiss Institute of Bioinformatics (SIB)
National Library of Medicine (NLM),
NCBI(National center for biotechnology information)
DNA DataBank of Japan (DDBJ),
the European Molecular Biology Laboratory (EMBL) and GenBank at NCBI
PubMed is a database developed by NCBI National Library of Medicine (NLM),
it works as a part of the NCBI Entrez retrieval system
PubMed Central (PMC)

Entrez Global Query Cross-Database Search System

Protein data bank (PDB)

Worldwide Protein Data Bank, wwPDB.

Sequence retrieval system (SRS;

Sequence homology vs sequence similarity

DNA and proteins are products of evolution

The building blocks of these biological macromolecules, nucleotide bases, and amino acids form
linear sequences that determine the primary structure of the molecules
The molecular sequences undergo random changes
Selected sequences gradually accumulate mutations and diverge over time, traces of evolution
may still remain in certain portions of the sequences to allow identification of the common ancestry
For example, active site residues of an enzyme family tend to be conserved because they are
responsible for catalytic functions.
Comparing sequences through alignment, patterns of conservation and variation can be
identified
sequence alignment can be used as basis for prediction of structure and function of
uncharacterized sequences.

When two sequences are descended from a common evolutionary origin, they are said to have a
homologous relationship or share homology. Sequence similarity, which is the percentage of
aligned residues that are similar in physiochemical properties such as size, charge, and
hydrophobicity.

sequence homology similarity

An inference or a conclusion about a common a direct result of observation from the
ancestral relationship drawn from sequence sequence alignment
similarity comparison when the two sequences
share a high enough degree of similarity

Sequence similarity can be quantified using

percentages (For example, one may say that two
sequences share 40% similarity. It is incorrect to
say that the two sequences share 40% homology.
They are either homologous or nonhomologous)

An identity of 30% or higher can be safely regarded

as having close homology. They are sometimes
referred to as being in the “safe zone” If their identity
level falls between 20% and 30%, determination of
homologous relationships in this range becomes less
certain. This is the area often regarded as the
“twilight zone,” Below 20% identity, where high
proportions of nonrelated sequences are present,
homologous relationships cannot be reliably
determined and thus fall into the “midnight zone.”

Sequence similarity and sequence identity are synonymous for nucleotide sequences. For protein
sequences, however, the two concepts are very different In a protein sequence alignment,
sequence identity refers to the percentage of matches of the same amino acid residues between
two aligned sequences. Similarity refers to the percentage of aligned residues that have similar
physicochemical characteristics and can be more readily substituted for each other

Calculation of sequence similarity/identity

S is the percentage sequence similarity Ls is the number of aligned residues with similar
characteristic L a and L b are the total lengths of each individual sequence

METHODS Global Alignment and Local Local Alignment

Alignment Global Alignment
Two sequences to be aligned are assumed to be Does not assume that the two sequences in
generally similar over their entire length question have similarity over the entire length
Alignment is carried out from beginning to end of It only finds local regions with the highest level of
both sequences to find the best possible similarity between the two sequences and aligns
alignment across the entire length between the these regions without regard for the alignment of
two sequences the rest of the sequence regions
Fails to recognize highly similar local regions more appropriate for aligning divergent biological
between the two sequences. sequences containing only modules that are
similar, which are referred to as domains or
motifs.

Задача

ALGORITAMS

Dynamic Programming for Global Alignment

Needleman–Wunsch algorithm
In this algorithm, an optimal alignment is obtained over the entire lengths of the two sequences.
One of the few web servers dedicated to global pairwise alignment is GAP. GAP
(http://bioinformatics.iastate.edu/aat/align/align.html) is a web-based pairwise global alignment
program.
It aligns two sequences without penalizing terminal gaps so similar sequences of unequal
lengths can be aligned.
To be able to insert long gaps in the alignment, such gaps are treated with a constant penalty.
This feature is useful in aligning cDNA to exons in genomic DNA containing the same gene.

The first application of dynamic programming in local alignment is the Smith–Waterman algorithm
•In this algorithm, positive scores are assigned for matching residues and zeros for mismatches.
• No negative scores are used.
•This approach may be suitable for aligning divergent sequences or sequences with multiple
domains that may be of different origins
Most commonly used pairwise alignment web servers apply the local alignment strategy, which
include SIM, SSEARCH, and LALIGN. SIM (http://bioinformatics.iastate.edu/aat/align/align.html) is
a web-based program for pairwise alignment using the Smith–Waterman algorithm that finds the
best scored non overlapping local alignments between two sequences.
•It is able to handle tens of kilobases of genomic sequence.
•The user has the option to set a scoring matrix and gap penalty scores.
•A specified number of best scored alignments are produced.

SSEARCH (http://pir.georgetown.edu/pirwww/search/pairwise.html) is a simpleweb-based

programs that uses the Smith–Waterman algorithm for pairwise alignment of sequences.
•Only one best scored alignment is given.
•There is no option for scoring matrices or gap penalty scores. LALIGN
(www.ch.embnet.org/software/LALIGN form.html) is a web-based program that uses a variant of
the Smith–Waterman algorithm to align two sequences.
•Unlike SSEARCH, which returns the single best scored alignment, LALIGN gives a specified
number of best scored alignments.
•The user has the option to set the scoring matrix and gap penalty scores.
•The same web interface also provides an option for global alignment performed by the ALIGN
program.

Major types of RNA

•mRNA messenger RNA (mRNA) RNA molecule that specifies the amino acid sequence of a
protein.

•rRNA ribosomal RNA (rRNA) Any one of a number of specific RNA molecules that form part of
the structure of a ribosome and participate in the synthesis of proteins

•tRNA transfer RNA (tRNA) Set of small RNA molecules used in protein synthesis as an interface
(adaptor) between messenger RNA and amino acids.
DNA encodes hereditary information (genotype) -> decoded into RNA -> protein
(phenotype)

TRANSLATION
Conversion of RNA into amino acid sequence that makes a protein
•The mRNA leaves the nucleus and enters the cytoplasm
• Ribosomes attach to mRNA
• tRNA (carrying anti-codon) picks up the correct amino acids and carries them to the mRNA
strand forming the protein
Ex:
–tRNA carries GAU (anti-codon)& looks for CUA on mRNA

Transcription
•Transcription- process that makes mRNA from DNA

1.DNA unzips into 2 separate strands A. DNA Helicase is the enzyme that breaks H-bond 2. Free
floating RNA NITROGEN BASES in the nucleus pair up w/unzipped DNA NITROGEN BASES: A.
Cytosine(C) pairs with Guanine(G) * (G) with (C) B. Uracil(U) pairs with Adenine(A) * (A) with (U)
C. Thymine (T) pairs with Adenine (A) ***remember (T) is only with DNA

3. After all the pairing is done:

•a single strand of RNA has been produced. 4. Genetic code from DNA is transferred to mRNA 5.
The code obtained from DNA lets the mRNA know which amino acids to pick up:

•code is a set of 3 nitrogen bases = Codon

Overall process

TRANSCRIPTION VS TRANSLATION

RNA splicing
In molecular biology and genetics, splicing is a modification of the nascent pre-messenger
RNA(pre-mRNA) transcript in which introns are removed and exons are joined. For nuclear-
encoded genes,splicing takes place within the nucleus after or concurrently with transcription.
•carried out by spliceosomes
•Spliceosomes
–complex of proteins and several small nuclear ribonucleoproteins (snRNPs)
–Recognize splice sites (specific RNA sequences)
–cleave out introns and splice together exons (coding
region)
In most eukaryotic genes, coding regions (exons) are
interrupted by noncoding regions (introns). During
transcription, the entire gene is copied into a pre-mRNA,
which includes exons and introns. During the process of
RNA splicing, introns are removed and exons joined to
form a contiguous coding sequence.

Function of RNA

Storage/transfer of genetic information

• Genomes
• many viruses have RNA genomes single-stranded
(ssRNA) e.g., retroviruses (HIV) double-stranded
(dsRNA)
• Transfer of genetic information
• mRNA = "coding RNA" - encodes proteins
A non-coding RNA (ncRNA) is an RNAmolecule that is not translated into a protein. Less-
frequently used synonyms are non-protein-coding RNA (npcRNA), non-messenger RNA
(nmRNA) and functional RNA (fRNA). The DNA sequence from which a functional non-coding
RNA is transcribed is often called an RNA gene

Structural
• e.g., rRNA, which is a major structural component of ribosomes BUT - its role is not just
structural, also: Catalytic RNA in the ribosome has peptidyltransferase activity
• Enzymatic activity responsible for peptide bond formation between amino acids in growing
peptide chain
• Also, many small RNAs are enzymes "ribozymes"

Regulatory Recently discovered important new roles for RNAs In normal cells:
• in "defense" - esp. in plants
• in normal development e.g., siRNAs, miRNA
As tools:
• for gene therapy or to modify gene expression
• RNAi
• RNA aptamers

Cell-The Unit of Life - Shobhit Nirwan
80% (40)
Cell-The Unit of Life - Shobhit Nirwan
23 pages
Bioinformatics 2
No ratings yet
Bioinformatics 2
50 pages
Bif 401 100% Solved Final Term Paper by Sulman Ali
No ratings yet
Bif 401 100% Solved Final Term Paper by Sulman Ali
5 pages
Bioinfo Training Material
No ratings yet
Bioinfo Training Material
42 pages
Protein Folds and Structure
No ratings yet
Protein Folds and Structure
19 pages
Francisco J. Ruiz-Ruano CV
No ratings yet
Francisco J. Ruiz-Ruano CV
18 pages
Chapter 01
No ratings yet
Chapter 01
20 pages
Lecture 5 - Proteins and Nucleic Acids PDF
No ratings yet
Lecture 5 - Proteins and Nucleic Acids PDF
49 pages
BIF501-Bioinformatics-II Solved Questions FINAL TERM (PAST PAPERS)
No ratings yet
BIF501-Bioinformatics-II Solved Questions FINAL TERM (PAST PAPERS)
23 pages
Lecture2 - Background
No ratings yet
Lecture2 - Background
43 pages
Protein English Aug 2006
No ratings yet
Protein English Aug 2006
18 pages
IGCSE Biology Chapter 4: Biological Molecules
67% (3)
IGCSE Biology Chapter 4: Biological Molecules
7 pages
BIF401 Midterm Past Papers Subjective
No ratings yet
BIF401 Midterm Past Papers Subjective
10 pages
GE Chem Nucleic Acids and Proteins Reviewer
No ratings yet
GE Chem Nucleic Acids and Proteins Reviewer
5 pages
Amino Acids & Proteins
No ratings yet
Amino Acids & Proteins
42 pages
Alignments Lecture
No ratings yet
Alignments Lecture
15 pages
Biochemistry PDF
No ratings yet
Biochemistry PDF
8 pages
Algorithms in Bioinformatics: A Practical Introduction: Introduction To Molecular Biology
No ratings yet
Algorithms in Bioinformatics: A Practical Introduction: Introduction To Molecular Biology
78 pages
Protein Structure
No ratings yet
Protein Structure
52 pages
SQH7001 Bioinformatics Task - Velda Rifka Almira
No ratings yet
SQH7001 Bioinformatics Task - Velda Rifka Almira
9 pages
Lecture 01
No ratings yet
Lecture 01
20 pages
Biological Molecule
No ratings yet
Biological Molecule
7 pages
2.protein Primary Structure
No ratings yet
2.protein Primary Structure
83 pages
L2 Proteomics, Genomics and Bioinformatics
No ratings yet
L2 Proteomics, Genomics and Bioinformatics
30 pages
CE6068 Lecture 1
No ratings yet
CE6068 Lecture 1
89 pages
Protein Mcq's (HUZAIFA) : B) Amino Acids
100% (4)
Protein Mcq's (HUZAIFA) : B) Amino Acids
4 pages
BIF401 Midterm Short Notes
No ratings yet
BIF401 Midterm Short Notes
45 pages
Answer For Hots Question
No ratings yet
Answer For Hots Question
24 pages
Protein Chemistry
No ratings yet
Protein Chemistry
92 pages
Lec 01
No ratings yet
Lec 01
93 pages
Protein Folding
No ratings yet
Protein Folding
21 pages
Peptide Bonds
No ratings yet
Peptide Bonds
30 pages
Molecular Biology
No ratings yet
Molecular Biology
34 pages
Into To Bioinfo
No ratings yet
Into To Bioinfo
53 pages
Ab Initio
No ratings yet
Ab Initio
9 pages
Bioinformatics: Farhan Haq, PHD Department of Biosciences Cui
No ratings yet
Bioinformatics: Farhan Haq, PHD Department of Biosciences Cui
24 pages
Samian AQA Biology GCSE Combined B1 Practice Answers
No ratings yet
Samian AQA Biology GCSE Combined B1 Practice Answers
2 pages
Lecture Bioinfo Databases
No ratings yet
Lecture Bioinfo Databases
27 pages
Computational Biology: Spring Semester 2012
No ratings yet
Computational Biology: Spring Semester 2012
46 pages
Ap Bio 2.0 PDF
No ratings yet
Ap Bio 2.0 PDF
63 pages
Bio 103 L 5 DNA RNA & Protein F
No ratings yet
Bio 103 L 5 DNA RNA & Protein F
34 pages
AP BIO 2.0
No ratings yet
AP BIO 2.0
28 pages
Bio 103 L 5 Dna Rna - Protein F
No ratings yet
Bio 103 L 5 Dna Rna - Protein F
34 pages
Structural Bioinformatics
No ratings yet
Structural Bioinformatics
75 pages
4 - Biological Molecules
No ratings yet
4 - Biological Molecules
23 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
66 pages
21BTB102T 2024 09 25 ClassExtra N1
No ratings yet
21BTB102T 2024 09 25 ClassExtra N1
48 pages
Bioinf Lecture1-2
No ratings yet
Bioinf Lecture1-2
44 pages
Gene Pridiction and Orf
No ratings yet
Gene Pridiction and Orf
34 pages
Bioinfo - S1 2021 - L9 - Protein Structure - 1 Slide
No ratings yet
Bioinfo - S1 2021 - L9 - Protein Structure - 1 Slide
87 pages
Bio Info Merged
No ratings yet
Bio Info Merged
154 pages
BIF401 Current Papers Solution Part 1
No ratings yet
BIF401 Current Papers Solution Part 1
6 pages
Bio Part 1 - Watermark
No ratings yet
Bio Part 1 - Watermark
5 pages
Biology Sem Exam Suggestion
No ratings yet
Biology Sem Exam Suggestion
16 pages
Protein Structure and Function
No ratings yet
Protein Structure and Function
52 pages
Biochemistry Notes
No ratings yet
Biochemistry Notes
53 pages
Protein Modeling: Protein Structure Prediction Other Topics
No ratings yet
Protein Modeling: Protein Structure Prediction Other Topics
76 pages
Chapter 13 - Genes and Life - v1
No ratings yet
Chapter 13 - Genes and Life - v1
36 pages
Biological Molecule3
No ratings yet
Biological Molecule3
5 pages
L2 Centraldogma
No ratings yet
L2 Centraldogma
41 pages
Lecture 1 Biochemistry
No ratings yet
Lecture 1 Biochemistry
58 pages
Sankalp Sanjeevni Neet: Biology
No ratings yet
Sankalp Sanjeevni Neet: Biology
13 pages
Testbank For Fundamentals of Biochemistry Life at The Molecular Level 5th Edition Voet
No ratings yet
Testbank For Fundamentals of Biochemistry Life at The Molecular Level 5th Edition Voet
18 pages
Fifth Lecture Protiens 4
No ratings yet
Fifth Lecture Protiens 4
32 pages
Cell and Molecular Biology: Gerald Karp
No ratings yet
Cell and Molecular Biology: Gerald Karp
53 pages
Lecture 12 (Structural Bioinformatics)
No ratings yet
Lecture 12 (Structural Bioinformatics)
30 pages
Lecture 01 - Agricultural Biotechnology - History & Scope
No ratings yet
Lecture 01 - Agricultural Biotechnology - History & Scope
16 pages
All Chapters Grade 12 Life Sciences Notes 1 045329
No ratings yet
All Chapters Grade 12 Life Sciences Notes 1 045329
60 pages
Transgenic Plants
No ratings yet
Transgenic Plants
10 pages
Memoria2019 2020
No ratings yet
Memoria2019 2020
269 pages
Answers: H2 Biology 9744/02
No ratings yet
Answers: H2 Biology 9744/02
22 pages
On Job Training Proposal TOTO
No ratings yet
On Job Training Proposal TOTO
6 pages
Rista Susanti - Coevolution Bursera and Blepharida
No ratings yet
Rista Susanti - Coevolution Bursera and Blepharida
9 pages
MRK - Spring 2020 - BT502 - 2 - BC170203159
No ratings yet
MRK - Spring 2020 - BT502 - 2 - BC170203159
11 pages
1.10 Competitive Binding Assays
No ratings yet
1.10 Competitive Binding Assays
4 pages
Phylogeny
No ratings yet
Phylogeny
43 pages
Cracking The Code of Life - Answers
No ratings yet
Cracking The Code of Life - Answers
10 pages
Chapter8 Primerdesigning
No ratings yet
Chapter8 Primerdesigning
8 pages
National Institute of Technology, Rourkela - 769 008 (ODISHA)
No ratings yet
National Institute of Technology, Rourkela - 769 008 (ODISHA)
2 pages
Masters's Degree Program in Chemistry - 120 ECTS
No ratings yet
Masters's Degree Program in Chemistry - 120 ECTS
5 pages
N9-20132+Rev.+A Cytek+Muse+Micro+Product+Brochure
No ratings yet
N9-20132+Rev.+A Cytek+Muse+Micro+Product+Brochure
8 pages
Angew Chem Int Ed - 2024 - Zandieh - Selection of Plastic Binding DNA Aptamers For Microplastics Detection
No ratings yet
Angew Chem Int Ed - 2024 - Zandieh - Selection of Plastic Binding DNA Aptamers For Microplastics Detection
8 pages
MCQ On Animal Biotechnology - MCQ Biology - Learning Biology Through MCQs
100% (1)
MCQ On Animal Biotechnology - MCQ Biology - Learning Biology Through MCQs
5 pages
No Uptake Mediated by Zosmanrt2 Is A Na - Dependent Mechanism
No ratings yet
No Uptake Mediated by Zosmanrt2 Is A Na - Dependent Mechanism
1 page
Certificate
No ratings yet
Certificate
1 page
Student Exploration: Cell Division: Vocabulary: Cell Division, Centriole, Centromere, Chromatid, Chromatin, Chromosome
No ratings yet
Student Exploration: Cell Division: Vocabulary: Cell Division, Centriole, Centromere, Chromatid, Chromatin, Chromosome
4 pages
HSC Botany Board Paper 2013
No ratings yet
HSC Botany Board Paper 2013
2 pages
Cloning Vector B.pharm
No ratings yet
Cloning Vector B.pharm
9 pages
GENBIO 1 Q2 Periodic Test
No ratings yet
GENBIO 1 Q2 Periodic Test
5 pages
Thoughts on the Origin of Life
From Everand
Thoughts on the Origin of Life
RB Raikow
No ratings yet

What Is Bioinformatics?

Uploaded by

What Is Bioinformatics?

Uploaded by

What is bioinformatics?

• DNA was discovered in 1869-Frederich Miescher-Watson and Crick model (1950)

• Each base is also attached to a sugar molecule and a phosphate molecule

• Base + Sugar + Phosphate = Nucleotide

• DNA can replicate or make copies of itself

• During cells division, each new cell needs to

Pyrimidines are bases that have single ring and double

Final 3-D shape of protein molecule is

Hemoglobin is the protein that makes blood red

MAVLD= Met-Ala-Val-Leu-Asp = Methionine–

European Nucleotide Archive-ENA

Entrez Global Query Cross-Database Search System

Protein data bank (PDB)

Worldwide Protein Data Bank, wwPDB.

Sequence retrieval system (SRS;

Sequence homology vs sequence similarity

DNA and proteins are products of evolution

sequence homology similarity

Sequence similarity can be quantified using

An identity of 30% or higher can be safely regarded

Calculation of sequence similarity/identity

METHODS Global Alignment and Local Local Alignment

Dynamic Programming for Global Alignment

SSEARCH (http://pir.georgetown.edu/pirwww/search/pairwise.html) is a simpleweb-based

Major types of RNA

3. After all the pairing is done:

•code is a set of 3 nitrogen bases = Codon

Storage/transfer of genetic information

You might also like