0% found this document useful (0 votes)

48 views8 pages

Lec 3 Terms and Definitions in Bioinformatics

Uploaded by

hamza.khurshid.989

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views8 pages

Lec 3 Terms and Definitions in Bioinformatics

Uploaded by

hamza.khurshid.989

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Terms and Definitions in Bioinformatics

Bioinformatics:

An absolute definition of bioinformatics has not been agreed upon.

The first level, however, can be defined as the design and application of
methods for the collection, organization, indexing, storage, and analysis
of biological sequences (both nucleic acids [DNA and RNA] and
proteins).

The next stage of bioinformatics is the derivation of knowledge

concerning the pathways, functions, and interactions of these genes
(functional genomics) and proteins (proteomics).

Bioinformatics is also referred to as computational biology.

Annotation (explanation): Text fields of information about a

biosequence which are added to a sequence databases. Annotation (the
elucidation and description of biologically relevant features in the
sequence) consists of the description of the following items:

 Function(s) of the protein.

 Post-translational modification(s). For example carbohydrates,
phosphorylation, acetylation, GPI-anchor, etc.
 Domains and sites. For example calcium binding regions, ATP-
binding sites, zinc fingers, homeobox, kringle, etc.
 Secondary structure.
 Quaternary structure. For example homodimer, heterotrimer, etc.
 Similarities to other proteins.
 Disease(s) associated with deficiencie(s) in the protein.
 Sequence conflicts, variants, etc.
Assembly: The process of placing fragments of DNA that have been
sequenced into their correct position within the chromosome.
Base: One of five molecules which are assembled, along with a ribose
and a phosphate, to form nucleotides.
Adenine (A), guanine (G), cytosine (C), and thymine (T) are found in
DNA while RNA is made from adenine (A), guanine (G), cytosine (C),
and uracil (U).
Base pair (BP): The complementary bases on opposite strands of DNA
which are held together by hydrogen bonding. The atomic structure of
these bases pre-select the pairing of adenine with thymine and the
pairing of guanine with cytosine (or uracil in RNA).

BLAST: Basic Local Alignment Search Tool.

A program for searching biosequence databases which was
developed and is maintained by a group at the National Center for
Biotechnology Information (NCBI). There are several versions of
BLAST:
BLASTP which searches a protein database,
BLASTN to search a nucleotide database,
TBLASTN which searches for a protein sequence in a nucleotide
database by translating nucleotide sequences in all 6 reading frames,
BLASTX which can search for a nucleotide sequence against a
protein database by translating the query via all 6 reading frames,
gapped-BLAST, and psi-BLAST.
BLAST locates patches of regional similarity instead of calculating
the best overall alignment using gaps. The program then uses a scoring
matrix to rank these matches as positive, negative or zero. If the initial
match is scored highly, the search is expanded in both directions until
the ranking score falls off.

BEAUTY (BLAST Enhanced Alignment Utility): A tool developed at

Baylor College of Medicine (Worley et al. 1995) which uses BLAST to
search several custom databases and incorporates sequence family
information, location of conserved domains, and information about any
annotated sites or domains directly into the BLAST query results.

BLITZ: EBI's ultra-fast protein database search which uses the

MPsearch algorithm.

cDNA (complementary DNA): An artificial piece of DNA that is

synthesized from an mRNA (messenger RNA) template and is created
using reverse transcriptase.
The single stranded form of cDNA is frequently used as a probe in the
preparation of a physical map of a genome. cDNA is preferred for
sequence analysis because the introns found in DNA are removed in
translation from DNA ----> mRNA ----> cDNA.

CLUSTAL W: A general purpose program for multiple alignments of

DNA and protein sequences developed by Thompson, et. al. in 1994.
Complementarity: The sequence-specific or shape-specific recognition
that occurs when two or more molecules bind together.

DNA forms double stranded helixes because the complementary

orientation of the bases in each strand facilitate the formation of the
hydrogen bonds which hold the strands together.

Computational biology: See bioinformatics

Consensus sequence: a sequence of DNA having similar structure and

function in different organisms.
or
The most commonly occurring amino acid or nucleotide at each
position of an aligned series of proteins or polynucleotides.
Consensus map: The location of all consensus sequences in a series of
multiply aligned proteins or polynucleotides.
Conserved sequence: A sequence within DNA or protein that is
consistent across species or has remained unchanged within the species
over its evolutionary period.

EMBL: The European Molecular Biology Laboratory

(http://www.embl-heidelberg.de/) which is located in Heidelberg
Germany.

EBI: The European Bioinformatics Institute (http://www.ebi.ac.uk/) is a

part of the EMBL.
EMBL Nucleotide Sequence Database: Europe's primary nucleotide
sequence resource. Main sources for DNA and RNA sequences are
direct submissions from individual researchers, genome sequencing
projects and patent applications. The database is produced in
collaboration with GenBank and the DNA Database of Japan (DDBJ).
Each of the three groups collects a portion of the total sequence data
reported worldwide, and all new and updated database entries are
exchanged between the groups on a daily basis.

Entrez: A WWW-based database retrieval program created by the

National Center for Biotechnology Information (NCBI), a division of the
NIH.

EST (Expressed Sequence Tag): A partial sequence of a cDNA clone

that can be used to identify sites in a gene.

FASTA: An alignment program for protein sequences created by

Pearson and Lipman in 1988. The program is one of the many heuristic
algorithms proposed to speed up sequence comparison. The basic idea is
to add a fast prescreen step to locate the highly matching segments
between two sequences, and then extend these matching segments to
local alignments using more rigorous algorithms such as Smith-
Waterman.

GenBank: The NIH genetic sequence database.

An annotated collection of all publicly available DNA sequences
which is located at http://www.ncbi.nlm.nih.gov/. There are
approximately 2,162,000,000 bases in 3,044,000 sequence records as of
December 1998.
GenBank is part of the International Nucleotide Sequence
Database Collaboration, which is comprised of the DNA DataBank of
Japan (DDBJ), the European Molecular Biology Laboratory (EMBL),
and GenBank at NCBI. These three organizations exchange data on a
daily basis.

HUGO: The Human Genome Organization.

Multiple Alignment: A set of biosequences arranged in a table such that

each row of the table consists of one sequence padded by gaps. The
columns of the table highlight similarity (or residue conservation)
between positions of each biosequence. An Optimal Multiple Alignment
is one that has the highest degree of similarity, or the lowest cost.

NCBI: The National Center for Biotechnology Information

(http://www.ncgi.nlm.nih.gov/), a division of the NIH, is the home of the
BLAST and Entrez servers.

NCGR: The National Center for Genome Resources

(http://www.ncgr.org/).

NHGRI: The National Human Genome Research Institute of the NIH

(http://www.nhgri.nih.gov/)

PDB (Protein Data Bank): An international repository for the results of

macromolecular studies using NMR, X-ray crystallography, or
homology methods. The results of structural studies of proteins, RNA,
DNA, viruses, and polysaccharides are presently available.
The term PDB also defines a standard file format for publishing protein
and nucleotide structures for use in computer programs.

Primer: The short sequence of nucleotides (usually eight) which serve

to prime the DNA polymerase process during cell division. Primers are
produced by the enzyme primase. Primers also can be customized to
'isolate' specific sections of DNA for replication using PCR.

Scoring Function (also cost function or weight function): The

methods used to evaluate the quality of the overlap between sequences.
A variety of scoring functions are used to evaluate single replacement
operations, multiple alignments (either whole or columns), and pairwise
alignments. The score of an alignment of two sequences (a and b) is the
sum of the score of all the replacement operations that lead from a to b.

Sequencing: Determining the order of nucleotides in a gene or the order

of amino acids in a protein.

Single nucleotide polymorphism (SNP): The most common type of

DNA sequence variation.
An SNP is a change in a single base pair at a particular position along
the DNA strand. When an SNP occurs, the gene's function may change,
as seen in the development of bacterial resistance to antibiotics or of
cancer in humans.
SWISS-PROT:
An annotated protein sequence database established in 1986 and
maintained collaboratively, since 1987, by the Department of Medical
Biochemistry of the University of Geneva and the EMBL Data Library
(now the EMBL Outstation - The European Bioinformatics Institute
(EBI)).
The SWISS-PROT protein sequence data bank consists of
sequence entries. Sequence entries are composed of different line-types,
each with their own format. For standardization purposes the format of
SWISS-PROT follows as closely as possible that of the EMBL
Nucleotide Sequence Database.
The Swiss Institute of Bioinformatics (SBI):
An academic institution established on March 30, 1998 as a non-
profit foundation.
The goals of this institute are to promote the development of
software tools and databases in the field of bioinformatics, to sustain a
high-quality research program in bioinformatics, to provide, in
collaboration with academic partners, a curriculum of courses and
seminars for the formation of research scientists in the field of
bioinformatics, and to offer services to the Swiss scientific user
community through the Swiss-EMBnet node (which is currently
maintained jointly by Swiss Institute for Cancer Research and the
University of Geneva).

TIGR: The Institute for Genomic Research, located in Bethesda

Maryland (http://www.tigr.org/)

Transcription and Translation Exercise VER2 - Solution
100% (1)
Transcription and Translation Exercise VER2 - Solution
3 pages
GlOsario Bioinformatica
No ratings yet
GlOsario Bioinformatica
5 pages
Unit V DM
No ratings yet
Unit V DM
96 pages
Biological Databases Genbank
No ratings yet
Biological Databases Genbank
31 pages
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
No ratings yet
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
75 pages
Lec 2 Bioinformatics Glossary
No ratings yet
Lec 2 Bioinformatics Glossary
6 pages
Bioinformatics Day3
No ratings yet
Bioinformatics Day3
4 pages
Bioinformatics (STH Sir)
No ratings yet
Bioinformatics (STH Sir)
13 pages
Biological Data and Database Biological Data
No ratings yet
Biological Data and Database Biological Data
10 pages
Data Retrieval
67% (3)
Data Retrieval
17 pages
Bioinformatics Database and Applications
100% (3)
Bioinformatics Database and Applications
82 pages
Bioinformatics Overview
100% (1)
Bioinformatics Overview
18 pages
Databases Bioinformatics
No ratings yet
Databases Bioinformatics
42 pages
NT Seq Database
No ratings yet
NT Seq Database
4 pages
Bioinformatics Definition
No ratings yet
Bioinformatics Definition
11 pages
Bio Hist1586267617
No ratings yet
Bio Hist1586267617
8 pages
Biological Sequence Databases
No ratings yet
Biological Sequence Databases
33 pages
Bioinformatics
No ratings yet
Bioinformatics
8 pages
Bio in For Matics
No ratings yet
Bio in For Matics
26 pages
Basics of Bioinformatics
100% (7)
Basics of Bioinformatics
99 pages
6.1 Bioinformatics Databases and Tools - Introduction: Lecture 6: December, 28, 2001
No ratings yet
6.1 Bioinformatics Databases and Tools - Introduction: Lecture 6: December, 28, 2001
31 pages
8024 Bio Info
No ratings yet
8024 Bio Info
28 pages
BIOINFORMATICS - eNOTES
No ratings yet
BIOINFORMATICS - eNOTES
23 pages
Unit I
No ratings yet
Unit I
28 pages
Lecture2-DataMining For Bioinformatics
No ratings yet
Lecture2-DataMining For Bioinformatics
7 pages
Biological Database
No ratings yet
Biological Database
8 pages
Introduction To Bioinformatics
No ratings yet
Introduction To Bioinformatics
10 pages
Bio PPT
No ratings yet
Bio PPT
35 pages
Lista de Bases de Datos
No ratings yet
Lista de Bases de Datos
13 pages
Databases Class Work
No ratings yet
Databases Class Work
48 pages
Bio 316 - 0
No ratings yet
Bio 316 - 0
43 pages
Adv Bi Unit 1
No ratings yet
Adv Bi Unit 1
39 pages
Bioinformatics Manual
No ratings yet
Bioinformatics Manual
117 pages
Bioinformatics
No ratings yet
Bioinformatics
47 pages
Bioinformatics Databases
No ratings yet
Bioinformatics Databases
7 pages
Terms 333
No ratings yet
Terms 333
18 pages
Database Dalam Bioinformatika
No ratings yet
Database Dalam Bioinformatika
34 pages
Bioinformatics Databases
No ratings yet
Bioinformatics Databases
10 pages
NCBI Handbook
No ratings yet
NCBI Handbook
391 pages
Database 2
No ratings yet
Database 2
15 pages
Bif401 Manual 2023
No ratings yet
Bif401 Manual 2023
27 pages
BIOINFORMATICS
No ratings yet
BIOINFORMATICS
22 pages
DDBJ Database BioInformatics Notes
No ratings yet
DDBJ Database BioInformatics Notes
8 pages
Introduction To Bioinformatics (Databases)
No ratings yet
Introduction To Bioinformatics (Databases)
28 pages
Bioinformatics: Tina Elizabeth Varghese
No ratings yet
Bioinformatics: Tina Elizabeth Varghese
9 pages
Bioinform-Tica-Pdf-May-6-2010-12-38-Pm-3-5-Meg
No ratings yet
Bioinform-Tica-Pdf-May-6-2010-12-38-Pm-3-5-Meg
105 pages
Bioinformatics Lecture Notes Database
No ratings yet
Bioinformatics Lecture Notes Database
28 pages
Introduction To Databases - NCBI, PDB and Uniprot
No ratings yet
Introduction To Databases - NCBI, PDB and Uniprot
5 pages
Database
No ratings yet
Database
40 pages
Bioinformatics Intro
No ratings yet
Bioinformatics Intro
69 pages
BCH 428 Slide
No ratings yet
BCH 428 Slide
32 pages
Biological Databases
No ratings yet
Biological Databases
41 pages
Bioinformatics Day2
No ratings yet
Bioinformatics Day2
3 pages
Bioinformatics: Intended Learning Outcomes
No ratings yet
Bioinformatics: Intended Learning Outcomes
9 pages
Biological Data and Database
No ratings yet
Biological Data and Database
13 pages
Biological Sequence Databases
No ratings yet
Biological Sequence Databases
35 pages
Bioinformatics Reviewer Full
No ratings yet
Bioinformatics Reviewer Full
16 pages
Blast
No ratings yet
Blast
6 pages
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)
Xenopus Development
From Everand
Xenopus Development
Malgorzata Kloc
No ratings yet
Dna Vaccines: Design of a Gene to Eradicate Hiv
From Everand
Dna Vaccines: Design of a Gene to Eradicate Hiv
Lane Scheiber
No ratings yet
Cot Curves
100% (2)
Cot Curves
4 pages
Worksheet Unit 5 - Gene Expression
No ratings yet
Worksheet Unit 5 - Gene Expression
2 pages
Gatb Biotechnology MCQ
No ratings yet
Gatb Biotechnology MCQ
5 pages
A Level - Essay - DNA Analysis and Profiling
No ratings yet
A Level - Essay - DNA Analysis and Profiling
1 page
Jumping Genes-1
100% (1)
Jumping Genes-1
56 pages
Lecture 6 - Epigenetic Regulation
No ratings yet
Lecture 6 - Epigenetic Regulation
16 pages
Botany (Hons) Questions (WBSU) - Sem4 - BOTA - 2019-22 - Library
No ratings yet
Botany (Hons) Questions (WBSU) - Sem4 - BOTA - 2019-22 - Library
17 pages
Micro Array
No ratings yet
Micro Array
13 pages
Microbiome Bioinformatics For High School Exercise - v1
No ratings yet
Microbiome Bioinformatics For High School Exercise - v1
2 pages
Kageyama Et Al 2003 J Clin Microbiol 41.full
No ratings yet
Kageyama Et Al 2003 J Clin Microbiol 41.full
10 pages
2midterm 9 - Molecular Biology TRANS
No ratings yet
2midterm 9 - Molecular Biology TRANS
3 pages
Molecular Simulation Studies of Dynamics and Interactions in Nucl
No ratings yet
Molecular Simulation Studies of Dynamics and Interactions in Nucl
339 pages
Biomolecules (Jatin Class XII Science)
No ratings yet
Biomolecules (Jatin Class XII Science)
16 pages
HSS2305 Midterm 34%
No ratings yet
HSS2305 Midterm 34%
336 pages
Module 4. Nucleic Acids
No ratings yet
Module 4. Nucleic Acids
3 pages
RNA Processing
No ratings yet
RNA Processing
258 pages
Molecular Marker Technology
No ratings yet
Molecular Marker Technology
99 pages
From Gene To Protein - Worksheet
No ratings yet
From Gene To Protein - Worksheet
4 pages
Chapter 4 Labeling Worksheets
No ratings yet
Chapter 4 Labeling Worksheets
4 pages
rRNA (Ribosomal) tRNA (Transfer) mRNA (Messenger) : Function
No ratings yet
rRNA (Ribosomal) tRNA (Transfer) mRNA (Messenger) : Function
3 pages
Get (Ebook PDF) Concepts of Genetics, Global Edition, 12th Edition Free All Chapters
100% (3)
Get (Ebook PDF) Concepts of Genetics, Global Edition, 12th Edition Free All Chapters
49 pages
Science10 - Q3 - Mod4 - Protein Synthesis - Ver3
50% (2)
Science10 - Q3 - Mod4 - Protein Synthesis - Ver3
28 pages
Biology Paper 2 TZ2 HL-4-8
No ratings yet
Biology Paper 2 TZ2 HL-4-8
5 pages
Science LESSON-03-06
No ratings yet
Science LESSON-03-06
2 pages
Document From Raveen Shavindra Perera
No ratings yet
Document From Raveen Shavindra Perera
32 pages
cDNA Libraries and Gene Cloning
No ratings yet
cDNA Libraries and Gene Cloning
8 pages
(Lesson Plan (LP) : Knowledge Skills Attitudes Values References Materials
No ratings yet
(Lesson Plan (LP) : Knowledge Skills Attitudes Values References Materials
3 pages
Actinobacteria 2
No ratings yet
Actinobacteria 2
7 pages
Assigment ON Hyperchromicity
No ratings yet
Assigment ON Hyperchromicity
13 pages

Lec 3 Terms and Definitions in Bioinformatics

Uploaded by

Lec 3 Terms and Definitions in Bioinformatics

Uploaded by

Terms and Definitions in Bioinformatics

An absolute definition of bioinformatics has not been agreed upon.

The next stage of bioinformatics is the derivation of knowledge

Bioinformatics is also referred to as computational biology.

Annotation (explanation): Text fields of information about a

 Function(s) of the protein.

BLAST: Basic Local Alignment Search Tool.

BEAUTY (BLAST Enhanced Alignment Utility): A tool developed at

BLITZ: EBI's ultra-fast protein database search which uses the

cDNA (complementary DNA): An artificial piece of DNA that is

CLUSTAL W: A general purpose program for multiple alignments of

DNA forms double stranded helixes because the complementary

Computational biology: See bioinformatics

Consensus sequence: a sequence of DNA having similar structure and

EMBL: The European Molecular Biology Laboratory

EBI: The European Bioinformatics Institute (http://www.ebi.ac.uk/) is a

Entrez: A WWW-based database retrieval program created by the

EST (Expressed Sequence Tag): A partial sequence of a cDNA clone

FASTA: An alignment program for protein sequences created by

GenBank: The NIH genetic sequence database.

HUGO: The Human Genome Organization.

Multiple Alignment: A set of biosequences arranged in a table such that

NCBI: The National Center for Biotechnology Information

NCGR: The National Center for Genome Resources

NHGRI: The National Human Genome Research Institute of the NIH

PDB (Protein Data Bank): An international repository for the results of

Primer: The short sequence of nucleotides (usually eight) which serve

Scoring Function (also cost function or weight function): The

Sequencing: Determining the order of nucleotides in a gene or the order

Single nucleotide polymorphism (SNP): The most common type of

TIGR: The Institute for Genomic Research, located in Bethesda

You might also like