Unigene

UniGene is an NCBI database that clusters EST sequences from dbEST and GenBank mRNA into gene-oriented clusters. Only ESTs with 3' ends are clustered to provide a more unique representation of transcripts. Contaminant sequences are removed before clustering the cleaned ESTs based on sequence overlaps. The final UniGene clusters represent unique genes and are annotated with gene and tissue information.

Uploaded by

Nandni Jha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

158 views7 pages

Unigene

Uploaded by

Nandni Jha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 7

UniGene

• UniGene is NCBI EST cluster database.

• Each cluster is a set of overlapping EST
sequences .
• The database is constructed based on
combined information from dbEST, GenBank
mRNA database.
• Only ESTs with 3’ ends are clustered.
• The resulting 3’EST sequences provide more
unique representation of the transcripts.
• The next step is to remove contaminant
sequences that include bacterial vectors.
• The cleaned ESTs are used to search against a
database of known unique genes with BLAST.
• The compiling step identifies sequence
overlaps and derived final sequence.
• During this step, errors in individual ESTs are
corrected, then sequences are partitioned into
clusters and assembled into contig.
• The final result is a set of nonredundant, gene
clusters known as UniGene clusters.
• Each UniGene cluster represents unique gene
and is further annotated for its gene locus
information, as well as information related to
the tissue type where gene has been
GSS
• In field of bioinformatics and computational
biology, genome survey sequences are
nucleotide sequences similar to ESTs.
• The only difference is that most of them are
genomic in origin rather than mRNA.
• Genome Survey Sequences are typically
generated and submitted to NCBI by labs
performing genome sequencing.
• They are used, amongst other things, as a
framework for the mapping and sequencing of
• Genome survey sequencing is a new way to
map the genome sequences.
• Current genome sequencing approaches are
mostly high-throughput shotgun methods,
and GSS is often used on the first step of
sequencing.
• GSSs can provide an initial global view of a
genome, which includes both coding and non-
coding DNA and contain repetitive section of
the genome.
UCSC
• The UCSC genome browser is an online
genome browser hosted by University of
California Santa Cruz.
• It is an interactive website offering access to
genome sequence data from variety of
vertebrates and invertebrates species.
• The UCSC genome browser hosts genomes
from variety of organisms: As of September
2009, this included 24 vertebrates, 14
mammals, 13 insects, 11 species of
• The UCSC genome browser is a part of
package of tools accessible from the UCSC
genome bioinformatics website.
• The UCSC genome browser provides users
with visualization of results from genome such
as SNP associated studies, linkage studies,
chromosomal positions of genes, evolutionary
relationships, alignments.
• It includes many tools such as Genome
browser, BLAT, Gene sorter, Genome graphs.
TIGR
• TIGR Gene Indices (www.tigr.org/tdb/tgi.shtml)
is an EST database that uses a different
clustering method from UniGene.
• It compiles data from dbEST, GenBank mRNA
and genomic DNA data, and TIGR’s own
sequence database.
• Sequences are only clustered if they are more
than 95% identical for over a forty nucleotide
region in pairwise comparisons.
• BLAST and FASTA are used to identify sequence
overlaps.

BIO3170 - Practice Midterm 1 PDF
No ratings yet
BIO3170 - Practice Midterm 1 PDF
5 pages
Intro and Databases
No ratings yet
Intro and Databases
30 pages
Bioinformatics Cheat Sheet
No ratings yet
Bioinformatics Cheat Sheet
4 pages
GlOsario Bioinformatica
No ratings yet
GlOsario Bioinformatica
5 pages
Lec 3 Terms and Definitions in Bioinformatics
No ratings yet
Lec 3 Terms and Definitions in Bioinformatics
8 pages
Blast Introduction
No ratings yet
Blast Introduction
42 pages
Bio Tools Booklet
No ratings yet
Bio Tools Booklet
5 pages
Lecture 2
No ratings yet
Lecture 2
28 pages
Lesson 18-DNA Technology
No ratings yet
Lesson 18-DNA Technology
7 pages
Data Retrieval
67% (3)
Data Retrieval
17 pages
Blast
100% (1)
Blast
21 pages
NCBI Genome
No ratings yet
NCBI Genome
37 pages
CUBT401 - 4 - Sequence and Genome Annotation
No ratings yet
CUBT401 - 4 - Sequence and Genome Annotation
66 pages
Using BLAST: FASTA Format
0% (1)
Using BLAST: FASTA Format
3 pages
Unit V DM
No ratings yet
Unit V DM
96 pages
Anotacion de Genomas
No ratings yet
Anotacion de Genomas
84 pages
Bif401 Manual 2023
No ratings yet
Bif401 Manual 2023
27 pages
Basics of Bioinformatics
100% (7)
Basics of Bioinformatics
99 pages
Biological Sequence Databases: A. National Center For Biotechnology Information (NCBI)
No ratings yet
Biological Sequence Databases: A. National Center For Biotechnology Information (NCBI)
41 pages
Fat Noews
No ratings yet
Fat Noews
27 pages
Genome Annotation
No ratings yet
Genome Annotation
24 pages
Plant Biotechnology
No ratings yet
Plant Biotechnology
44 pages
Mids Notes
No ratings yet
Mids Notes
11 pages
Lectura Complementaria 1
No ratings yet
Lectura Complementaria 1
3 pages
Bioinformatics: Blast and Sequence Analysis
No ratings yet
Bioinformatics: Blast and Sequence Analysis
45 pages
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
No ratings yet
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
75 pages
Blast
No ratings yet
Blast
6 pages
Biology 171L - General Biology Lab I Lab 12: Introduction To Bioinformatics
No ratings yet
Biology 171L - General Biology Lab I Lab 12: Introduction To Bioinformatics
6 pages
Fat Noews
No ratings yet
Fat Noews
24 pages
Biological Sequence Databases
No ratings yet
Biological Sequence Databases
35 pages
Database Dalam Bioinformatika
No ratings yet
Database Dalam Bioinformatika
34 pages
Lecture - 02 - Comparative Sequence Analysis
No ratings yet
Lecture - 02 - Comparative Sequence Analysis
28 pages
4 Bioinformaticsdatabases
No ratings yet
4 Bioinformaticsdatabases
71 pages
Anvita Nigam 032
No ratings yet
Anvita Nigam 032
3 pages
Biological Data Searching
No ratings yet
Biological Data Searching
18 pages
Terms 333
No ratings yet
Terms 333
18 pages
Introduction
No ratings yet
Introduction
13 pages
Blast: Background: BLAST Is One of The Most Widely Used Bioinformatics Programs
100% (1)
Blast: Background: BLAST Is One of The Most Widely Used Bioinformatics Programs
4 pages
Blast Introduction
No ratings yet
Blast Introduction
42 pages
Bioinformatics: ABE 2007 Kent Koster Group 3
No ratings yet
Bioinformatics: ABE 2007 Kent Koster Group 3
43 pages
Bioinformatics: Intended Learning Outcomes
No ratings yet
Bioinformatics: Intended Learning Outcomes
9 pages
Bioinformatics 28 8 1166
No ratings yet
Bioinformatics 28 8 1166
2 pages
Model 6
No ratings yet
Model 6
132 pages
2 Blast Similarity Search 2
No ratings yet
2 Blast Similarity Search 2
2 pages
Molbio Chapter 4 Transes Midterms
No ratings yet
Molbio Chapter 4 Transes Midterms
3 pages
EST - "Expressed Sequence Tags": - Manali Mehendale
No ratings yet
EST - "Expressed Sequence Tags": - Manali Mehendale
19 pages
Ans .: DNA Annotation or Genome Annotation Is The Process of Identifying The
100% (1)
Ans .: DNA Annotation or Genome Annotation Is The Process of Identifying The
3 pages
ESTWeb Bioinformatics Services For EST
No ratings yet
ESTWeb Bioinformatics Services For EST
2 pages
Genomic Databases - Analysis Tools
No ratings yet
Genomic Databases - Analysis Tools
87 pages
Using Genbank and BLAST in The Biology Classroom: Matt Wester
No ratings yet
Using Genbank and BLAST in The Biology Classroom: Matt Wester
9 pages
Some Significant Databases Blast Blast
No ratings yet
Some Significant Databases Blast Blast
18 pages
Bioinformatics Manual Updated
No ratings yet
Bioinformatics Manual Updated
48 pages
Introduction To Different Resources of Bioinformatics and Application PDF
No ratings yet
Introduction To Different Resources of Bioinformatics and Application PDF
55 pages
UCSC Genome Browser
No ratings yet
UCSC Genome Browser
9 pages
BLAST
No ratings yet
BLAST
30 pages
TY-Exercise 4
No ratings yet
TY-Exercise 4
8 pages
Neuroevolution: Fundamentals and Applications for Surpassing Human Intelligence with Neuroevolution
From Everand
Neuroevolution: Fundamentals and Applications for Surpassing Human Intelligence with Neuroevolution
Fouad Sabry
No ratings yet
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)
Introducing Epigenetics: A Graphic Guide
From Everand
Introducing Epigenetics: A Graphic Guide
Cath Ennis
3/5 (4)
Xenopus Development
From Everand
Xenopus Development
Malgorzata Kloc
No ratings yet
Gene Control: Unlocking Genetic Secrets
From Everand
Gene Control: Unlocking Genetic Secrets
Deevakar Asan
No ratings yet
Chap. 3A Amino Acids, Peptides, and Proteins: Topics
No ratings yet
Chap. 3A Amino Acids, Peptides, and Proteins: Topics
27 pages
Nerve & Muscle Physiology: - Jeff Ericksen, MD
No ratings yet
Nerve & Muscle Physiology: - Jeff Ericksen, MD
80 pages
04-Chemical Basis of Heredity
100% (1)
04-Chemical Basis of Heredity
10 pages
Automated High-Throughput Genome Editing Platform With An AI Learning in Situ Prediction Model
No ratings yet
Automated High-Throughput Genome Editing Platform With An AI Learning in Situ Prediction Model
11 pages
Activity No. 3 The Cell As A School (Module-1)
No ratings yet
Activity No. 3 The Cell As A School (Module-1)
3 pages
COT Biomolecules
No ratings yet
COT Biomolecules
64 pages
Cell Synchronization - Cell Growth Stages
No ratings yet
Cell Synchronization - Cell Growth Stages
3 pages
57-4-3 Biology
No ratings yet
57-4-3 Biology
15 pages
Plant and Animal Cell
No ratings yet
Plant and Animal Cell
38 pages
2021mol Bio Higher Rates of Processed Pseudogene Acquisition in Humans and Three Great Apes Revealed by Long-Read Assemblies
No ratings yet
2021mol Bio Higher Rates of Processed Pseudogene Acquisition in Humans and Three Great Apes Revealed by Long-Read Assemblies
9 pages
Introduction To DNA-Serology
No ratings yet
Introduction To DNA-Serology
24 pages
Lesson 4.2 DNA Replication 1
No ratings yet
Lesson 4.2 DNA Replication 1
3 pages
Genome Size and Complexity: Presentation On
No ratings yet
Genome Size and Complexity: Presentation On
16 pages
Krebs Cycle
No ratings yet
Krebs Cycle
11 pages
Babes Et Al. - 2011 - TRPM8, A Sensor For Mild Cooling in Mammalian Sensory Nerve Endings PDF
No ratings yet
Babes Et Al. - 2011 - TRPM8, A Sensor For Mild Cooling in Mammalian Sensory Nerve Endings PDF
11 pages
BOCM 3714: T: +27 (0) 51 401 9111 - Info@ufs - Ac.za - WWW - Ufs.ac - Za
No ratings yet
BOCM 3714: T: +27 (0) 51 401 9111 - Info@ufs - Ac.za - WWW - Ufs.ac - Za
25 pages
Bachelor Degree Thesis Format
100% (3)
Bachelor Degree Thesis Format
4 pages
Fungal Extracellular Vesicles: Abbreviations
No ratings yet
Fungal Extracellular Vesicles: Abbreviations
8 pages
In Class Case Study - GWAS in Dogs
No ratings yet
In Class Case Study - GWAS in Dogs
7 pages
LAPORAN PRAKTIKUM DNA Colonning Moh Jamal 226070103141001
No ratings yet
LAPORAN PRAKTIKUM DNA Colonning Moh Jamal 226070103141001
10 pages
2nd Handout
No ratings yet
2nd Handout
6 pages
Amino Acids and Protein
No ratings yet
Amino Acids and Protein
36 pages
NNN
No ratings yet
NNN
27 pages
Dokumen - Pub Machine Learning in Bioinformatics of Protein Sequences Algorithms Databases and Resources For Modern Protein Bioinformatics 9811258570 9789811258572
No ratings yet
Dokumen - Pub Machine Learning in Bioinformatics of Protein Sequences Algorithms Databases and Resources For Modern Protein Bioinformatics 9811258570 9789811258572
378 pages
Packer Dissertation 2017
No ratings yet
Packer Dissertation 2017
122 pages
Molecular Biology Bank-Chikankata
No ratings yet
Molecular Biology Bank-Chikankata
15 pages
Passive Transport
No ratings yet
Passive Transport
11 pages
Vaccines1 03
No ratings yet
Vaccines1 03
41 pages

Unigene

Uploaded by

Unigene

Uploaded by

UniGene

• UniGene is NCBI EST cluster database.

You might also like