0% found this document useful (0 votes)

10 views20 pages

The C-Value Paradox

total genome

Uploaded by

sabinp2023

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views20 pages

The C-Value Paradox

total genome

Uploaded by

sabinp2023

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

The C-value paradox

• The C-value is the total number of DNA nucleotide residues in the genome (per
haploid set of chromosomes).

• When you compare this to the complexity of the organism you find a massive
disparity.

• Clearly the amount of DNA is not proportional to that required to produce all
the proteins made by the organism.

• The E. Coli genome has 4.6 million base pairs and codes for about 3,000
different proteins (proteins of ~40,000 and 500 bp for promoters)

• Using the same assumptions the human genome should code for 1 million
proteins (3 billion base pairs (3*10^9), protein ~30,000 and promoters of 1500
bp)
Re annealing experiments
• Only about 3% of the DNA in the genome
actually codes for proteins. What is the
rest of it doing?

• Some clues come from re-annealing

experiments.

• The time it takes for DNA to re-anneal

depends on the complexity of the sequence
Cot plots
Single
stranded
DNA

Cot½

A260

Double
stranded
DNA

Co*Time (M*s)
• You need to account for the starting concentration (Co).

• Obviously if you started with more of a sequence it would

anneal quicker….more matches to find each other.

• By using the Cot value you can plot (on the same graph)
different annealing experiments with different starting
concentrations (10 – 2000 ug/mL) from the same source and
they will all lie of the graph.

• The rate of re-association, hence the time taken to renature is

dependent on the complexity of the DNA sequence.

• The complexity is defined as the number of bases in each

unique sequence e.g. poly (U)+poly (A) has a complexity of 1,
the repeating sequence AGTGCn has a complexity of 5.
• The Cot1/2 for a given DNA depends on the complexity.

• Eukaryotic genomic DNA can be divided up into 4 classes:

highly repetitive (hundreds to millions of copies),
moderately repetitive (10s to hundreds of copies), slightly
repetitive (1 – 10 copies) and single copy sequences.

• The last 2 are often combined

Highly
repetitive Moderately
repetitive

unique
CpG island
Any stretch of DNA greater than 500bp with a CG content of greater than
50%.
So, it a region of DNA in which the frequency of the CG sequence is higher
than in other regions.
"p" indicates that "C" and "G" are connected by a phosphodiester bond.
CpG island –from several hundred to several thousand base pairs long.
In humans there are about 45,000 CpG islands, mostly found at the 5'
ends of genes.

8
CpG island properties

CpG islands are often located around the promoters of genes frequently expressed in a
cell.

A promoter - specific region just upstream from a gene that acts as a binding site for
transcription factors and RNA polymerase during the initiation of transcription.

Thus, the knowledge of CpG island is important for the computational prediction of
promoters for genes. Recently it was shown that the prediction, which no associated with
CpG-island may not even be possible.

9
According to a recent study, human chromosomes 21 and 22
contain about 1100 CpG-islands and about 750 genes.
(Comprehensive analysis of CpG islands in human chromosomes 21 and 22, Proc. Natl. Acad. Sci. US, March 19,
2002)

10
CpG islands are not really a repeated sequence, but a special type of DNA sequence with a particular
function

This is a typical gene with a CpG island.

The island includes the first exon.

A function for islands: Molecular studies showed that the chromatin in these regions has an "open" configuration,
with no nucleosomes or histone H1.

This would make the DNA accessible to transcription factors, etc. and hence able to be transcribed.

11
What CpG islands are?

• CpG dinucleotides are rare in mammal DNA

• DNA Methylation only occurs at CpG sites

• Methylated cytosines may be converted to thymine by deamination over evolution

• CpG  TpG

• CpG islands are short stretches of DNA with higher frequency of the CG sequence

• Usually they are not methylated

• Definition from Gardiner-Garden & Frommer
• At least 200 bases long
• G+C content: > 50%
• observed CpG/expected CpG ratio: >= 0.6

• Definition from Takai & Jones

• Longer than 500 bp
• G+C content: > 55%
• observed CpG/expected CpG ratio: >= 0.65
• With this definition, these CpGi’s are more likely to be associated with the 5’
regions of genes and exclude most Alu’s

• There are about 29,000 such regions in the human genome

CpG islands and Genes

• CpG islands located in the promoter regions of genes can play important roles in gene silencing
• Housekeeping genes
• Almost all housekeeping genes are associated with at least one CpG island
• CpG islands are starting 5’ to the transcription start site and covering one or more exons and introns
• Tissue specific genes
• About 40 % tissue specific genes are associated with islands
• The position of these islands is not strongly toward the transcription start site as in the housekeeping genes
• Not all CpG islands are associated with genes
• Ioshikhes & Zhang determined the features to discriminate the promoter-associated and non-associated CpG
islands
• There are methylation-prone and methylation-resistant CpG islands
• Feltus et. al. found patterns to discriminate methylation-prone from methylation-resistant CpG islands
5’ end

CpGi
Gene
Promoter CpG islands
Gene

Gene CpG islands in body

Gene 3’ end CpG islands

Highly repetitive DNA
• Short sequences arranged in tandem repeats,
sometimes thousands of times.
• Short Tandem Repeats (STRs) or satellite DNA
• 16 bp sequence of "gatagatagatagata
• gata is repeated
• Microsatellites 1 – 13 nucleotides
• Minisatellites 14 – 500 nucleotides
• Often found clustered around the centromere or
the telomere.
Moderately repetitive DNA
• Segments of 100 to several thousand base pairs
repeated

• Repeated groups of genes whose products are

needed by cells in large quantities e.g. histones,
ribosomal and transfer RNA (although these are
sometimes classified in the highly repetitive group)

• Retrotransposons, DNA which has been transcribed in

reverse back from RNA
Retrotransposons
• Around 40% of the human genome

• LINES (long interspersed nuclear elements) 6 – 8 kb

segments that encode the proteins that enable the
transposition (e.g human L1 was from its retrotransposition
into the factor VIII gene causing hemophilia)

• SINES (short interspersed nuclear elements) 100 – 400 bp

sections containing remnants of tRNA transcription
machinery.

• LTR retrotransposons or long terminal repeats

Gene Families
• Most genes in the genome are only represented
once.

• Some have a few copies on the genome.

• One example is the globin family. This set of genes

contains a number of closely related sequences
which vary by only a few changes in the code.

• Sometimes found clustered together on the one

chromosome (but not always!)
Single copy genes
• Most of the genes of the organism are single copy
genes

• But they only make up a small proportion of the

total genome.

• They are the most complex group and hence take

the longest to re-anneal.

Genomics and Proteomics
100% (1)
Genomics and Proteomics
317 pages
Molecular Basis of Inheritance
No ratings yet
Molecular Basis of Inheritance
52 pages
Feralis-Booster Expanded 2022 DAT Notes
No ratings yet
Feralis-Booster Expanded 2022 DAT Notes
45 pages
DNA Structure and Chemistry
100% (4)
DNA Structure and Chemistry
37 pages
Genome Organization 1
100% (1)
Genome Organization 1
42 pages
Unique and Repetitive DNA
No ratings yet
Unique and Repetitive DNA
24 pages
Fine Structure of A Gene
No ratings yet
Fine Structure of A Gene
58 pages
Glimpses of Human Genome Project
75% (4)
Glimpses of Human Genome Project
94 pages
Structure and Organization of Human Genome
No ratings yet
Structure and Organization of Human Genome
18 pages
Genes, Chromosomes and The Content of The Human Genome
No ratings yet
Genes, Chromosomes and The Content of The Human Genome
39 pages
The Flow of Genetic Information: DNA RNA Protein
No ratings yet
The Flow of Genetic Information: DNA RNA Protein
134 pages
Cot Curve
100% (1)
Cot Curve
16 pages
G-6 Report
No ratings yet
G-6 Report
78 pages
Introduction To Humangenetics and Genomics
No ratings yet
Introduction To Humangenetics and Genomics
84 pages
Genetics Lecture 2 - DNA and Chromosome Structure PDF
No ratings yet
Genetics Lecture 2 - DNA and Chromosome Structure PDF
58 pages
IB DP Bio - D1.1 DNA Replication
No ratings yet
IB DP Bio - D1.1 DNA Replication
40 pages
Stuvia 1321801 Summary Bhcs 2003 Genetics
No ratings yet
Stuvia 1321801 Summary Bhcs 2003 Genetics
58 pages
1 Dr. Ergoren - Genes and Genomes Evolution 2022
No ratings yet
1 Dr. Ergoren - Genes and Genomes Evolution 2022
67 pages
Lecture 8 Chapter 11
No ratings yet
Lecture 8 Chapter 11
61 pages
Human Genome
No ratings yet
Human Genome
47 pages
Genomics 1
No ratings yet
Genomics 1
47 pages
Chapter 2
No ratings yet
Chapter 2
65 pages
Human Mol Gen
No ratings yet
Human Mol Gen
42 pages
Human Genome Project Class 12
100% (2)
Human Genome Project Class 12
7 pages
Omics L2
No ratings yet
Omics L2
47 pages
Cytogenetics 1 200L MBBS-4
No ratings yet
Cytogenetics 1 200L MBBS-4
42 pages
Lecture 5. Genome Organization
No ratings yet
Lecture 5. Genome Organization
38 pages
Lec 15-16
No ratings yet
Lec 15-16
33 pages
Genome
No ratings yet
Genome
31 pages
Repetitive DNA in Eukaryotic Genomes
No ratings yet
Repetitive DNA in Eukaryotic Genomes
6 pages
Human Molecular Genetics
No ratings yet
Human Molecular Genetics
46 pages
Human Molecular Genetics: Fourth Edition
No ratings yet
Human Molecular Genetics: Fourth Edition
67 pages
Genome Composition and Organization in Eukaryotes
No ratings yet
Genome Composition and Organization in Eukaryotes
27 pages
1-Genome Organisation-22-07-2024
No ratings yet
1-Genome Organisation-22-07-2024
29 pages
Genetic Resources and Food Traceability: Course
No ratings yet
Genetic Resources and Food Traceability: Course
73 pages
Gene Evolution and Supercoiling
No ratings yet
Gene Evolution and Supercoiling
26 pages
Anatomy of A Gene
No ratings yet
Anatomy of A Gene
33 pages
CYTO Transes
No ratings yet
CYTO Transes
10 pages
The Structure and Organization of Genomes
No ratings yet
The Structure and Organization of Genomes
10 pages
Human Genome Sequence
No ratings yet
Human Genome Sequence
22 pages
Genome Organization & Protein Synthesis and Processing in Plants
No ratings yet
Genome Organization & Protein Synthesis and Processing in Plants
46 pages
Molecular Biology Basics
No ratings yet
Molecular Biology Basics
52 pages
Genome Organisation
No ratings yet
Genome Organisation
9 pages
Lecture 1.1.3 Genome Organization
No ratings yet
Lecture 1.1.3 Genome Organization
13 pages
Unit Ii
No ratings yet
Unit Ii
51 pages
Coding and Non Coding
No ratings yet
Coding and Non Coding
11 pages
Computational Biology 12BBI152: Human Genome Project Ultra-Conservation in Human Genome
No ratings yet
Computational Biology 12BBI152: Human Genome Project Ultra-Conservation in Human Genome
48 pages
The Human Genome - Final
No ratings yet
The Human Genome - Final
27 pages
L20 Mutation 15
No ratings yet
L20 Mutation 15
26 pages
BIOL 3301 - Genetics Ch10D - DNA Organization 08 ST
No ratings yet
BIOL 3301 - Genetics Ch10D - DNA Organization 08 ST
27 pages
Cytogenetics and Genome Organization
No ratings yet
Cytogenetics and Genome Organization
516 pages
Zcort - 103 1 To 57
No ratings yet
Zcort - 103 1 To 57
11 pages
138 Repetitive DNA 1
No ratings yet
138 Repetitive DNA 1
5 pages
Structure of Genomes 1
No ratings yet
Structure of Genomes 1
5 pages
CPG Site
No ratings yet
CPG Site
7 pages
Noncoding DNA
No ratings yet
Noncoding DNA
5 pages
Genomics 3
No ratings yet
Genomics 3
8 pages
Molecular Basis of Inheritance - 1
No ratings yet
Molecular Basis of Inheritance - 1
103 pages
POG Lecture 12
No ratings yet
POG Lecture 12
7 pages
TRANSPOSONS
No ratings yet
TRANSPOSONS
19 pages
Human Molecular Genetics, Fourth Edition. ISBN 0815341490, 978-0815341499
100% (38)
Human Molecular Genetics, Fourth Edition. ISBN 0815341490, 978-0815341499
23 pages
Bioinformatics Notes: 1. Horizontal Gene Transfer
No ratings yet
Bioinformatics Notes: 1. Horizontal Gene Transfer
4 pages
Lewins Genes Xi
No ratings yet
Lewins Genes Xi
968 pages
Quarter 4 Lesson 4 Dna Profiling
No ratings yet
Quarter 4 Lesson 4 Dna Profiling
8 pages
Human Genemoe Project
No ratings yet
Human Genemoe Project
14 pages
ZOO 202 Notes by DR OE Ogundele - 083638
No ratings yet
ZOO 202 Notes by DR OE Ogundele - 083638
17 pages
Plant Genetics and Genomics Crops and Models
No ratings yet
Plant Genetics and Genomics Crops and Models
719 pages
Module 1 Genomics
No ratings yet
Module 1 Genomics
19 pages
(Original PDF) Molecular and Genome Evolution Download
No ratings yet
(Original PDF) Molecular and Genome Evolution Download
39 pages
The Human Genome Project Class 12th
No ratings yet
The Human Genome Project Class 12th
10 pages
Science - Abn6919 SM
No ratings yet
Science - Abn6919 SM
154 pages
Decoding The Human Genome - Machine Learning Techniques For DNA Sequencing Analysis
No ratings yet
Decoding The Human Genome - Machine Learning Techniques For DNA Sequencing Analysis
10 pages
12 ZOOLOGY ANSWER KEY 2025 (11-03-2025) by E.VINOTH KUMAR ZOOLOGY HOD
No ratings yet
12 ZOOLOGY ANSWER KEY 2025 (11-03-2025) by E.VINOTH KUMAR ZOOLOGY HOD
13 pages
Dynamic Alternative DNA Structures in Biology and Disease
No ratings yet
Dynamic Alternative DNA Structures in Biology and Disease
24 pages
Wei Et Al., 2024
No ratings yet
Wei Et Al., 2024
17 pages
Plant Genome Project - Shanza Fiaz
No ratings yet
Plant Genome Project - Shanza Fiaz
19 pages
FORENSIC SCIENCE - 2nd Semester
No ratings yet
FORENSIC SCIENCE - 2nd Semester
16 pages
Partial Sequencing Reveals The Transposable Element Composition of Coffea Genomes and Provides Evidence For Distinct Evolutionary Stories
No ratings yet
Partial Sequencing Reveals The Transposable Element Composition of Coffea Genomes and Provides Evidence For Distinct Evolutionary Stories
13 pages
Freitas Et Al 2019
No ratings yet
Freitas Et Al 2019
19 pages
Expressed Sequence Tags (Ests)
No ratings yet
Expressed Sequence Tags (Ests)
3 pages
La Chapelle 2010
No ratings yet
La Chapelle 2010
9 pages
Transposable Elements, Epigenetics
No ratings yet
Transposable Elements, Epigenetics
10 pages
The Science of Stem Cells
From Everand
The Science of Stem Cells
Jonathan M. W. Slack
No ratings yet
Xenopus Development
From Everand
Xenopus Development
Malgorzata Kloc
No ratings yet
Gene Editing, Epigenetic, Cloning and Therapy
From Everand
Gene Editing, Epigenetic, Cloning and Therapy
Amin Elsersawi Ph.D.
4.5/5 (2)
A Journey Into The Depth Of Our DNA
From Everand
A Journey Into The Depth Of Our DNA
Jimmy sidhu
No ratings yet
Introducing Epigenetics: A Graphic Guide
From Everand
Introducing Epigenetics: A Graphic Guide
Cath Ennis
3/5 (4)
Epigenetic Feeding
From Everand
Epigenetic Feeding
Carlos Herrero Carcedo
No ratings yet

The C-Value Paradox

Uploaded by

The C-Value Paradox

Uploaded by

The C-value paradox

• Some clues come from re-annealing

• The time it takes for DNA to re-anneal

• Obviously if you started with more of a sequence it would

• The rate of re-association, hence the time taken to renature is

• The complexity is defined as the number of bases in each

• Eukaryotic genomic DNA can be divided up into 4 classes:

• The last 2 are often combined

This is a typical gene with a CpG island.

• CpG dinucleotides are rare in mammal DNA

• DNA Methylation only occurs at CpG sites

• Methylated cytosines may be converted to thymine by deamination over evolution

• Usually they are not methylated

• Definition from Takai & Jones

• There are about 29,000 such regions in the human genome

Gene CpG islands in body

Gene 3’ end CpG islands

• Repeated groups of genes whose products are

• Retrotransposons, DNA which has been transcribed in

• LINES (long interspersed nuclear elements) 6 – 8 kb

• SINES (short interspersed nuclear elements) 100 – 400 bp

• LTR retrotransposons or long terminal repeats

• Some have a few copies on the genome.

• One example is the globin family. This set of genes

• Sometimes found clustered together on the one

• But they only make up a small proportion of the

• They are the most complex group and hence take

You might also like