The C-Value Paradox
The C-Value Paradox
• The C-value is the total number of DNA nucleotide residues in the genome (per
haploid set of chromosomes).
• When you compare this to the complexity of the organism you find a massive
disparity.
• Clearly the amount of DNA is not proportional to that required to produce all
the proteins made by the organism.
• The E. Coli genome has 4.6 million base pairs and codes for about 3,000
different proteins (proteins of ~40,000 and 500 bp for promoters)
• Using the same assumptions the human genome should code for 1 million
proteins (3 billion base pairs (3*10^9), protein ~30,000 and promoters of 1500
bp)
Re annealing experiments
• Only about 3% of the DNA in the genome
actually codes for proteins. What is the
rest of it doing?
Cot½
A260
Double
stranded
DNA
Co*Time (M*s)
• You need to account for the starting concentration (Co).
• By using the Cot value you can plot (on the same graph)
different annealing experiments with different starting
concentrations (10 – 2000 ug/mL) from the same source and
they will all lie of the graph.
unique
CpG island
Any stretch of DNA greater than 500bp with a CG content of greater than
50%.
So, it a region of DNA in which the frequency of the CG sequence is higher
than in other regions.
"p" indicates that "C" and "G" are connected by a phosphodiester bond.
CpG island –from several hundred to several thousand base pairs long.
In humans there are about 45,000 CpG islands, mostly found at the 5'
ends of genes.
8
CpG island properties
CpG islands are often located around the promoters of genes frequently expressed in a
cell.
A promoter - specific region just upstream from a gene that acts as a binding site for
transcription factors and RNA polymerase during the initiation of transcription.
Thus, the knowledge of CpG island is important for the computational prediction of
promoters for genes. Recently it was shown that the prediction, which no associated with
CpG-island may not even be possible.
9
According to a recent study, human chromosomes 21 and 22
contain about 1100 CpG-islands and about 750 genes.
(Comprehensive analysis of CpG islands in human chromosomes 21 and 22, Proc. Natl. Acad. Sci. US, March 19,
2002)
10
CpG islands are not really a repeated sequence, but a special type of DNA sequence with a particular
function
A function for islands: Molecular studies showed that the chromatin in these regions has an "open" configuration,
with no nucleosomes or histone H1.
This would make the DNA accessible to transcription factors, etc. and hence able to be transcribed.
11
What CpG islands are?
• CpG islands are short stretches of DNA with higher frequency of the CG sequence
• CpG islands located in the promoter regions of genes can play important roles in gene silencing
• Housekeeping genes
• Almost all housekeeping genes are associated with at least one CpG island
• CpG islands are starting 5’ to the transcription start site and covering one or more exons and introns
• Tissue specific genes
• About 40 % tissue specific genes are associated with islands
• The position of these islands is not strongly toward the transcription start site as in the housekeeping genes
• Not all CpG islands are associated with genes
• Ioshikhes & Zhang determined the features to discriminate the promoter-associated and non-associated CpG
islands
• There are methylation-prone and methylation-resistant CpG islands
• Feltus et. al. found patterns to discriminate methylation-prone from methylation-resistant CpG islands
5’ end
CpGi
Gene
Promoter CpG islands
Gene