0% found this document useful (0 votes)

30 views8 pages

DNA The Code of Life

The document discusses the Human Genome Project (HGP), which aimed to sequence the entire human genome and identify all human genes, leading to advancements in genetic research and applications like DNA fingerprinting. It highlights the methodologies used in HGP, the significance of DNA polymorphism, and the regulation of gene expression in prokaryotes, particularly through the lac operon. Additionally, it emphasizes the implications of genomic research for understanding diseases and the potential for future scientific advancements.

Uploaded by

kamogelotladi636

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views8 pages

DNA The Code of Life

Uploaded by

kamogelotladi636

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Life at Molecular, Cellular and Tissue Level

DNA: The Code of Life

Human Genome Project and DNA Fingerprinting

Did you know that scientists have determined the complete DNA sequence of humans! Yes, it’s
true, through an ambitious project called the Human Genome Project (HGP). Also, did you
know that just like your fingerprint, you also have a DNA fingerprint that is unique to you!
Want to know more about these concepts? Let’s find out.

Human Genome Project

The Human Genome Project (HGP) was a mega project launched in the year 1990. The
advances in genetic engineering techniques have made this project possible. The aims of this
project reveal the magnitude and the requirements of this project.

The human genome (i.e. the complete set of genes) has approximately 3 x 109 base pairs. If the
cost of sequencing is US$ 3 per base pair, then the cost of the entire project would be
approximately US$ 9 billion! Moreover, let’s say the sequencing data were to be stored in
books. Then if each page has 1000 letters and each book has 1000 pages, we will need 3300
such books to store the genetic information from a single cell!

This large amount of data would also need computational devices with high speed to store,
retrieve and analyse the data. Therefore, HGP aided the rapid development of another field in
biology – Bioinformatics.

Goals Of HGP

• Identify all the genes in the human genome (approximately 20,000-25,000 genes).

• Provide a complete and accurate sequence of the 3 billion base pairs that make up the
human genome.

• Store all the sequencing data in databases.

• Develop new tools to obtain and analyse data and make the information widely
available.

• Necessitate technology transfer to other sectors like industries.

• Address the ethical, legal and social implications of the project on society.
HGP was a 13-year project, coordinated by the National Institute of Health (NIH) and the U.S.
Department of Energy. It involved contributions from other countries too such as Japan,
Germany, China, France etc. The benefits of this project are that it can lead to revolutionary
new ways to diagnose, treat and prevent human diseases.

Besides the human genome, information about the genomes of non-human organisms can also
be very helpful. We can understand their natural capabilities and apply them towards solving
problems in human healthcare, agriculture, energy production etc. Therefore, scientists have
also sequenced many non-human organisms such as bacteria, yeast, fruit fly, plants etc.

Methodologies Of HGP

HGP involved two major approaches:

• Expressed Sequence Tags (ESTs) – This approach focussed on identifying all genes
expressed as RNA.

• Sequence Annotation – This blind approach involved sequencing the whole genome
(coding and non-coding) and later assigning functions to the different regions.
DNA sequencing involves the following steps:

• First, total DNA is isolated from a cell and converted into random, small-size fragments
since it is difficult to sequence long pieces of DNA. These fragments are then cloned into
a suitable host (bacteria or yeast) using special vectors such as Bacterial Artificial
Chromosomes (BAC) or Yeast Artificial Chromosomes (YAC). This amplifies each DNA
fragment so that it can be sequenced easily.

• Next, the fragments are sequenced using automated DNA sequencers. These sequences
work on the principle of Frederick Sanger’s method.

• Special computer-based programs are used to arrange and align the DNA sequences
based on overlapping regions present in them.

• Subsequently, the sequences are annotated and assigned to each chromosome.

This is a time-consuming process. Therefore, the sequence of chromosome 1 (the last
chromosome to be sequenced) was completed only in May 2006.

Salient Features of Human Genome

• There are 3164.7 million nucleotide bases in the human genome.

• An average gene has 3000 bases. However, sizes vary greatly, with the largest human
gene being ‘dystrophin’ that has 2.4 million bases.

• The original estimate of the number of genes was between 80,000 and 140,000.
However, HGP gave an estimate of about 30,000 genes. Approximately 99.9% nucleotide
bases are the same in all people.
• For over 50% of the discovered genes, the functions are unknown.

• Less than 2% of the genome codes for proteins.

• Repeated sequences form a large part of the human genome.

• Stretches of DNA sequences that are repeated many times (sometimes 100 to 1000
times) are repetitive sequences. Although they don’t code for proteins, they shed light
on chromosome structure, evolution, and dynamics.

• Chromosome 1 has the greatest number of genes (2968), and chromosome Y has the
least (231).

• HGP has identified 1.4 million locations with single base DNA differences in humans. This
information will revolutionize the identification of disease-associated sequences and
tracking of human history.

Components of the Human Genome [Source: Wikimedia Commons]

Applications of HGP and Future Challenges

The need to derive meaningful knowledge from genomic sequences and better understand
biological systems will drive future research. This enormous task will require the coordinated
effort of scientists from various fields.

A major impact of HGP is providing a radically new approach in biological research. Earlier,
researchers studied one or a few genes at a time. Now, with new technologies and whole
genome sequences, they can study all the genes in a genome i.e. all the transcripts in a tissue
or an organ. They can also study how thousands of genes work together in networks to make a
system function.
DNA Fingerprinting

As we know, 99.9% of nucleotide bases are the same in all humans. However, there are some
differences in human DNA sequences, which make them unique. This is their DNA fingerprint.
How do we determine these differences? If we compare the whole DNA sequences of two
individuals, it’ll take far too long. DNA fingerprinting is a quicker way to compare the sequences
of two individuals.

This technique involves identifying differences in the repetitive DNA regions. The peaks on a
density gradient centrifugation help to separate the repetitive part from the bulk DNA. Here,
the bulk DNA forms a major peak, while the small peaks are called satellite DNA.

Satellite DNA is classified into micro-satellites and mini-satellites based on multiple factors such
as – base composition (A:T rich or G:C rich), number of repetitive units, length of segment etc.
These sequences do not code for any protein but are abundant in the human genome. They
also show a high degree of polymorphism i.e. differences in DNA sequence and therefore,
form the basis of DNA fingerprinting.

DNA from every tissue such as hair follicle, saliva, skin, bone etc show the same degree of
polymorphism. Thus, these are very important as an identification tool in forensic applications.
Moreover, since polymorphisms are passed on from parents to children, this fingerprinting
technique is also the basis of paternity testing.

Let’s understand exactly what polymorphisms are.

Polymorphism

Polymorphisms are variations at the genetic level that arise due to mutations. In an individual,
new mutations can arise either in somatic cells or germ cells i.e. cells that generate sperm and
ovum. If the germ cell mutation doesn’t affect the individual’s ability to reproduce, then it is
passed on to the next generation and thus, spreads in the population.

DNA polymorphism is an inheritable mutation observed at a high frequency in a population.

The probability of these variations is higher on non-coding DNA since mutations in them will
not impact an individual’s reproductive ability. This then passes from generation to generation
and is one of the basis’ of variation in human evolution. Polymorphisms can be changes in a
single nucleotide or large-scale changes.

Technique

Alec Jeffreys initially developed the technique of DNA fingerprinting using a satellite DNA that
shows a very high degree of polymorphism, as a probe. It is called Variable Number of Tandem
Repeats (VNTR). VNTR belongs to the class of mini satellites. Here, a small DNA sequence is
arranged in many copies. The copy number varies between individuals and the number of
repeats shows a high degree of polymorphism.

The technique of DNA fingerprinting involves Southern blot hybridization using radiolabelled
VNTR as a probe. The steps are:

• Sample collection

• DNA isolation.

• DNA digestion using restriction endonucleases.

• Separation of DNA fragments using electrophoresis.

• Blotting (transferring) of separated DNA fragments on to synthetic membranes like

nylon or nitrocellulose.

• Hybridization with the labelled VNTR probe.

• Detection of the hybridized DNA fragments using autoradiography.

The size of VNTR ranges from 0.1 to 20 kilobases. Therefore, the autoradiogram results show
bands of multiple sizes. These bands give a characteristic pattern which differs between
individuals except for monozygotic twins. Furthermore, polymerase chain reaction (PCR)
increases the sensitivity of fingerprinting i.e. DNA from a single cell is enough to perform
fingerprinting.

DNA fingerprint [Source: Flickr]

Apart from forensic science and paternity testing, this technique is also useful in
determining population and genetic diversities. Therefore, many different probes are used
currently to generate DNA fingerprints.
Gene Expression
We now know that genes encode proteins and proteins control the functions of a cell. Are all
the genes in a cell expressed at the same time? And, are all genes expressed all the time? No!
This will not only lead to wastage of cellular energy but also affect the balance within a cell.
This is the reason why gene expression is regulated. Exactly how are genes regulated? Let’s find
out.

Regulation of Gene Expression

Protein synthesis begins at transcription, ends at translation and involves multiple steps.
Therefore, regulation of gene expression can happen at any of these steps. In eukaryotes, gene
regulation occurs at any of the following steps:

• Transcriptional level i.e. during the formation of the primary transcript.

• Processing level i.e. at the stage of splicing.

• During transport of mRNA from the nucleus to the cytoplasm.

• Translational level.

A great example of coordinated gene regulation is the development and differentiation of

embryo into adult organisms. Metabolic, physiological and environmental conditions govern
the regulation of gene expression. For example, E. coli uses lactose as a source of energy.

To do so, it synthesizes an enzyme called beta-galactosidase which hydrolyses lactose into

galactose and glucose. However, if there is no lactose around to be used as an energy source,
the E. coli does not need to synthesize beta-galactosidase.
Prokaryotic Gene Regulation

In prokaryotes, the main site for regulation of gene expression is transcription initiation. Within
a transcription unit, the activity of RNA polymerase at the promoter is regulated by ’accessory
proteins’. These proteins affect the ability of RNA polymerase to recognize start sites. These
proteins can act both positively (activators) or negatively (repressors).

In prokaryotic DNA, the accessibility of the promoter depends on the interaction of proteins
with sequences called operators. In most operons, the operator is adjacent to the promoter
elements. Moreover, in most cases, the operator has a repressor protein bound to it.
Therefore, each operon has its own, specific operator and repressor. Let’s understand this
better using lac operon as an example.

The Lac Operon

Here, ’lac’ refers to lactose. Francois Jacob and Jacque Monod were the first to elucidate
the lac operon – a transcriptionally regulated system. Lac operon consists of a polycistronic
structural gene regulated by a common promoter and regulatory genes. Such arrangements
are common in bacteria and are called operons. Other examples
include trp operon, val operon, his operon etc.

The lac operon has the following parts:

• One regulatory gene – The i gene where ’i’ is derived from ‘inhibitor’. This gene codes
for the repressor of the lac operon.

• Three structural genes –

1. The z gene that codes for the enzyme beta-galactosidase that hydrolyses lactose to
glucose and galactose.

2. The y gene codes for the enzyme permease that increases the permeability of the
cell to beta-galactosides.

3. The a gene codes for transacetylase.

Lactose metabolism requires gene products of all three genes mentioned above. Lactose, the
substrate for the enzyme beta-galactosidase, regulates the switching on or the switching off, of
the operon. Therefore, lactose is the inducer. Let’s understand how lactose switches the
operon on or off.

In the absence of lactose, the i gene synthesizes the repressor which then binds to the operator
region of the operon. This prevents RNA polymerase from transcribing the genes (z, y, a) on the
operon. Therefore, if there is no lactose, the operon does not synthesize genes for its
utilization. The action of the repressor on the lac operon is negative regulation.
In the presence of lactose, the repressor interacts with lactose and gets inactivated. Thus, RNA
polymerase is free and can transcribe the genes in the operon. Therefore, if lactose is present,
the operon synthesizes the genes for its utilization. Therefore, essentially, the presence of the
substrate i.e. lactose regulates the synthesis of enzymes for its utilization.

Human Genome Project
86% (36)
Human Genome Project
39 pages
BIOLOGY Class 12 Project
93% (29)
BIOLOGY Class 12 Project
15 pages
Anshumala Ahaturvedi 1
No ratings yet
Anshumala Ahaturvedi 1
19 pages
Human Genome Project
No ratings yet
Human Genome Project
17 pages
2020 Genome 1
No ratings yet
2020 Genome 1
38 pages
The Human Genome Project
100% (1)
The Human Genome Project
22 pages
HGP
No ratings yet
HGP
24 pages
Human Genome Project Sachi 2
No ratings yet
Human Genome Project Sachi 2
19 pages
Human Genome Project
No ratings yet
Human Genome Project
18 pages
Contents HGP
No ratings yet
Contents HGP
11 pages
Human Genome Project
No ratings yet
Human Genome Project
9 pages
Biology Investigatory Project
No ratings yet
Biology Investigatory Project
16 pages
STS Report
No ratings yet
STS Report
11 pages
HGF PDF 02
No ratings yet
HGF PDF 02
12 pages
304a0898-3496-4b32-b97b-14c2bf6616ba
No ratings yet
304a0898-3496-4b32-b97b-14c2bf6616ba
12 pages
Human Genome Project and DNA Fingerprinting
No ratings yet
Human Genome Project and DNA Fingerprinting
20 pages
The Human Genome Project
No ratings yet
The Human Genome Project
26 pages
Human Genome Project
No ratings yet
Human Genome Project
4 pages
Human Genome Project
No ratings yet
Human Genome Project
3 pages
Human Genome Project - Biology Science Fair Project Ideas
100% (1)
Human Genome Project - Biology Science Fair Project Ideas
8 pages
Bio Project Gems
No ratings yet
Bio Project Gems
20 pages
Human Genemoe Project
No ratings yet
Human Genemoe Project
14 pages
Human Genome Project-1
No ratings yet
Human Genome Project-1
4 pages
TINKU
No ratings yet
TINKU
13 pages
The Human Genome Project Class 12th
No ratings yet
The Human Genome Project Class 12th
10 pages
Biology Investigatory Project Class 12-A
No ratings yet
Biology Investigatory Project Class 12-A
24 pages
Print Human Genome Project
No ratings yet
Print Human Genome Project
6 pages
Human Genome Project-1
No ratings yet
Human Genome Project-1
3 pages
Expressed Sequence Tags (Ests)
No ratings yet
Expressed Sequence Tags (Ests)
3 pages
RDT Tools & Techniques - Part I - Prof. Parimal K. Khan
100% (1)
RDT Tools & Techniques - Part I - Prof. Parimal K. Khan
29 pages
HGP PDF
No ratings yet
HGP PDF
12 pages
Black & White Deliver Card (1) - WPS Office
No ratings yet
Black & White Deliver Card (1) - WPS Office
28 pages
Bigdye Terminator Protocol v1.1
No ratings yet
Bigdye Terminator Protocol v1.1
74 pages
Bio Investigatory Project
No ratings yet
Bio Investigatory Project
20 pages
BIOLOGY Class 12 Project
No ratings yet
BIOLOGY Class 12 Project
13 pages
Bio Final
No ratings yet
Bio Final
15 pages
Human Genome Project Assignment (M Tuaseen 9211)
0% (1)
Human Genome Project Assignment (M Tuaseen 9211)
3 pages
Human Genome Therapy
No ratings yet
Human Genome Therapy
6 pages
Kartik Project Biology
No ratings yet
Kartik Project Biology
17 pages
Genomics, Proteomics and System Biology: Seminar On Human Genome Project
No ratings yet
Genomics, Proteomics and System Biology: Seminar On Human Genome Project
20 pages
Biology Project - 20241231 - 171730 - 0000
No ratings yet
Biology Project - 20241231 - 171730 - 0000
21 pages
Human Genome Project Class 12
100% (4)
Human Genome Project Class 12
15 pages
Biology Investigatory Project
No ratings yet
Biology Investigatory Project
44 pages
Investigatory Project Bio
No ratings yet
Investigatory Project Bio
18 pages
Human Genome Project
No ratings yet
Human Genome Project
13 pages
6 Molecular Basis of Inheritance Capsule Notes
No ratings yet
6 Molecular Basis of Inheritance Capsule Notes
4 pages
Tharun MB
No ratings yet
Tharun MB
24 pages
Human Genome Project and DNA Fingerprinting
80% (10)
Human Genome Project and DNA Fingerprinting
2 pages
Kartik Agsi Biology Project
No ratings yet
Kartik Agsi Biology Project
18 pages
Abi 2024
No ratings yet
Abi 2024
22 pages
Biology Class 12 Project
No ratings yet
Biology Class 12 Project
17 pages
Biology Project: Kendriya Vidyalaya AFS Bareilly
100% (1)
Biology Project: Kendriya Vidyalaya AFS Bareilly
16 pages
12 Bio
No ratings yet
12 Bio
2 pages
ISC Biotechnology
No ratings yet
ISC Biotechnology
11 pages
Ashwin Project
100% (2)
Ashwin Project
15 pages
Saurav Biology Class 11
No ratings yet
Saurav Biology Class 11
21 pages
Biology Ip HGP
No ratings yet
Biology Ip HGP
10 pages
BIOLOGY Class 12 Project
No ratings yet
BIOLOGY Class 12 Project
19 pages
Phy Project File-1
No ratings yet
Phy Project File-1
15 pages
Meiosis
100% (1)
Meiosis
8 pages
R-Sen Lecture 01b
No ratings yet
R-Sen Lecture 01b
57 pages
BIOLOGY Class 12 Project
75% (185)
BIOLOGY Class 12 Project
15 pages
Age Piche Print
No ratings yet
Age Piche Print
3 pages
Cytology: Emmanuel A. Domingcil, BSN, RN, USRN, MAN
No ratings yet
Cytology: Emmanuel A. Domingcil, BSN, RN, USRN, MAN
32 pages
Timeline Ofthe Human Genome
No ratings yet
Timeline Ofthe Human Genome
5 pages
Cell Four
No ratings yet
Cell Four
37 pages
Harsh
No ratings yet
Harsh
11 pages
BIOLOGY Class 12 Project
No ratings yet
BIOLOGY Class 12 Project
15 pages
Bhadauria V (Ed.) - Next-Generation Sequencing and Bioinformatics For Plant Science-Caister (2017)
No ratings yet
Bhadauria V (Ed.) - Next-Generation Sequencing and Bioinformatics For Plant Science-Caister (2017)
204 pages
Rapid Sequencing Gdna Barcoding SQK Rbk114 Document Document MinION en RBK 9176 v114 RevR 04jun2025 33
No ratings yet
Rapid Sequencing Gdna Barcoding SQK Rbk114 Document Document MinION en RBK 9176 v114 RevR 04jun2025 33
38 pages
Chemical Mediators of Acute Inflammation - Handout
No ratings yet
Chemical Mediators of Acute Inflammation - Handout
24 pages
DNA Damage & Repair
No ratings yet
DNA Damage & Repair
11 pages
Schneider 2017
No ratings yet
Schneider 2017
8 pages
Chapter 10 Biomolecules PYQ
No ratings yet
Chapter 10 Biomolecules PYQ
27 pages
Chapter 18 Presentation
No ratings yet
Chapter 18 Presentation
47 pages
Biochemistry Midterm 1 Questions
No ratings yet
Biochemistry Midterm 1 Questions
2 pages
Lec1 - Cytology
No ratings yet
Lec1 - Cytology
3 pages
Life Sciences Grade 12 Term 1 Week 1 - 2021
No ratings yet
Life Sciences Grade 12 Term 1 Week 1 - 2021
4 pages
Using Entrez: An Integrated Database Search and Retrieval System
No ratings yet
Using Entrez: An Integrated Database Search and Retrieval System
53 pages
A pET Vector Is A Type of Plasmid Vector Commonly Used For Protein Expression in Escherichia Coli
No ratings yet
A pET Vector Is A Type of Plasmid Vector Commonly Used For Protein Expression in Escherichia Coli
13 pages
Life Science Assignment 3
No ratings yet
Life Science Assignment 3
17 pages
Metabolism Exam 2 - GIFT - Spring 2016
100% (2)
Metabolism Exam 2 - GIFT - Spring 2016
9 pages
Cells Practice Answers
No ratings yet
Cells Practice Answers
12 pages
MCB 250 Practice Exam I
No ratings yet
MCB 250 Practice Exam I
14 pages
Chapter 21
No ratings yet
Chapter 21
11 pages
Glucose Transporters
No ratings yet
Glucose Transporters
2 pages
AO1 Cells
No ratings yet
AO1 Cells
6 pages
Gene Guns
No ratings yet
Gene Guns
4 pages
Reproduksi Sel Ringkasan
No ratings yet
Reproduksi Sel Ringkasan
2 pages

DNA The Code of Life

Uploaded by

DNA The Code of Life

Uploaded by

Life at Molecular, Cellular and Tissue Level

DNA: The Code of Life

Human Genome Project and DNA Fingerprinting

Human Genome Project

• Store all the sequencing data in databases.

• Necessitate technology transfer to other sectors like industries.

HGP involved two major approaches:

• Subsequently, the sequences are annotated and assigned to each chromosome.

Salient Features of Human Genome

• There are 3164.7 million nucleotide bases in the human genome.

• Less than 2% of the genome codes for proteins.

• Repeated sequences form a large part of the human genome.

Components of the Human Genome [Source: Wikimedia Commons]

Applications of HGP and Future Challenges

Let’s understand exactly what polymorphisms are.

DNA polymorphism is an inheritable mutation observed at a high frequency in a population.

• DNA digestion using restriction endonucleases.

• Separation of DNA fragments using electrophoresis.

• Blotting (transferring) of separated DNA fragments on to synthetic membranes like

• Hybridization with the labelled VNTR probe.

• Detection of the hybridized DNA fragments using autoradiography.

DNA fingerprint [Source: Flickr]

Regulation of Gene Expression

• Transcriptional level i.e. during the formation of the primary transcript.

• Processing level i.e. at the stage of splicing.

• During transport of mRNA from the nucleus to the cytoplasm.

A great example of coordinated gene regulation is the development and differentiation of

To do so, it synthesizes an enzyme called beta-galactosidase which hydrolyses lactose into

The Lac Operon

The lac operon has the following parts:

• Three structural genes –

3. The a gene codes for transacetylase.

You might also like