Proteins and Functions
Proteins and Functions
Macromolecules
COURSE CODE: BCH 203
Lecturer: Mrs Famutimi Olufemi
A TERM PAPER ON
PROTEIN STRUCTURE
AND DISEASES
BY
OGUNDELE
OLUWADAMILARE
OLANREWAJU
BCH/2021/144
INTRODUCTION
Protein structures are intricate arrangements of amino acids that form the building blocks of
living organisms. These structures are fundamental to various biological processes due to the
diverse function’s proteins perform. Proteins serve ass enzymes, catalysts for biochemical
reactions; structural components, providing support to cells and tissues; signaling molecules,
transmitting messages within and between cells; and transporters, facilitating the movement
of substances across membranes. The precise three-dimensional conformations of proteins are
essential for their proper functioning, and deviations from their native structures can lead to
dysfunction and disease. Understanding protein structures is key to unravelling the
complexities of cellular mechanisms and developing insights into the molecular basis of
health and pathology.
The intricate relationship between protein structures and the development of diseases lies in
the susceptibility of these structures to alterations, often resulting in functional abnormalities.
Proteins, with their specific three-dimensional configurations, play pivotal roles in
maintaining cellular homeostasis. However, genetic mutations, environmental factors, or
other stressors can induce changes in protein structures, leading to misfolding, aggregation,
or loss of function. Such structural anomalies are implicated in the onset and progression of
various diseases.
For instance, misfolded proteins are associated with neurodegenerative disorders like
Alzheimer’s and Parkinson’s, where aggregates contribute to neuronal damage. In certain
cancers, mutations in proteins can disrupt regulatory pathways, promoting uncontrolled cell
growth. Understanding the link between protein structures and diseases is crucial for
identifying potential therapeutic targets, designing interventions to restore normal protein
function, and advancing the field of precision medicine. Investigating the structural basis of
diseases provides insights into diagnostic methods, drug development, and ultimately,
strategies to mitigate the impact of diverse pathological conditions.
AMINO ACIDS
Amino acids are the building block of proteins. Amino acids are important organic
compounds that contain amine (-NH2 ) and Carboxyl (-COOH) functional groups, along with
a side-chain (R group) that is specific for each amino acid (Figure 1). Twenty different amino
acids are commonly found in proteins.
Figure 1 :
All these 20 common amino acids are α-amino acids except proline and their general
structure is shown below. They have a carboxyl group and amino group which are covalently
bonded to a α-carbon atom. They differ from each other in their side chain R groups. Since,
the remaining structure are same therefore properties of these amino acids are primarily
determined by the side chain groups. The nature of these side chain maybe polar, nonpolar
(aliphatic), hydrophilic, hydrophobic, acidic, basic, and aromatic. These amino acids have
been abbreviated using either three letter word or one letter word.
Proteins
Proteins are made from amino acids and therefore always contain the elements carbon,
hydrogen, oxygen, and nitrogen, and in some cases sulphur. Some proteins form complexes
with other molecules containing phosphorus, iron, zinc, and copper. Proteins are
macromolecules of high Mr (relative formula mass or molecular mass), typically between
several thousands and several millions, consisting of chains of amino acids. They are
polymers and amino acids are the monomers. There are 20 different amino acids which are
commonly found in naturally occurring proteins. The potential variety of proteins is
unlimited because the sequence of amino acids in each protein is specific for that protein and
is genetically controlled by the DNA of the cell in which it is made. Proteins are the most
abundant organic molecules to be found in cells and form over 50% of their total dry mass.
They are an essential component of the diet of animals and may be converted to both fat and
carbohydrate by the cells. Their diversity enables them to display a great range of structural
and metabolic activities within the organism.
Structure of proteins
Each protein possesses a characteristic three-dimensional shape, its conformation. There are
four separate levels of structure and organisation as follows.
1. Primary structure
The primary structure of a protein involves the linear sequence of amino acids, forming the
backbone of the protein molecule. Any changes or mutations in this sequence can have
profound effects on the protein’s structure and function.
The primary structure is the sequence of amino acids in a polypeptide chain. The first person
to work out the complete amino acid sequence of a protein was Fred Sanger, working at the
Cavendish laboratory in Cambridge, where Watson and Crick also determined the structure of
DNA. He worked with the hormone insulin, the smallest protein he could find. It took ten
years, and the results were published in 1953. Max Perutz, another great molecular biologist
of the Cavendish, recalls 'it caused a sensation, because it proved for the first time that protein
has a specific arrangement of amino acids along its chain.' Sanger was awarded the Nobel
prize for his work in 1958 (and has since won a second for work on nucleic acids). Insulin is a
protein of 51 amino acids. It is made of two polypeptide chains held together by disulphide
bridges.
2. Secondary Structure
The secondary structure of a protein refers to the local folding patterns within the polypeptide
chain. Common secondary structures include alpha helices and beta sheets, stabilized by
hydrogen bonds between amino acids.
The two main types of secondary structures are:
1. Alpha Helix: A right-handed coil formed by hydrogen bonding between the carbonyl
oxygen of one amino acid and the amide hydrogen of an amino acid four residues
away. This helical structure provides stability to the protein.
2. Beta Sheet: A structure where adjacent polypeptide strands are connected by hydrogen
bonds, forming a sheet-like arrangement. Beta sheets can be parallel or antiparallel,
contributing to the protein’s overall stability.
These secondary structures result from interactions between amino acid residues and play a
crucial role in determining a protein’s three-dimensional conformation and, consequently, its
function.
3. Tertiary structure
Usually, the polypeptide chain bends and folds extensively, protein's forming a
tertiary precise, structure compact and 'globular' it is maintained shape. This is by
the interaction of the four types of bonds already discussed, namely ionic,
hydrogen and disulphide bonds as well as hydrophobic interactions. The latter
are quantitatively the most important and occur when the protein folds to shield
hydrophobic side groups from the aqueous surroundings, at the same time
exposing hydrophilic side chains, as described above. The tertiary structure of a
protein can be determined by X-ray crystallography.
By early 1959, and after many years' work, John Kendrew and Max Perutz had
built the first atomic model of myoglobin showing secondary and tertiary
structures using this technique. They received the Nobel prize for their work in
1962.
4. Quaternary Structure
Many highly complex proteins consist of more than one polypeptide chain. The separate
chains are held together by hydrophobic interactions and hydrogen and ionic bonds. Their
precise arrangement is known as the quaternary structure. Haemoglobin shows such a
structure. It is the red oxygen-carrying pigment found in the red blood cells of vertebrates.
It consists of four separate polypeptide chains of two types, namely two α chains and two ß
chains. These resemble myoglobin in structure. The two α chains each contain 141 amino
acids, while the two chains each contain 146 amino acids. The complete structure of
haemoglobin was worked out by Kendrew and Perutz.
As is typical of globular proteins, its hydrophobic side chains point inwards to the
centre of the molecule, and its hydrophilic side chains face outwards, making it soluble in
water. A mutation which causes one of the hydrophilic amino acids to be replaced by a
hydrophobic amino acid, thereby reducing its solubility, is responsible for the disease sickle
cell anaemia.
Protein Folding
Protein folding is the complex process by which a linear chain of amino acids, known as a
polypeptide, acquires its three-dimensional functional and biologically active structure. This
process is critical for a protein to carry out its specific biological functions within cells.
The primary structure of a protein, dictated by the sequence of amino acids encoded in its
corresponding gene, serves as the starting point for folding. The folding process is influenced
by various interactions among amino acid residues, including:
2. Hydrogen Bonds: Formed between the hydrogen atom of one amino acid and the
oxygen or nitrogen atom of another, contributing to the folding pattern.
3. Disulfide Bonds: Covalent bonds between the sulphur atoms of two cysteine residues,
providing additional stability to the protein structure.
4. Ionic Interactions: Attraction or repulsion between positively and negatively charged
amino acid side chains, influencing the folding process.
The protein folds into specific secondary structures like alpha helices and beta sheets, which
further arrange into a unique tertiary structure. In some cases, multiple polypeptide chains
(subunits) come together to form the quaternary structure.
Protein folding is a highly orchestrated and regulated process within cells. Chaperone
proteins assist in the folding of nascent polypeptides, preventing misfolding or aggregation.
When proteins fail to fold correctly, it can lead to functional deficits or even contribute to the
development of diseases, including neurodegenerative disorders.
In conclusion, the intricacies of protein folding, and conformation extend across multiple
layers, from molecular precision to systemic functionality. Understanding these processes is
essential not only for unravelling the mysteries of cellular biology but also for advancing
therapeutic approaches in the treatment of various diseases.
3. Fluorescence Spectroscopy:
Principle: Measurement of fluorescence emission to study protein conformational
changes and interactions.
Advantages: Sensitive, applicable to real-time studies.
Limitations: Limited structural resolution, dependence on fluorophore placement.
Contribution: Measures changes in fluorescence emission to understand protein
conformational changes and interactions. Fluorescent probes can be strategically placed for
studying specific regions of a protein.
Application: Useful for real-time monitoring of protein folding, ligand binding, and
studying structural changes in response to environmental factors.
5. Mass Spectrometry (MS):
Principle: Mass-to-charge ratios of ionized protein fragments are measured, providing
information on composition and structure.
Advantages:
Useful for studying protein interactions.
post-translational modifications.
Limitations: Limited in providing high-resolution structural details.
Contribution: Provides information on protein composition, post-translational
modifications, and protein interactions. Structural information can be inferred by analysing
the mass-to-charge ratios of ionized protein fragments.
Application: Valuable for studying protein dynamics, post-translational modifications, and
mapping protein-protein interactions.
1. Genetic Mutations:
Point Mutations: Alterations in the DNA sequence can result in a change in a single amino
acid, affecting the folding process.
Insertions or Deletions: Shifts in the reading frame can lead to the insertion or deletion of
amino acids, disrupting the correct folding pattern.
2. Environmental Factors:
Temperature and pH: Changes in environmental conditions, such as elevated temperatures
or extremes in pH, can destabilize protein structures, promoting misfolding.
Chemical Agents: Exposure to certain chemicals or toxins can induce protein misfolding by
interfering with the folding process.
3. Chaperone Deficiency:
Chaperone Proteins: Chaperones assist in the correct folding of proteins. Deficiencies in
chaperone function can lead to the accumulation of misfolded proteins.
Heat Shock Response: Cellular stress, such as heat shock, can overwhelm the chaperone
machinery, leading to the misfolding of proteins.
4. Post-Translational Modifications:
Aberrant Modifications: Improper addition of functional groups or post-translational
modifications can hinder the correct folding of proteins, affecting their structure and function.
5. Protein Overexpression:
Overproduction: Excessive production of a protein can overwhelm the cellular folding
machinery, leading to the accumulation of misfolded proteins.
6. Aging:
Accumulation of Damage: Over time, cells may accumulate damage to molecular
components, including proteins, which can contribute to misfolding.
Several diseases are associated with protein misfolding, where proteins adopt abnormal
conformations and may form aggregates. These diseases are often referred to as protein
misfolding disorders. Here are some notable examples:
1. Alzheimer’s Disease:
Protein: Amyloid-beta (Aβ) and Tau proteins.
Characteristics: Formation of extracellular plaques (Aβ) and intracellular neurofibrillary
tangles (Tau) in the brain, leading to neurodegeneration.
2. Parkinson’s Disease:
Protein: Alpha-synuclein.
Characteristics: Aggregation of alpha-synuclein into Lewy bodies, which are abnormal
protein clumps in neurons. This disrupts normal cellular function and contributes to
neurodegeneration.
3. Huntington’s Disease:
Protein: Huntingtin.
Characteristics: Expansion of a CAG repeat in the huntingtin gene leads to the production
of a mutant huntingtin protein. This misfolded protein forms aggregates, especially in
neurons, causing neurodegeneration.
5. Cystic Fibrosis:
Protein: Cystic fibrosis transmembrane conductance regulator (CFTR).
Characteristics: Mutations in the CFTR gene lead to misfolding of the CFTR protein,
affecting its transport function. This results in the production of thick, sticky mucus, causing
respiratory and digestive issues.
7. Type 2 Diabetes:
Protein: Amylin.
Characteristics: Misfolding and aggregation of amylin in pancreatic islets contribute to the
formation of amyloid deposits. This impairs insulin secretion and worsens diabetes.
Case Studies
There are many case studies of successful applications of protein structure knowledge in
disease treatment. Here are some examples:
Cystic fibrosis (CF): CF is a genetic disorder caused by mutations in the CF
transmembrane conductance regulator (CFTR) protein, which is responsible for
transporting chloride ions across cell membranes. The most common mutation,
F508del, causes the protein to misfold and be degraded by the cell. Researchers have
used protein structure knowledge to design small molecules that can correct the
folding and function of F508del-CFTR, such as VX-809 and VX-661. These
molecules act as pharmacological chaperones, binding to the mutant protein and
stabilizing its conformation. In combination with other drugs that enhance the activity
of CFTR, such as VX-770 and VX-445, these molecules have shown significant
clinical benefits for CF patients12.
HIV/AIDS: HIV is a retrovirus that infects and destroys CD4+ T cells, leading to
immunodeficiency and opportunistic infections. HIV has several key proteins that are
essential for its replication and survival, such as reverse transcriptase, protease,
integrase, and envelope glycoprotein. Researchers have used protein structure
knowledge to design inhibitors that target these proteins and block their function, such
as AZT, indinavir, raltegravir, and enfuvirtide. These inhibitors have been used in
combination therapy, known as highly active antiretroviral therapy (HAART), to
suppress viral load and improve immune function in HIV patients34.
Cancer: Cancer is a group of diseases characterized by uncontrolled cell growth and
invasion. Cancer cells have various genetic and epigenetic alterations that affect the
expression and function of many proteins involved in cell cycle, apoptosis, signal
transduction, angiogenesis, and metastasis. Researchers have used protein structure
knowledge to design drugs that target these proteins and modulate their activity, such
as imatinib, trastuzumab, vemurafenib, and olaparib. These drugs have shown
remarkable efficacy and specificity for certain types of cancer, such as chronic
myeloid leukemia, breast cancer, melanoma, and ovarian cancer
Conclusion
Protein structure is a key determinant of protein function and regulation. Understanding
the relationship between protein structure and disease is crucial for developing novel and
effective therapeutic interventions. In this paper, we have reviewed some of the current
strategies and challenges of targeting protein structures to treat or prevent various
diseases, such as cystic fibrosis, HIV/AIDS, and cancer. We have also discussed some of
the successful applications of protein structure knowledge in drug design and discovery. I
hope that this paper will provide a useful overview and insight for researchers and
students interested in this fascinating and important field of study.
References
1. Tayyab S, Nasrulhaq A. A journey from amino acids to proteins. University of Malaya
press. 2006.
2. Cox MM, Nelson DL. Lehninger Principles of Biochemistry 2011. Fifth Edition.
WH Freeman and Company. New York (USA).
3. Monod J, Wyman J, Changeux JP. On the nature of allosteric transitions: A plausible
model. J M Biol. 1965; 12: 88-118\
4. In silico Analyses of Gene Expression and Protein Structure/Function and
Applications in Disease (uic.edu)
5. D.J. Taylor, N.P.O Green, G.W. Stout. Biological Science 1&2
6. Biomedicines | Special Issue : Protein Structure, Function and Dynamics in Diseases
and Therapeutics (mdpi.com)
7. https://link.springer.com/article/10.1007/s11030-023-10606-w
8. Neurodegenerative diseases distinguished through protein-structure analysis
(nature.com)