0% found this document useful (0 votes)

30 views25 pages

Week 2

This document provides an overview of protein structure and bioinformatics. It discusses that proteins are responsible for catalyzing reactions in cells and regulating gene activity. Bioinformatics uses DNA sequence information to determine protein amino acid sequences and find related proteins to deduce their properties, structures and functions. The document outlines the four levels of protein structure - primary, secondary, tertiary, and quaternary. It also describes how the differing properties of amino acids and how they are linked by peptide bonds to form polypeptide chains.

Uploaded by

Nurullah Mertel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views25 pages

Week 2

Uploaded by

Nurullah Mertel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Bioinformatics

Protein Structure
Assoc. Prof. Dr. Gazi Erkan BOSTANCI

Slides are mainly based on ‘Understanding Bioinformatics’ by Marketa

Zvelebil and Jeremy O. Baum
• If there is one class of molecules which could be said to live life it
would be the proteins.

• They are responsible for catalyzing almost all the chemical reactions
in the cell (RNA has a more limited but important role, as we saw
earlier), they regulate all gene activity, and they provide much of the
cellular structure.

• There is speculation that life may have started with nucleic acid
chemistry only, but it is the extraordinary functional versatility of
proteins that has enabled life to reach its current complex state.
• Proteins can function as enzymes
catalyzing a wide variety of *Cytoskeleton is the skeleton for a cell and
maintains the shape of a cell.
reactions necessary for life, and
they can be important for the
structure of living systems, such as
those proteins involved in the
cytoskeleton.

• The size of a protein can vary from

relatively small to quite large
macromolecules.
• The DNA sequence of a gene can be analyzed to give the amino acid
sequence of the protein product. In that aspect alone, the ready
availability of DNA sequences of genes and whole genomes from the
1980s onward revolutionized biology, as it opened up this vital
shortcut to determining the amino acid sequence of virtually any
protein.

• Bioinformatics uses this sequence information to find related proteins

and thus gather together knowledge that can help deduce the likely
properties of unknown proteins, plus their structures and functions.

• Knowing the relationship between a protein’s structure and its

function provides a greater understanding of how the protein works,
and thus often enables the researcher to propose experiments to
explore how modifying the structure will affect the function.
Primary and Secondary Structure
• A protein folds into a three-dimensional
structure, which is determined by its
protein sequence. The fold of the protein
consists of repeating structural units
called secondary structures, that will be
discussed in this section (see Flow
Diagram).

• The fold of the protein is very important

for the way the protein will function, and
whether it will function correctly.
• Therefore the study of the ways in which
proteins fold and understanding how they
fold is an important area of
bioinformatics, as well as predicting the
fold of a protein from its sequence.
Protein structure can be considered on
several different levels
• The analysis of protein structure by experimental techniques such as
X-ray crystallography and nuclear magnetic resonance (NMR) has
shown that proteins adopt distinct structural elements.

• In general there are four levels of protein structure to consider.

• The primary structure is the protein sequence, the types and order of
the amino acids in the protein chain.

• The secondary structure is the first level of protein folding, in which

parts of the chain fold to form generic structures that are found in all
proteins.

• The tertiary structure is formed by the further folding and packing

together of these elements to give the final three-dimensional
conformation unique to the protein.

• Many functional proteins are formed of more than one protein chain,
in which case the individual chains are called protein subunits. The
subunit composition and arrangement in such multisubunit proteins
is called the quaternary conformation.
• The structure adopted by a protein chain, and
thus its function, is determined entirely by its
amino acid sequence, but the rules that govern
how a protein chain of a given sequence folds
up are not yet understood and it is impossible
to predict the folded structure of a protein de
novo from its amino acid sequence alone.
• –There are several studies on this, including
recent ones.

• Helping to solve this problem is one of the

challenges facing bioinformatics.
Amino acids are the building blocks of
proteins
• Proteins are made up of 20 types of naturally occurring amino acids,
with a few other amino acids occurring infrequently.

• These 20 amino acids consist solely of the elements carbon (C),

nitrogen (N), oxygen (O), and hydrogen (H), with the exception of
cysteine and methionine, which also contain sulfur (S).

• The structure of an amino acid can be divided into a common main

chain part and a side chain that differs in chemical structure among
the different amino acids. The side chain is attached to the main
chain carbon atom known as the α-carbon (Cα).
• Diagram of an amino acid.
• (A) shows the chemical structure
of two amino acids, where R
represents the side chains, which
can be different as shown in (B).

• The amino acid consists of a

central Cα atom with a main
chain N and C at either side of it.
The C is bonded to an O with a
double bond.
The differing chemical and physical properties
of amino acids are due to their side chains
• The functional properties of proteins are almost entirely due to the
side chains of the amino acids. Each type of amino acid has specific
chemical physical properties that are conferred on it by the structure
and chemical properties of its side chain.

• They can, however, be classified into overlapping groups that share

some common physical and chemical properties, such as size and
electrical charge.
• The smallest amino acid is glycine, which has
only a hydrogen atom as its side chain. This
endows it with particular properties such as
great flexibility.

• The other extreme of side-chain flexibility is

represented by proline, an amino acid that
has a side chain bonded to the main-chain
nitrogen atom, resulting in a rigid structure.
• Some amino acids have uncharged side
chains and these are generally hydrophobic
(not liking water, therefore tend to be
buried within the protein surrounded by
other hydrophobic amino acids) while
others are positively or negatively charged.

• The charged or polar amino acids are

hydrophilic; they like to be surrounded by
water molecules with which they can form
interactions.
• As there are 20 distinct amino acids that occur in proteins, there can
be 20n different polypeptide chains of length n.

• For example, a polypeptide chain 250 amino acids in length will be

one of more than 10325 alternative different sequences.

• Clearly, the sequences that do occur are only a tiny fraction of those
possible. Often only a few sequence modifications are needed to
destabilize the three-dimensional conformation of a protein, and so it
is probable that the majority of these alternative sequences will not
adopt a stable conformation.
Amino acids are covalently linked together in
the protein chain by peptide bonds
• The primary structure of a protein is the sequence of
amino acids in the linear protein chain, which consists of
covalently linked amino acids. This linear chain is often
called a polypeptide chain.
Amino acid structure

• The amino acids are linked by peptide bonds, which are

formed by a condensation reaction (the loss of a water
molecule) between the backbone carboxyl group of one
amino acid and the amino group of another.

• When linked together in this way, the individual amino

acids are conventionally called amino acid residues.
• Peptide bonds.
• (A) gives the chemical
formulae of the peptide bond
that is formed between amino
acids to make a polypeptide
chain.
• (B) illustrates the above in a
diagrammatic form.
Implication for Bioinformatics
• In part, bioinformatics concerns itself with the analysis of protein
sequence to predict the secondary structure, the tertiary structure,
and the function of the protein, as well as its relationship to other
proteins.

• Different secondary structures tend to have subtle differences in

chemical environments, resulting in amino acid preferences.

• In addition, amino acid preferences are seen at particular locations in

proteins due to the functional role they play, for example as catalytic
residues or stabilizing the overall protein structure.
Evolution has aided sequence analysis
• Protein sequence similarity is a
powerful tool for characterizing
protein function and structure since
an enormous amount of information
is conserved throughout the
evolutionary process.
• Proteins that have a common
ancestor are referred to as being
homologous.
• Sequence alignment and database search techniques can identify
homologous proteins.

• Homologous proteins usually have a similar three-dimensional

structure with related active sites and binding domains. Therefore
homologous proteins will also often have related functions, although
this is not always the case.

• Most amino acids that change during evolution are found in regions
that are not structurally or functionally important, such as many of
the loops (or variable) regions.

• If the homologous protein is also functionally related then the amino

acids involved in function are often conserved during evolution,
which helps in identifying the function of a new protein.
Visualization and computer manipulation of
protein structures
• There are a number of programs available that read the coordinate
file and convert it to a visible three-dimensional representation of the
protein. The protein can be rotated, specific regions highlighted, and
some measurements can be calculated.
• Some of these programs are very powerful and can be of great use in
analyzing the structural properties and molecular function, as well as
allowing for the manual modification of the molecule.
• Some of the programs are free or low cost, such as Chimera, Yasara,
and DeepView. Others are extremely powerful programs that allow
the user to carry out computationally intensive modifications to the
molecule, but are expensive.
• Molecular representations.
• The different representations that can
be used to illustrate molecules, from
very simple ones that only use the Cα
or backbone atoms to spacefilling
models of all atoms in the structure.
• There are many styles for viewing molecular structures, including
those with atomic-level detail such as space-filling models, ball and
stick models, and wireframe models (also called stick models or
skeletal models), as well as surface models.

• However, it is often desirable to have a simplified model of the

protein, such as backbone or Cα models and schematic (cartoon)
models. Such models can be represented on a computer screen and
can be represented in different styles and colors.

• Molecular models are usually based upon an atomic coordinate file,

which in general give the (x,y,z) coordinates of each atom.
• ChimeraX demonstration Heteronychus arator

• 2bbv.pdb (Black beetle virus, RNA virus)

2021 Members Directory Compressed
100% (1)
2021 Members Directory Compressed
204 pages
Fifth Lecture Protiens 4
No ratings yet
Fifth Lecture Protiens 4
32 pages
Proteins Lecture MBCHB 1 2025
No ratings yet
Proteins Lecture MBCHB 1 2025
74 pages
Protein
No ratings yet
Protein
58 pages
Protein Structure 2013 (Marianne)
No ratings yet
Protein Structure 2013 (Marianne)
57 pages
Protein
No ratings yet
Protein
54 pages
Homology Modeling Explained For Students
No ratings yet
Homology Modeling Explained For Students
32 pages
Protein Structure
No ratings yet
Protein Structure
36 pages
Amino Acids and Proteins
No ratings yet
Amino Acids and Proteins
51 pages
Proteins 1
No ratings yet
Proteins 1
27 pages
Lecture4-Protein Data Analysis
No ratings yet
Lecture4-Protein Data Analysis
26 pages
Amino Acid 41
No ratings yet
Amino Acid 41
35 pages
Amino Acids & Proteins 2 BDS
No ratings yet
Amino Acids & Proteins 2 BDS
25 pages
Proteins KKM
No ratings yet
Proteins KKM
37 pages
5.protein As Drug Target
No ratings yet
5.protein As Drug Target
30 pages
Lec - 4 - Protein
No ratings yet
Lec - 4 - Protein
23 pages
Chapter-1 Protein: January 2021
No ratings yet
Chapter-1 Protein: January 2021
22 pages
Proteins Before
No ratings yet
Proteins Before
23 pages
PROTEIN
No ratings yet
PROTEIN
16 pages
Proteins
No ratings yet
Proteins
29 pages
MCB 208 Lecture Note
No ratings yet
MCB 208 Lecture Note
15 pages
Proteins
No ratings yet
Proteins
28 pages
FALLSEM2024-25 BBIT202L TH VL2024250104080 2024-10-25 Reference-Material-I
No ratings yet
FALLSEM2024-25 BBIT202L TH VL2024250104080 2024-10-25 Reference-Material-I
24 pages
Chemistry Project
No ratings yet
Chemistry Project
22 pages
Biogeochemical Cycles PPT
100% (1)
Biogeochemical Cycles PPT
26 pages
Protein (S)
No ratings yet
Protein (S)
12 pages
Biochemistry
No ratings yet
Biochemistry
14 pages
Protein
No ratings yet
Protein
23 pages
The Four Major Macromolecules Protein
No ratings yet
The Four Major Macromolecules Protein
7 pages
Protein
No ratings yet
Protein
12 pages
بايو نضري 4
No ratings yet
بايو نضري 4
9 pages
The Three Dimensional Structures of Proteins
No ratings yet
The Three Dimensional Structures of Proteins
14 pages
TB 8 Protein Structure - Lowres
No ratings yet
TB 8 Protein Structure - Lowres
5 pages
Proteins and Amino Acids
No ratings yet
Proteins and Amino Acids
19 pages
College Biology Chapter 6 Macromolecules - Proteins
No ratings yet
College Biology Chapter 6 Macromolecules - Proteins
6 pages
MBC 221 Levels of Protein Structure
No ratings yet
MBC 221 Levels of Protein Structure
7 pages
Protein 3d
No ratings yet
Protein 3d
86 pages
13 Proteins and Nucleic Acids
No ratings yet
13 Proteins and Nucleic Acids
27 pages
AFN 3209 - Protein Structure
No ratings yet
AFN 3209 - Protein Structure
8 pages
Introduction
No ratings yet
Introduction
9 pages
ZOO 103 Lecture 09 19 Proteins
No ratings yet
ZOO 103 Lecture 09 19 Proteins
12 pages
Chimica Biologica
No ratings yet
Chimica Biologica
57 pages
2 Cell Mol
No ratings yet
2 Cell Mol
7 pages
Independent Study and Research
No ratings yet
Independent Study and Research
23 pages
Proteins and Functions
No ratings yet
Proteins and Functions
15 pages
BCH 307 (Amino Acids and Protein Structure)
No ratings yet
BCH 307 (Amino Acids and Protein Structure)
6 pages
BCH 201 Protein Structure
No ratings yet
BCH 201 Protein Structure
6 pages
BIO1400 01 Proteins 2022
No ratings yet
BIO1400 01 Proteins 2022
7 pages
Proteins
No ratings yet
Proteins
28 pages
Chou Fasman
No ratings yet
Chou Fasman
6 pages
Protein Electrophoresis Lab
No ratings yet
Protein Electrophoresis Lab
4 pages
03 Proteins Structure and Functions
No ratings yet
03 Proteins Structure and Functions
57 pages
Diversity of Structures Included in Proteins, Producing A Wide Range of Functions
No ratings yet
Diversity of Structures Included in Proteins, Producing A Wide Range of Functions
7 pages
Biochem Midterm Cov
No ratings yet
Biochem Midterm Cov
6 pages
B1.2 Proteins
No ratings yet
B1.2 Proteins
9 pages
Chem Raw File
No ratings yet
Chem Raw File
23 pages
Lecture notes-biochemistry-1-AAs-proteins-web
100% (1)
Lecture notes-biochemistry-1-AAs-proteins-web
29 pages
Industrial-Metal Coatings1
No ratings yet
Industrial-Metal Coatings1
23 pages
Definition Paper Final Resubmit 2
No ratings yet
Definition Paper Final Resubmit 2
10 pages
Quenching Oil
100% (1)
Quenching Oil
36 pages
Proteins: Structure & Functions
No ratings yet
Proteins: Structure & Functions
15 pages
Lecture 3: Petroleum Refining Overview: 3.1 Crude Oil
100% (2)
Lecture 3: Petroleum Refining Overview: 3.1 Crude Oil
66 pages
Proteins
100% (5)
Proteins
13 pages
General Organic Chemistry-01 - Theory
100% (1)
General Organic Chemistry-01 - Theory
54 pages
Approaches and Practice in Pest Management
No ratings yet
Approaches and Practice in Pest Management
56 pages
ENZYME ENGINEERING LAB MANUAL Btech
No ratings yet
ENZYME ENGINEERING LAB MANUAL Btech
25 pages
Animal Nutrition Multiple Choice (1.1)
100% (1)
Animal Nutrition Multiple Choice (1.1)
10 pages
Module 5 Review of Basic Organic Compounds
No ratings yet
Module 5 Review of Basic Organic Compounds
18 pages
Intermolecular H-Abstraction
No ratings yet
Intermolecular H-Abstraction
30 pages
(Ebook PDF) Organic Chemistry 9th Edition by John E. McMurry PDF Download
No ratings yet
(Ebook PDF) Organic Chemistry 9th Edition by John E. McMurry PDF Download
36 pages
Pharmacist 2025 Super Shot - 3
No ratings yet
Pharmacist 2025 Super Shot - 3
19 pages
ELPL Nutraceuticals
No ratings yet
ELPL Nutraceuticals
69 pages
EE403 Notes 3rd4th Week
No ratings yet
EE403 Notes 3rd4th Week
47 pages
1-Medi. - Millets (53) 29 JAN 24
No ratings yet
1-Medi. - Millets (53) 29 JAN 24
53 pages
Pharmacognosy Chap 13 Notes
No ratings yet
Pharmacognosy Chap 13 Notes
4 pages
Sach Ly Thuyet 1 - 13
No ratings yet
Sach Ly Thuyet 1 - 13
21 pages
ChemSusChem - 2023 - Bohre - Chemical Recycling Processes of Waste Polyethylene Terephthalate Using Solid Catalysts
No ratings yet
ChemSusChem - 2023 - Bohre - Chemical Recycling Processes of Waste Polyethylene Terephthalate Using Solid Catalysts
24 pages
Snar
No ratings yet
Snar
7 pages
325 04 Transformers
No ratings yet
325 04 Transformers
31 pages
Chemical Blowing Agent Composition Endoawaw
No ratings yet
Chemical Blowing Agent Composition Endoawaw
5 pages
Week 1
No ratings yet
Week 1
24 pages
An Investigation of The Therac-25 Accidents
No ratings yet
An Investigation of The Therac-25 Accidents
24 pages
Fleyfel Et Al.
No ratings yet
Fleyfel Et Al.
18 pages
Nano EncapsulatedherbicideRMMuchhadiya
No ratings yet
Nano EncapsulatedherbicideRMMuchhadiya
9 pages
Lipids Part 1
No ratings yet
Lipids Part 1
30 pages
Integration and Control Pasco 1
No ratings yet
Integration and Control Pasco 1
4 pages
Final
No ratings yet
Final
5 pages
Assignment #1
No ratings yet
Assignment #1
4 pages
Vitamin A Ikan Sidat (Anguilla Marmorata) Asal Sungai Palu Dan Danau Poso
No ratings yet
Vitamin A Ikan Sidat (Anguilla Marmorata) Asal Sungai Palu Dan Danau Poso
7 pages
CEN338 Midterm Exam Nurullah Mertel 18290219
No ratings yet
CEN338 Midterm Exam Nurullah Mertel 18290219
3 pages
Determinación de Formaldehído en Cosméticos Por HPLC
No ratings yet
Determinación de Formaldehído en Cosméticos Por HPLC
8 pages
SOIL 2 - Module 2b - Key Concepts
No ratings yet
SOIL 2 - Module 2b - Key Concepts
5 pages
Chemistry - Ncert Board Full Syllabus Practice Test - Subjective
No ratings yet
Chemistry - Ncert Board Full Syllabus Practice Test - Subjective
4 pages
Essay
No ratings yet
Essay
2 pages
Glycol Ether EBA (TDS)
No ratings yet
Glycol Ether EBA (TDS)
1 page
Midterm
No ratings yet
Midterm
1 page
Fungicide - FRAC - MoA - Poster 2010 - Final - Version - Print
No ratings yet
Fungicide - FRAC - MoA - Poster 2010 - Final - Version - Print
1 page
Biochemistry Essentials
From Everand
Biochemistry Essentials
Jay Templin
3/5 (5)
Illustrated Notes on Biomolecules
From Everand
Illustrated Notes on Biomolecules
Mohammad Fahad Ullah
No ratings yet

Week 2

Uploaded by

Week 2

Uploaded by

Bioinformatics

Slides are mainly based on ‘Understanding Bioinformatics’ by Marketa

• The size of a protein can vary from

• Bioinformatics uses this sequence information to find related proteins

• Knowing the relationship between a protein’s structure and its

• The fold of the protein is very important

• In general there are four levels of protein structure to consider.

• The secondary structure is the first level of protein folding, in which

• The tertiary structure is formed by the further folding and packing

• Helping to solve this problem is one of the

• These 20 amino acids consist solely of the elements carbon (C),

• The structure of an amino acid can be divided into a common main

• The amino acid consists of a

• They can, however, be classified into overlapping groups that share

• The other extreme of side-chain flexibility is

• The charged or polar amino acids are

• For example, a polypeptide chain 250 amino acids in length will be

• The amino acids are linked by peptide bonds, which are

• When linked together in this way, the individual amino

• Different secondary structures tend to have subtle differences in

• In addition, amino acid preferences are seen at particular locations in

• Homologous proteins usually have a similar three-dimensional

• If the homologous protein is also functionally related then the amino

• However, it is often desirable to have a simplified model of the

• Molecular models are usually based upon an atomic coordinate file,

• 2bbv.pdb (Black beetle virus, RNA virus)

You might also like