CADD Unit 4 2
CADD Unit 4 2
INFORMAT
wystems to
store, organize and and nprocems lata,
or the use of
conputer
care industry, especiall.
,
lustry, cspecially when
Informatics,
convenience
to the health
efficiency and of pharmaceuticals.
brings and management
comes
to the tracking
informatics is
defined by continuum of i Cuen
within the
Pharmacy
medication-related
data and knowledge
use
thcare
and disseminatione
on
its acquisition,
storage, analysis, the
including care and health outcomes
systems palient
delivery of optimal me<dicalion-related
Pharmacists issued a practice d e .
called the
The Amerncan
Statement of Health-System
Soxietyon the Pharmaclst's Role in Informaties. The su ormati
The staterement
document
pharmaCy inf nenN
of the pharmacist and the
the respmsibilities broad area
society outlines five hroad
reaffimed
informatics. The areas of
healthcare
in
for this role:
responsibility
)Informmation management
2)Knowledgedelivery
3) Data analytics
4) Chnical intormatics
5) Change management
7.1.1. Bioinformatics
methods and software
Bioinfomatics is interdisciplinary field that develops
an
7.1.1.1. Aim
ngeneral,
there are three aims of bioinfo
formatics
efirst aim of bioinformatics is to store the biological data organised in
) Of a database. This allows the researchers an easy access to existing
and submits new entries. These data must be annoted
formation a
to give
able meaning or to assign its functional characteristics. The databases
talso be able to correlate between different hierarchies of information
must
1.1.1.2. Goal
means to
of bioinformatics thus are to provide scientists with a
The goals
explain:
)Normal biological processes.
to diseases.
2) Malfunctions in these processes which lead
3) Approaches to improving drug discovery.
within bioinformatics:
There are three important sub-disciplines with which to assess
The development of new algorithms and statistics
) data sets;
relationships among members of large nucleotide
and interpretation of various types of data including
2) The analysis
domains, and protein structures;
and amino acid sequences, protein
of tools that enable efficient
access
availability Biological
online Database: reached.
stage of
biological was one of Developing biological
data is in form of research, but now by the primary
concerns at
be known
text, table and having many biological database
initia
that how to pictures and many other
retrieve exact ana
may be of text data from a formats. It
should
data retrieval retrieval,
importance. sequence retrieval or it suitable database.
may also include Database
structural
4) Sequence Alignment and
with compare to other Database Searching:
relevant and similar
biological research to understand Alignment of sequence
sequence is
predict structure and function basedrelation between two very much needed in
sequences and also to
alignment of sequences use of BLAST ison sequence similarity. For basic
sequences involved in very common. Based on
sequencing, these alignments can be number of
pairwise alignment or multiple sequence classified into
5) Predictive Methods using DNA alignment.
classified into three major Sequences: Gene-finding strategies can be
i) categories:
Content-based methods rely on the overall bulk
in making determination. properties of a sequence
Characteristics considered
often particular codons are used, the here include how
periodicity of repeats. and the
compositional complexity of the sequence. Because different
use
synonymous codons with different frequency, such dues can organisms
insight into determining regions that are more likely to be exons. provide
11) In site-based methods, the focus turns to the presence or absence of a
specific sequence, pattern, or consensus. These methods are used to
detect features such as donor and acceptor splice sites,
binding sites for
transcription factors, polyA tracts, and start and stop codons.
l1) Comparative methods can make determinations based on
sequence
homology. Here, translated sequences are subjected to database searches
against protein sequences to determine whether a previously
characterised coding region corresponds to a region in the query
Sequence. Although this is conceptually the most straightforward of the
methods, it is restrictive because most newly discovered genes do not
have gene products that match anything in the protein databases. Tools
associated with these are Grail, Genscan, Fgenes, procrustes and many
others developed with bioinformatics.
118 Computer Aided Drug Desip
esign
6) Predictive Methods Using Protein Sequences: There are tools based
predictive methods using protein sequences, such as PSLPred, NRn
PSEAPred. There are other methods also based on motif level, residue
signal level, peptide level, domain level and profile based. level,
7) Sequences Assembly and Finishing Methods: At present. present,
he
sequencing process is often talked of as consisting of two parts, name
assembly and finishing. but in practice there is considerable ove
between the two. Assembly is the process of attempting to order and
the readings, and finishing is the task of checking and editino
assembled data. the
This includes performing new sequencing experiments to fill gaps or to COve
segments where the data is poor and adjudicating between conflici
readings when editing. ting
8) Phylogenetic Analysis: Phylogenetic analysis is also one of the important
implementation of bioinformatics in biological research. Phylogenetic
analysis is study of ancestral history of an organism. Here after sequence
and structural similarities, it should be tried to relate organism's ancestral
history to show how origin of organism was related to each other and
what was order of evolution".
efinitions
e r d i n R to FA
Brown, "The Use of
r has become a critical inforation technology ana
4 part of the drug discovery process.
The'moir ormatics is the miXing ot those inlormation resources to transform
a t a ntO rmation and information into
infc
ccording
i n g to M.
to M. Hann, R Green, "Chemoinformatics a new name for an old
y v h e m
ording
ding to G. Paris, "Chemoinformatics is a generic term that encompasses
on,
design, creation
organisation, management, retrieval, analysis, dissemination,
lal1sation and use of chemical
information."
oarding to J.Gastegeir et.al., "Chemoinformatics is the application of
icco
Knowledge Abstraction
Context
Information
Measurements,
Calculations
Data
7.1.2.1. Objective
Temoinformatics should assist the chemist to solve some of following
tundamental problems:
design molecules with desired properties, the major task of Chemist
o make a is
i n f o r m a t i o n
eeded for drug
discovery.
Some of the
moinformatics
research are:
data.
1)
Analysis ofHTS chemicals.
search
Similarity
2) c o m b i n a t o r i a l
libraries.
of
3) Design libraries.
focused
of of libraries.
4) Design similarity/diversity
of the
Comparison
5)
6) Virtual screening.
7) Docking.
n o v o design.
8) De
9) Pharmacophore Perception.
and pharmacn.
physicochemical
properties kineic
of affinities,
10) Prediction
properties.
models which
can be interpreted and guide the
Establishment of QSAR
11) drug.
of a new
further development
Chemoinformatics
of chemical data,
produced by increase in million every year)
by
the number may
are known and
compounds
database.
using a proper
for knowledge extracthon
needs a novel technique
Also, the field of chemistry structure of r
between the
relationships
from data to model complex influence of reac
activity or the o
chemical compound and biological
Chemoinformatics has wider rang SOme
condition on reactivity.
chemical
chemoinformatics
n
influence if
application and Figure 7.2 shows
specific research areas.
Environmental
effects and
Analysis and Hazards Spectroscopy
Modelling
Pharmacology
Toxicology
Regulations
In traditional drug discovery phase, the process which cost more time and
noney is replaced with lead identification and lead optimisation proce
modern drug discovery system. Each phase has an interaction component that
ransfers data, knowledge and information to one another (shown in figure 7.4).
Computer AidedDrug Desgn
122
Lead
Target Identification Lead Optimisation Pre-Clinial
Disease
Identification
and Validation
Identification
Trails
CDSCO
Approval from
Drug Authority
Clinical Trials
and
Modern Drug Discovery
Flgure 7.3:
Development Life Cycle
Lead HTS
Optimisation
Chemoinformatics
Compound ADME
design
Interaction Process
Figure 74:
Structure Chemmetrics
CASD
Representation
Figure 7.5
mainly where
1) Computer-Assisted Synthesis Design (CASD): It is applied mainiydi
d in
artificial intelligence technique can be applied. This technique is appxile
textile
cal and
various clinica analysis purpose. Structure epresentation deals
ction Representation, Structure Descriptors, Molecul Modelling.
t o r
with Reactic
Connection Table
12 3456 7 8 910 11
0 1 00 0 0 0 0 0 2 0
1 0 2 0 0 |0 0 0 00 0
0 2 0|1|0||0|0|1|00 0
4
0 0 1|0| 0 | 0 0|0|0 0
0 0 0|1|02|1|0 0|0
6 0 0 0 0 2 0 0 0 |0
7 0 0 0 0|1|0 0 0 00
80 0 1 | 0 | 0 0 0|0 2 0
90 0 0 000 0 2 0
10 2 0 0 00 0 0 0
11 0 0 o 0 o 0 00|ol
OH]clecccl Fragment Code
0001001 10100111 Fingerprint
$244987098423150 Hash Code
Reaction Representation helps to understand the basic chemical models,
quantify chemical reactivity and extract knowledge from the reaction
ntormation. Molecular modelling is a method includes a variety of
computational schemes which are aimed at stimulating molecular
Structures, their properties and "in-silico" behaviour.
CASD Structure
Representation
L
Suructure
Reaction Molecular Structure
Searching
CASE Deseriptors
Representation Modelling