Essentials of Bioinformatics, Volume I Understanding Bioinformatics Genes To Proteins Full-Resolution Download
Essentials of Bioinformatics, Volume I Understanding Bioinformatics Genes To Proteins Full-Resolution Download
Visit the link below to download the full version of this book:
https://medipdf.com/product/essentials-of-bioinformatics-volume-i-understanding-
bioinformatics-genes-to-proteins/
vii
viii Foreword
Kaiser Jamil
School of Life Sciences and Centre for Biotechnology and Bioinformatics
Jawaharlal Nehru Institute of Advanced Studies
Hyderabad, India
Preface
ix
x Preface
straightforward to use and only demand simple inputs like nucleotide or amino acid
sequence and protein structures to analyze and return the output files.
Since majority of the chapters in this book are prepared by scientists, who utilize
different bioinformatics tools in their day-to-day research activities, we believe that
this book will mainly help young biologists keen to learn new skills in bioinformat-
ics. Our authors have taken care in simple presentation of chapter contents, easier
enough for any practicing molecular biologist to independently employ bioinfor-
matics methods in their regular research tasks without consulting computational
experts. The reader of this book is expected to be familiar with fundamental con-
cepts in biochemistry, molecular biology, and genetics. Our authors have consulted
numerous original articles and online reading materials to provide the most updated
information in preparing their chapters. Although we have taken a care to cover
major bioinformatics tools, but in case if any important tool is missed out, then it is
only due to space limitation but not a bias against any particular software program.
The chapters in this book mainly covers the introduction to bioinformatics, intro-
duction to biological databases, sequence bioinformatics, structural bioinformatics,
functional bioinformatics, computer-aided drug discovery methods, and some spe-
cial concepts like in silico PCR and molecular modeling. A total of 17 chapters are
included in this book, and most of them are relatively independent from each other.
All the chapters in each section are arranged in a logical manner where one chapter
acts as follow-ups to the next one. Since this book is basically meant for practicing
molecular biologists, few selected molecular or mathematical formulas, which are
prerequisites for understanding the corresponding concepts, are used. A general dis-
cussion about computational program is often described along with its web links.
The conclusion part is provided at the bottom of each chapter to refresh the under-
standing of readers.
We sincerely thank the Princess Al-Jawhara Center of Excellence in Research of
Hereditary Disorders (PACER-HD) and Department of Genetic Medicine, Faculty of
Medicine and Department of Biology, Faculty of Science at King Abdulaziz
University (KAU) for providing us the opportunity to teach and train the bioinformat-
ics course, because of which we have been able to bring up this book. We thank Prof.
Jumana Y. Al-Aama, director of the PACER-HD, KAU, for her excellent moral and
administrative support in letting us involved in teaching and training the bioinformat-
ics to young minds in biology and medicine. We also thank Dr. Musharraf Jelani,
Dr. Nuha Al-Rayes, Dr. Sheriff Edris, and Dr. Khalda Nasser our colleagues in the
PACER-HD for their wonderful friendship. We would also like to thank the chairman
of the Department of Biological Sciences, Prof. Khalid M. AlGhamdi, and the head
of Plant Sciences section, Dr. Hesham F. Alharby, for providing us the valuable sug-
gestions and encouragement to complete this task. Last but not the least, we would
like to acknowledge the support of Springer Nature publishing house for accepting
our book proposal, their regular follow-up, and the final publication of this book.
xi
xii Contents
Index�������������������������������������������������������������������������������������������������������������������� 389
About the Editors
xiii
xiv About the Editors
Contents
1.1 Introduction 1
1.2 The Role of Bioinformatics in Gene Expression Data Analysis 4
1.3 The Role of Bioinformatics in Gene/Genome Mapping 5
1.4 Role of Bioinformatics in Sequence Alignment and Similarity Search 6
1.5 Contribution of Bioinformatics toward Modern Cancer Research 9
1.6 The Domain of Structural Bioinformatics 11
1.7 Bioinformatics Processing of Big Data 12
1.8 Conclusion 14
References 14
1.1 Introduction
B. Banaganapalli (*)
Princess Al-Jawhara Center of Excellence in Research of Hereditary Disorders,
Department of Genetic Medicine, Faculty of Medicine, King Abdulaziz University,
Jeddah, Saudi Arabia
e-mail: [email protected]
N. A. Shaik
Department of Genetic Medicine, Faculty of Medicine, King Abdulaziz University,
Jeddah, Saudi Arabia
e-mail: [email protected]
process, and in the recent years, a wide range of sophisticated bioinformatics meth-
ods, techniques, algorithms and tools which immensely contributed toward extract-
ing the knowledge from major biological data repositories are developed. All of
these developments have potentially contributed towards the progression of biologi-
cal research in general but also facilitated the progress of new scientific domains
such as molecular medicine, targeted gene therapy, and in silico drug design
approaches to name a few. State-of-the-art bioinformatic programs used in high-
throughput sequence analysis allows researchers to accurately monitor and identify
the minute genomic alterations on genes, and these kinds of endeavors result in the
generation of massive amounts of biological data that requires highly efficient analy-
sis (Goldfeder et al. 2011; Yang et al. 2009). This once again highlights the advantage
of bioinformatics over traditional molecular techniques where one can study only a
single gene at one time (Jorge et al. 2012; Blekherman et al. 2011; Kihara et al. 2007).
From a biological data analysis perspective, bioinformatics is a robust set of data
mining paradigms that help convert textual data into human perceivable knowledge.
In every domain of scientific research, computing technologies have virtually
become indispensable and the situation is no different in case of bioinformatics
(Akalin 2006; Bork 1997; Brzeski 2002). Biological sequencing machines generate
tremendous volumes of data, and it has become critically important to archive, orga-
nize, and process such data using powerful computational methods efficiently to
extract the knowledge. Similarly, computational simulation of complex molecular
processes can be an asset as it allows researchers to gain rapid insights in silico
without the need for time consuming experimentations (Akalin 2006).
Some of the key contributions of the discipline of bioinformatics include (Akalin
2006):
• Conceptualization, design, and development of biological relational databases
for archiving, organizing, and retrieving biological data.
• Development of cutting-edge computer algorithms to model, visualize, mine and
compare biological data.
• Highly intuitive and user-friendly curation of biological data that will help bio-
logical researchers who lack IT knowledge to derive useful representation of
information.
Bioinformatic tools and techniques have become virtually ubiquitous in modern
biological research. From a molecular biology perspective involving microarrays
and sequencing experiments, bioinformatics techniques can be used for efficient
analysis of raw transcript signals and sequence data. In the field of genetics and
genomics, bioinformatics can assist in the analysis of sequencing data, annotation
of genomic landmarks, and identification of genetic mutations that could have a
direct correlation with disease pathology (Batzoglou and Schwartz 2014; Yalcin
et al. 2016; Zharikova and Mironov 2016). Furthermore, bioinformatics data mining
tools can also be highly effective in the analysis and extraction of knowledge from
biological literature so that sophisticated gene ontologies could be conceptualized
for future query and analysis. Bioinformatics tools and algorithms also find
application in the field of proteomics such as the analysis of protein expression and
its regulation. Bioinformatics can contribute toward the analysis of biological
4 B. Banaganapalli and N. A. Shaik
pathways and molecular network analysis from a systems biology approach (Hou
et al. 2016). In the field of structural biology, bioinformatics tools can help in the in
silico simulation and modeling of genomic and proteomic data that can help
researchers better understand the key molecular interactions. A key domain where
bioinformatics has immensely contributed is, in the field of biomedicine. It will not
be wrong to comment that every human disease is somehow connected to a genetic
event. In this context, the complete draft of the human genome has greatly helped in
the mapping the disease-associated genes and elucidation of their molecular func-
tion. As a result it has become possible for researchers to gain comprehensive
insights into pathogenesis at the cellular level, thus creating grounds for the devel-
opment of effective therapeutic interventions. Through the use of the state-of-the-art
bioinformatics and computational tools, it has now become possible to simulate,
identify, and establish potential drug targets that will have much greater efficacy
against diseases with minimal side effects. With the advent of different bioinformat-
ics paradigms, it has now become possible to carry out analysis of an individual’s
genetic profile leading to the innovative concept of personalized medicine. In
domains such as agriculture, bioinformatics tools can be used to alter the genomic
structure so that there is an increase in the resistance of crops toward different plant
pathogens and insects (Bolger et al. 2014; Edwards and Batley 2010).
bioinformatics has become critical with regard to analyzing and deciphering the micro-
array data to extract the desired information. Bioinformatics data mining tools and
work pipelines such as cluster analysis, and heatmap show high efficacy in microarray
data analysis (Bodrossy and Sessitsch 2004; Loy and Bodrossy 2006). A typical micro-
array experiment workflow is presented in Fig. 1.2 (Macgregor and Squire 2002).
Fig. 1.3 Illustration of large-scale mapping of a gene by genetic mapping and physical mapping
the need for genome maps. These maps allow identification of the key genetic fea-
tures that could have major therapeutic significance (Brown 2002).
In silico gene mapping (Schadt 2006), uses the publicly available genomic data-
bases to discover and map genes on the genome. In silico gene mapping techniques
using bioinformatic applications have been very successful in identifying or map-
ping QTLs (quantitative trait loci) that help understand the molecular pathways
associated with the pathogenesis of different polygenic diseases (Burgess-Herbert
et al. 2008). The excellent and informative BAC (bacterial artificial chromosome)
clone contig map for Sus scrofa is achieved with help of in silico mapping
approaches. In silico allows efficient and fast mapping of genes, which may specifi-
cally influence resistance or susceptibility of a particular animal towards a specific
disease. Moreover the availability of excellent computational resources, public
genomic and phenotypic databases like UniGene and GenBank, and tools like
BLAST considerably accelerates the mapping process, when done in silico as com-
pared to the conventional methodologies that are more labor intensive, costly, and
time consuming (Schadt 2006). The simple workflow of the gene mapping tech-
nique is illustrated in Fig. 1.3.