Class03-What Is bioinformatics-2022-SIV2001
Class03-What Is bioinformatics-2022-SIV2001
SIV 2001
Class 3
What is bioinformatics
Biology
• Science that deals with living organisms and life processes.
• Plant, animal and microbial life of an particular region or environment.
• Properties and vital phenomena exhibited by an organism or group of
organisms.
• Elements, processes and interactions in living beings
1
26/10/2022
Computer
Information Science
2
26/10/2022
Information Theory
Knowledge
• Knowing something with familiarity gained through
experience or association
• Understanding of a science, art, or technique
• Being aware of something
• Apprehending fact through reasoning
• Being learned.
3
26/10/2022
Summary
• Data are the facts.
Bioinformatics - definitions
• Bioinformatics (NIH):
“research, development, or application of computational
tools and approaches for expanding the use of biological,
medical, behavioral or health data, including those to
acquire, store, organize, analyze, or visualize such data.”
4
26/10/2022
What is bioinformatics?
What is Bioinformatics?
Biology
Molecular
Biology
Chemistry Medicine
Bioinformatics
Mathematics
Physics
Statistics
Computer
Science
Informatics
10
5
26/10/2022
Why Bioinformatics
6
26/10/2022
Bioinformatics history
• Earliest bioinformatics exercise: Margaret Dayhoff (1965) first
protein sequence database Atlas of Protein Sequence and
Structure (now PIR).
• 1970s:
• Protein structure database (PDB) 1972 with a collection of
ten X-ray crystallographic protein structures.
• Protein Sequence Database (PSD) by Margaret Dayhoff
• Sequence alignment algorithm Needleman-Wunsch
• Sanger sequencing
• Routine sequence comparisons and database searching
• Protein structure prediction algorithm Chou and Fasman
1980s saw establishment of GenBank and FASTA and BLAST
Bioinformatics history
• 1980s:
• Human Genome Project started late 1980s
• The PCR reaction was described by Kary Mullis and co-
workers
• The Smith-Waterman algorithm for sequence alignment
• The SWISS-PROT database
• The FASTP algorithm was published by Lipman & Pearson
• The National Center for Biotechnology Information (NCBI)
was established
• 1990s:
• The BLAST program by Altschul, et. al.
• 2000 and beyond:
• A draft of the human genome (3,000 Mbp) was published
7
26/10/2022
As of 15 June 2019
8
26/10/2022
Bioinformatics approaches
1. Data acquisition
2. Data management/storage
3. Data retrieval
4. Data analysis/interpretation
5. Data compilation
9
26/10/2022
Biological information
Biological information
Molecular Sequences
• Nucleic acid
• Amino acid
• Sequencing Technology
• Sanger sequencing
Gene prediction Genome viewer/browser
• NGS
• 3rd generation sequencing
• Molecular Sequence Analysis
• Sequence alignments conservation pattern
• Variant (SNPs) analysis
• Comparative genomics Comparative genomics
• Genome Annotation
• Open reading frame
• Functional sites
• Genome Viewer
• ...
10
26/10/2022
Biological information
3D structure
• Double helix DNA
• RNA structure
• Protein x-ray crystallography
• 3D structure prediction
double helix DNA
Biological information
Biological functions
• Pathway analysis
• Network analysis
• Gene Ontology
• Protein-protein interaction
• Proteomics
Pathway analysis Network analysis
• Metabolomics
• Molecular modeling
• Molecular dynamics
• Phylogenetics
• Evolutional relationship
• Drug design
• Vaccine design
• ...
11
26/10/2022
Bioinformatics approaches
1. Data acquisition
2. Data management/storage
3. Data retrieval
4. Data analysis/interpretation
5. Data compilation
12
26/10/2022
Input:
Biological information
(data)
BIOINFORMATICS
Output:
New Biological information
( New data and Knowledge)
13
26/10/2022
14
26/10/2022
Main players
Organization Database
https://www.nih.gov/
Institutes at NIH
15
26/10/2022
https://www.ncbi.nlm.nih.gov/
NCBI resources
• Databases:
• PubMed
• Bookshelf
• Sequence Read Archive (SRA)
• Online Mendelian Inheritance in Man (OMIM)
• ClinVar
• BioSystems
• Nucleotide Database
• GenBank BLAST tools
• Reference Sequence (RefSeq)
• Database of Short Genetic Variations (dbSNP)
• Tools
• 1000 Genomes Browser
• Basic Local Alignment Search Tool (BLAST)
• Genome Data Viewer (GDV)
16
26/10/2022
https://www.embl.org/
• EMBL Heidelberg, Germany - Main
laboratory
• EMBL Hamburg, Germany - Structural
biology
• EMBL Barcelona, Spain - Tissue biology
and disease modelling
• EMBL-EBI Hinxton, United kingdom -
European Bioinformatics Institute
• EMBL Grenoble, France -Structural
biology
• EMBL Rome, Italy - Epigenetics and
neurobiology
17
26/10/2022
https://www.nig.ac.jp/nig/
NIG resources
Mouse Microorganisms NBRP – National BioResource
– Mouse Genetic Resources – E. Coli: Strain/Vector/Antibody Project
– Mouse Genome Database – E. Coli: Genome Database SHIGEN – Shared Information of
– Mouse Phenotype Database – E. Coli: TEC Database Genetic Resources
– Japan Mouse/Rat Strain – S. Japonicus (JapoNet) RRC – Research Resource Circulation
– Microsatellite Data Base of Japan – Bacillus subtilis DDBJ - DNA Data Bank of Japan
– RefEx for Mouse or Rat Fish
Human – Zebrafish: zTRAP
– dbHERV-REs – Zebrafish: Knock Out Fish Project
– RefEx for human – Coelacanth
Drosophila Aquatic organisms
– Drosophila Strains (NIG-FLY) – Hydra
– Segmentation Antibodies – Xenopus laevis
C. Elegans – Sea urchin (Hemicentrotus pulcherrimus)
– Gene Expression Database (NEXTDB) Plants
– cDNA library – Rice (Oryzabase)
– Liverwort (Marchantia polymorph
18
26/10/2022
Impact of bioinformatics
• Personal Genomics
• Increased vigilance and taking action to prevent disease
• Improving health care provide individual/specific medical care
• Understanding the link between genomics and environment
• Novel Drug Development
• Identifying novel drug targets
• Validating drug targets
• Predicting toxicity and adverse reactions
• Improving clinical trials and testing
• Gene therapy
• Replacing the gene rather than the gene product
• Stem cells therapies
• Replacing the entire cell type or tissue to cure a disease
19
26/10/2022
Impact of Bioinformatics
• Pharmacogenomics
• Personalized medicine Adjusting drug, amounts and delivery to suit
patients
• Maximize efficacy and minimize side effects
• Identify genetics of adverse reactions
• Identify patients who respond optimally
Impact of Bioinformatics
20
26/10/2022
Limitations
Bioinformatics is expanding…
21