0% found this document useful (0 votes)

7 views

Sec1 Introduction to Bioinformatics

The document provides an introduction to bioinformatics, defining it as the development of methods for managing and analyzing biological information from genomics and high-throughput experiments. It outlines the types of genomic data, the importance of studying bioinformatics, and the role of databases in organizing biological information. Additionally, it discusses various sequence data formats, particularly the FASTA format, used for representing nucleotide and peptide sequences.

Uploaded by

mernagoodgirl666

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Sec1 Introduction to Bioinformatics

Uploaded by

mernagoodgirl666

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

Introduction to

Bioinformatics

Helwan National University

Faculty of Science
Biotechnology and Genetic
Engineering Program

Second-year students 2024-2025

TA / A S M A A A S H R A F 04/07/20
Defi nition: Bioinformatics is the
development of methods for the
management and analysis of biological
information arising from genomics and high
throughput experiments.

What is
o for molecular biologists , bioinformatics
Bioinformatics can be regarded as computational
molecular biology, that uses
? computational techniques to study the
structure, function, regulation, and
interactive network of genes and proteins.
The ultimate is to analyze and predict the structure,
goal organization, function, regulation, and
dynamics of the entire genome of an
organism.
It is an interdisciplinary
field, which harnesses
computer science,
mathematics, physics, and
biology

Fig1 Interaction of disciplines that have contributed to the

formation of bioinformatics.
o Genomic data encompasses a wide
range of information, with sequence
data at its core

Data o The term "ome" in genomics refers

to the entire collection of an entity,
such as the transcriptome,
proteome, or interactome.
Fig 2 The central dogma of molecular biology and correspondence with 'omics'
TYPES OF GENOMIC DATA INCLUDE:

oDNA sequence data , which includes gene and mRNA sequences in the form of
complementary DNA (cDNA)1.

oGene- and protein-expression data , facilitated by techniques like microarrays for

studying global gene expression, also known as transcriptomics.

oProteome data

oMetabolome data .

oProtein-protein interaction data .

oProtein structural data.

oProtein-DNA interaction data.

oGene and protein network data.

oSmall noncoding RNA (ncRNA) data.

Why to study Bioinformatics
o Understanding Biological Processes: Develop a deep understanding of biological processes
through analysis and integration of gene and protein information.

o Determines the biological role of genes and proteins

o Identifies conserved genes and mutations.

o Identifies genes, proteins, and functional elements in DNA/RNA sequences

o Compares genomes of different species to study evolution.

o Enables sequence alignment, comparison, and annotation

o Integrates multi-omics data (genomics, proteomics, metabolomics).

o Predicts 3D structures of proteins and nucleic acids.

o Aids in drug discovery and molecular docking

o Tool Development: Create new bioinformatics tools and improve existing ones for various
analyses.
DATABASES

are digital repositories based on a

computerized software for storage of
information in a system and their
retrieval from the system using search
tools.

Databases
BIOLOGICAL DATABASES
The primary objective of a
database is to organize the data are libraries of biological information,
in a structured and searchable collected from scientifi c experiments,
form allowing easy retrieval of
published literature, high-throughput
useful data
experiment technology, and
computational analysis.
PRIMARY DATABASES SECONDARY DATABASES

o are archival in nature. o are curated, non-redundant

databases
o contain raw sequence data (experimental
results) with some interpretation and o are derived from the primary (archival)
explanation databases.

o the data are not curated. o Multiple entries of the same sequence
in primary databases are merged to
o There are redundancies—that is, the create a single sequence in the
same sequence might be submitted by secondary database with extensive
different laboratories, sometimes under annotation derived from all available
different names information on the sequence

o There are three primary databases that o For example:

contain all the sequence data so far  the NCBI RefSeq database:
generated: sequences including genomic DNA,
 GenBank, transcripts, and proteins
 EMBL database, also called the EMBL-Bank,  UniProtKB/Swiss-Prot: Secondary
 and DDBJ (DNA Databank of Japan) database of proteins
o All published DNA and RNA
sequences are usually deposited
in three parallel public
databases.

o Three collaborat ing databases

The INS DC collaboration

(Int ernational Nucleot ide Sequence

Dat abase Collaboration)

1. GenBank

2. DNA Database of Japan(DDBJ)

3.E uropean M olecular Biology

Laboratory (EM BL) Database
o NCBI (National Center for Biotechnology
Information)

o GenBank (Genetic Sequence Databank) at

NCBI

o EMBL (European Molecular Biology

Laboratory)
Some popular o DDBJ (DNA Data Bank of Japan)

databases
are:
Protein databases

o SwissProt

o UniProt
NCBI
o Very comprehensive biological database

o GENBANK: The nucleotide sequence

database

o Provides 42 diff erent resource NCBI

o Provides a simple and easy to use web National Center for
Biotechnology Information
interface
(Part of the U.S. National
o Search Engine for data retrieval: Entrez Institutes of Health)

o Retrieves information across all the

resources under NCBI. Example: PubMed,
taxonomy, SNP, PubChem etc.
Entrez databases

For example, most common

Entrez databases include
PubMed, Nucleotide, Protein
and Structure.
o A sequence data format is a specifi c
layout or arrangement of text
characters, symbols, keywords, and
description that identify a sequence
and contain information about its

SEQUENCE various attributes.

DATA o A variety of diff erent fi le formats have

been developed to store/analyse DNA
FORMATS and protein sequence information.

o A widely used input sequence format

for the purpose of analysis is the FASTA
format.
o FASTA (pronounced fast “A”) stands for
“fast all”.

o FASTA format is a text-based format for

representing either nucleotide sequences
or peptide sequences, in which base pairs
or amino acids are represented using

FASTA File single-letter codes.

o A sequence in FASTA format begins with a

Format single-line description, followed by lines of

sequence data.

o The description line is distinguished from

the sequence data by a greater-than (">")
symbol in the fi rst column.

o It is recommended that all lines of text be

shorter than 80 characters in length.
An Accession Number: The unique identifier for a sequence record. An
accession number applies to the complete record and is usually a
combination of a letter(s) and numbers, (unchangeable).
THANK YOU

Biological Databases Lec 2,3
No ratings yet
Biological Databases Lec 2,3
49 pages
Lesson 35 Medication Administration and Dose Calculations
No ratings yet
Lesson 35 Medication Administration and Dose Calculations
4 pages
Lecture 5- DataBase
No ratings yet
Lecture 5- DataBase
18 pages
Module 2 (Bioinformatics)
No ratings yet
Module 2 (Bioinformatics)
81 pages
Bioinformatics Database and Applications
100% (3)
Bioinformatics Database and Applications
82 pages
M Lec 01 & 02 Biological Database
No ratings yet
M Lec 01 & 02 Biological Database
50 pages
BCH 516-1
No ratings yet
BCH 516-1
32 pages
Bioinformatics PPT Section B Data Storage and Retrival Group 3
No ratings yet
Bioinformatics PPT Section B Data Storage and Retrival Group 3
36 pages
CH12
No ratings yet
CH12
8 pages
Capture D'écran . 2023-03-14 À 00.15.22
No ratings yet
Capture D'écran . 2023-03-14 À 00.15.22
54 pages
Lec2 Databases
No ratings yet
Lec2 Databases
135 pages
Bioinformatics Lecture Notes Database
No ratings yet
Bioinformatics Lecture Notes Database
28 pages
Tics - A Brief Introduction
No ratings yet
Tics - A Brief Introduction
4 pages
FALLSEM2019-20 BIT2001 ETH VL2019201000690 Reference Material I 11-Jul-2019 Unit I New
No ratings yet
FALLSEM2019-20 BIT2001 ETH VL2019201000690 Reference Material I 11-Jul-2019 Unit I New
48 pages
2024.HF_BioInformatics_Lec3p
No ratings yet
2024.HF_BioInformatics_Lec3p
11 pages
BCH 505 Bioinformatics 3(2 2) Databases
No ratings yet
BCH 505 Bioinformatics 3(2 2) Databases
17 pages
8024 Bio Info
No ratings yet
8024 Bio Info
28 pages
Day 1
No ratings yet
Day 1
38 pages
المحاضرة 2
No ratings yet
المحاضرة 2
16 pages
unit 1
No ratings yet
unit 1
24 pages
Introduction To Bioinformatics (Databases)
No ratings yet
Introduction To Bioinformatics (Databases)
28 pages
"MBG1002 Biological Databases Week II
No ratings yet
"MBG1002 Biological Databases Week II
37 pages
Bioinformatics
No ratings yet
Bioinformatics
47 pages
1. Databases
No ratings yet
1. Databases
34 pages
Database
No ratings yet
Database
16 pages
Bioinformatics: Intended Learning Outcomes
No ratings yet
Bioinformatics: Intended Learning Outcomes
9 pages
lecture1_BIOF242_shuvadeep
No ratings yet
lecture1_BIOF242_shuvadeep
38 pages
Bioinformatics Biological Database
No ratings yet
Bioinformatics Biological Database
31 pages
9. Biological Databases
No ratings yet
9. Biological Databases
17 pages
Bio PPT
No ratings yet
Bio PPT
35 pages
Bioinformatics: Nadiya Akmal Binti Baharum (PHD)
No ratings yet
Bioinformatics: Nadiya Akmal Binti Baharum (PHD)
54 pages
04 Computer Applications in Pharmacy Full Unit IV
No ratings yet
04 Computer Applications in Pharmacy Full Unit IV
14 pages
BTH 403-BTG407 LECTURE 1
No ratings yet
BTH 403-BTG407 LECTURE 1
6 pages
Additional Note PDF
No ratings yet
Additional Note PDF
25 pages
Biol BDs Singapore
No ratings yet
Biol BDs Singapore
24 pages
Basics of Bioinformatics in Biological Research
No ratings yet
Basics of Bioinformatics in Biological Research
5 pages
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
No ratings yet
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
75 pages
#1 L1 BioDatabases
No ratings yet
#1 L1 BioDatabases
89 pages
Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3
No ratings yet
Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3
5 pages
Biological Database 1
No ratings yet
Biological Database 1
50 pages
Bio in For Ma Tics
No ratings yet
Bio in For Ma Tics
52 pages
Lecture Bioinfo Databases
No ratings yet
Lecture Bioinfo Databases
27 pages
Bioinformatics
No ratings yet
Bioinformatics
22 pages
Databases in Bioinformatics - An Introduction
No ratings yet
Databases in Bioinformatics - An Introduction
11 pages
Biological Databases: - Bio-Informatics
No ratings yet
Biological Databases: - Bio-Informatics
16 pages
UNIT II
No ratings yet
UNIT II
23 pages
Introduction A La Bioinformatique
No ratings yet
Introduction A La Bioinformatique
165 pages
PB Bioinfo L1 2023
No ratings yet
PB Bioinfo L1 2023
21 pages
Module_2_Reference Course content
No ratings yet
Module_2_Reference Course content
19 pages
Biological Databases: DR Z Chikwambi Biotechnology
No ratings yet
Biological Databases: DR Z Chikwambi Biotechnology
47 pages
Lecture2-DataMining for Bioinformatics
No ratings yet
Lecture2-DataMining for Bioinformatics
7 pages
BIOINFORMATICS - eNOTES
No ratings yet
BIOINFORMATICS - eNOTES
23 pages
Class04- Biological databases - 2022
No ratings yet
Class04- Biological databases - 2022
14 pages
Biological Information on Artificial Intelligence
No ratings yet
Biological Information on Artificial Intelligence
20 pages
Zoya Bioinformatics Assignment
No ratings yet
Zoya Bioinformatics Assignment
36 pages
CMSC 838T - Lecture 9: Bioinformatics Databases
No ratings yet
CMSC 838T - Lecture 9: Bioinformatics Databases
65 pages
Lecture 5 Information Retrieval From Databases
No ratings yet
Lecture 5 Information Retrieval From Databases
22 pages
Bioinformatics Overview
100% (1)
Bioinformatics Overview
18 pages
Bioinformatics lecture 1
No ratings yet
Bioinformatics lecture 1
48 pages
2006 09 01 - Lect01 - ch1 2 PDF
No ratings yet
2006 09 01 - Lect01 - ch1 2 PDF
104 pages
Introduction to Bioinformatics, Sequence and Genome Analysis
From Everand
Introduction to Bioinformatics, Sequence and Genome Analysis
Jerry H. Swift
No ratings yet
Bio-Oil Product Information
No ratings yet
Bio-Oil Product Information
9 pages
Pre-Operative Risk Assessment Post
No ratings yet
Pre-Operative Risk Assessment Post
2 pages
Applied Learning Project Tri-Fold Board Final Draft
No ratings yet
Applied Learning Project Tri-Fold Board Final Draft
1 page
Medip, IJCMPH-10601 O
No ratings yet
Medip, IJCMPH-10601 O
7 pages
Step 2: Left Hand (Within The Gown) Lifts The Right Glove by Its Cuff
No ratings yet
Step 2: Left Hand (Within The Gown) Lifts The Right Glove by Its Cuff
6 pages
Silvadene (Silver Sulfadiazine)
No ratings yet
Silvadene (Silver Sulfadiazine)
1 page
NABH 5th Edition - COP - Emergency Services
No ratings yet
NABH 5th Edition - COP - Emergency Services
14 pages
Lecturer: Mr. Sabaruddin Machmud Adjie, S.S, M.Si: General Conversation " About Indentivication"
No ratings yet
Lecturer: Mr. Sabaruddin Machmud Adjie, S.S, M.Si: General Conversation " About Indentivication"
3 pages
Religious Exemption
No ratings yet
Religious Exemption
2 pages
D-37/1, TTC MIDC, Turbhe, Navi Mumbai-400 703: Processed At: Thyrocare
No ratings yet
D-37/1, TTC MIDC, Turbhe, Navi Mumbai-400 703: Processed At: Thyrocare
3 pages
B. Infection
No ratings yet
B. Infection
27 pages
Unit 11#process of Hospitalization
No ratings yet
Unit 11#process of Hospitalization
25 pages
CMPI - Module 7 - PHARMACY CLIENT SERVICES)
No ratings yet
CMPI - Module 7 - PHARMACY CLIENT SERVICES)
42 pages
News Story 3
No ratings yet
News Story 3
2 pages
Sutures, Clips and Staples
No ratings yet
Sutures, Clips and Staples
19 pages
Eassy On Diet and Fitness
No ratings yet
Eassy On Diet and Fitness
2 pages
DENTAL ENGLISH 2 - Inglés I Odontología
No ratings yet
DENTAL ENGLISH 2 - Inglés I Odontología
3 pages
Activity 2 - Digestive Disorders
No ratings yet
Activity 2 - Digestive Disorders
1 page
Community Health Nursing Lecture
No ratings yet
Community Health Nursing Lecture
3 pages
Nurse Residency Programs
No ratings yet
Nurse Residency Programs
8 pages
Tugas English Conversation
No ratings yet
Tugas English Conversation
4 pages
Physical Examination
No ratings yet
Physical Examination
2 pages
Thesis
No ratings yet
Thesis
5 pages
Hospital Construction Project: By: Amrah Arif Hamza Naeem Saleha Sugrio Zaid Baig Laiba Khan
No ratings yet
Hospital Construction Project: By: Amrah Arif Hamza Naeem Saleha Sugrio Zaid Baig Laiba Khan
45 pages
Drowning Management and Prevention: Summer Hazards
No ratings yet
Drowning Management and Prevention: Summer Hazards
4 pages
Risk Assessment For Trial SOP
No ratings yet
Risk Assessment For Trial SOP
8 pages
Brooklyn Nursing Home Deaths
No ratings yet
Brooklyn Nursing Home Deaths
2 pages
The Staff Are Not Motivated Anymore Health Care Wo
No ratings yet
The Staff Are Not Motivated Anymore Health Care Wo
14 pages
Wasim Eligibility Letter
No ratings yet
Wasim Eligibility Letter
1 page

Sec1 Introduction to Bioinformatics

Uploaded by

Sec1 Introduction to Bioinformatics

Uploaded by

Introduction to

Helwan National University

Second-year students 2024-2025

Fig1 Interaction of disciplines that have contributed to the

Data o The term "ome" in genomics refers

oGene- and protein-expression data , facilitated by techniques like microarrays for

oProtein-protein interaction data .

oProtein structural data.

oProtein-DNA interaction data.

oGene and protein network data.

oSmall noncoding RNA (ncRNA) data.

o Determines the biological role of genes and proteins

o Identifies conserved genes and mutations.

o Identifies genes, proteins, and functional elements in DNA/RNA sequences

o Compares genomes of different species to study evolution.

o Enables sequence alignment, comparison, and annotation

o Integrates multi-omics data (genomics, proteomics, metabolomics).

o Predicts 3D structures of proteins and nucleic acids.

o Aids in drug discovery and molecular docking

are digital repositories based on a

o are archival in nature. o are curated, non-redundant

o There are three primary databases that o For example:

o Three collaborat ing databases

(Int ernational Nucleot ide Sequence

2. DNA Database of Japan(DDBJ)

3.E uropean M olecular Biology

o GenBank (Genetic Sequence Databank) at

o EMBL (European Molecular Biology

o GENBANK: The nucleotide sequence

o Provides 42 diff erent resource NCBI

o Retrieves information across all the

For example, most common

SEQUENCE various attributes.

DATA o A variety of diff erent fi le formats have

o A widely used input sequence format

o FASTA format is a text-based format for

FASTA File single-letter codes.

o A sequence in FASTA format begins with a

Format single-line description, followed by lines of

o The description line is distinguished from

o It is recommended that all lines of text be

You might also like