0% found this document useful (0 votes)
267 views20 pages

BIOINFORMATICS

Bioinformatics is an interdisciplinary field that stores, organizes, and analyzes vast amounts of biological data, especially sequences and structures of proteins and DNA. The goals of bioinformatics include organizing data correctly, properly analyzing it, and interpreting it in a biologically meaningful way. Machine learning is a popular technique used in bioinformatics for tasks like classifying gene expression profiles, predicting protein structure, sequencing DNA, and more. Common machine learning methods used include neural networks, decision trees, support vector machines, and natural language processing.

Uploaded by

ANGELIKA ABORITA
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
267 views20 pages

BIOINFORMATICS

Bioinformatics is an interdisciplinary field that stores, organizes, and analyzes vast amounts of biological data, especially sequences and structures of proteins and DNA. The goals of bioinformatics include organizing data correctly, properly analyzing it, and interpreting it in a biologically meaningful way. Machine learning is a popular technique used in bioinformatics for tasks like classifying gene expression profiles, predicting protein structure, sequencing DNA, and more. Common machine learning methods used include neural networks, decision trees, support vector machines, and natural language processing.

Uploaded by

ANGELIKA ABORITA
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 20

BIOINFO

RMATIC PREPARED BY:

GERINNE CABOBOY

S
JECEL NIVERA
LOVELYN PIGAR
MA.JANNA VICTORIA B. SISTER
WHAT IS BIOINFORMATICS?

■ Bioinformatics is an interdisciplinary field that stores, organize


and analyze the vast amount of biological data which is available
in the form of sequences and structures of proteins.
■ Paulien Hogeweg was the first person who gave the name
Bioinformatics to this field in 1979.
AIMS OF BIOINFORMATICS

Organizing Data in the correct manner


Proper Analysis of the Data
Interpreting the data in a biologically meaningful
manner
BIOINFORMAT
ICS:
RELATION TO
Machine learning is a subset of the broader field of
artificial intelligence (AI). It enables systems to
independently learn from data and execute tasks
that they are not explicitly programmed to handle.
Its goal is to give machines the ability to perform
tasks that require human intelligence, such as
diagnosing, planning, and predicting.
There are two main types of machine learning:
Supervised learning relies on labeled datasets to teach algorithms an existing
classification system and how to make predictions based on it. This ML type is used
to train decision trees and neural networks.

Unsupervised learning doesn’t use labels. Instead, algorithms try to uncover data
patterns on their own. In other words, they learn things that we can’t teach them
directly. This is comparable to how the human brain works.
It’s also possible to combine labeled and
unlabeled data during training, which will
result in semi-supervised learning. This
ML type can be useful when you don’t
have enough high-quality labeled data for
a supervised learning approach, but you
still want to use it to direct the learning
process.
What are the most popular machine learning techniques used in
bioinformatics?

Natural Language Decision tree


Process classifiers
a set of techniques that can A decision tree algorithm will be used
understand unstructured human to split dataset features through a cost
language. function.

Support Vector
Neural Network Machine
A neural network is a method in artificial intelligence
that teaches computers to process data in a way that is a type of deep learning algorithm that performs
inspired by the human brain. It is a type of machine supervised learning for classification or regression
learning process, called deep learning, that uses of data groups. In AI and machine learning,
interconnected nodes or neurons in a layered structure supervised learning systems provide both input
that resembles the human brain. and desired output data, which are labeled for
classification.
Natural Language Process

NLP can search through volumes of biology


research, aggregate information on a given topic
from various sources, and translate research findings
from one language to another. In addition to mining
research papers, NLP solutions can parse relevant
biomedical databases.

NLP can benefit the bioinformatics field in the


following ways:
● Interpreting genetic variants
● Analyzing DNA expression arrays
● Annotating protein functions
● Looking for new drug targets
Neural Network

This is a multi-layered structure consisting of nodes/neurons as its


building blocks. Neurons in adjacent layers are connected to each
other via links, but neurons of the same layer are not interlinked. The
input layer neurons receive information, process it, and pass it along
as an input to the next layer. And this process continues until the
processed information reaches the output layer. The most basic
neural network is called perceptron. It consists of one neuron that
acts as a classifier. This neuron receives an input and places it in one
of two classes using a linear discrimination function. In larger neural
networks, there is no limit on the number of layers or the number of
nodes in one layer.

Neural Network can benefit the bioinformatics field in the following


ways:
● Classifying gene expression profiles
● Predicting protein structure
● Sequencing DNA
Decision tree classifiers

This is one of the most popular classical supervised learning


classifiers. These algorithms apply a recursive approach to build a
flowchart-like tree model, where each node represents a test on a
feature. First, the algorithm determines the top node — the root —
and then builds the tree recursively considering one parameter at a
time. The final node in each sequence is called “the leaf node.” It
represents the final classification and holds the class label. Decision
tree models demand high computational power during training, but
afterwards they can perform classifications without extensive
computing. The main advantage these classifiers bring to the
bioinformatics field is that they generate understandable rules and
explainable results.
Support Vector Machine

This approach is useful in computational


identification of functional RNA genes.
It can select the optimal set of genes for
cancer detection based on their
expression data.
Applications of Artificial
Intelligence in Bioinformatics
01
Facilitating gene
editing experiments
Gene editing refers to
manipulations on an organism’s
genetic composition by deleting,
inserting, and replacing a part of
its DNA sequence.
02
Identifying protein
structure
Proteomics is a study of proteins, their interactions,
composition, and their role in the human body. This
field involves heavy biological datasets and is
computationally expensive. Therefore, technologies
like machine learning in bioinformatics are essential
here.
03
Spotting genes
associated with
diseases
Researchers increasingly use machine learning in
bioinformatics to identify genes that are likely to be
involved in particular diseases. This is achieved by
analyzing gene expression microarrays and RNA
sequencing.
04
Traversing the knowledge base in
search of meaningful patterns
Advanced sequencing technology doubles genomic databases each
2.5 years, and researchers are looking for a way to extract useful
insights from this accumulated knowledge. Machine learning in
bioinformatics can sift through biomedical publications and reports
to identify different genes and proteins and search for their
functionality. It can also aid in annotating protein databases and
complement them with the information it retrieves from the
literature.
05
Repurposing drugs
Drug repurposing, or reprofiling, is a technique scientists use to
discover new applications of existing drugs that they were not
intended for. Researchers adopt AI in bioinformatics to perform
drug analysis on relevant databases, such as BindingDB and
DrugBank. There are three major directions for drug
repurposing:

• Drug-target interaction looks into the drug’s ability to bind directly to


the target protein

• Drug-drug interaction investigates how medications act when they are


taken in combinations

• Protein-protein interaction looks into the surface of interacting


intracellular proteins, and attempts to discover hotspots and allosteric
sites.
THANKS!
CREDITS: This presentation template was created by
Slidesgo, including icons by Flaticon, and infographics &
images by Freepik

You might also like