Background:: A Modeling and Simulation Web Tool For Plant Biologists

1) The document introduces PlantSimLab, a web-based application that allows plant biologists to construct dynamic mathematical models of molecular networks, interrogate them similar to laboratory experiments, and use the models to generate biological hypotheses without requiring mathematical modeling expertise. 2) DIRECT is a new method for predicting RNA structural contacts that incorporates a Restricted Boltzmann Machine to augment sequence covariation information with structural features, achieving better accuracy than other methods especially for long-range contacts. 3) An ensemble feature selection strategy is proposed to identify a 100-miRNA signature for cancer classification from a dataset of 8023 samples. The signature distinguishes tumor from normal tissues and provides better accuracy than other feature selection methods when tested across

Uploaded by

sk3 khan

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views

Background:: A Modeling and Simulation Web Tool For Plant Biologists

Uploaded by

sk3 khan

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

A modeling and simulation web tool for plant biologists

Background: At the molecular level, nonlinear networks of heterogeneous molecules

control many biological processes, so that systems biology provides a valuable approach in
this field, building on the integration of experimental biology with mathematical modeling.
One of the biggest challenges to making this integration a reality is that many life scientists
do not possess the mathematical expertise needed to build and manipulate mathematical
models well enough to use them as tools for hypothesis generation. Available modeling
software packages often assume some modeling expertise. There is a need for software
tools that are easy to use and intuitive for experimentalists.

Outcomes: This paper introduces PlantSimLab, a web-based application

developed to allow plant biologists to construct dynamic mathematical models
of molecular networks, interrogate them in a manner similar to what is done in
the laboratory, and use them as a tool for biological hypothesis generation. It is
designed to be used by experimentalists, without direct assistance from
mathematical modelers.

RNA contact predictions by integrating structural patterns

Background: It is widely believed that tertiary nucleotide-nucleotide interactions are
essential in determining RNA structure and function. Currently, direct coupling analysis
(DCA) infers nucleotide contacts in a sequence from its homologous sequence alignment
across different species. DCA and similar approaches that use sequence information alone
typically yield a low accuracy, especially when the available homologous sequences are
limited. Therefore, new methods for RNA structural contact inference are desirable because
even a single correctly predicted tertiary contact can potentially make the difference
between a correct and incorrectly predicted structure. Here we present a new method
DIRECT (Direct Information REweighted by Contact Templates) that incorporates a
Restricted Boltzmann Machine (RBM) to augment the information on sequence co-
variations with structural features in contact inference.

Results:Benchmark tests demonstrate that DIRECT achieves better overall performance

than DCA approaches. Compared to mfDCA and plmDCA, DIRECT produces a substantial
increase of 41 and 18%, respectively, in accuracy on average for contact prediction. DIRECT
improves predictions for long-range contacts and captures more tertiary structural features .
Automatic discovery of 100-miRNA signature for cancer
classification using ensemble feature selection
Background: MicroRNAs (miRNAs) are noncoding RNA molecules heavily involved in
human tumors, in which few of them circulating the human body. Finding a tumor-
associated signature of miRNA, that is, the minimum miRNA entities to be measured for
discriminating both different types of cancer and normal tissues, is of utmost importance.
Feature selection techniques applied in machine learning can help however they often
provide naive or biased results.

Results: An ensemble feature selection strategy for miRNA signatures is proposed. miRNAs
are chosen based on consensus on feature relevance from high-accuracy classifiers of
different typologies. This methodology aims to identify signatures that are considerably
more robust and reliable when used in clinically relevant prediction tasks. Using the
proposed method, a 100-miRNA signature is identified in a dataset of 8023 samples,
extracted from TCGA. When running eight-state-of-the-art classifiers along with the 100-
miRNA signature against the original 1046 features, it could be detected that global
accuracy differs only by 1.4%. Importantly, this 100-miRNA signature is sufficient to
distinguish between tumor and normal tissues. The approach is then compared against
other feature selection methods, such as UFS, RFE, EN, LASSO, Genetic Algorithms, and EFS-
CLA. The proposed approach provides better accuracy when tested on a 10-fold cross-
validation with different classifiers and it is applied to several GEO datasets across different
platforms with some classifiers showing more than 90% classification accuracy, which
proves its cross-platform applicability.

Shared data science infrastructure for genomics data

Background: Creating a scalable computational infrastructure to analyze the wealth of
information contained in data repositories is difficult due to significant barriers in
organizing, extracting and analyzing relevant data. Shared data science infrastructures like
Boag is needed to efficiently process and parse data contained in large data repositories.
The main features of Boag are inspired from existing languages for data intensive computing
and can easily integrate data from biological data repositories.

Outcomes: As a proof of concept, Boa for genomics, Boag, has been implemented to
analyze RefSeq’s 153,848 annotation (GFF) and assembly (FASTA) file metadata. Boag
provides a massive improvement from existing solutions like Python and MongoDB, by
utilizing a domain-specific language that uses Hadoop infrastructure for a smaller storage
footprint that scales well and requires fewer lines of code. We execute scripts through Boag
to answer questions about the genomes in RefSeq. We identify the largest and smallest
genomes deposited, explore exon frequencies for assemblies after 2016, identify the most
commonly used bacterial genome assembly program, and address how animal genome
assemblies have improved since 2016. Boag databases provide a significant reduction in
required storage of the raw data and a significant speed up in its ability to query large
datasets due to automated parallelization and distribution of Hadoop infrastructure during
computations.

Additional Neural Matrix Factorization model for

computational drug repositioning
Background: Computational drug repositioning, which aims to find new applications for
existing drugs, is gaining more attention from the pharmaceutical companies due to its low
attrition rate, reduced cost, and shorter timelines for novel drug discovery. Nowadays, a
growing number of researchers are utilizing the concept of recommendation systems to
answer the question of drug repositioning. Nevertheless, there still lie some challenges to be
addressed: 1) Learning ability deficiencies; the adopted model cannot learn a higher level of
drug-disease associations from the data. 2) Data sparseness limits the generalization ability
of the model. 3)Model is easy to overfit if the effect of negative samples is not taken into
consideration.

Outcomes: In this study, we propose a novel method for computational drug

repositioning, Additional Neural Matrix Factorization (ANMF). The ANMF model makes use
of drug-drug similarities and disease-disease similarities to enhance the representation
information of drugs and diseases in order to overcome the matter of data sparsity. By
means of a variant version of the autoencoder, we were able to uncover the hidden features
of both drugs and diseases. The extracted hidden features will then participate in a
collaborative filtering process by incorporating the Generalized Matrix Factorization (GMF)
method, which will ultimately give birth to a model with a stronger learning ability. Finally,
negative sampling techniques are employed to strengthen the training set in order to
minimize the likelihood of model overfitting. The experimental results on the Gottlieb and
Cdataset datasets show that the performance of the ANMF model outperforms state-of-the-
art methods.

Applied Computer-Aided Drug Design: Models and Methods
From Everand
Applied Computer-Aided Drug Design: Models and Methods
Igor José dos Santos Nascimento
No ratings yet
Bioinformatics Project On Drug Discovery and Drug Designing
No ratings yet
Bioinformatics Project On Drug Discovery and Drug Designing
10 pages
IEEE 2012 Titles Abstract
No ratings yet
IEEE 2012 Titles Abstract
14 pages
Bioinformatics Definition
No ratings yet
Bioinformatics Definition
11 pages
Artificial Intelligence: A Multidisciplinary Approach towards Teaching and Learning
From Everand
Artificial Intelligence: A Multidisciplinary Approach towards Teaching and Learning
Tahmeena Khan
No ratings yet
Computer-Aided Drug Discovery Methods: A Brief Introduction
From Everand
Computer-Aided Drug Discovery Methods: A Brief Introduction
Manos C. Vlasiou
No ratings yet
Protein Remote Homology Detection-Methods and Evaluation Metrics
No ratings yet
Protein Remote Homology Detection-Methods and Evaluation Metrics
7 pages
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)
Methods of Gene Prediction
No ratings yet
Methods of Gene Prediction
5 pages
2020 - Computational Methods and Software Tools For Functional Analysis of MiRNA Data
No ratings yet
2020 - Computational Methods and Software Tools For Functional Analysis of MiRNA Data
16 pages
Computational Validation and Analysis of Semi-Quantitative Data Using In-Silico Approaches
No ratings yet
Computational Validation and Analysis of Semi-Quantitative Data Using In-Silico Approaches
5 pages
Smart Business Problems and Analytical Hints in Cancer Research
From Everand
Smart Business Problems and Analytical Hints in Cancer Research
Zemelak Goraga
No ratings yet
Deep Learning for Cb
No ratings yet
Deep Learning for Cb
16 pages
17373.selected Works in Bioinformatics by Xuhua Xia PDF
No ratings yet
17373.selected Works in Bioinformatics by Xuhua Xia PDF
190 pages
Cancer Info
No ratings yet
Cancer Info
11 pages
Acssynbio 9b00523
No ratings yet
Acssynbio 9b00523
14 pages
IC@N Research Projects SCSE May 2022 Updated (LATEST)
No ratings yet
IC@N Research Projects SCSE May 2022 Updated (LATEST)
7 pages
NNNNNNN
No ratings yet
NNNNNNN
11 pages
draftDNAPressChapter v8
No ratings yet
draftDNAPressChapter v8
77 pages
BIOINFORMATICS
No ratings yet
BIOINFORMATICS
85 pages
Fuzzy Art Map Algorithm: Implementation of For Data Mining in Bio Informatics
No ratings yet
Fuzzy Art Map Algorithm: Implementation of For Data Mining in Bio Informatics
11 pages
Computational Intelligence and its Applications
From Everand
Computational Intelligence and its Applications
Vikash Yadav
No ratings yet
Plagiarism1 - Report
No ratings yet
Plagiarism1 - Report
8 pages
Tutorial R
No ratings yet
Tutorial R
456 pages
DeepECA- An End-To-End Learning Framework for Protein Contact Prediction From a Multiple Sequence Alignment
No ratings yet
DeepECA- An End-To-End Learning Framework for Protein Contact Prediction From a Multiple Sequence Alignment
17 pages
Bioinformatics Unveiled
From Everand
Bioinformatics Unveiled
Joan Melody
No ratings yet
Computational biology
No ratings yet
Computational biology
19 pages
Knowledge-Based Bioinformatics: From Analysis to Interpretation
From Everand
Knowledge-Based Bioinformatics: From Analysis to Interpretation
Gil Alterovitz
No ratings yet
Computational Genomics
No ratings yet
Computational Genomics
5 pages
Ahora Si Este Es El Bueno
No ratings yet
Ahora Si Este Es El Bueno
8 pages
Tutorials in Chemoinformatics
From Everand
Tutorials in Chemoinformatics
Alexandre Varnek
No ratings yet
Mastering Parallel Programming with R
From Everand
Mastering Parallel Programming with R
Simon R. Chapple
No ratings yet
Application of Data Mining in Bioinformatics
No ratings yet
Application of Data Mining in Bioinformatics
5 pages
Introduction to Bioinformatics, Sequence and Genome Analysis
From Everand
Introduction to Bioinformatics, Sequence and Genome Analysis
Jerry H. Swift
No ratings yet
draft_manuscript_8_28_2024
No ratings yet
draft_manuscript_8_28_2024
20 pages
Applied Machine Learning and Multi-criteria Decision-making in Healthcare
From Everand
Applied Machine Learning and Multi-criteria Decision-making in Healthcare
Ilker Ozsahin
No ratings yet
micrornaNaming
No ratings yet
micrornaNaming
12 pages
Thawing Frozen Robust Multi-Array Analysis (fRMA) : Software Open Access
No ratings yet
Thawing Frozen Robust Multi-Array Analysis (fRMA) : Software Open Access
7 pages
rRNAdB (1)
No ratings yet
rRNAdB (1)
27 pages
Mol - Modelling of BCL-2 Final
No ratings yet
Mol - Modelling of BCL-2 Final
37 pages
Integration of Omics Approaches and Systems Biology for Clinical Applications
From Everand
Integration of Omics Approaches and Systems Biology for Clinical Applications
Antonia Vlahou
No ratings yet
Research 2
No ratings yet
Research 2
6 pages
Complete Download Bioinformatics for Comparative Proteomics 1st Edition Chuming Chen PDF All Chapters
100% (3)
Complete Download Bioinformatics for Comparative Proteomics 1st Edition Chuming Chen PDF All Chapters
85 pages
A Review of Software For Predicting Gene Function
No ratings yet
A Review of Software For Predicting Gene Function
14 pages
Random walk with restart
No ratings yet
Random walk with restart
22 pages
On Bioinformatic Resources
No ratings yet
On Bioinformatic Resources
7 pages
Targeted Projection Pursuit
No ratings yet
Targeted Projection Pursuit
2 pages
Probabilistic_variable-length_segmentation_of_prot
No ratings yet
Probabilistic_variable-length_segmentation_of_prot
17 pages
An Approach of Hybrid Clustering Technique For Maximizing Similarity of Gene Expression
No ratings yet
An Approach of Hybrid Clustering Technique For Maximizing Similarity of Gene Expression
14 pages
Simple R Tools For Genetic Markers Research
No ratings yet
Simple R Tools For Genetic Markers Research
3 pages
Artificial Intelligence and Natural Algorithms
From Everand
Artificial Intelligence and Natural Algorithms
Rijwan Khan
No ratings yet
Multi Layers Networks
No ratings yet
Multi Layers Networks
24 pages
Generating Structural Data Analysis
No ratings yet
Generating Structural Data Analysis
8 pages
Para, Gene Dis
No ratings yet
Para, Gene Dis
11 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Distributed and Sequantial Algorithms for Bioinformatics
No ratings yet
Distributed and Sequantial Algorithms for Bioinformatics
376 pages
Clustering Molecular Dynamics Trajectories - I. Characterizing The Performance of Different Clustering Algorithms
No ratings yet
Clustering Molecular Dynamics Trajectories - I. Characterizing The Performance of Different Clustering Algorithms
44 pages
Untitled document (3)
No ratings yet
Untitled document (3)
3 pages
Deep Learning in Mining Biological Data
100% (1)
Deep Learning in Mining Biological Data
33 pages
Fiji For Biological Image Analysis
No ratings yet
Fiji For Biological Image Analysis
7 pages
List of Lab Experiments - CSE314 PDF
No ratings yet
List of Lab Experiments - CSE314 PDF
1 page
Address Aggregation / Route Summarization / Supernetting: Step 1
No ratings yet
Address Aggregation / Route Summarization / Supernetting: Step 1
1 page
Mollecular and Cellular Biology: Lecture - 2
No ratings yet
Mollecular and Cellular Biology: Lecture - 2
31 pages
Sequence Alignment: Lecture - 4
No ratings yet
Sequence Alignment: Lecture - 4
19 pages
Gene Duplication and Read Mapping: Lecture - 5
No ratings yet
Gene Duplication and Read Mapping: Lecture - 5
20 pages
DNA Sequencing: Lecture - 3
No ratings yet
DNA Sequencing: Lecture - 3
25 pages
Lecture - 1: Nafis Neehal, Lecturer, Department of CSE, DIU
No ratings yet
Lecture - 1: Nafis Neehal, Lecturer, Department of CSE, DIU
21 pages
Lab1-Design A Lexical Analyzer
No ratings yet
Lab1-Design A Lexical Analyzer
1 page
Problems To Be Completed Manually
No ratings yet
Problems To Be Completed Manually
2 pages
Maxam and Gilbert DNA Sequencing
No ratings yet
Maxam and Gilbert DNA Sequencing
5 pages
Editors, Contents, Cover Details Tibtec
No ratings yet
Editors, Contents, Cover Details Tibtec
1 page
S. Blair Hedges, Sudhir Kumar - The Timetree of Life (2009)
No ratings yet
S. Blair Hedges, Sudhir Kumar - The Timetree of Life (2009)
574 pages
Diniz Et Al - 2018
No ratings yet
Diniz Et Al - 2018
30 pages
Essential Cell Biology 5th Edition PDF
No ratings yet
Essential Cell Biology 5th Edition PDF
43 pages
Bioinformatics: Nadiya Akmal Binti Baharum (PHD)
No ratings yet
Bioinformatics: Nadiya Akmal Binti Baharum (PHD)
54 pages
2020 Book AdvancesInBioinformaticsAndCom
No ratings yet
2020 Book AdvancesInBioinformaticsAndCom
284 pages
DNA Chromatography (Douglas T. Gjerde, Christopher P. Hanna & David Hornby)
No ratings yet
DNA Chromatography (Douglas T. Gjerde, Christopher P. Hanna & David Hornby)
239 pages
M.F.SC & PHD Programs in - Syllabus: Fish Biotechnology
No ratings yet
M.F.SC & PHD Programs in - Syllabus: Fish Biotechnology
24 pages
Evolution and Taxonomy of The Grasses (Poaceae) : A Model Family For The Study of Species-Rich Groups
No ratings yet
Evolution and Taxonomy of The Grasses (Poaceae) : A Model Family For The Study of Species-Rich Groups
40 pages
Dissertation Martin Hölzer, 2017 PDF
No ratings yet
Dissertation Martin Hölzer, 2017 PDF
253 pages
A Protocol For Extraction and Purification of High-Quality and Quantity Bacterial DNA Applicable For Genome Sequencing: A Modified Version of The Marmur Procedure.
No ratings yet
A Protocol For Extraction and Purification of High-Quality and Quantity Bacterial DNA Applicable For Genome Sequencing: A Modified Version of The Marmur Procedure.
6 pages
The Human Genome Project
No ratings yet
The Human Genome Project
4 pages
Techniques Used in Molecular Biology-1
No ratings yet
Techniques Used in Molecular Biology-1
72 pages
Genetic and Phylogenetic Analysis of A Novel Parvovirus Isolated From Chickens in Guangxi, China
No ratings yet
Genetic and Phylogenetic Analysis of A Novel Parvovirus Isolated From Chickens in Guangxi, China
5 pages
The P-Glucuronidase (Gus)
No ratings yet
The P-Glucuronidase (Gus)
17 pages
Molecular Biology Dissertation Topics
100% (2)
Molecular Biology Dissertation Topics
5 pages
Molecular Biology of The Cell, Sixth Edition Chapter 8: Analyzing Cells, Molecules, and Systems
100% (1)
Molecular Biology of The Cell, Sixth Edition Chapter 8: Analyzing Cells, Molecules, and Systems
60 pages
Genomic Dna by Ligation SQK lsk110 GDE - 9108 - v110 - Revx - 10nov2020 Minion
No ratings yet
Genomic Dna by Ligation SQK lsk110 GDE - 9108 - v110 - Revx - 10nov2020 Minion
26 pages
AI Can Help To Speed Up Drug Discovery - But Only If We Give It The Right Data
No ratings yet
AI Can Help To Speed Up Drug Discovery - But Only If We Give It The Right Data
4 pages
Introduction To Molecular Introduction To Molecular Biology Biology
No ratings yet
Introduction To Molecular Introduction To Molecular Biology Biology
18 pages
Effects of COVID-19 Restrictions To The Recreational Behavior
No ratings yet
Effects of COVID-19 Restrictions To The Recreational Behavior
65 pages
2023 06 05 23290958v1 Full
No ratings yet
2023 06 05 23290958v1 Full
26 pages
Journal of Plant Physiology: Sciencedirect
No ratings yet
Journal of Plant Physiology: Sciencedirect
10 pages
KAPA HiFi HotStart ReadyMix TDS
No ratings yet
KAPA HiFi HotStart ReadyMix TDS
4 pages
نوتس بايوتكنولوجي
No ratings yet
نوتس بايوتكنولوجي
2 pages
3 Molecular Evidence - BioNinja
No ratings yet
3 Molecular Evidence - BioNinja
4 pages
Microbiological Indoor Air Quality of Hospital Buildings With D
No ratings yet
Microbiological Indoor Air Quality of Hospital Buildings With D
9 pages
Highly Secure DNA-based Audio Steganography: Shyamasree C M, Sheena Anees
No ratings yet
Highly Secure DNA-based Audio Steganography: Shyamasree C M, Sheena Anees
6 pages
Nubel 1997 PCR Primers To Amplify 16S rRNA Genes From Cyanobacteria
No ratings yet
Nubel 1997 PCR Primers To Amplify 16S rRNA Genes From Cyanobacteria
6 pages

Background:: A Modeling and Simulation Web Tool For Plant Biologists

Uploaded by

Background:: A Modeling and Simulation Web Tool For Plant Biologists

Uploaded by

A modeling and simulation web tool for plant biologists

Background: At the molecular level, nonlinear networks of heterogeneous molecules

Outcomes: This paper introduces PlantSimLab, a web-based application

RNA contact predictions by integrating structural patterns

Results:Benchmark tests demonstrate that DIRECT achieves better overall performance

Shared data science infrastructure for genomics data

Additional Neural Matrix Factorization model for

Outcomes: In this study, we propose a novel method for computational drug

You might also like