0% found this document useful (0 votes)

58 views15 pages

Metagenomics Classification: Project Synopsis

This document provides a project synopsis for a metagenomics classification project. The project aims to build an inference engine using a deep learning model called GeNet to classify metagenomic DNA sequences without requiring a large database. The project involves collecting DNA sequence data from online sources, preprocessing the data to create balanced training, validation and test datasets, developing the GeNet model architecture using convolutional and residual blocks, and training and evaluating the model to classify sequences by taxonomy. The project scope includes data collection and preprocessing, model development and training, and model evaluation.

Uploaded by

Samyak Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views15 pages

Metagenomics Classification: Project Synopsis

Uploaded by

Samyak Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 15

METAGENOMICS CLASSIFICATION

Project Synopsis
Version 1.0

(ECS799)
Degree
BACHELOR OF TECHNOLOGY (CSE)

PROJECT GUIDE: SUBMITTED BY:

Prof. Ajay Chakravarti (Internal) Samyak Jain (TCA1709030)
Samyak Jain (TCA1709021)
Mr. Kolli Sarath(External)

August, 2020

COLLEGE OF COMPUTING SCIENCES AND INFORMATION TECHNOLOGY

TEERTHANKER MAHAVEER UNIVERSITY, MORADABAD
TMU-CCSIT Version 1.1 T001-Project Synopsis

Table of Contents

1 Project Title.........................................................................................................................................3
2 Domain................................................................................................................................................3
3 Problem Statement.............................................................................................................................3
4 Project Description..............................................................................................................................3
4.1 Scope of the Work.......................................................................................................................3
4.2 Project Modules...........................................................................................................................3
5 Implementation Methodology............................................................................................................3
6 Technologies to be used......................................................................................................................4
6.1 Software Platform........................................................................................................................4
6.2 Hardware Platform......................................................................................................................4
6.3 Tools............................................................................................................................................4
7 Advantages of this Project...................................................................................................................4
8 Future Scope and further enhancement of the Project.......................................................................4
9 Team Details........................................................................................................................................4
10 Conclusion.......................................................................................................................................5
11 References.......................................................................................................................................5

Title: Page 2 of 15
TMU-CCSIT Version 1.1 T001-Project Synopsis

1 Project Title
This is project is based on metagenomics classification related to biology therefore it’s title
named is Metagenomics Classification.

2 Domain
This project is a research project and using Deep Learning technology to achieve the require
results.

3 Problem Statement
This project aims to build a inference engine which is a GeNet deep representation for
Metagenomics Classification based on this research
paper(https://arxiv.org/pdf/1901.11015.pdf) to replace the Kraken and Centrifuge DNA
classification methods which requires large database that makes them unaffordable
untransferable and become challenging when the amount of noise in data increases.

4 Project Description
To counter the above mention problem Deep learning systems is required that can learn
from the noise distribution of the input reads. Moreover, a classiﬁcation model learns a
mapping from input read to class probabilities, and thus does not require a database at
run-time. Deep learning systems provide representations of DNA sequences which can be
leveraged for downstream tasks.
DNA sequence are also called read which is represented by (G,T,A,C) characters and varies
form organism to organism.
Taxonomy is the classification of any organism by this order (Kingdom, Phylum,
Class,Order,Family, Genus, Species).
We have to predict the taxonomy by passing read to seven models simultaneously and
each models classifies a particular part of above taxa. Combined results of these models
helps to classify the read.
There are six kingdoms in biological system -(Plants, Animals, Protists, Fungi,
Archaebacteria, Eubacteria)

Title: Page 3 of 15
TMU-CCSIT Version 1.1 T001-Project Synopsis

This project is vast and divided according to the kingdoms as a sub project and each sub
project needs eight models taxa+organism name. This report is only on Eubacteria.

4.1 Scope of the Work

Step 1- Collection of data from online resources.
Step 2- Data processing.
Step 3- Preparing balanced datasets.
Step 4- Creating the model structure.
Step 5- Model training.
Step 6- Model testing.
Step 7- Model evaluation.

4.2 Project Modules

Collection of Data
This is the first steps of moving toward project. This project needs a data which has
reads and it’s belongs taxonomy.
Data is collected from these resources for each kingdom-
1. NCBI (https://www.ncbi.nlm.nih.gov/) for all kingom.
2. DairyDB(https://github.com/marcomeola/DAIRYdb) for bacteria
3. PlantGDB(http://www.plantgdb.org/) for plants.
4. RVDB(https://rvdb.dbi.udel.edu/) for virus.
5. PlutoF(https://www.gbif.org/dataset/search) for fungi.
6. GreenGene(https://greengenes.secondgenome.com/) for archaea.

All these data are available in FASTA file format which need preprocessing to filter out
the required data and stored in a csv file format.

Data Preprocessing

Title: Page 4 of 15
TMU-CCSIT Version 1.1 T001-Project Synopsis

After filtering out the required data using python and prepare train, valid and test CSVs.

Each column of main csv file is considered as the particular model target labels and there
are n labels in each columns
.
Data Balancing
I taked 35 rows of each labels and removed label rows which are less than 35 and truncate
the rows of labels which are above 35. I created csv for each labels as row of 35 and put
under the particular column folder.
Now I have to prepare train,valid and test csv files. From each label csv I took 20 rows
column as train data, 10 rows as a valid data and 5 rows as test data of a particular
column.
Above processing will create balanced dataset which helps the model to learn equally for
each label. Imbalanced dataset decrease the accuracy.

GeNet Model Architecture

GeNet, a model for metagenomic classiﬁcation based on convolutional neural networks.
GeNet is trained end-to-end from raw DNA sequences using standard backpropagation
with cross-entropy loss.
Here is it’s image below-

Title: Page 5 of 15
TMU-CCSIT Version 1.1 T001-Project Synopsis

First layer of model is convolutional 2D neural layer which takes matrix of read as a input.

Resnet Blocks in image are residual block also called ‘skip connections’. They are used to
allow gradients to flow through a network directly, without passing through non-linear
activation functions. When network depth increase accuracy get saturated and then
degrade rapidly to remove that problem here we use residual blocks.

Here it’s image below -

There are two convolution layer with pooling and batch normalization layer in each residual
blocks. At last input is added to the output of second convolution layers as ‘skip connection’
to reduce gradient degrading problem.

Title: Page 6 of 15
TMU-CCSIT Version 1.1 T001-Project Synopsis

Pooling Layers provide an approach to down sampling feature maps by summarizing the
presence of features in patches of the feature map. Two common pooling methods are
average pooling and max pooling that summarize the average presence of feature and the
most activated presence of a feature respectively. Here is the use of average pooling in
model.
Batch Normalization layer normalizes each input channel across a mini-batch. To speed up
training of convolutional neural networks and reduce the sensitivity to network initialization.
Relu refers to the Rectifier Unit, the most commonly deployed activation function for the
outputs of the CNN neurons. It introduced non-linearity in learning process of model which
helps it to learn features more efficiently and make it robust.
Relu is simple to compute therefore is faster than sigmoid function and avoids vanishing
gradient problem.

Models Evaluation
I am using test data set of 875 rows to evaluate the combined result of all models with
accuracy.

Title: Page 7 of 15
TMU-CCSIT Version 1.1 T001-Project Synopsis

Inference Output

I take a DNA reads to get the combined result as taxonomy prediction .

'GATGAACGCTGGCGGCGTGCCTAACACATGCAAGTCGAGCGGAGTTTAACTGGAAGCACTTGTGCGACCGGATAAACTTA
GCGGCGGACGGGTGAGTAACACGTGAGCAACCTACCTATCGCAGGGGAACAACATTGGGAAACCAGTGCTAATACCGCAT
AACATCTTTTGGGGGCATCCCCGGAAGATCAAAGGATTTCGATCCGGCGACAGATGGGCTCGCGTCCGATTAGCTAGTTG
GTAAGGTAAAAGCTTACCAAGGCAACGATCGGTAGCCGAACTGAGAGGTTGATCGGCCACATTGGGACTGAGACACGGCC
CAGGCTCCTACGGGAGGCAGCAGTGGGGAATATTGGGCAATGGGGGAAACCCTGACCCAGCAACGCCGCGTGAAGGAAGA
AGGCCTTCGGGTTGTAAACTTCTTTGATCAGGGACGAAACAAATGACGGTACCTGAAGAACAAGTCACGGCTAACTACGT
GCCAGCAGCCGCGGTAATACGTAGGTGACAAGCGTTATCCGGATTTACTGGGTGTAAAGGGCGTGTAGGCGGTTTCGTAA
GTTGGATGTGAAATTCTCAGGCTTAACCTGAGAGGGTCATCCAAAACTGCAAAACTTGAGTACTGGAGAGGATAGTGGAA

Title: Page 8 of 15
TMU-CCSIT Version 1.1 T001-Project Synopsis

TTCCTAGTGTAGCGGTAAAATGCGTAGATATTAGGAGGAACACCAGTGGCGAAGGCGACTATCTGGACAGTAACTGACGC
TGAGGCGCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAATACTAGGTGTAGG
GGGTATCGACCCCCCCTGTGCCGCAGCTAACGCAATAAGTATTCCACCTGGGGAGTACGACCGCAAGGTTGAAACTCAAA
GGAATTGACGGGGGCCCGCACAAGCAGTGGAGTATGTGGTTTAATTCGAAGCAACGCGAAGAACCTTACCAGGGCTTGAC
ATCCTCTGACGGCTGTAGAGATACAGCTTTCCCTTCGGGGACAGAGAGACAGGTGGTGCATGGTTGTCGTCAGCTCGTGT
CGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCCTATGGTCAGTTGCCAGCACGTAATGGTGGGCACTCTGGCA
AGACTGCCGTTGATAAAACGGAGGAAGGTGGGGACGACGTCAAATCATCATGCCCCTTATGTCCTGGGCTACACACGTAC
TACAATGGCAACAACAGAGGGCAGCCAGGTCGCGAGGCCGAGCGAATCCCAAAATGTTGTCTCAGTTCAGATTGCAGGCT
GCAACTCGCCTGCATGAAGTCGGAATTGCTAGTAATGGCAGGTCAGCATACTGCCGTGAATACGTTCCCGGGTCTTGTAC
ACACCGCCCGTCACACCATGAGAGTTTGTAACACCCGAAGTCAGTAGTCTGACCGTAAGGAGGGCGCTGCCGAAGGTGGG
ACAGATAATTGGGGTG’

All taxas are predicted correctly for above DNA read.

5 Implementation Methodology

GeNet: Deep Representations for Metagenomics

Pipelining process of the deep representation for metagenomic
classification is divided into four parts. For better understanding here is the image below-

Title: Page 9 of 15
TMU-CCSIT Version 1.1 T001-Project Synopsis

Dataset-
Each read is a string of characters (G,T,A,C) varies from organism to organism.

Vector Representation-
Each read’s character is encoded into the numeric data and that list of of numeric data
sequence is converted into 2d array.
After that we perform normalization and moved to next step.
Learning Process-
Here GeNet architecture model used for each taxa based on Covolutional Neural Network.
Trained Models-
Here are eight models taxa+name as a combined result.

Approach for High Accuracy

I choose the following hyperparameters to which helped me to reach to the average

accuracy of 65% of models.
Fully Connected layers is a feed forward neural networks. The input to the fully connected
layer is the ouput from the convolutional layers.

I used three FC layers -

Title: Page 10 of 15
TMU-CCSIT Version 1.1 T001-Project Synopsis

Layer 1- {768,384}
Layer 2- {384,768}
Layer 3- {768,total classes}
I used dropout layer after the first two layers and at the end of residual blocks so each
model neuron run effectively by stop learning the some neurons at the probability of 20%.

I choose three FC layers with not much variance in the number of neurons to make it
deeper to avoid much widening of layers because wide layers memorize the output and not
work as a general model for different data.

Optimizer is an optimization algorithm that is used to update network weights iterative

using training data.
Optimizer Algorithms used-
Adam
SGD
Learning rate is a tuning parameter in an optimization algorithm that deternines the step
size at each iteration while moving toward a minimum of a loss function.
I used 0.001 and 0.0001 as a learning rate iteratively.
Training approach-

Title: Page 11 of 15
TMU-CCSIT Version 1.1 T001-Project Synopsis

Above Graph is showing hikes in graph by at the time of Adam optimizer training the model
with learning rate of 0.001.

Title: Page 12 of 15
TMU-CCSIT Version 1.1 T001-Project Synopsis

Above Graph showing the down slope at the time SGD optimizer training the model with
learning rate of 0.001

Step 1-
I started training with SGD optimizer and learning rate of 0.001 because
SGD(Stochastic Gradient Descent) select a few sample from whole data randomly for
each iteration due to which it perform many iterations and give time to learn model
slowly with 0.001 learning rate helps to move the global minima which is closer to the
correct prediction of input.

Step 2-
After training with 0.001 learning rate I train it with 0.0001 learning rate to learn more
slowly and learn more features and slowly gradient moves toward global minima.

Step 3-
In this step I change the SGD optimizer with Adam because which is a combination of
RMSprop and Stochastic Gradient Descent with momentum. It uses the squared gradients
to scale the learning rate like RMSprop and it takes advantage of momentum by using
moving average of the gradient instead of gradient like SGD with momentum.
Adam perform fast convergence than SGD with learning rate of
0.001 moves the model toward global minima.
This steps sometimes increase the accuracy abruptly.

Step 4- Repeat the above step until get the high accuracy.
Freezing and Unfreezing method improves the accuracy of the model further by 3 to 4%.
I freeze the previous layers of the models and truncate the last classification layer and add
new two new FC layer{768,384} and {384,192} with classification layer at last {192,total

Title: Page 13 of 15
TMU-CCSIT Version 1.1 T001-Project Synopsis

classes}.

6 Technologies to be used
6.1 Software Platform
a) Google Colab
b) Ananconda Distribution

6.2 Hardware Platform

RAM, Hard Disk

6.3 Tools
Laptop

7 Advantages of this Project

It will help in replace the Kraken and Centrifuge DNA classification methods which requires

large database that makes them unaffordable untransferable and become challenging
when the amount of noise in data increases.

8 Future Scope and further enhancement of the Project

I and my team mate currently worked on bacteria and fungi will help in classifying bacteria
and fungi at low memory requirement and easily transferable .
Project is not completed yet we have to collect data for plants , animals and viruses and
build models for them.

9 Team Details
Group# Course Name Student ID Student Role Signature
Name
Industrial TCA1709030 Samyak Jain Developer
Project TCA1709021 Samyak Jain Developer

10 Conclusion
Here overall accuracy is not going above 28% because of small dataset and number of rows
which has all taxas predicted correctly are less. It can only be improved by using large data
Title: Page 14 of 15
TMU-CCSIT Version 1.1 T001-Project Synopsis

set and train models again.

11 References
https://arxiv.org/pdf/1901.11015.pdf

https://www.ncbi.nlm.nih.gov/
https://github.com/marcomeola/DAIRYdb
http://www.plantgdb.org/.
https://rvdb.dbi.udel.edu/
https://www.gbif.org/dataset/search
https://greengenes.secondgenome.com/

Title: Page 15 of 15

Tcpdump in Depth: Definitive Reference for Developers and Engineers
From Everand
Tcpdump in Depth: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Technical Foundations of Torch: Definitive Reference for Developers and Engineers
From Everand
Technical Foundations of Torch: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Builders Method
No ratings yet
Builders Method
16 pages
Imaging Plant Cell Walls Using Fluorescent Stains
No ratings yet
Imaging Plant Cell Walls Using Fluorescent Stains
19 pages
Comparison of A Modern Broiler Line and
No ratings yet
Comparison of A Modern Broiler Line and
10 pages
She Who Craves The Cruel - Her Own Sweet Way
No ratings yet
She Who Craves The Cruel - Her Own Sweet Way
79 pages
Tekton Pipeline Engineering: Definitive Reference for Developers and Engineers
From Everand
Tekton Pipeline Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
LE10-ECO-MA-SP25-Elementary Economic Analysis
No ratings yet
LE10-ECO-MA-SP25-Elementary Economic Analysis
33 pages
2021 Super Duty Chassis Cab Tech Specs
100% (1)
2021 Super Duty Chassis Cab Tech Specs
9 pages
Real Life Applications of Cone Surface Area and Volume
No ratings yet
Real Life Applications of Cone Surface Area and Volume
8 pages
Influence of Superconductor Dirtiness On The SNSPD Sensitivity-Bandwidth Trade-Off
No ratings yet
Influence of Superconductor Dirtiness On The SNSPD Sensitivity-Bandwidth Trade-Off
12 pages
DNA Design
No ratings yet
DNA Design
10 pages
Sigmoid Deep Learning
No ratings yet
Sigmoid Deep Learning
8 pages
Deep Learning
No ratings yet
Deep Learning
32 pages
Patellofemoral Pain Instability and Arth
No ratings yet
Patellofemoral Pain Instability and Arth
10 pages
Dr. Lov Kumar
No ratings yet
Dr. Lov Kumar
44 pages
The Diamond Lens
No ratings yet
The Diamond Lens
19 pages
Geography Grade10 June Exams 2023-1
100% (1)
Geography Grade10 June Exams 2023-1
14 pages
Report Skin Cancer
No ratings yet
Report Skin Cancer
29 pages
Dairying in Haryana
No ratings yet
Dairying in Haryana
124 pages
Belt Conveyors Rollers Diagnostics Based On Acoustic Signal Collected Using Autonomous Legged Inspection Robot
No ratings yet
Belt Conveyors Rollers Diagnostics Based On Acoustic Signal Collected Using Autonomous Legged Inspection Robot
13 pages
Application of Deep Learning Technique in Next Generation Sequence Experiments
No ratings yet
Application of Deep Learning Technique in Next Generation Sequence Experiments
21 pages
ML Experiment 4
No ratings yet
ML Experiment 4
6 pages
Cleantech Documentation
No ratings yet
Cleantech Documentation
15 pages
Machine Learning Engineer Interview Preparation Guide
No ratings yet
Machine Learning Engineer Interview Preparation Guide
14 pages
Minor 1
No ratings yet
Minor 1
38 pages
Minor 1
No ratings yet
Minor 1
36 pages
About The Defungi Dataset
No ratings yet
About The Defungi Dataset
14 pages
Full Text 02
No ratings yet
Full Text 02
38 pages
End To End Project
No ratings yet
End To End Project
21 pages
University Institute of Engineering Department of Computer Science & Engineering
No ratings yet
University Institute of Engineering Department of Computer Science & Engineering
8 pages
ML Digit Classification Report
No ratings yet
ML Digit Classification Report
7 pages
AI Image Classification With Neural Network 221002564 222002074
No ratings yet
AI Image Classification With Neural Network 221002564 222002074
17 pages
TFM Jenifer Tabita Ciuciu-Kis
No ratings yet
TFM Jenifer Tabita Ciuciu-Kis
83 pages
GUB CSE Thesis ProjectTemplate 2 1
No ratings yet
GUB CSE Thesis ProjectTemplate 2 1
131 pages
StudentInnovation AI Equation Discovery2025
No ratings yet
StudentInnovation AI Equation Discovery2025
12 pages
Image Classification of Animals
No ratings yet
Image Classification of Animals
4 pages
Nikhilreport
No ratings yet
Nikhilreport
19 pages
LearningFromExamples II
No ratings yet
LearningFromExamples II
36 pages
Level 4 Reading Narrative With KEY
No ratings yet
Level 4 Reading Narrative With KEY
13 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
Minor Project
No ratings yet
Minor Project
21 pages
Report
No ratings yet
Report
16 pages
FI Proj
No ratings yet
FI Proj
17 pages
Cancer Type Prediction and Classification Based On RNA-sequencing Data
No ratings yet
Cancer Type Prediction and Classification Based On RNA-sequencing Data
4 pages
Animal Image Recognition System
No ratings yet
Animal Image Recognition System
2 pages
EMS003 Rev2
No ratings yet
EMS003 Rev2
53 pages
Molecular Classification With Graph ConvolutionalNetworks: Exploring The MUTAG Dataset For Mutagenicity Prediction
No ratings yet
Molecular Classification With Graph ConvolutionalNetworks: Exploring The MUTAG Dataset For Mutagenicity Prediction
6 pages
Butterfly Image Classification Using Convulational Neural Network (CNN)
No ratings yet
Butterfly Image Classification Using Convulational Neural Network (CNN)
6 pages
A Comparative Analysis of Deep Learning Model For Flower Recognition and Health Prediction
No ratings yet
A Comparative Analysis of Deep Learning Model For Flower Recognition and Health Prediction
87 pages
B4 Final Report
No ratings yet
B4 Final Report
82 pages
17 Master2017Liu
No ratings yet
17 Master2017Liu
105 pages
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
From Everand
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
M. Sreedevi
No ratings yet
Greek (Pythagoras and Plato)
100% (1)
Greek (Pythagoras and Plato)
29 pages
Image Fish
No ratings yet
Image Fish
4 pages
Data Mining Fall 2023
No ratings yet
Data Mining Fall 2023
15 pages
Basic Rules For Using Articles
No ratings yet
Basic Rules For Using Articles
4 pages
Failure Analysis of Water Pump Shaft
No ratings yet
Failure Analysis of Water Pump Shaft
7 pages
School of Studies Engineering and Technology
No ratings yet
School of Studies Engineering and Technology
27 pages
Bio Report El
No ratings yet
Bio Report El
8 pages
Oishee Dey NNFL Project Report
No ratings yet
Oishee Dey NNFL Project Report
20 pages
ECBFMBP: Design of An Ensemble Deep Learning Classifier With Bio-Inspired Feature Selection For High-Efficiency Multidomain Bug Prediction
100% (1)
ECBFMBP: Design of An Ensemble Deep Learning Classifier With Bio-Inspired Feature Selection For High-Efficiency Multidomain Bug Prediction
24 pages
Report
No ratings yet
Report
2 pages
Dynamic Selection of Heterogenous Ensemble To Improve Bug Prediction
No ratings yet
Dynamic Selection of Heterogenous Ensemble To Improve Bug Prediction
62 pages
CIS 6213 Applied Machine Learning Coursework
No ratings yet
CIS 6213 Applied Machine Learning Coursework
5 pages
A Simple, Illustrated Introduction To Single Infusion Mash Temperatures
No ratings yet
A Simple, Illustrated Introduction To Single Infusion Mash Temperatures
5 pages
Himanshu Gupta Configuration Manual
No ratings yet
Himanshu Gupta Configuration Manual
16 pages
Final 1
No ratings yet
Final 1
36 pages
Modelo Atômico de Theodoro Augusto Ramos
No ratings yet
Modelo Atômico de Theodoro Augusto Ramos
2 pages
Ideology of Islam
No ratings yet
Ideology of Islam
15 pages
Tunnelling Methods
0% (1)
Tunnelling Methods
15 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Research Paper
No ratings yet
Research Paper
7 pages
GIC Bourdon Sensing
No ratings yet
GIC Bourdon Sensing
2 pages
Major Component Component Section: Hierarchy Component & Symptom of Problem
No ratings yet
Major Component Component Section: Hierarchy Component & Symptom of Problem
34 pages
Kristen Steffen 11
No ratings yet
Kristen Steffen 11
4 pages
Probor Datasheet
No ratings yet
Probor Datasheet
6 pages
UCLA Electronic Theses and Dissertations: Title
No ratings yet
UCLA Electronic Theses and Dissertations: Title
43 pages
Ebug Final
No ratings yet
Ebug Final
25 pages
CSE Sem7 N 8
No ratings yet
CSE Sem7 N 8
51 pages
Cotton Crop Disease Prediction Using Deep Learning
No ratings yet
Cotton Crop Disease Prediction Using Deep Learning
13 pages
READING
No ratings yet
READING
62 pages
Project Manual - Team 591965
No ratings yet
Project Manual - Team 591965
27 pages
Environment Baseline SURVEY Report For Nghi Son Refinery Petrochemical Complex
No ratings yet
Environment Baseline SURVEY Report For Nghi Son Refinery Petrochemical Complex
159 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Gene Prediction Using Unsupervised Deep Networks
No ratings yet
Gene Prediction Using Unsupervised Deep Networks
49 pages
Plant Disease Identification
No ratings yet
Plant Disease Identification
17 pages
Tensor Flow 2
No ratings yet
Tensor Flow 2
3 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages

Metagenomics Classification: Project Synopsis

Uploaded by

Metagenomics Classification: Project Synopsis

Uploaded by

METAGENOMICS CLASSIFICATION

PROJECT GUIDE: SUBMITTED BY:

COLLEGE OF COMPUTING SCIENCES AND INFORMATION TECHNOLOGY

4.1 Scope of the Work

4.2 Project Modules

GeNet Model Architecture

Here it’s image below -

I take a DNA reads to get the combined result as taxonomy prediction .

All taxas are predicted correctly for above DNA read.

GeNet: Deep Representations for Metagenomics

Approach for High Accuracy

I choose the following hyperparameters to which helped me to reach to the average

I used three FC layers -

Optimizer is an optimization algorithm that is used to update network weights iterative

6.2 Hardware Platform

7 Advantages of this Project

8 Future Scope and further enhancement of the Project

set and train models again.

You might also like