0% found this document useful (0 votes)

71 views13 pages

Rushikesh Kamble (Roll No. 20) Siddhesh Ghosalkar (Roll No. 14) Pradnya Dhondge (Roll No. 07)

The document is a project report submitted in partial fulfillment of a bachelor's degree. It discusses identifying and classifying toxic comments online. It proposes using natural language processing and deep learning techniques like word embeddings, LSTM, CNN, Naive Bayes, and SVM to classify comments. The techniques involve preprocessing text through tokenization, vectorization, and word embeddings. Different algorithms are then applied and evaluated for accuracy, with the proposed LSTM model achieving 98.15% accuracy.

Uploaded by

siddhesh ghosalkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

71 views13 pages

Rushikesh Kamble (Roll No. 20) Siddhesh Ghosalkar (Roll No. 14) Pradnya Dhondge (Roll No. 07)

Uploaded by

siddhesh ghosalkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

PROJECT REPORT

“IDENTIFY & CLASSIFY ONLINE TOXIC COMMENTS”

SUBMITTED IN PARTIAL FULFILLMENT OF THE

REQUIREMENTS OF DEGREE OF

Bachelor of Engineering
BY

RUSHIKESH KAMBLE (ROLL NO. 20)

SIDDHESH GHOSALKAR (ROLL NO. 14)
PRADNYA DHONDGE (ROLL NO. 07)

SUPERVISOR

PROF. NEHA RANE

Department of Computer Engineering

Pillai HOC College of Engineering and Technology.
Rasayani,
Pillai’s HOC Educational Campus, HOCLColony,
Rasayani, Tal: Khalapur, Dist Raigad-410207
UNIVERSITY OF MUMBAI
[2020-21]
CERTIFICATE

This is to certify that the project work entitled" IDENTIFY & CLASSIFY ONLINE TOXIC
COMMENTS " is the bonafide work of" Rushikesh Kamble", "Siddhesh Ghosalkar" and
"Pradnya Dhondge" Group No. 25 submitted to the university of Mumbai in partial
fulfillment of the requirement for the award of the degree of "BACHELOR OF
ENGINEERING" in "COMPUTER ENGINEERING".

Prof. Neha Rane

(Guide)

Dr. Ashok Kanthe Dr. Mathew Joseph

(Head Of Department) (Principal)
Declaration

I declare that this written submission represents my ideas in my own words and
where others' ideas or words have been included, I have adequately cited and
referenced the original sources. I also declare that I have adhered to all principles
of academic honesty and integrity and have not misrepresented or fabricated or
falsified any idea/data/fact/source in my submission. I understand that any
violation of the above will be cause for disciplinary action by the Institute and can
also evoke penal action from the sources which have thus not been properly cited
or from whom proper permission has not been taken when needed.

Pradnya Dhongade(07)

Siddhesh Ghosalkar(14)
Date:-

Rushikesh Kamble(20)
TABLE OF CONTENTS

Contents Page No.

1.Abstract 3
2.Introduction 4
3. Preprocessing Text 5
4.Algorithms 6
5.Implementation 7
6.Conclusion 8
7. References 8
1. Abstract

Text classification has become one of the most useful applications of

Deep Learning, this process includes techniques like Tokenizing,
Stemming, and Embedding. This paper uses these techniques along with
few algorithms, that are used to classify online comments based on their
level of toxicity. We proposed a neural network model to classify the
comments and compared the model’s accuracy with some other models
like Long Short Term Memory (LSTM), Naive Bayes Support Vector
Machine, Fasttext and Convolutional Neural Network .The comments
are first passed to a tokenizer or vectorizer to create a dictionary of
words, then an embedding matrix is created after which it is passed to a
model to classify. The proposed model achieved an accuracy of 98.15%.
Key Words: Toxic comment detection, Long short term Memory, Convolutional Neural
Networks, Naive Bayes, Support Vector Machine, Fasttext.
2. INTRODUCTION

Social media is a place where a lot of discussions happen, being

anonymous while doing so has given the freedom to many people to
express their opinions freely. But people who disagree with a point of
view extremely can misuse this freedom sometimes. Sharing things that
you care about will become a difficult task with this constant threat of
harassment or toxic comments online. This will eventually lead to
people not sharing their ideas online and stop asking for other people’s
opinion on them. Unfortunately, the social media platforms face these
issues all the time and find it difficult to identify and stop these toxic
remarks before it leads to the abrupt end of conversations.
In this project, we will be using Natural Language Processing with Deep
neural networks to solve this problem of identifying the toxicity of
online comments. Word embeddings will be used in conjunction with
recurrent neural networks with Long Short Term Memory (LSTM),
Convolutional Neural Networks (CNN), and Naive Bayes (NB)-Support
Vector Machine (SVM) and Fasttext separately and see which model fits
and works best.
3 . PREPROCESSING TEXT –
Pre-processing of the text is the first step that is performed on the
dataset. The dataset is cleaned and prepared for the classification tasks
by removing punctuation, imputing missing values, normalisation, etc.
Besides these common preprocessing functions there are other
techniques that are used specifically for deep learning classification.

3.1 TOKENIZATION
Tokenization is the process of converting a text corpus to a set of distinct
tokens of any size. These tokens are usually numbers which are assigned
to the words present in the text. As a computer cannot understand a
language, this method helps us to map all the words to distinct numbers
which makes it easier for the computer to understand. So the result of
this process is a dictionary of fixed size that contains a mapping from
words to numbers.
3.2 VECTORIZATION
Vectorization is a technique in which words are converted to feature
vectors. This paper uses the Term Frequency Inverse Document
Frequency Vectorization (TFIDF). TFIDF Vectorization converts the
words in the document to a vector that can be used as input to the
estimator. It can be used to learn how important a word is to a document.
This is done by assigning a score to each word in the document.
3.3 WORD EMBEDDINGS
Every word in the dataset is embedded into feature vectors, this is done
by creating an embedding matrix. An embedding matrix is a list of
words and their corresponding embeddings. Embeddings usually refer to
n-dimensional dense vectors. The embedding matrix is of shape
(vocab_size, embed_size). Here vocab_size is the number of words in
the dictionary that are obtained from the tokenization method and
embed_size is the number of features into which the words will be
embedded. There are a lot of pre-trained word embeddings available
with different embedding sizes like the GloVe (Global Vectors for Word
Representation), word2vec, Fasttext-crawl, etc. This paper uses fasttext-
crawl-300d-2m for the embedding matrix. This embedding matrix is
then passed to different algorithms.
4. ALGORITHMS

4.1 Long Short Term Memory (LSTM)

Recurrent Neural Networks (RNN) are neural networks that contain
cyclic connections. The output of a given hidden layer is fed back to
itself in an RNN to remember some information as the memory from
previous computations. This makes RNNs a powerful tool for sequential
data like text, video, and speech.
Long Short Term Memory (LSTM) is an RNN that can learn long term
dependencies which is something a traditional RNN finds difficult to do.
An LSTM model, like RNN, has a chain-like architecture where each
unit of this repeating structure is called an LSTM cell.
An LSTM cell contains an input gate, an output gate and a forget gate
that regulates the data which is flown into and outside of the cell. The
forget gate decides what information it's going to discard from the
current cell. The input gate then decides what new information is going
to be added to modify the current state of the memory and finally, the
output gate decides what information leaves the cell.

4.2 Convolutional Neural Network

Convolutional Neural Network (CNN) is a deep neural network that is
usually applied to images. CNNs were inspired by the human brain. Like
the human brain, CNN consists of interconnected neurons in different
layers. Each neuron in a layer is a perceptron that performs some
computation to the weights that are passed to it. Although CNNs are
mostly used for image classification, they can also be used for text
classification by passing the feature vectors of input text to CNN. The
CNN then computes weights for different neurons which are used to
determine a function that maps the feature vectors to the output. A CNN
usually consists of the following layers: -
a)Convolutional Layer: - The purpose of a Convolutional layer is to
extract and learn features from the input vectors. A convolutional layer
computes outputs of neurons by performing dot product operations to the
weights and passes this output to an activation function.
c)Activation Function: - The output of a convolutional layer is passed
to an activation function. An activation function is used to add non-
linearity to the output of the Convolutional Layer. The most common
activation function is the Rectified Linear Unit (ReLu) function. A
ReLu function can be defined as follows: f(x) = max (0, x)
c)Pooling Layer: - A pooling layer is used to reduce the dimensions of
the input by preserving the important features. A Convolutional layer is
often succeeded by a pooling layer to reduce the size and number of
parameters from the previous layer.
4.3 SVM with NB features (NBSVM)
Naive Bayes classifier algorithm is based on Bayes theorem which
determines the posterior probability of an event occurring based on prior
knowledge or evidence. Multinomial Naive Bayes an instance of NB
classifier which uses a multinomial distribution for each feature of data.
Support Vector Machine (SVM): SVM works on the principle of finding
hyperplanes that distinctly classify the data units when you plot them
onto an n-dimensional graph(here the n refers to the number of features
of the data).
In Naive Bayes -Support Vector Machine the probabilities calculated in
MNB are then fed to SVM to classify. NBSVM is observed to give
better results than a simple NB classifier or SVM classifier when used
separately.

4.4 Fasttext –
Fasttext is a text classification library developed by Facebook. Fasttext
can be used to learn word embeddings, create supervised or
unsupervised classification models from these word embeddings.
Fasttext has its word embeddings called Fasttext crawl which is trained
on around 600 Billion tokens. These word embeddings are open and can
be downloaded by anyone for their use.
Fasttext has multiple pre-trained models to choose from depending on
the nature of the problem. In this paper we use the default supervised
classifier model.
5. IMPLEMENTATION

It is open-sourced neural network library built on python which provides

user-friendly, high-level APIs which enable easy implementations of
different deep neural network algorithms.
The dataset used for classification in this project is given as part of the
competition hosted by Jigsaw and it contains 159571 comments taken
from Wikipedia. Each comment is classified into one of 6 labels based
on their level of toxicity. The results are then validated against a test
data set of 153164 new examples.

5.1 Naive Bayes-Support Vector Machines

In this approach, we first compute TF-IDF scores for the words in the
train data using the TF-IDF Vectorizer. This generates a matrix that
contains an array of scores for each training example.
Then we use the Multinomial Naive Bayes theorem on each column of
the labels to generate the NB probabilities from the TF-IDF matrix. This
is then fed to SVM to predict the probabilities of each label.
After predicting the test set NB-SVM achieved an accuracy of 97.61 %.

5.2 FASTTEXT
The fasttext library takes the input in a text format, so all the comments
from the train data are converted to a text document with each training
example starting with ’ __label ’ followed by the respective label of
the comment and then the comment itself. This text file is fed into the
fasttext model and after fine-tuning the hyper parameters of the number
of epochs and learning rate to 5 and 0.1 respectively, the model achieved
an accuracy of 95.4%.

5.3 Long Short Term Memory

The first step of this algorithm is tokenization. This generates a sequence
of numbers for each comment. As each comment may vary in their
lengths, the output of tokenization is padded to a fixed length of 200.
This is then passed to a Keras Embedding Layer which learns
embeddings. The embedding size used was 300.
The output of this embedding layer is then fed to an LSTM of 60 units,
which then returns sequences.

5.4 CONVOLUTIONAL NEURAL NETWORKS

The first few steps of this algorithm are the same as that of LSTM. But
here, instead of allowing the embedding layer to learn the weights by
itself, we provide the Embedding Layer with a matrix extracted from the
fasttext-crawl file. After the embedding layer, we then use a series of
convolutional layers in conjunction with pooling layers. This paper uses
4 convolutional layers and 4 max-pooling layers. The output of these
layers is concatenated and then Flattened to an array, which is finally fed
to a Dense layer of 6 units with a sigmoid activation function, which
predicts the probabilities of each label.

6. CONCLUSION
With the Internet being a platform accessible to everyone, it is important
to make sure that people with different ideas are heard without the fear
of any toxic and hateful remarks. And after analyzing various
approaches to solve this problem of classification of toxic comments
online, it is found that CNN model works slightly better than LSTM and
NB-SVM with the accuracy of 98.13%. Future scope for this analysis
would be integrating such classification algorithms into social media
platforms to automatically classify and censor or toxic comments
7. REFERENCES
[1]Siwei Lai, Liheng Xu, Kang Liu, Jun Zhao, ” Recurrent
Convolutional Neural Networks for Text Classification” Proceedings of
the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin,
Texas, 2015.
[2]Sida Wang and Christopher D. Manning, “Baselines and Bigrams:
Simple, Good Sentiment and Topic Classification”, Stanford,CA,
[3]Mujahed A. Saif, Alexander N. Medvedev, Maxim A.
Medvedev, and Todorka
Atanasova, “Classification of online toxic comments using the logistic
regression and neural networks models”, AIP Conference Proceedings
2048, 060011 (2018)
[4]R. Nicole, “Title of paper with only first word capitalized,” J. Name
Stand.
Abbrev., in press.
[5]Sepp Hochreiter, Jurgen Schmidhuber, “LONG SHORT- TERM
MEMORY”,
Neural Computation 9(8):1735-1780, 1997
[6]Navaney, P., Dubey, G., &Rana, A. (2018). "SMS Spam Filtering
Using Supervised Machine Learning Algorithms." 2018 8th International
Conference on Cloud Computing, Data Science & Engineering
(Confluence).

Hate Speech Detection PPT FINAL
100% (1)
Hate Speech Detection PPT FINAL
29 pages
Toxic Comments Classification
No ratings yet
Toxic Comments Classification
10 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
52 pages
The United Republic of Tanzania: Ibra Advanced Mathematics Series Program
No ratings yet
The United Republic of Tanzania: Ibra Advanced Mathematics Series Program
4 pages
Reading Body Language of 7 Meaning Communication
No ratings yet
Reading Body Language of 7 Meaning Communication
10 pages
Safetalk(Abstract)
No ratings yet
Safetalk(Abstract)
43 pages
ML Project Report PDF
No ratings yet
ML Project Report PDF
26 pages
Toxic Comment Classification Using Natural Language Processing IRJET-V7I61123
No ratings yet
Toxic Comment Classification Using Natural Language Processing IRJET-V7I61123
4 pages
REPORT
No ratings yet
REPORT
30 pages
BHAVYATHA_TECHNICAL_SEMINAR_REPORT
No ratings yet
BHAVYATHA_TECHNICAL_SEMINAR_REPORT
30 pages
Ashwin Prasanth PT1 Project
No ratings yet
Ashwin Prasanth PT1 Project
38 pages
DB REPORTTTTTTTT
No ratings yet
DB REPORTTTTTTTT
31 pages
Paper_2_DK
No ratings yet
Paper_2_DK
20 pages
Identification and Classification of Toxic Comment Using Machine Learning Methods
0% (1)
Identification and Classification of Toxic Comment Using Machine Learning Methods
5 pages
3-Natural Language Processing With Attention Models
No ratings yet
3-Natural Language Processing With Attention Models
62 pages
Classification Survey
No ratings yet
Classification Survey
40 pages
Report in ML
No ratings yet
Report in ML
9 pages
Maslej-Krešňáková Et Al. - 2020 - Comparison of Deep Learning Models and Various Text Pre-Processing Techniques For The Toxic Comments C-Annotated
No ratings yet
Maslej-Krešňáková Et Al. - 2020 - Comparison of Deep Learning Models and Various Text Pre-Processing Techniques For The Toxic Comments C-Annotated
26 pages
Toxic_Comment_Classification_Using_S-BERT_Vectorization_and_Random_Forest_Algorithm
No ratings yet
Toxic_Comment_Classification_Using_S-BERT_Vectorization_and_Random_Forest_Algorithm
6 pages
Sentiment Analysis Using NLP
No ratings yet
Sentiment Analysis Using NLP
42 pages
"Hosting A Static Gym Website Using AWS S3 Service": A Project Report ON
No ratings yet
"Hosting A Static Gym Website Using AWS S3 Service": A Project Report ON
15 pages
Deep_Learning_Techniques_for_Sentiment_Analysis_on_Social_Media_Text Final
No ratings yet
Deep_Learning_Techniques_for_Sentiment_Analysis_on_Social_Media_Text Final
51 pages
ToxicCommentClassificationusingBidirectionalLSTMandTensorFlow
No ratings yet
ToxicCommentClassificationusingBidirectionalLSTMandTensorFlow
35 pages
WordEmbeddingMethodsofTextProcessing (1)
No ratings yet
WordEmbeddingMethodsofTextProcessing (1)
7 pages
Ml Projrct Article 2
No ratings yet
Ml Projrct Article 2
6 pages
Lect05
No ratings yet
Lect05
17 pages
Istqb Advanced Test Analyst Questions
0% (2)
Istqb Advanced Test Analyst Questions
20 pages
2020 Trac-1 4
No ratings yet
2020 Trac-1 4
5 pages
fin_irjmets1699759581
No ratings yet
fin_irjmets1699759581
5 pages
Martin, Adrián Rodríguez, Barcelona - 2018 - Toxic Comment Classification Using Convolutional and Recurrent Neural Networks-Annotated
No ratings yet
Martin, Adrián Rodríguez, Barcelona - 2018 - Toxic Comment Classification Using Convolutional and Recurrent Neural Networks-Annotated
4 pages
Toxic Commefbnt Final Report
No ratings yet
Toxic Commefbnt Final Report
10 pages
ML7 - Text Classification
No ratings yet
ML7 - Text Classification
13 pages
BDA3
No ratings yet
BDA3
61 pages
A_Comparative_Study_and_Analysis_on_Toxic_Comment_Classification
No ratings yet
A_Comparative_Study_and_Analysis_on_Toxic_Comment_Classification
5 pages
3. Introtophilo-1st Peiodical
100% (1)
3. Introtophilo-1st Peiodical
4 pages
Document Classification Using Machine Learning: What Is Document Classifier?
No ratings yet
Document Classification Using Machine Learning: What Is Document Classifier?
9 pages
Toxic Comment Detection Code Using LSTM: A Project On
No ratings yet
Toxic Comment Detection Code Using LSTM: A Project On
11 pages
13. TEXT CLASSIFICATION USING NLP
No ratings yet
13. TEXT CLASSIFICATION USING NLP
28 pages
Lecture - Young-Laplace and Kelvin Equations
No ratings yet
Lecture - Young-Laplace and Kelvin Equations
8 pages
Complete Report
No ratings yet
Complete Report
56 pages
Analysis of Student Feedback Using Deep Learning
No ratings yet
Analysis of Student Feedback Using Deep Learning
4 pages
Literature Surveyy
No ratings yet
Literature Surveyy
6 pages
Project Report
No ratings yet
Project Report
6 pages
Deep Learning Journal
No ratings yet
Deep Learning Journal
6 pages
Thesis - Aru Omarali
No ratings yet
Thesis - Aru Omarali
34 pages
ProjectReport2023
No ratings yet
ProjectReport2023
32 pages
2000-double-seated-globe-control-valve
No ratings yet
2000-double-seated-globe-control-valve
24 pages
A Comprehensive Guide To Understand and Implement Text Classification in Python
No ratings yet
A Comprehensive Guide To Understand and Implement Text Classification in Python
34 pages
Copy of Events Nc III Assessment_20250117_103739_0000
No ratings yet
Copy of Events Nc III Assessment_20250117_103739_0000
37 pages
paper scope
No ratings yet
paper scope
2 pages
poster_version_final_bis
No ratings yet
poster_version_final_bis
1 page
Toxic Comment Analyser
No ratings yet
Toxic Comment Analyser
19 pages
Grating Light Valve Technology: Seminar Report ON
No ratings yet
Grating Light Valve Technology: Seminar Report ON
41 pages
7834-Article Text-8539-1-10-20230901
No ratings yet
7834-Article Text-8539-1-10-20230901
12 pages
2023_01_14 539 Melli yard report R2-1 2022-01-03_Signed
No ratings yet
2023_01_14 539 Melli yard report R2-1 2022-01-03_Signed
52 pages
Project Report Toxic Comment Classifier
No ratings yet
Project Report Toxic Comment Classifier
25 pages
Mechanics of Materials Oel
No ratings yet
Mechanics of Materials Oel
6 pages
AMDP
No ratings yet
AMDP
11 pages
Homework Construction LTD
100% (1)
Homework Construction LTD
7 pages
Human Values & Professional Ethics by MR Vinay Yadav Sir
100% (1)
Human Values & Professional Ethics by MR Vinay Yadav Sir
123 pages
ORGANIZATIONAL BEHAVIOUR AND PERFORMANCE - JUNE 2024 PAST QUESTION - PE 1
No ratings yet
ORGANIZATIONAL BEHAVIOUR AND PERFORMANCE - JUNE 2024 PAST QUESTION - PE 1
20 pages
Colourimetry Practical
100% (1)
Colourimetry Practical
6 pages
Bali 2007: On The Road Again!
No ratings yet
Bali 2007: On The Road Again!
7 pages
Jule A'. Scholl
100% (2)
Jule A'. Scholl
8 pages
Sample (Charles Benson)
No ratings yet
Sample (Charles Benson)
23 pages
ABAP Training - Module Poolnew
100% (1)
ABAP Training - Module Poolnew
85 pages
Sony hcd-v707
No ratings yet
Sony hcd-v707
81 pages
03 Damping
No ratings yet
03 Damping
20 pages
XCELL Roster
No ratings yet
XCELL Roster
13 pages
Topic1 PDF
No ratings yet
Topic1 PDF
17 pages
Gateway Intermediate Workbook
No ratings yet
Gateway Intermediate Workbook
21 pages
MP Lab
No ratings yet
MP Lab
34 pages
Project Report - 32527
No ratings yet
Project Report - 32527
51 pages
Ayushi Resume
No ratings yet
Ayushi Resume
3 pages
A QR Code Technology For Centralized Inventory Management System
No ratings yet
A QR Code Technology For Centralized Inventory Management System
8 pages
Baracor 450: Completion Fluid Services Corrosion Inhibitor
No ratings yet
Baracor 450: Completion Fluid Services Corrosion Inhibitor
2 pages
Media and Information Literacy: Quarter 1 - Module 3
No ratings yet
Media and Information Literacy: Quarter 1 - Module 3
12 pages
Tensioners Warper: General Specifications
No ratings yet
Tensioners Warper: General Specifications
2 pages
Toyota 2J Engine Data
No ratings yet
Toyota 2J Engine Data
1 page
Address: - : REGISTERED & HEAD OFFICE. Bajaj Auto LTD., Akurdi, Pune 411035
No ratings yet
Address: - : REGISTERED & HEAD OFFICE. Bajaj Auto LTD., Akurdi, Pune 411035
1 page
Software-Defined Networks: A Systems Approach
From Everand
Software-Defined Networks: A Systems Approach
Larry Peterson
5/5 (1)
Blockchain Foundation Courseware - English
From Everand
Blockchain Foundation Courseware - English
Eppo Luppes
No ratings yet
Transformers in Deep Learning Architecture: Definitive Reference for Developers and Engineers
From Everand
Transformers in Deep Learning Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Semantic Computing
From Everand
Semantic Computing
Phillip C.-Y. Sheu
No ratings yet
Real-Time Critical Systems
From Everand
Real-Time Critical Systems
Jordan Lee Mauro-Buhagiar
3/5 (1)
Communication Nets: Stochastic Message Flow and Delay
From Everand
Communication Nets: Stochastic Message Flow and Delay
Leonard Kleinrock
3/5 (1)
Mivar NETs and logical inference with the linear complexity
From Everand
Mivar NETs and logical inference with the linear complexity
Varlamov, Oleg O.
No ratings yet
Dataflow and Reactive Programming Systems
From Everand
Dataflow and Reactive Programming Systems
Matt Carkci
No ratings yet
Handbook of Cloud Computing: Basic to Advance research on the concepts and design of Cloud Computing
From Everand
Handbook of Cloud Computing: Basic to Advance research on the concepts and design of Cloud Computing
Dr. Anand Nayyar
No ratings yet
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
From Everand
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
Robert Johnson
No ratings yet
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
From Everand
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
M. Sreedevi
No ratings yet
Sussman Anomaly: Fundamentals and Applications
From Everand
Sussman Anomaly: Fundamentals and Applications
Fouad Sabry
No ratings yet
Knowledge Reasoning: Fundamentals and Applications
From Everand
Knowledge Reasoning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Conceptual Dependency Theory: Fundamentals and Applications
From Everand
Conceptual Dependency Theory: Fundamentals and Applications
Fouad Sabry
No ratings yet

Rushikesh Kamble (Roll No. 20) Siddhesh Ghosalkar (Roll No. 14) Pradnya Dhondge (Roll No. 07)

Uploaded by

Rushikesh Kamble (Roll No. 20) Siddhesh Ghosalkar (Roll No. 14) Pradnya Dhondge (Roll No. 07)

Uploaded by

PROJECT REPORT

“IDENTIFY & CLASSIFY ONLINE TOXIC COMMENTS”

SUBMITTED IN PARTIAL FULFILLMENT OF THE

RUSHIKESH KAMBLE (ROLL NO. 20)

PROF. NEHA RANE

Department of Computer Engineering

Prof. Neha Rane

Dr. Ashok Kanthe Dr. Mathew Joseph

Contents Page No.

Text classification has become one of the most useful applications of

Social media is a place where a lot of discussions happen, being

4.1 Long Short Term Memory (LSTM)

4.2 Convolutional Neural Network

It is open-sourced neural network library built on python which provides

5.1 Naive Bayes-Support Vector Machines

5.3 Long Short Term Memory

5.4 CONVOLUTIONAL NEURAL NETWORKS

You might also like