Deep Learning Algorithms For Cyber-Bulling Detection in Social Media Platforms

Deep_Learning_Algorithms_for_Cyber-Bulling_Detection_in_Social_Media_Platforms

Uploaded by

PARTHIBAN M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views8 pages

Deep Learning Algorithms For Cyber-Bulling Detection in Social Media Platforms

Deep_Learning_Algorithms_for_Cyber-Bulling_Detection_in_Social_Media_Platforms

Uploaded by

PARTHIBAN M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Received 15 April 2024, accepted 22 May 2024, date of publication 28 May 2024, date of current version 6 June 2024.

Digital Object Identifier 10.1109/ACCESS.2024.3406595

Deep Learning Algorithms for Cyber-Bulling

Detection in Social Media Platforms
MOHAMMED HUSSEIN OBAIDA 1,2 , SALEH MESBAH ELKAFFAS3 ,
AND SHAWKAT KAMAL GUIRGUIS2
1 College of Science, Al-Nahrain University, Jadriya, Baghdad 64074, Iraq
2 Department of Information Technology, Institute of Graduate Studies and Research, Alexandria University, Alexandria 21544, Egypt
3 College of Computing and Information Technology, Arab Academy for Science, Technology and Maritime Transport, Alexandria 19838, Egypt
Corresponding author: Mohammed Hussein Obaida ([email protected])

ABSTRACT Social media platforms are among the most widely used means of communication. However,
some individuals exploit these platforms for nefarious purposes, with ‘‘cyberbullying’’ being particularly
prevalent. Cyberbullying, which involves using electronic means to harass or harm others, is especially
common among young people. Consequently, this study aims to propose a model for detecting cyberbullying
using a deep learning algorithm. Three datasets from Twitter, Instagram, and Facebook were utilized to
predict instances of bullying using the Long Short-Term Memory (LSTM) method. The results obtained
revealed the development of an effective model for detecting cyberbullying, addressing challenges faced
by previous cyberbullying detection techniques. The model achieved accuracies of approximately 96.64%,
94.49%, and 91.26% for the Twitter, Instagram, and Facebook datasets, respectively.

INDEX TERMS Cyberbullying, deep learning, LSTM, social networks.

I. INTRODUCTION social networks, often leads individuals to exhibit more

With the advent of online social networks, the widespread aggressive behavior [3]. Additionally, the growing trend of
accessibility of information and communication technol- expressing opinions freely online has also contributed to
ogy, as well as the prevalent utilization of computers and the spread of hate speech. Given the detrimental impact of
smartphones, individuals on the Internet are experiencing such prejudiced communication on society, governments and
heightened degrees of freedom of speech. Furthermore, users social media platforms stand to benefit from the utilization of
of social media platforms frequently have the ability to tools for the detection and prevention of hate speech [4].
conceal their identities, thereby enabling the exploitation Deep Learning (DL) is a method employed in the field
of various functionalities. The issue of offensive language of machine learning that enables the performance of unsu-
has become prominent in the domain of social networking. pervised learning using un labeled data. In the areas of
Offensive language refers to any form of communication that data mining and text classification, a variety of research
displays abusive conduct with the purpose of causing harm to investigations have applied DL methodologies to predict and
others. Various types of abusive language can be observed classify events such as detecting hate speech and categorizing
on social networking platforms, including sexism, racism, opinions. Various types of Deep Learning Networks include
cyberbullying, hate speech, and toxic remarks [1]. Feed Forward Neural Networks, Deep Belief Networks, Con-
Hate speech has become increasingly prevalent in both volutional Neural Networks (CNN), Restricted Boltzmann
face-to-face interactions and online communication in recent Machines, Recurrent Neural Networks (RNN), and Stacked
years [2]. Various factors play a role in this phenomenon. Denoising Autoencoders [5].
Firstly, the anonymity offered by online platforms, especially Numerous approaches have been explored in the quest
to detect hate speech, including conventional classifiers,
The associate editor coordinating the review of this manuscript and neural network-driven classifiers, or a combination of both
approving it for publication was Nuno M. Garcia . techniques. Multi-layer Perceptron (MLP), support vector
2024 The Authors. This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.
VOLUME 12, 2024 For more information, see https://creativecommons.org/licenses/by-nc-nd/4.0/ 76901
M. H. Obaida et al.: Deep Learning Algorithms for Cyber-Bulling Detection

machines (SVM) and extreme gradient boosting (XGB) the enhancement in performance compared to standard
are commonly used as classifiers in this domain, where references.
the application of vectorized representations of textual
data is typically essential. Commonly employed approaches
include the utilization of bag-of-word models in conjunction II. RELATED WORKS
with TF-IDF (term frequency-inverse document frequency). Several methods offer models for cyberbullying detection.
The progress in embedding methodologies rooted in deep In the study of [8], the model is suggested to provide a dual
learning has brought forth tools like Fast Text, Glove, definition of cyberbullying by utilizing a creative CNN idea
word2vec, and transformer-based approaches, which have for content analysis as well as a dishonest method to deal
been employed to acquire more complex representations. with providing the arrangement with less accuracy. When
These embedding-based representations, pre-trained using compared to other studies, the collected data are proven to
representation learning tools, can be utilized with both tra- provide superior precision and categorization.
ditional and sophisticated classifiers. This expands the array A systematic review of n=186 entries from internet data-
of techniques available for identifying hate speech, providing banks was published by [9]. In this article, 10 reviews of
a diverse set of potential solutions applicable to various real- the literature have been chosen to assess and debate the data
world situations [6]. regarding the effectiveness of ML in preventing cyberbully-
The increase in cyberbullying on social networking sites ing. To predict cyberbullying, most models take advantage of
and the diversity of its forms has resulted in negative effects content-based features. While the most prevalent algorithms
on the victim. The negative effects that appeared on the are support vector machines, naive Bayes, and convolutional
victims after being exposed to cyberbullying are many, such neural networks, the majority of these traits are based on text
as negative effects on physical health and mental health like from social media posts. ML is a cutting-edge preventative
anxiety, depression, thinking, and low self-esteem, and some- technique that might enhance and combine adolescent edu-
times it led to suicide [27]. With the emergence of negative cation programs and serve as the foundation for the creation
effects and the increase of bullying on social media sites, of technology-based automated screening methods.
it has become necessary to find a solution to reduce and According to studies by [10], a technique to detect cyber-
prevent the phenomenon of cyberbullying [28]. bullying was created using fuzzy logic, in which the two
In a comparative study published by [29] for cyberbullying users’ communication is continuously observed, and each
detection in social media for the last five years, presented a message’s emotional content is identified. Depending on their
group of previous studies that used machine learning and deep emotions, each user’s behaviour is classified as either decent
learning algorithms in good attempts to detect and classify or bullying. The user’s account is automatically terminated
the phenomenon of cyberbullying. They concluded from their and reported if the amount of observed bullying exceeds a
observations of the results of related works for achieving predetermined threshold value. They concluded that if used
better results in future research.is recommended to use deep alongside social networking sites, it could be a helpful tool
learning algorithms (the BiLSTM classifier and BERT) while for avoiding online harassment. The created algorithm can
in the case of using machine learning algorithms, prefer use also be used for surveillance and studying human behaviour.
SVM and NB as classifiers. A novel pre-trained BERT model was developed by [11]
From the survey of the previous studies a number of limi- and assessed using two social media datasets. One dataset
tation has been addressed such as multi-class cyberbullying featured a comparatively small network layer at the top func-
categories, experiments with larger datasets for aggression tioning as a classifier, while the other dataset had a larger
detection, datasets from multiple social media platforms not network layer at the top serving as a classifier. The primary
taken in consideration. So the primary aim of this research objective of the study was to detect instances of cyberbullying
was to develop a detection model that enhances the per- on various social media platforms. When compared to earlier
formance of classifiers on a large generic dataset through methods, this one works better in terms of dimensions and
combining feature extraction techniques. The study presents training the model.
the creation of an LSTM deep learning detection model capa- A study [12] presented a new model known as DEA-
ble of identifying cyberbullying content in user comments RNN, which combines Elman-type recurrent neural networks
across three distinct social media platforms in real-time. The (RNNs) with a refined dolphin allocation algorithm (DEA).
experimental setup for the new datasets closely follows the A dataset of 10,000 tweets was used for evaluation, com-
methodology outlined in a prior publication [7]. in an attempt paring the model’s performance with various advanced
to solve the limitation, allowing us to examine how well the algorithms such as bidirectional long short-term memory (Bi-
suggested model performs on the chosen datasets and how LSTM), RNN, SVM, multinomial naive Bayes (MNB), and
adaptable it is to other datasets. Using other datasets makes random forests (RF). Results from the experiments indicated
the model more flexible in its detection of bullying. numer- that DEA-RNN outperformed all other methods across dif-
ous experiments were conducted with a wide range of time ferent scenarios, achieving an average accuracy of 90.45%,
steps utilizing a dataset from the real world, while adhering precision of 89.52%, recall of 88.98%, F1-score of 89.25%,
to a time-conscious evaluation technique that demonstrates and specificity of 90.94%.