A Survey On Bias Detection in Online News Using Deep Learning
A Survey On Bias Detection in Online News Using Deep Learning
Department of Computer Science and Engineering Depart ment of Computer Science and Engineering
Delhi T echnological University, Delhi, India Delhi T echnological University, Delhi, India
[email protected] [email protected]
2023 2nd International Conference on Applied Artificial Intelligence and Computing (ICAAIC) | 978-1-6654-5630-2/23/$31.00 ©2023 IEEE | DOI: 10.1109/ICAAIC56838.2023.10140917
Department of Computer Science and Engineering Department of Computer Science and Engineering
Delhi T echnological University, Delhi, India Delhi T echnological University, Delhi, India
[email protected] [email protected]
Abstract— The detection of bias in online news has on the bias in various news sources. It can aid in the creation
become a critical and sensitive area of research in recent of tools for news consumers to lessen any negative effects
years, largely due to the growing use of online platforms, of media bias on them which today can be performed by
such as social media, and the proliferation of news
using various deep learning and machine learning based
sources in digital format. This article provides a
comprehensive review of existing studies on online bias architectures using mBERT, LSTM, CNN, etc. The article
detection using natural language processing, including includes models which are being implemented to enhance
an analysis of the methodologies employed, an overview the quality of the news and detect biases in the news. A
of available datasets, and suggestions for further Neural Network model, Headline attention network is used
research in this field. The article examines techniques to increase the accuracy of bias identification in comparison
such as data pre-processing, feature extraction, to baseline model LSTM. With the increase of news in
classification, and prediction in detail. Various deep
various languages, detection of the bias became extremely
learning algorithms, such as BERT and Long Short-
Term Memory (LSTM), as well as machine learning difficult with the existing models and therefore the mBERT
algorithms, such as logistic regression, Recursive Neural model is implemented. The paper also reviews additional
Network models, and Naive Bayes, can be used to detect models that are applied to improve the model's accuracy.
bias in news headlines and articles. The article concludes Automatic bias detection for journalists can enhance their
by discussing the potential impact of bias detection on writing through more unbiased reporting. It might enable
journalism and society, as well as future research
balanced search for news aggregator programs like Google
directions.
News, similar to what is available on All Sides . The survey
Keywords—Bias Detection, Deep Learning, Natural includes five sections where section 1 gives introduction of
Language Processing, Long S hort-Term Memory, Machine the paper. Section 2 gives a brief introduction about various
Learning technologies used in the research. In Section 3, numerous
I. INTRODUCTION data sets examined by researchers are described, along with
an overview of recent work, as well as various approaches
The core purpose of news is to inform people about current
of feature extraction. This section also talks about the
affairs and developing global challenges. The intent of news
different algorithms that are implemented to achieve best
is to let the public know about the events, issues and
accuracy. This research study explores the outcomes of
different happenings that are unfolding everywhere in the
numerous models and provides a comparative analysis of
world. Today, with increasing acceptance of extensive
the techniques in section 4. This research study is concluded
social media platforms it has gathered more than millions of
in section 5 by briefly outlining the future direction of study.
users of different age groups and from all over the world,
with this extensive access of news which are available on II. BACKGROUND
online websites, users are able to browse and access news A. Deep Learning (DL)
data more quickly and easily. These websites are required to Deep learning uses artificial neural networks, a branch of
provide accurate, unbiased news that includes facts. On machine learning, to model and resolve complicated issues.
several levels, identifying and maybe reducing media bias in AI enables machines to learn from massive volumes of data
the news is important for society. Automated bias detection and carry out operations that ordinarily demand for human
can help policy regulators and related entities keep an eye intellect, like picture recognition, interpretation of natural
language, and decision-making. Each layer in the neural language and identify any potentially biased content. To do
network's topology extracts increasingly abstract this, the LSTM network can be trained using a sizable
information from the incoming data. A network can learn corpus of biased news items. The network can then be used
more intricate and complex properties the more layers it to analyze new articles and identify any language patterns
has.Some very effective applications of deep learning can be that may be associated with biased reporting. LSTM can be
found in computer vision, speech recognition, and natural a useful tool for bias detection in news, as it is able to
language processing. It can therefore be quite useful in identify subtle patterns in language use that may be difficult
identifying and reducing bias; nevertheless, it necessitates for humans to detect. By using LSTM, it is possible to
careful consideration of a number of variables, such as the analyze large quantities of news articles and identify any
selection of fairness metrics, data preparation, model potentially biased content, which can help to promote more
architecture, and others. The analysis of news articles, fair and balanced reporting.
headlines, and other media information to find potential
biases is a common practice in news bias detection.
CNNs are a type of deep neural network that are commonly headline, description, date and time, URL of the
employed to analyze data with a grid based formation, such article, full article, article type short, mid, long
as pictures. They typically consist of an input layer, values to show the length of the article
convolution layer, max-pooling layer, and fully connected ● News Category Dataset [9] - This is an open source
layer, also known as a dense layer, are some of the layers of that contains around 2 lakh news headlines from
a CNN that are described in [Fig . 2]. An image is entered 2012 to 2022 . Each record consists of these
into the input layer. The output of the convolution layer features : Editorial article belongs to which group,
relies on the kernel or filter value, which is then fed into the headline, authors, Link, short description, date. This
subsequent layers. The pooling layer is utilized to enhance dataset has a total of 42 categories in which it is
processing speed and reduce dimensionality. categorized like politics, wellness, entertainment,
travel, style and beauty and many more.
Fig.7 Statistics about the dataset[3] A multilingual version of BERT that has been trained on
100 languages including Hindi known as XLM-RoBERTa
was used .To prepare the input for the transformer network,
When either of these methods is applied, the results in a [SEP] token was used, and the standard cross entropy loss
Table 2 demonstrate considerable improvements when was used to train the model.In order to improve the model's
compared to the baseline approach without de-biasing, TLP ability to identify bias on the dataset the advantage of the
showed an increase of 14.12 points in accuracy and 12.73 model's pre-training in Hindi is taken.A ML approach SVM
points in macro-F1. The use of triplet loss to mitigate source is also used to transform the textual data into a set of
bias and incorporation of media-level representation from features using TF-IDF. Using the Radial Basis Function, the
Twitter followers resulted in 30.51 and 28.76 absolute classification model is applied to the changed features.
improvements in macro F1 compared to the baseline Twitter
bios + Article with TLP vs Article (baseline) on the K (x, x′) = exp (−γ∥x − x′∥2)
challenging media-based split, respectively.
Here γ is a free parameter. A count matrix is created, and
the model is trained using TF-IDF. once the matrix has been
normalized .The dataset was tested using both the
approaches and yielded the following validation accuracy
scores: 80.2% for mBERT , 83% for XLM-RoBERTa,
79.2% for XLM-RoBERTa (Hindi), 78.9 % for IndicBERT,
78.7% for SVM, 77.1% for Logistic Regression , 3: 78.7%
for Random Forest.And It was observed that XLM -
RoBERTa a multilingual deep learning models perform
better than the machine learning approaches since the results
obtained were 83% accuracy, 72.1% MCC and 76.4% F1-
score.The main issue was that the models could not tell if
the news stories and headlines were neutral to the BJP.
news and misinformation. Relative research of the recent articles in various languages from recent studies in this field
work is shown in the table. of study. There are more available for research purposes.
The requirement for a sizable data set with labels for
TABLE II training models is one of the major issues with deep
learning-based approaches for bias detection. Most recent
COM PARISON AM ONG SOM E OF THE RECENT WORK IN works used inequivalent open-source datasets, where the
ONLINE BIAS DETECTION proportion of biased news was much smaller than that of
unbiased news. Even with small and unbalanced datasets,
numerous techniques produced results with extremely high
sensitivity and specificity. This demonstrates that overfitting
is present to some level and that insufficient data exists to
allow for generalization of the models to the task at hand.
Study Year of Method used Remarks
To some degree, data augmentation approaches may be able
Publication to solve this problem. The training of these models might be
enhanced with more data. The grade of the enhanced data
89.54% may increase by incorporating more labeled data. Thus it
[2] 2020 Headline
accuracy would be possible to create a deep learning model that is
Attention
precise.
Network.
REFERENCES
70%
[1] Minh Vu “Political News Bias Detection using Machine Learning”
Twitter bios + (BERT)
accuracy Department of Computer Science Earlham College 801 National Road
[3] 2019 Article with TLP
West Richmond, Indiana 47374
72%
(LSTM ) [2] R. R. Reddy, S. R. Duggenpudi, R. mamidi,” Detecting Political Bias in
accuracy News Articles Using Headline Attention” 10.18653/v1/W19-4809, ].
[1] 2018 Multilayer 81% [5] W. Marusarz, “T he 2022 Definitive Guide to Natural Language
Perceptron model accuracy Processing (NLP)”, November 15, 2022.
[14] Kamen cay, P., M. Benco, T. Mizdo ˇ s and R. Radil. “A new method
for face recognition using convolutional neural network.” Advances in
Electrical and Electronic Engineering 15 (2017): 663 -672.