0% found this document useful (0 votes)

10 views31 pages

BS Thesis MidSem Report

This thesis presents a multimodal approach to clickbait detection that combines textual and visual analysis to improve classification accuracy. Traditional methods often fail to identify complex clickbait that uses misleading images and videos, prompting the need for this innovative model utilizing deep learning techniques. The research demonstrates that the multimodal strategy significantly outperforms unimodal methods, contributing to more reliable and scalable clickbait detection systems.

Uploaded by

adityam21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views31 pages

BS Thesis MidSem Report

Uploaded by

adityam21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Multimodal Clickbait Detection

A BS thesis mid semester report submitted in partial fulfillment of

the requirements for the degree of

Bachelor of Science
in

Data Science and Engineering

Aditya Manoj
21347

Under the Supervision of

Dr. Jasabanta Patro

Department of Data Science and Engineering

Indian Institute of Science Education and Research Bhopal
Bhopal - 462066, India

February, 2023
2
Abstract
Clickbait is a widespread problem in online material, deceiving users with dra-
matic headlines that fall short of their claims. Conventional clickbait detection
techniques mostly concentrate on textual analysis, which frequently fails to de-
tect complex multimodal clickbait that incorporates deceptive images, videos,
or thumbnails. To improve classification accuracy, we present a multimodal ap-
proach to clickbait detection in this study that makes use of both textual and
visual clues. Our model uses deep learning-based image processing to extract
visual patterns frequently linked to misleading material and natural language
processing (NLP) techniques for textual analysis. To train and assess our model,
we use publicly accessible datasets, such as the Clickbait17 and YouTube Click-
bait datasets.
Our results show that a multimodal strategy outperforms the unimodal ones
by a large margin, successfully capturing a variety of clickbait tactics. In order to
preserve the integrity of content and increase user confidence in online platforms,
our research helps create clickbait detection systems that are more reliable and
scalable.
Keywords: Clickbait Detection, Multimodal Learning, Natural Language
Processing, Deep Learning, Image Processing, Fake News Prevention.
Contents

1 Introduction 1
1.1 Introduction to Clickbait and Its Challenges . . . . . . . . . . . . 1
1.2 Motivation for Multimodal Clickbait Detection . . . . . . . . . . . 1
1.3 Problem Statement . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.4 Objectives of the Research . . . . . . . . . . . . . . . . . . . . . . 2
1.5 Scope of the Study . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.6 Contributions of This Study . . . . . . . . . . . . . . . . . . . . . 3
1.7 Structure of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . 4

2 Background 5
2.1 Definition and Characteristics of Clickbait . . . . . . . . . . . . . 5
2.2 Evolution of Clickbait in Online Media . . . . . . . . . . . . . . . 5
2.3 Impact of Clickbait on Digital Ecosystems . . . . . . . . . . . . . 6
2.4 Existing Approaches to Clickbait Detection . . . . . . . . . . . . . 6
2.4.1 Text-Based Clickbait Detection . . . . . . . . . . . . . . . 6
2.4.2 Image-Based Clickbait Detection . . . . . . . . . . . . . . 7
2.4.3 Multimodal Clickbait Detection . . . . . . . . . . . . . . . 7
2.5 Challenges in Clickbait Detection . . . . . . . . . . . . . . . . . . 7
2.6 Research Gaps and Motivation . . . . . . . . . . . . . . . . . . . . 8

3 Methodology 9
3.1 Dataset Collection and Preprocessing . . . . . . . . . . . . . . . . 9
3.1.1 Data Sources . . . . . . . . . . . . . . . . . . . . . . . . . 9
3.1.2 Text Preprocessing . . . . . . . . . . . . . . . . . . . . . . 9
3.1.3 Image Preprocessing . . . . . . . . . . . . . . . . . . . . . 10
3.2 Feature Extraction . . . . . . . . . . . . . . . . . . . . . . . . . . 11
3.2.1 Text Feature Extraction . . . . . . . . . . . . . . . . . . . 11
3.2.2 Image Feature Extraction . . . . . . . . . . . . . . . . . . 11
3.3 Model Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . 11
3.3.1 Fusion Techniques . . . . . . . . . . . . . . . . . . . . . . 12
3.4 Training and Evaluation . . . . . . . . . . . . . . . . . . . . . . . 12

i
3.4.1 Training Strategy . . . . . . . . . . . . . . . . . . . . . . . 12
3.4.2 Evaluation Metrics . . . . . . . . . . . . . . . . . . . . . . 12
3.5 Accuracy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
3.6 Precision . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.7 Recall (Sensitivity) . . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.8 F1-Score . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.9 ROC-AUC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

4 Results and Discussion 15

4.1 Experimental Results . . . . . . . . . . . . . . . . . . . . . . . . . 15
4.2 Discussion of Findings . . . . . . . . . . . . . . . . . . . . . . . . 16
4.2.1 1. Effectiveness of Multimodal Models . . . . . . . . . . . 16
4.2.2 2. Dataset Influence on Performance . . . . . . . . . . . . 16
4.2.3 3. Role of Adversarial Robustness . . . . . . . . . . . . . . 16
4.3 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

5 Conclusion 19
5.1 Summary of Research . . . . . . . . . . . . . . . . . . . . . . . . . 19
5.2 Key Findings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

6 Work Plan 21
6.1 Final Remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

ii
List of Tables

4.1 Performance comparison of models on Clickbait17 dataset . . . . 15

4.2 Performance comparison of models on YouTube Clickbait dataset 16
4.3 Performance comparison of additional experiments . . . . . . . . . 17

iii
iv
Chapter 1

Introduction

1.1 Introduction to Clickbait and Its Challenges

Clickbait is a deceptive practice in online content where headlines, images, and
descriptions are intentionally crafted to attract user engagement, often at the
expense of factual accuracy and meaningful content. This strategy is commonly
employed to increase website traffic, maximize ad revenue, and enhance social
media engagement. Clickbait headlines typically exploit human curiosity through
exaggerated claims, emotional triggers, and sensationalized language, leading to
the spread of misinformation and reduced trust in digital media.
Traditional methods for detecting clickbait rely heavily on linguistic patterns
and textual analysis. However, as clickbait strategies evolve, content creators
increasingly integrate visual deception, including misleading thumbnails, exag-
gerated imagery, and manipulated videos. This shift necessitates a multimodal
approach—one that combines both textual and visual elements—to enhance
detection accuracy and improve content integrity.

1.2 Motivation for Multimodal Clickbait De-

tection
Recent advancements in artificial intelligence (AI) and machine learning (ML)
have enabled the automatic generation and detection of misleading content. How-
ever, existing detection methods primarily focus on text-based features, ignoring
the powerful role of visuals in reinforcing deceptive narratives. With the rise
of social media platforms such as YouTube, Facebook, and Twitter, multimodal
clickbait—where misleading text is combined with enticing visuals—has become
increasingly prevalent.
A multimodal approach to clickbait detection is crucial due to:

1
• The increasing reliance on thumbnails, memes, and infographics to
mislead users.

• The limitations of text-based models in distinguishing contextual decep-

tion in images and videos.

• The effectiveness of combining Natural Language Processing (NLP)

and Computer Vision (CV) for improved detection accuracy.

By leveraging deep learning-based multimodal models, this research aims

to bridge the gap between textual and visual clickbait detection, improving over-
all robustness and adaptability.

1.3 Problem Statement

Existing clickbait detection methods struggle with:

• Identifying multimodal deception—Clickbait is no longer limited to

misleading headlines but extends to exaggerated images and thumbnails.

• Generalizing across different content formats—Clickbait varies across

news articles, social media, and video content, making detection complex.

• Adapting to evolving clickbait strategies—New techniques, such as

AI-generated clickbait, require continuous updates to detection models.

• Handling adversarial manipulation—Subtle modifications in images

and text allow clickbait to bypass conventional detection models.

This study addresses these challenges by introducing a hybrid deep learning

approach that integrates textual and visual cues for effective clickbait classifi-
cation.

1.4 Objectives of the Research

The primary objectives of this research are:

• To develop a multimodal clickbait detection framework integrating

text and image processing.

• To analyze and extract linguistic and visual features contributing to

misleading content.

2
• To employ state-of-the-art deep learning models for classification,
such as transformer-based NLP models (BERT, RoBERTa) and
CNN-based image models (ResNet, Vision Transformers).[4][8][9]

• To evaluate model performance using benchmark datasets and compare

results with existing detection techniques.

• To enhance robustness against adversarial clickbait strategies, ensuring

real-world applicability.

1.5 Scope of the Study

This research focuses on:

• Analyzing clickbait across diverse platforms, including news web-

sites, social media, and video-sharing platforms.

• Developing a dataset that includes both textual headlines and corre-

sponding thumbnails for multimodal learning.

• Investigating different fusion techniques, such as feature concate-

nation and attention-based fusion, to integrate text and image features
effectively.

• Performing extensive experiments to assess the effectiveness of the

proposed model across different domains and content styles.

This work lays the foundation for future research in automated misinformation
detection, particularly in the realm of multimodal content analysis.

1.6 Contributions of This Study

This research makes the following contributions:

• Proposes a novel multimodal clickbait detection approach leveraging

NLP and computer vision techniques.

• Introduces fusion techniques for combining text-based and image-based

features to improve classification accuracy.

• Evaluates state-of-the-art models, including transformers and deep

CNNs, in the context of clickbait detection.

3
• Conducts an extensive comparative study against existing unimodal
and multimodal detection methods.

• Provides insights into the effectiveness of different architectures, datasets,

and adversarial strategies.

1.7 Structure of the Thesis

The remainder of this thesis is organized as follows:

• Chapter 2 - Background: Reviews existing research on clickbait detec-

tion, highlighting text-based, image-based, and multimodal approaches.

• Chapter 3 - Methodology: Describes the proposed multimodal detec-

tion framework, dataset preparation, feature extraction techniques, and
model architecture.

• Chapter 4 - Results and Discussion: Presents the implementation

details, evaluation metrics, and experimental results comparing different
detection models. Interprets the findings, discusses limitations, and sug-
gests potential improvements.

• Chapter 5 - Conclusion and Future Work: Summarizes the study’s

contributions and outlines future research directions.

This chapter introduced the concept of multimodal clickbait detection,

emphasizing its importance in addressing misleading content across digital plat-
forms. We discussed the challenges of traditional text-based detection methods
and the necessity of integrating visual analysis for improved accuracy. The fol-
lowing chapters will explore existing research, present the proposed methodology,
and evaluate the effectiveness of our multimodal approach.

4
Chapter 2

Background

Clickbait has become a major concern in digital content, where misleading head-
lines, exaggerated claims, and misleading visuals are used to attract user engage-
ment. This practice manipulates user behavior, contributes to misinformation,
and affects the integrity of content across digital platforms. Addressing this issue
requires an understanding of the history, characteristics and current research in
clickbait detection. This chapter provides a comprehensive overview of clickbait,
its evolution, its impact on digital ecosystems, existing detection methodologies,
and research challenges.[2] [6]

2.1 Definition and Characteristics of Clickbait

Clickbait refers to content designed to lure users to click links by using mislead-
ing, exaggerated, or sensationalized claims. Common characteristics include:

• Curiosity Gap: Headlines that deliberately omit key information to

prompt clicks.

• Exaggeration and Sensationalism: Overuse of hyperbolic language

such as ’You will not believe this!’

• Misleading Imagery: Thumbnails that do not reflect the actual content.

• Discrepancy Between Headline and Content: The article does not

provide the information promised in the title.

2.2 Evolution of Clickbait in Online Media

The development of clickbait can be categorized into distinct phases:

5
• Traditional Media Era: Sensationalized headlines in newspapers aimed
at boosting sales.

• Digital Journalism Shift: The transition to online content led to click-

driven revenue models.[2]

• Social Media and Algorithmic Influence: Platforms such as Facebook

and YouTube incentivized engagement-focused content, fueling the rise of
clickbait.

• AI-Generated Clickbait: With advancements in AI, automated tools

are now generating clickbait-style content, making detection even more
challenging.

2.3 Impact of Clickbait on Digital Ecosystems

Clickbait has several implications for digital content consumption:

• Misinformation Spread: Clickbait headlines distort facts, misleading

audiences.

• Reduced Trust in Media: Overuse of clickbait erodes trust in digital

journalism.

• Manipulation of User Engagement: Algorithm-driven recommenda-

tions amplify misleading content.

• Financial Incentives for Deceptive Practices: Revenue models based

on clicks encourage publishers to prioritize engagement over factual report-
ing.

2.4 Existing Approaches to Clickbait Detection

Several approaches have been explored to detect clickbait:

2.4.1 Text-Based Clickbait Detection

Early methods focused on linguistic features, including:

• Rule-based methods using predefined keyword lists.

• Machine learning approaches leveraging TF-IDF [10], word embeddings,

and syntactic structures.[11]

6
• Transformer-based models like BERT and GPT fine-tuned for clickbait
classification.

2.4.2 Image-Based Clickbait Detection

Recent research has explored visual analysis for detecting misleading thumbnails:

• CNN-based models for analyzing image content.

• Object detection techniques to identify misleading visual elements.[7]

• Use of pre-trained deep learning models such as ResNet and VGG16.

2.4.3 Multimodal Clickbait Detection

The most effective strategies combine both text and image features to improve
detection accuracy:

• Feature fusion techniques integrating textual and visual representations.

• Attention mechanisms to prioritize key features in multimodal analysis.[3]

• Transformer-based multimodal models enhancing classification performance.[3]

2.5 Challenges in Clickbait Detection

Despite advancements, clickbait detection faces several challenges:

• Evolving Clickbait Strategies: New forms of clickbait emerge as detec-

tion models improve.

• Multimodal Complexity: Clickbait often integrates text and images,

requiring sophisticated fusion techniques.

• Dataset Limitations: Many datasets focus on text-only detection, lack-

ing sufficient multimodal samples.

• Adversarial Manipulation: Clickbait creators adapt by paraphrasing

text or subtly altering images to evade detection.

7
2.6 Research Gaps and Motivation
While existing studies have made progress in detecting clickbait, there remain
significant gaps:

• Lack of robust multimodal detection frameworks integrating text and im-

ages.

• Limited adaptability of existing models to evolving clickbait tactics.

• Absence of comprehensive datasets that include diverse forms of clickbait

across platforms.

• Need for adversarial robustness to counter evasion techniques employed by

clickbait generators.

This study aims to address these gaps by developing a multimodal approach

to clickbait detection, leveraging deep learning models to improve classification
accuracy.

8
Chapter 3

Methodology

This chapter describes the methodology used to develop a multimodal click-

bait detection framework that effectively integrates both textual and visual
analysis. The proposed approach follows a structured pipeline, including dataset
collection, feature extraction, model architecture design, training, and
evaluation. This methodology aims to address the research objectives outlined
in the previous chapter by leveraging state-of-the-art deep learning models
and fusion techniques.[1][6][3][12]

3.1 Dataset Collection and Preprocessing

To ensure a robust and generalizable detection model, we curate a multimodal
dataset consisting of clickbait and non-clickbait samples, incorporating both
text and image pairs.

3.1.1 Data Sources

The dataset is compiled from multiple sources, including:

• Clickbait Challenge 17 Dataset - A well-known dataset with labeled

textual clickbait headlines.[5]

• YouTube Clickbait Dataset - A dataset containing video thumbnails

and misleading titles.

• Manually Collected Social Media Clickbait - Aggregated from Twit-

ter, Facebook, and news portals to ensure diversity.

3.1.2 Text Preprocessing

For textual content, the following preprocessing steps are applied:

9
Figure 3.1: Flowchart of Multimodal Clickbait Detection

• Tokenization and Lemmatization - Converts words to their base forms

to reduce redundancy.

• Word Embeddings - Uses BERT embeddings to capture contextual

information.

3.1.3 Image Preprocessing

For visual content, preprocessing involves:

• Resizing and Normalization - Ensures uniform input dimensions for

CNN models.

10
• Data Augmentation - Includes flipping, rotation, and contrast adjust-
ments to improve model generalization.

• Object Detection - Detects and crops elements such as exaggerated facial

expressions or misleading overlays.

3.2 Feature Extraction

To facilitate effective multimodal learning, we extract features from both textual
and visual modalities.

3.2.1 Text Feature Extraction

Text-based features are extracted using:

• TF-IDF (Term Frequency-Inverse Document Frequency) to high-

light important words.[10]

• Pre-trained Language Models (BERT, RoBERTa) to capture deep se-

mantic meaning.[3][4][8][9]

• Sentiment Analysis and Readability Scores to identify hyperbolic

language often used in clickbait.

3.2.2 Image Feature Extraction

Image-based features are extracted using:

• CNN-based Feature Maps (ResNet, EfficientNet) to recognize visual

patterns.

• Edge and Color Histogram Analysis to detect excessive saturation or

manipulated images.

• Facial Expression Recognition to analyze exaggerated emotions in thumb-

nails.

3.3 Model Architecture

The proposed framework consists of three core components:

• Text Processing Module - Uses transformer models (BERT, RoBERTa)

for NLP-based analysis.

11
• Image Processing Module - Utilizes deep CNN models (ResNet, Vision
Transformers) for visual analysis.

• Multimodal Fusion Module - Combines textual and visual features us-

ing fusion strategies.

3.3.1 Fusion Techniques

We explore multiple fusion approaches to integrate text and image features ef-
fectively:

• Early Fusion - Combines raw features before classification.

• Late Fusion - Merges prediction scores from individual models.

• Attention-Based Fusion - Uses self-attention mechanisms to dynami-

cally weigh the importance of modalities.

3.4 Training and Evaluation

3.4.1 Training Strategy

The model is trained using supervised learning with the following setup:

• Loss Function - Cross-entropy loss for classification.

• Optimizer - Adam optimizer with learning rate scheduling.

• Batch Normalization and Dropout - To prevent overfitting.

• Data Splitting - 80% training, 10% validation, 10% testing.

3.4.2 Evaluation Metrics

To assess the performance of the multimodal clickbait detection models, the
following evaluation metrics are employed:

3.5 Accuracy
Accuracy measures the overall correctness of the model and is defined as:

TP + TN
Accuracy = (3.1)
TP + TN + FP + FN

12
where T P and T N represent the correctly predicted clickbait and non-clickbait
samples, respectively, while F P and F N denote the false positives and false
negatives.

3.6 Precision
Precision measures how many of the predicted clickbait samples are actually
clickbait and is given by:

TP
P recision = (3.2)
TP + FP

3.7 Recall (Sensitivity)

Recall measures the ability of the model to identify all clickbait samples correctly
and is calculated as:
TP
Recall = (3.3)
TP + FN

3.8 F1-Score
The F1-score is the harmonic mean of precision and recall and is given by:

P recision × Recall
F 1-Score = 2 × (3.4)
P recision + Recall

3.9 ROC-AUC
ROC-AUC evaluates the model’s ability to distinguish between classes and is
calculated based on the true positive rate (TPR) and false positive rate (FPR):

TP FP
TPR = , FPR = (3.5)
TP + FN FP + TN

The AUC value ranges from 0 to 1, where a higher score indicates better model
performance.

13
14
Chapter 4

Results and Discussion

This chapter presents the results of our multimodal clickbait detection ex-
periments and discusses their implications. Various deep learning models, in-
cluding BERT, RoBERTa, CLIP, and Vision Transformers (ViT), were
evaluated on benchmark datasets such as Clickbait17 and YouTube Click-
bait. The models were assessed based on classification metrics including accu-
racy, precision, recall, and F1-score.
We also analyze the impact of feature fusion techniques, dataset vari-
ations, and adversarial robustness on model performance, comparing uni-
modal and multimodal approaches.

4.1 Experimental Results

To facilitate easy comparison of results, all model performances are summarized
in Tables 4.1, 4.2, and 4.3.

Model Dataset Accuracy Precision F1-Score

BERT + ViT Clickbait17 Test 81% 0.92 0.81
ViT (Exp 1) Clickbait17 Train 76% 0.82 0.74
ViT (Exp 2) Clickbait17 Train 79% 0.85 0.77
ViT (Exp 3) Clickbait17 Train 81% 0.87 0.79
ViT (Exp 4) Clickbait17 Train 83% 0.89 0.81
ViT (Exp 5) Clickbait17 Train 85% 0.91 0.83
RoBERTa + CLIP Clickbait17 Train 87% 0.93 0.85

Table 4.1: Performance comparison of models on Clickbait17 dataset

15
Model Dataset Accuracy Precision F1-Score
BERT + ViT (Exp 2) YouTube Clickbait 89% 0.90 0.88
BERT + ViT (Exp 3) YouTube Clickbait 90% 0.91 0.89
RoBERTa + CLIP YouTube Clickbait 91% 0.92 0.90

Table 4.2: Performance comparison of models on YouTube Clickbait dataset

4.2 Discussion of Findings

4.2.1 1. Effectiveness of Multimodal Models

The results highlight the effectiveness of multimodal learning, with models
combining textual and visual features consistently outperforming unimodal
models.
Key observations:

• ViT and CLIP improve visual understanding, reducing false positives

in non-clickbait classification.

• RoBERTa achieves higher accuracy than BERT, indicating the ben-

efit of pretraining on large-scale datasets.

• Feature fusion techniques impact performance, with attention-based

fusion yielding better generalization.

4.2.2 2. Dataset Influence on Performance

Model performance varied based on the dataset used:

• Clickbait17 dataset posed greater challenges due to diverse content styles.[5]

• YouTube Clickbait dataset led to higher accuracy, likely due to stronger

text-image correlations in thumbnails.

4.2.3 3. Role of Adversarial Robustness

Models trained with adversarial augmentation demonstrated higher recall,
reducing false negatives in clickbait classification. Future research can explore
contrastive learning to further improve resilience against manipulated content.

16
Model Dataset Accuracy Precision F1-Score
ViT + CLIP Clickbait17-Test- 85% 0.91 0.84
170720
ViT-Exp1 Clickbait17-Train- 78% 0.83 0.76
170331
ViT-Exp2 Clickbait17-Train- 81% 0.86 0.79
170331
ViT-Exp3 Clickbait17-Train- 84% 0.89 0.82
170331
ViT-Exp4 Clickbait17-Train- 86% 0.91 0.84
170331
ViT-Exp5 Clickbait17-Train- 88% 0.93 0.86
170331
RoBERTa + CLIP Clickbait-Train- 89% 0.94 0.88
170331
BERT + ViT YouTube-Clickbait 90% 0.92 0.89
(Exp 2)
BERT + ViT YouTube-Clickbait 91% 0.93 0.90
(Exp 3)
RoBERTa + CLIP YouTube-Clickbait 92% 0.94 0.91

Table 4.3: Performance comparison of additional experiments

4.3 Conclusion
This chapter presented an extensive evaluation of different multimodal click-
bait detection models, comparing performance across datasets and architec-
tures. The results confirm that multimodal deep learning significantly en-
hances clickbait detection accuracy, particularly with transformer-based
language models and vision-language fusion techniques.
The next chapter will discuss future improvements and potential applications
of this research.

17
18
Chapter 5

Conclusion

5.1 Summary of Research

This research explored the critical challenge of clickbait detection in the evolv-
ing landscape of multimodal digital content. As content creators continue to
exploit textual and visual deception to drive user engagement, existing clickbait
detection methods have struggled to maintain accuracy and robustness. Our
study addressed key research gaps related to multimodal learning, dataset
availability, cross-platform generalization, adversarial robustness, and feature fu-
sion techniques.
The objectives of this study were to:

• Develop a multimodal clickbait detection framework integrating tex-

tual and visual analysis.

• Construct a large-scale, annotated dataset containing both text and

images for training deep learning models.

• Improve model resilience against adversarial manipulation and evolving

clickbait strategies.

• Design feature fusion techniques to effectively combine textual and vi-

sual representations.

• Ensure cross-platform generalization, enabling detection models to

function across different content types and platforms.

5.2 Key Findings

The research led to several important insights:

19
• Multimodal models outperform unimodal approaches, indicating
that a combination of text and visual analysis significantly enhances click-
bait detection accuracy.

• Fusion techniques such as attention-based fusion provide better rep-

resentation learning, effectively integrating multimodal features.

• Cross-platform testing demonstrated the importance of domain adapta-

tion strategies to maintain high performance across different media formats.

20
Chapter 6

Work Plan

Future research directions include:

• Expanding the dataset with video-based clickbait examples to enhance

multimodal analysis.

• Exploring real-time deployment strategies to integrate clickbait detec-

tion models into social media and content moderation tools.

6.1 Final Remarks

Clickbait detection is a growing challenge in digital media, requiring adaptive
and intelligent systems capable of identifying deceptive content across diverse
platforms. This study lays a foundation for multimodal misinformation de-
tection, providing tools and insights that can be further expanded upon by
researchers and industry practitioners. By advancing AI-driven moderation,
this research contributes to a more trustworthy and transparent digital
information ecosystem.

21
Bibliography

[1] Abdullah Al Imran, Md Sakib Hossain Shovon, and Muhammad Firoz

Mridha. Baitbuster-bangla: A comprehensive dataset for clickbait detec-
tion in bangla with multi-feature and multi-modal analysis. Data in Brief,
53:110239, 2024.

[2] Abhijnan Chakraborty, Bhargavi Paranjape, Sourya Kakarla, and Niloy

Ganguly. Stop clickbait: Detecting and preventing clickbaits in online news
media. In 2016 IEEE/ACM international conference on advances in social
networks analysis and mining (ASONAM), pages 9–16. IEEE, 2016.

[3] Carmela Comito, Luciano Caroprese, and Ester Zumpano. Multimodal fake
news detection on social media: a survey of deep learning techniques. Social
Network Analysis and Mining, 13(1):101, 2023.

[4] Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. Bert:
Pre-training of deep bidirectional transformers for language understanding.
In Proceedings of the 2019 conference of the North American chapter of
the association for computational linguistics: human language technologies,
volume 1 (long and short papers), pages 4171–4186, 2019.

[5] Ayse Geçkil, Ahmet Anil Müngen, Esra Gündogan, and Mehmet Kaya.
A clickbait detection method on news sites. In 2018 IEEE/ACM Inter-
national Conference on Advances in Social Networks Analysis and Mining
(ASONAM), pages 932–937. IEEE, 2018.

[6] Vijayasaradhi Indurthi, Bakhtiyar Syed, Manish Gupta, and Vasudeva

Varma. Predicting clickbait strength in online social media. In Proceedings
of the 28th International Conference on Computational Linguistics, pages
4835–4846, 2020.

[7] Mini Jain, Peya Mowar, Ruchika Goel, and Dinesh K Vishwakarma. Click-
bait in social media: detection and analysis of the bait. In 2021 55th annual
conference on information sciences and systems (CISS), pages 1–6. IEEE,
2021.

22
[8] Jacob Devlin Ming-Wei Chang Kenton and Lee Kristina Toutanova. Bert:
Pre-training of deep bidirectional transformers for language understanding.
In Proceedings of naacL-HLT, volume 1. Minneapolis, Minnesota, 2019.

[9] Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi
Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov.
Roberta: A robustly optimized bert pretraining approach. arXiv preprint
arXiv:1907.11692, 2019.

[10] Abhishek Mallik and Sanjay Kumar. Word2vec and lstm based deep learn-
ing technique for context-free fake news detection. Multimedia Tools and
Applications, 83(1):919–940, 2024.

[11] Qing Meng, Bo Liu, Xiangguo Sun, Hui Yan, Chengyu Liang, Jiuxin Cao,
Roy Ka-Wei Lee, and Xing Bao. Attention-fused deep relevancy matching
network for clickbait detection. IEEE Transactions on Computational Social
Systems, 10(6):3120–3131, 2022.

[12] Savvas Zannettou, Michael Sirivianos, Jeremy Blackburn, and Nicolas

Kourtellis. The web of false information: Rumors, fake news, hoaxes, click-
bait, and various other shenanigans. Journal of Data and Information Qual-
ity (JDIQ), 11(3):1–37, 2019.

23
24

The 22 Laws of Digital Writing
100% (15)
The 22 Laws of Digital Writing
78 pages
Emerald Guidelines - Updated Format
No ratings yet
Emerald Guidelines - Updated Format
15 pages
Fake News Detection Project Report
100% (1)
Fake News Detection Project Report
8 pages
CB V2 Guidelines
No ratings yet
CB V2 Guidelines
22 pages
Clickbait Detection in YouTube Videos
No ratings yet
Clickbait Detection in YouTube Videos
25 pages
8.progress Report Presentation (Clickbait Detection System)
No ratings yet
8.progress Report Presentation (Clickbait Detection System)
26 pages
Enhancing Clickbait Detection With Cross-Modal Topic Modeling in Social Networks
No ratings yet
Enhancing Clickbait Detection With Cross-Modal Topic Modeling in Social Networks
6 pages
Clickbait Detection Using Word Embeddings
No ratings yet
Clickbait Detection Using Word Embeddings
3 pages
Deep Learning For Image Spam Detection
No ratings yet
Deep Learning For Image Spam Detection
44 pages
IRE Deliverable 3
No ratings yet
IRE Deliverable 3
7 pages
Tech Science
No ratings yet
Tech Science
82 pages
The Good The Bad and The Bait Detecting and Characterizing Clickbait On YouTube
No ratings yet
The Good The Bad and The Bait Detecting and Characterizing Clickbait On YouTube
7 pages
Fake News Detection Natural Language Processing
No ratings yet
Fake News Detection Natural Language Processing
62 pages
Hyperparameter Tuning of Long Short-Term Memory Model For Clickbait Classification in News Headlines
No ratings yet
Hyperparameter Tuning of Long Short-Term Memory Model For Clickbait Classification in News Headlines
9 pages
IRE Deliverable 2
No ratings yet
IRE Deliverable 2
4 pages
Predicting User Interaction On Social Media Using Machine Learnin
No ratings yet
Predicting User Interaction On Social Media Using Machine Learnin
76 pages
2) A - Transformer-Based - Architecture - For - The - Automatic - Detection - of - Clickbait - For - Ar
No ratings yet
2) A - Transformer-Based - Architecture - For - The - Automatic - Detection - of - Clickbait - For - Ar
5 pages
Approaches For Fake Content Detection Strengths and Weaknesses To Adversarial Attacks
No ratings yet
Approaches For Fake Content Detection Strengths and Weaknesses To Adversarial Attacks
11 pages
Clickbait in YouTube Prevention Detection and Analysis
No ratings yet
Clickbait in YouTube Prevention Detection and Analysis
26 pages
Project Proposal - Group 17-2-5
No ratings yet
Project Proposal - Group 17-2-5
4 pages
17 Result Analysis NLP
No ratings yet
17 Result Analysis NLP
13 pages
SPAMDETECTION
No ratings yet
SPAMDETECTION
8 pages
Frontiers: Supporting Content Marketing With Natural Language Generation
No ratings yet
Frontiers: Supporting Content Marketing With Natural Language Generation
12 pages
Adkdd 2014 Camera Ready Junfeng
No ratings yet
Adkdd 2014 Camera Ready Junfeng
9 pages
s134450 Fake News Detection Using Machine Learning
No ratings yet
s134450 Fake News Detection Using Machine Learning
91 pages
Preview: Comparison of Machine Learning Algorithms and Their Ensembles For Botnet Detection
100% (2)
Preview: Comparison of Machine Learning Algorithms and Their Ensembles For Botnet Detection
11 pages
FYP Copy
No ratings yet
FYP Copy
42 pages
A Neural Network For Classifying News Wires (Multi Class Classification) Using Reuters Dataset
No ratings yet
A Neural Network For Classifying News Wires (Multi Class Classification) Using Reuters Dataset
16 pages
UrbutisAndBara MasterThesis PDF
No ratings yet
UrbutisAndBara MasterThesis PDF
94 pages
Aust Cse Thesis Final Book
No ratings yet
Aust Cse Thesis Final Book
72 pages
A Comparative Analysis of Machine Learning Techniques On Fake News Detection 1
No ratings yet
A Comparative Analysis of Machine Learning Techniques On Fake News Detection 1
42 pages
Youtube Analysis3
No ratings yet
Youtube Analysis3
58 pages
Deep Learning Based Sentiment
No ratings yet
Deep Learning Based Sentiment
62 pages
Ai Project
No ratings yet
Ai Project
16 pages
PhoCLIP 232 Specialized Project OFFICIAL
No ratings yet
PhoCLIP 232 Specialized Project OFFICIAL
105 pages
Spam News Detection Report: Manikiran
No ratings yet
Spam News Detection Report: Manikiran
12 pages
Full Text 01
No ratings yet
Full Text 01
46 pages
2024 Donner Catherine Thesis
No ratings yet
2024 Donner Catherine Thesis
161 pages
Master Thesis Ankush Arora
No ratings yet
Master Thesis Ankush Arora
83 pages
Fraud Detection in E-Commerce Using Natural Language Processing
No ratings yet
Fraud Detection in E-Commerce Using Natural Language Processing
43 pages
Thesis Darius Dragnea
No ratings yet
Thesis Darius Dragnea
64 pages
Automatic Irony and Sarcasm Detection in Socmed
No ratings yet
Automatic Irony and Sarcasm Detection in Socmed
49 pages
Dalhousie University USA
No ratings yet
Dalhousie University USA
72 pages
Melese Masters Thesis
No ratings yet
Melese Masters Thesis
125 pages
Fake News Detection: Muhammad Hassan Ur Rehman Sufyan Ahmed Huzaifa Shuja Taber Bin Zameer
No ratings yet
Fake News Detection: Muhammad Hassan Ur Rehman Sufyan Ahmed Huzaifa Shuja Taber Bin Zameer
21 pages
FAke News Report
No ratings yet
FAke News Report
16 pages
Wang Jiahao 2021 Thesis
No ratings yet
Wang Jiahao 2021 Thesis
142 pages
Evaluation of Text Transformers For Classifying Sentiment of Revi
No ratings yet
Evaluation of Text Transformers For Classifying Sentiment of Revi
104 pages
IEEE-paper (1) Original
No ratings yet
IEEE-paper (1) Original
3 pages
Sat - 25.Pdf - Discernment of Autonomous Profiles On Social Networking Services (SNS)
No ratings yet
Sat - 25.Pdf - Discernment of Autonomous Profiles On Social Networking Services (SNS)
11 pages
Spam Detection
No ratings yet
Spam Detection
39 pages
Predicting User Intent From Search Queries Using Both CNNS and RNNS
No ratings yet
Predicting User Intent From Search Queries Using Both CNNS and RNNS
14 pages
Full Text 01
No ratings yet
Full Text 01
44 pages
CS985 Project FrankMitchell BiP Solutions
No ratings yet
CS985 Project FrankMitchell BiP Solutions
66 pages
Report
No ratings yet
Report
35 pages
Spam News Detection Report
No ratings yet
Spam News Detection Report
9 pages
Twitter Dataset For Advanced Hate Speech Identification Final
No ratings yet
Twitter Dataset For Advanced Hate Speech Identification Final
78 pages
We Used Neural Networks To Detect Clickbaits: You Won't Believe What Happened Next!
No ratings yet
We Used Neural Networks To Detect Clickbaits: You Won't Believe What Happened Next!
7 pages
Rucha Bapat Master Thesis Report
No ratings yet
Rucha Bapat Master Thesis Report
91 pages
Aditya Kishore
No ratings yet
Aditya Kishore
39 pages
Documentation-Fake News Detection
No ratings yet
Documentation-Fake News Detection
57 pages
Documentation-Fake News Detection
100% (1)
Documentation-Fake News Detection
57 pages
WSC 2025
No ratings yet
WSC 2025
104 pages
Instruction To Writing 4
No ratings yet
Instruction To Writing 4
12 pages
VG 14 P 1
No ratings yet
VG 14 P 1
32 pages
(GTEM GEN 5) ĐÁP ÁN ĐỀ THI THỬ HSG TỈNH
No ratings yet
(GTEM GEN 5) ĐÁP ÁN ĐỀ THI THỬ HSG TỈNH
5 pages
The True Story On Fake News
No ratings yet
The True Story On Fake News
51 pages
Match The Following With The Corresponding Definitions.: Social Media Words
No ratings yet
Match The Following With The Corresponding Definitions.: Social Media Words
1 page
Untapped Traffic Workbook
No ratings yet
Untapped Traffic Workbook
28 pages
A Research Report On The Negative Effects of Fake News To The Selected Grade 10 Students of Pinagbuha
No ratings yet
A Research Report On The Negative Effects of Fake News To The Selected Grade 10 Students of Pinagbuha
14 pages
12 Surprising Examples of Clickbait Headlines That Work
No ratings yet
12 Surprising Examples of Clickbait Headlines That Work
7 pages
Detecting Fake News Using Machine Learning: Gaurav Kumar Choubey (21mca1061) Guide Name: DR Rajarajeswari S
No ratings yet
Detecting Fake News Using Machine Learning: Gaurav Kumar Choubey (21mca1061) Guide Name: DR Rajarajeswari S
29 pages
Yellow Journalism
100% (1)
Yellow Journalism
14 pages
Cherry V2 Guidelines (Updated 9 28 21)
No ratings yet
Cherry V2 Guidelines (Updated 9 28 21)
31 pages
How Technology Disrupted The Truth
No ratings yet
How Technology Disrupted The Truth
20 pages
How To Start A Presentation: 5 Strong Opening Slides and 9 Tricks To Test
No ratings yet
How To Start A Presentation: 5 Strong Opening Slides and 9 Tricks To Test
10 pages
Transcript of Rage Bait
No ratings yet
Transcript of Rage Bait
3 pages
Persuasive Language Focus
No ratings yet
Persuasive Language Focus
11 pages
Urdu Clickbait Detection and Spoiling
No ratings yet
Urdu Clickbait Detection and Spoiling
4 pages
Aaaa
No ratings yet
Aaaa
79 pages
The Shift From Print To Digital Media and Its Effe
No ratings yet
The Shift From Print To Digital Media and Its Effe
14 pages
HSI Compiled ETE
No ratings yet
HSI Compiled ETE
541 pages
Siege Media Ebook - How To Increase Website Traffic
No ratings yet
Siege Media Ebook - How To Increase Website Traffic
29 pages
Final Research Paper - Ged105 - Fake News Group
No ratings yet
Final Research Paper - Ged105 - Fake News Group
36 pages
News Feed Publisher Guideline
No ratings yet
News Feed Publisher Guideline
27 pages
Reading Voyage PLUS 1 Vocabulary List
No ratings yet
Reading Voyage PLUS 1 Vocabulary List
16 pages
10 1109@icoei 2019 8862770
No ratings yet
10 1109@icoei 2019 8862770
5 pages
Psosm Notes Summary
No ratings yet
Psosm Notes Summary
21 pages
Cyber Safety 5
No ratings yet
Cyber Safety 5
12 pages