0% found this document useful (0 votes)

27 views9 pages

Project Synopsis Report Format

Uploaded by

Varnika Tomar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views9 pages

Project Synopsis Report Format

Uploaded by

Varnika Tomar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

A

Project
Synopsis Report (KCS 753)
On

Title of Project
Under the Supervision
of
(supervisor's name with Designation)

Submitted by:
Student Name (Uni. Roll No)

Session: 2024-25

Department of Computer Science and Engineering

Dronacharya Group of Institutions, Greater Noida, Uttar Pradesh, India
201306

TABLE OF CONTENTS
1. Introduction
2. Background/ Relevant Work\Existing System\ Literature Review
3. Proposed work
4. Methodology/Experimental Work
5. Conclusion and Future Scope
1. INTRODUCTION
News has been the provider of information since centuries. In traditional times, there were news
agencies which were the source of news and hence, reliability and confidentiality remained with
the official organizations itself. In recent times, internet grew rapidly from rural to urban areas.
With the growth of internet, more users from all over the world got access to internet and to
spread the information in their way [1].

According to Economic Times report of 2019, there are 627 million internet users in India which
means India is home to world’s second largest internet user base [2]. However, with the
increasing popularity of social media, the internet becomes ideal breeding ground for fake news.
A research by BBC shows that nearly 72% Indians struggled to distinguish between fake and
real news [3]. Websites like The Onion[4], News Thump[5], The Poke News[6], and The Mash
News[7] are among the top rankers of ‘Fake’ or ‘misleading’ news propagator [8]. Hence, many
online fact checking resources like Snopes[9], FactCheck.org[10], Factmata.com[11],
PolitiFact.com[12] and many more grew rapidly. Social networking sites such as Facebook,
Whatsapp, and Google addressed this particular concern but the efforts hardly contributed in
solving the issue.

Approaches to detect Fake News:

I. Detection Approaches Based on Machine Learning: Support Vector Machines

(SVMs), Random forests, logistic regression models, Conditional Random Field (CRF)
classifiers, Hidden Markov Models (HMMs) [13].

II. Detection approaches based on deep learning: The two most widely implemented
paradigms in modern artificial neural networks are Recurrent Neural Networks (RNN)
and Convolutional Neural Networks (CNN) [13].

This model will detect fake news by checking the credibility

of the news provider, comment sentiment analysis and
content of the provided news. We will be using Natural Language Processing
for pre-processing the dataset and machine learning approach to fight fake news.
Figure 1: Fact Checker [14]

2. BACKGROUND
There are many models for fact checking and detecting fake news. PolitiFact[12] - A fact-
checking website operated by Poynter Institute in St. Petersburg, Florida which uses Truth-O-
Meter to determine truthfulness of a statement/article/event/Image/video. But the fact checking
is limited to political news and hence fails to cover broad spectrum of news. According to a
survey paper, Facebook fake news sources can be encountered using BS Detector[15]. Another
fact checking website, Factmata[11] provides platform to get better understanding of the content
by providing scores content on nine signals, including Hate speech and Political bias, to give us
a deep understanding of credibility and safety of any content on web. Messenger for businesses
Flock has launched Fake news detector that aims to stop false and misleading information from
being introduced in their environment [16].

In India, fact check has recently been launched by India Today, Times of India, and AFP India
but these resources do not provide platform for users to check whether the news article they are
viewing is fake or real. AltNews [17] has been successful in India to provide platform for user to
clear their doubt, though it is yet to get more efficient and reliable.

Models like Fact Finder, only check whether the news is fake or real. On the other hand,
AltNews website or app works on fake news and publish viral fake news articles. Our model,
performs both work simultaneously.

3. PROPOSED WORK
In this paper a model is build based on pre-processing data with the use of NLTK library,
removing all the stopwords such as “the”, “is”, and “are” and only using those words which are
unique and provide us with relevant information. We also removed punctuations, numbers and
converted our dataset into lowercase letters. Also we have used Count Vectorizer or TF-IDF
matrix which tallies to how often the word in used in a given article in our dataset, Figure 2
depicts the process from collecting News Articles Dataset to using News Classification
Algorithm. Since the problem concerns with text classification and information extraction, we
have used Naïve Bayes classifier for text-based classification. For training and testing, we have
used Multinomial NB and Passive Aggressive Classifier with 33% training dataset. We will also
remove rare words occurring in our corpus with the help of Count Vectorizer [18-20].

The goal of the project is to make a website and app for user so that whenever he/she selects a
text, the app reflects with floating window and provides user with the percentage of fake and
real news of the selected text. The advantage with the app or website is that without opening or
uploading any content in the app, the app will detect fake news.

Figure 2: Process Flow Diagram

4. METHODOLOGY
In this section, the methodology of proposed model has been described. Figure 3 represents
work flow of methods involved in creating the model. The major steps involved in building the
model are:

a) Corpus of Text Document

b) Text wrangling and pre-processing
c) Parsing and Basic Exploratory Data Analysis
d) Text representation using relevant feature engineering techniques
e) Modeling
f) Evaluation and Deployment

Figure 3: Methodology

4.1. Scraping News Articles for Data Retrieval

Currently, the model has been trained using a dataset from Kaggle [21] with 6335 rows and 4
columns. News articles will be scraped from, in shorts [22], with the help of Python libraries
along with NLTK and spacy. A typical news article is also in the HTML section as depicted in
the following image:

Figure 4: The landing page for technology news articles and its corresponding HTML structure [23]

The specific HTML tags can also be used which contain the textual content [24]. Hence, with
the help of libraries such as BeautifulSoup and requests, useful content will be scraped.

Collected dataset contains 6335 rows and 4 columns; the head of the dataset has been depicted
in the following Figure 5:

Figure 5: Dataset of real and fake news articles

4.2. Text Wrangling, Cleaning and Pre-processing
Here, the nltk and spacy packages both have been leveraged to process the data. Stopwords can
be used to process data and remove the most common words used in our dataset such as “and”,”
the” and “is”. Along with stop words, HTML tags, accented text, expand contractions,
punctuations, numbers, and special characters are also needed to be removed since they do not
provide relevant information. Lemmatizing and stemming text are done with the help of
functions such as lemmatize_text() and simple_stemmer() respectively.

With the help of TF-IDF vectorizer, word importance in a given article in the entire corpus is
determined. [25]

4.3. Data Visualization and Feature Extraction

For better understanding of the dataset, we use matplotlib and seaborn libraries for visualization
and plotting graphs. Using stripplot() method, present in seaborn library statistical plot as
depicted in Figure 6 was formed which shows 0~5000, datasets are REAL while from
5000~10000, datasets are FAKE. CountVectoriser library to remove the rare words was
imported.

Figure 6: Dataset Visualization of Fake news and Real news using Seaborn
X-axis represents label(fake or real), y-axis represents Index

4.4. Modeling and Grid Search

With the help of Multinomial NB and Passive Aggressive Classifier, 33% of the dataset was
trained and testing rest 67%. Using confusion matrix, highest accuracy model will be achieved.
[26]

4.5. Experimental and Result Analysis

Let’s consider the result as positive, when the classifier classifies news articles as fake:

• The number of True Positives is the number of news articles correctly classified as Fake
News;

• The number of False Positives is the number of news articles incorrectly classified as
Fake News;

• The number of True Negatives is the number of news articles correctly classified as True
News;

• The number of True Positives is the number of news articles incorrectly classified as True
News;
The precision of a classifier is calculated as follows:

Precision = tp / (tp + fp)

where:
tp – number of true positive examples;
fp – number of false positive examples.
The recall of a classifier is calculated as follows:

Recall = tp / (tp + fn), (27)

where fn is a number of false negative examples.

As depicted in figure 7, confusion matrix helps in evaluating the quality of the output of a
classifier, in this case being, Multinomial NB and Passive Aggressive Classifier, on the fake or
real news dataset. Diagonal elements of the matrix represents number of points where predicted
label is equal to true label while off-diagonal matrix of the matrix represents number of points
where prediction of the model fails.

The figure shows the matrix without normalization. Here the results of the matrix changes as the
classification models or vectorizers are changed.

In Matrix 1, combination of Multinomial NB and Tf-Idf Vectoriser

In Matrix 2, combination of Multinomial NB and Count Vectoriser

In Matrix 3, combination of Passive Aggressive Classifier and Tf-Idf Vectoriser

In Matrix 4, combination of Passive Aggressive Classifier and Hashing Vectoriser

Figure 7: Confusion Matrix, without normalization

The precision for the given classifying model is 0.902; recall on the other hand is 0.486.

The precision of the model represents the relevant instances among the retrieved instances,
while recall is the fraction of total amount of relevant instances that were actually retrieved.

5. CONCLUSION AND FUTURE SCOPE

In this project, the proposed model is Fake News Detection which differentiates the text by text
classification algorithms to tell whether the news is ‘fake’ or ‘real’. For training, 33% dataset
has been used, and 67% data has been used for testing the FND model. The model predicted
fake and real news successfully with 90.2% accuracy.

In future, VADER for sentiment analysis can be used which is more efficient algorithm and a
text classification model that provides us with highest accuracy. Also, existing Fake News
Detection models have worked for news and politics only, scope in Stock Markets, where shares
rise and fall very frequently, still persists.

REFERENCES

1. Kuriakose, Ammu, et al. "ALIKAH-A Clickbait and Fake News Detection System using Natural
Language Processing." 2019 3rd International Conference on Trends in Electronics and Informatics
(ICOEI). IEEE, 2019.

2. “India has second highest number of Internet users after China” - economictimes.com, 2019[Online].
Available : https://economictimes.indiatimes.com

3. “Ordinary Indians are fueling the country’s fake-news crisis” – qz.com, 2018[Online]. Available:
https://qz.com/india

4. “The Onion” – theonion.com [Online]. Available: https://www.theonion.com/

5. “News Thump” – newsthump.com [Online]. Available: https://newsthump.com/
6. “Poke News” – pokenews.com [Online]. Available:
https://thepoke.co.uk/category/news/

7. “Mash News” – mashnews.com [Online].

Available: https://www.thedailymash.co.uk/news

8. “Top 50 Fake News Websites And Blogs on the Web in 2019” – blog.feedspot.com, 2019[Online].
Available: https://blog.feedspot.com/fake_news_blogs/

9. “Snopes” – snopes.com [Online]. Available: https://www.snopes.com/

10. “FACTCHECK.ORG” – factcheck.org [Online]. Available: https://www.factcheck.org/
11. “FACTMATA” – factmata.com [Online]. Available: https://factmata.com/
12. “Fact Checking U.S. Politics | PolitiFact ” – politifact.com [Online].
Available: https://politifact.com/
13. Bondielli, Alessandro, and Francesco Marcelloni. "A survey on fake news and rumour detection
techniques." Information Sciences 497 (2019): 38-55.

14. “Protecting the EU Elections From Misinformation and Expanding Our Fact-Checking Program to
New Languages” – aboutfb.com[Online]. Available: https://about.fb.com/news

15. "B.S. Detector - Browser extension to identify fake news sites", Bsdetector.tech, 2018. [Online].
Available: http://bsdetector.tech/.
16. “Messenger platform Flock launches feature to identify fake news”, economictimes.com, 2019 [Online].
Available: https://m.economictimes.com/small-biz
17. “Alt News”, altnews.com [Online]. Available: https://www.altnews.in/
18. N. J. Conroy, V. L. Rubin, and Y. Chen, “Automatic deception detection: Methods for finding fake
news,” Proceedings of the Association for Information Science and Technology, vol. 52, no. 1, pp. 1–4,
2015.
19. S. Feng, R. Banerjee, and Y. Choi, “Syntactic stylometry for deception detection,” in Proceedings of the
th
50 Annual Meeting of the Association for Computational Linguistics: Short Papers-Volume 2,
Association for Computational Linguistics, 2012, pp. 171–175.
20. Shlok Gilda,Department of Computer Engineering, Evaluating Machine Learning Algorithms for Fake
News Detection,2017 IEEE 15th Student Conference on Research and Development (SCOReD)
21. “Kaggle”, kaggle.com [Online]. Available: https://kaggle.com
22. “inshorts - stay informed”, inshorts.com [Online]. Available: https://inshorts.com
23. “A Practitioner's Guide to Natural Language Processing (Part I) — Processing & Understanding Text”,
towardsdatascience.com, 2019 [Online]. Available: https://towardsdatascience.com
24. M. Pagliardini, P. Gupta, and M. Jaggi, “Unsupervised learning of sentence embeddings using
compositional n-gram features,” arXiv preprint arXiv:1703.02507, 2017.
25. H. Rashkin, E. Choi, J. Y. Jang, S. Volkova, Y. Choi, and P. G. Allen, “Truth of Varying Shades:
Analyzing Language in Fake News and Political Fact-Checking,” in Proceedings of the 2017
Conference on Empirical Methods in Natural Language Processing, 2017, pp. 2931–2937.
26. M. Balmas, “When Fake News Becomes Real: Combined Exposure to Multiple News Sources and
Political Attitudes of Inefficacy, Alienation, and Cynicism,” Communic. Res., vol. 41, no. 3, pp. 430–
454, 2014.
27. Naive Bayes classifier. (n.d.) Wikipedia. [Online]. Available:
https://en.wikipedia.org/wiki/Naive_Bayes_classifier. Accessed Feb. 6, 2017.

Remark:
Instructions for Formatting the Project Report:
1. All text fonts should be in Times New Roman.
2. The heading should be in font size = 14 with bold.
3. The text should be of Font Size=12.
3. There should be 1.5 line spacing between the Texts.
4. Figure & its caption should be center justified with font size 10.
5. The table and caption should be center justified with font size 10.
6. All the text should be Justified (Select text -> Ctrl +J) in the
synopsis.

Fake News Detection
100% (1)
Fake News Detection
25 pages
MAJOR PROJECT REPORT (1) - For Merge
No ratings yet
MAJOR PROJECT REPORT (1) - For Merge
46 pages
DAA Notes Complete
No ratings yet
DAA Notes Complete
242 pages
s134450 Fake News Detection Using Machine Learning
No ratings yet
s134450 Fake News Detection Using Machine Learning
91 pages
Ai Project
No ratings yet
Ai Project
16 pages
AI Phase5
No ratings yet
AI Phase5
26 pages
A Fake News Detection System Using Data Science and ML
No ratings yet
A Fake News Detection System Using Data Science and ML
7 pages
Fake News Detector With Real Time Web Scraping
No ratings yet
Fake News Detector With Real Time Web Scraping
11 pages
Graph Data Science with Python and Neo4j: Hands-on Projects on Python and Neo4j Integration for Data Visualization and Analysis Using Graph Data Science for Building Enterprise Strategies (English Edition)
From Everand
Graph Data Science with Python and Neo4j: Hands-on Projects on Python and Neo4j Integration for Data Visualization and Analysis Using Graph Data Science for Building Enterprise Strategies (English Edition)
Timothy Eastridge
No ratings yet
AR - VR Infrastruture Engineer
100% (1)
AR - VR Infrastruture Engineer
148 pages
Operating System Quantum
No ratings yet
Operating System Quantum
85 pages
SecurityX CAS-005 Exam Objectives
No ratings yet
SecurityX CAS-005 Exam Objectives
18 pages
ML Project Report PDF
No ratings yet
ML Project Report PDF
26 pages
Fake Phase3
No ratings yet
Fake Phase3
14 pages
Fake News Detetcion PPT 2023
No ratings yet
Fake News Detetcion PPT 2023
25 pages
IR - MINIPROJECT Final
No ratings yet
IR - MINIPROJECT Final
15 pages
D13 Manuscript
No ratings yet
D13 Manuscript
12 pages
FYP Copy
No ratings yet
FYP Copy
42 pages
Shoaib Khan - 1918922 - Report
No ratings yet
Shoaib Khan - 1918922 - Report
20 pages
A6V10316241 NK8237 Installation
No ratings yet
A6V10316241 NK8237 Installation
96 pages
Doosan B10 - 13 - 15 - 16R-5 - SB4292E - 18.09
100% (2)
Doosan B10 - 13 - 15 - 16R-5 - SB4292E - 18.09
494 pages
Fake News Detection Using Machine Learning12 2
No ratings yet
Fake News Detection Using Machine Learning12 2
65 pages
MINOR REPORT (1) Fake News Detect
No ratings yet
MINOR REPORT (1) Fake News Detect
14 pages
Fake News Detection
No ratings yet
Fake News Detection
5 pages
ML Report Fake News Detection
No ratings yet
ML Report Fake News Detection
15 pages
Fake News Proposal
No ratings yet
Fake News Proposal
18 pages
Case Study DL
No ratings yet
Case Study DL
8 pages
MAJOR PROJECT REPORT On Machine Learning Model To Determine Fake News
No ratings yet
MAJOR PROJECT REPORT On Machine Learning Model To Determine Fake News
52 pages
Identifying Fake News in Real Time 230603 103213
No ratings yet
Identifying Fake News in Real Time 230603 103213
6 pages
Fake News Detection PPT (AIB602)
No ratings yet
Fake News Detection PPT (AIB602)
11 pages
FAke News Report
No ratings yet
FAke News Report
16 pages
AI Phase2
No ratings yet
AI Phase2
6 pages
Fake News Detection PPT 1
No ratings yet
Fake News Detection PPT 1
13 pages
Detection of Fake News
No ratings yet
Detection of Fake News
17 pages
Dar Es Salaam Institutes of Technolog1
No ratings yet
Dar Es Salaam Institutes of Technolog1
8 pages
Aiml Project Report
No ratings yet
Aiml Project Report
46 pages
Fake News Detector Project Abstract
No ratings yet
Fake News Detector Project Abstract
9 pages
Fake News Paper2
No ratings yet
Fake News Paper2
6 pages
Fake News Detection
No ratings yet
Fake News Detection
9 pages
The Main Objective Is To Detect The Fake News, Which Is A Classic Text Classification
No ratings yet
The Main Objective Is To Detect The Fake News, Which Is A Classic Text Classification
57 pages
Fake News Detection With Different Model
No ratings yet
Fake News Detection With Different Model
15 pages
Daa - Mini - Project (1) Orginal
No ratings yet
Daa - Mini - Project (1) Orginal
21 pages
B.E Cse Batchno 214
No ratings yet
B.E Cse Batchno 214
47 pages
A I Project Proposal
No ratings yet
A I Project Proposal
10 pages
Fake News Mini PDF
No ratings yet
Fake News Mini PDF
12 pages
Fake News Detection Using Deep Learning
No ratings yet
Fake News Detection Using Deep Learning
5 pages
Training Report On Machine Learning
No ratings yet
Training Report On Machine Learning
27 pages
Unit-3 CFOA Notes
100% (1)
Unit-3 CFOA Notes
12 pages
JPNR 2022 04 140
No ratings yet
JPNR 2022 04 140
7 pages
Scan Jan 14, 2021
No ratings yet
Scan Jan 14, 2021
35 pages
Reserch Paper
No ratings yet
Reserch Paper
8 pages
Reserch Paperupdated
No ratings yet
Reserch Paperupdated
8 pages
Account Allocation Sheet
No ratings yet
Account Allocation Sheet
22 pages
Machine Learning Techniques For The Classification of Fake News
No ratings yet
Machine Learning Techniques For The Classification of Fake News
5 pages
Fake News Detection Report
No ratings yet
Fake News Detection Report
20 pages
Fake News Detection Using Python
No ratings yet
Fake News Detection Using Python
11 pages
Final Synopsis-Major Abhilasha, Ananya
No ratings yet
Final Synopsis-Major Abhilasha, Ananya
10 pages
Fake News Analysis
No ratings yet
Fake News Analysis
46 pages
Fake News Detection Project Report
100% (1)
Fake News Detection Project Report
8 pages
Report Content
No ratings yet
Report Content
29 pages
UNIT-2 Scientific Management
No ratings yet
UNIT-2 Scientific Management
23 pages
Learn Devops With A Grade Project
No ratings yet
Learn Devops With A Grade Project
13 pages
Adobe Scan Oct 14, 2020
No ratings yet
Adobe Scan Oct 14, 2020
23 pages
A Project Report On Fake News Detection
100% (1)
A Project Report On Fake News Detection
29 pages
Atpg Scripts
100% (1)
Atpg Scripts
3 pages
Final Year of Computer Engineering 2022-23 Semester VII Project Synopsis
No ratings yet
Final Year of Computer Engineering 2022-23 Semester VII Project Synopsis
11 pages
Fake News Detection Using Machine Learning: Nihel Fatima Baarir Abdelhamid Djeffal
No ratings yet
Fake News Detection Using Machine Learning: Nihel Fatima Baarir Abdelhamid Djeffal
6 pages
Adobe Scan Feb 17, 2021
No ratings yet
Adobe Scan Feb 17, 2021
10 pages
Async-JS.L.U01-05 (Asynchronous JavaScript)
No ratings yet
Async-JS.L.U01-05 (Asynchronous JavaScript)
43 pages
SYNOPSIS
No ratings yet
SYNOPSIS
4 pages
Telecom Knowledge and Experience Sharing - ? LTE KPI
No ratings yet
Telecom Knowledge and Experience Sharing - ? LTE KPI
8 pages
Fake News Synopsis 1
No ratings yet
Fake News Synopsis 1
6 pages
std-2 RS 9
No ratings yet
std-2 RS 9
11 pages
Fake News Synopsis 1
No ratings yet
Fake News Synopsis 1
6 pages
Fake News Detection Using Python and Machine Learning
No ratings yet
Fake News Detection Using Python and Machine Learning
6 pages
VIH Series60
100% (2)
VIH Series60
1 page
ITN260
No ratings yet
ITN260
7 pages
M 758 LMR
No ratings yet
M 758 LMR
43 pages
Adobe Scan Oct 17, 2020
No ratings yet
Adobe Scan Oct 17, 2020
4 pages
C VIGIL
No ratings yet
C VIGIL
15 pages
Machine Learning For The Classification of Fake News
No ratings yet
Machine Learning For The Classification of Fake News
4 pages
Fake News Detection Using Machine Learning
No ratings yet
Fake News Detection Using Machine Learning
4 pages
Synopsis Minor Project-2
No ratings yet
Synopsis Minor Project-2
5 pages
1 Goal Programming
No ratings yet
1 Goal Programming
9 pages
Report Se
No ratings yet
Report Se
4 pages
Ôn HK1 - 3
No ratings yet
Ôn HK1 - 3
6 pages
ISTN212 Exam 2023 V2 - PRINT
No ratings yet
ISTN212 Exam 2023 V2 - PRINT
21 pages
Simple Device Discovery Protocol Specification
No ratings yet
Simple Device Discovery Protocol Specification
12 pages
Sequence The Activities
No ratings yet
Sequence The Activities
1 page
ICT 7 2nd PT Wanswer
No ratings yet
ICT 7 2nd PT Wanswer
2 pages
Queuing and Reliability Theory (MATH712) : MODULE 2: Advanced Queuing Models
No ratings yet
Queuing and Reliability Theory (MATH712) : MODULE 2: Advanced Queuing Models
14 pages
Frappe CRM Config and Automation
No ratings yet
Frappe CRM Config and Automation
3 pages
Cbse Class 10 Maths Pre Board Sample Paper For 2023 24
No ratings yet
Cbse Class 10 Maths Pre Board Sample Paper For 2023 24
7 pages
BADS (KMBA 106) - Qus Bank
No ratings yet
BADS (KMBA 106) - Qus Bank
7 pages
We Speak: Translation and Desktop Publishing
No ratings yet
We Speak: Translation and Desktop Publishing
4 pages
Provide Excellent Office Multifunction Printer in UAE - Konica Minolta Dubai
No ratings yet
Provide Excellent Office Multifunction Printer in UAE - Konica Minolta Dubai
4 pages
AP-14 Ver 1.0 EN
No ratings yet
AP-14 Ver 1.0 EN
3 pages
Manual - IP - Firewall - L7 - MikroTik Wiki
No ratings yet
Manual - IP - Firewall - L7 - MikroTik Wiki
3 pages

Project Synopsis Report Format

Uploaded by

Project Synopsis Report Format

Uploaded by

A

Department of Computer Science and Engineering

Approaches to detect Fake News:

I. Detection Approaches Based on Machine Learning: Support Vector Machines

This model will detect fake news by checking the credibility

Figure 2: Process Flow Diagram

a) Corpus of Text Document

4.1. Scraping News Articles for Data Retrieval

Figure 5: Dataset of real and fake news articles

4.3. Data Visualization and Feature Extraction

4.4. Modeling and Grid Search

4.5. Experimental and Result Analysis

Precision = tp / (tp + fp)

Recall = tp / (tp + fn), (27)

where fn is a number of false negative examples.

In Matrix 1, combination of Multinomial NB and Tf-Idf Vectoriser

In Matrix 2, combination of Multinomial NB and Count Vectoriser

In Matrix 3, combination of Passive Aggressive Classifier and Tf-Idf Vectoriser

In Matrix 4, combination of Passive Aggressive Classifier and Hashing Vectoriser

Figure 7: Confusion Matrix, without normalization

5. CONCLUSION AND FUTURE SCOPE

4. “The Onion” – theonion.com [Online]. Available: https://www.theonion.com/

7. “Mash News” – mashnews.com [Online].

9. “Snopes” – snopes.com [Online]. Available: https://www.snopes.com/

You might also like