0% found this document useful (0 votes)
214 views13 pages

Fake News Detection: Using Machine Learning & Python (Predicting Website)

The document describes building a machine learning model and website to detect fake news using Python. It will use a random forest classifier trained on news articles labeled as real or fake. The workflow includes collecting a dataset of labeled news, preprocessing the text with TF-IDF, training a random forest model, and building a website using Flask, Heroku and other tools that will predict if new articles are fake or real. The goal is to accurately classify news and address the spread of misinformation.

Uploaded by

Palak Arora
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
214 views13 pages

Fake News Detection: Using Machine Learning & Python (Predicting Website)

The document describes building a machine learning model and website to detect fake news using Python. It will use a random forest classifier trained on news articles labeled as real or fake. The workflow includes collecting a dataset of labeled news, preprocessing the text with TF-IDF, training a random forest model, and building a website using Flask, Heroku and other tools that will predict if new articles are fake or real. The goal is to accurately classify news and address the spread of misinformation.

Uploaded by

Palak Arora
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 13

FAKE NEWS

DETECTION
USING MACHINE LEARNING & PYTHON (PREDICTING WEBSITE)

GROUP MEMBERS :-
UNDER THE GUIDANCE OF
HIMANSHI DIXIT (A50105218048)
Dr. Shalini Bhaskar Bajaj Dr. Aman Jatain JIYA SEHGAL (A50105218043)
HOD, ASET Assistant Professor PALAK ARORA (A50105218032)
Amity University Amity University
BTECH CSE SEC A (7th SEM)
INDEX
 INTRODUCTION

 AIMS & OBJECTIVE

 NECESSARY?

 TECHNOLOGY

 WORKFLOW

 METHODOLOGY

 ARCHITECTURE
INTRODUCTION
Fake News
Fake news has been around for decades and is very common nowadays. However, the
dawn of the social media age has aggravated the generation and circulation of fake
news many folds .Fake news can be simply explained as a piece of article which is
usually written for economic, personal or political gains.

Many scientists believe that fake news issue may be addressed by means of machine
learning and artificial intelligence. Detection of such unrealistic news articles is
possible by using various LSTM techniques, Machine learning ,and Artificial
intelligence.
AIMS & OBJECTIVE
The aims at investigating the principles, methodologies and algorithms for detecting
fake news articles, creators and subjects from online social networks and evaluating
the corresponding performance.
The main focus is on predicting the FAKE or REAL news using algorithm and dataset
will be of news which will identify the news and the labels denoting whether the
news is REAL or FAKE.
The objectives are as follows – 
1) To build a model to accurately classify a piece of news as REAL or FAKE.
2) To predict the Fake and real news using machine learning algorithm.
3) Build a website that will predict whether the NEWS is FAKE or REAL.
NECESSARY ?
WHY WE NEED FAKE NEWS DETECTION WEBSITE
TOOLS
TECHNOLOGY
PYTHON
The Python Programming Language is a high-level
language. It is also an interpreted and the
dynamically typed language. The python contains so
many different types of the libraries; some of the
libraries are the sklearn, pandas, Matplotlib and
MACHINE LEARNING
many more. Machine learning (ML) is a type of artificial
intelligence (AI) that allows software applications to

FLASK
become more accurate at predicting outcomes without
being explicitly programmed to do so. Machine learning
Flask is a micro web framework written in Python. It algorithms use historical data as input to predict new
is classified as a microframework because it does not output values.
require particular tools or libraries. It has no
database abstraction layer, form validation, or any
other components where pre-existing third-party
libraries provide common functions
TECHNOLOGY
HEROKU
Heroku is a cloud platform as a service supporting
several programming languages. One of the first HTML &CSS (FRONT-END)
cloud platforms.
It support following programming language Ruby,
Java, Node.js, Scala, Clojure, Python, PHP, and Go. The HyperText Markup Language, or HTML is the
standard markup language for documents designed
to be displayed in a web browser.

Cascading Style Sheets is a style sheet language used


for describing the presentation of a document
written in a markup language such as HTML. CSS is a
cornerstone technology of the World Wide Web,
alongside HTML
WORKFLOW
METH0DOLOGY
A random forest is a machine learning technique that’s used to solve
regression and classification problems. A random forest algorithm
RANDOM FOREST consists of many decision trees. The ‘forest’ generated by the
random forest algorithm is trained through bagging or bootstrap
CLASSIFIER aggregating.

How Random Forest Works?

 A rain forest system relies on various decision trees. Every


decision tree consists of decision nodes, leaf nodes, and a
root node.
 The leaf node of each tree is the final output produced by
that specific decision tree.
 The selection of the final output follows the majority-
voting system. In this case, the output chosen by the
majority of the decision trees becomes the final output of
the rain forest system.
TfidfVectorizer
It is a very common algorithm to transform text into a meaningful
representation of numbers which is used to fit machine algorithm for
prediction.

TF (Term Frequency): The number of times a word appears in a document


is its Term Frequency. A higher value means a term appears more often than
others, and so, the document is a good match when the term is part of the
search terms.
IDF (Inverse Document Frequency): Words that occur many times a
document, but also occur many times in many others, may be irrelevant. IDF
is a measure of how significant a term is in the entire corpus. GUNICORN
Green Unicorn, commonly shortened to
"Gunicorn", is a Web Server Gateway Interface
(WSGI) server implementation that is commonly
used to run Python web applications.
.
ARCHITECTURE

VS Studio
WORKFLOW

THANKYOU

You might also like