0% found this document useful (0 votes)
84 views53 pages

Data Mining: Presentation Topic

The document discusses various machine learning algorithms and their real-life applications. It begins by explaining Naive Bayes classification algorithm and how it can be used for spam filtering and weather forecasting. It then describes the FP-Growth algorithm and its uses in knowledge discovery from databases and managing medical data. Finally, it discusses the KNN algorithm and its applications in areas such as text mining, social media analysis, medicine, and industries.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
84 views53 pages

Data Mining: Presentation Topic

The document discusses various machine learning algorithms and their real-life applications. It begins by explaining Naive Bayes classification algorithm and how it can be used for spam filtering and weather forecasting. It then describes the FP-Growth algorithm and its uses in knowledge discovery from databases and managing medical data. Finally, it discusses the KNN algorithm and its applications in areas such as text mining, social media analysis, medicine, and industries.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 53

DATA MINING

Presentation topic All


Real Life Usage of Algorithms
PRESENTERS INFO

AYESHA BUTT INFO.TECH 16031


BUSHRA ASGHAR INFO.TECH 16007
AYESHA SHAMAS INFO.TECH 16040
MARIUM MAJEED INFO.TECH 16026
TAYYABA RAFIQUE INFO.TECH 16006
NAIVE BAYES

 Naive Bayes is a very simple but powerful algorithm used for


prediction as well as classification.

 The algorithm is called Naïve because it assumes that the features in


a class are unrelated to the other features and all of them
independently contribute to the probability calculation.
REAL LIFE USAGE OF NAIVE BAYES
ALGORITM
Spam Filtering 

 Naive Bayes algorithm can be used to filter the Spam mails. A list of
keywords(on which basis a mail is decided to be a spam or not) is
made and then the mail is checked for those keywords. If the mail
contains a large number of those keywords then there will be higher
chances for it to be spam.
Spam Filtering
Spam Filtering 
WEATHER FORECAST 

 This algorithm can be used to predict the weather report based upon
the atmosphere features like (temperature, wind, clouds, Humidity,
etc), we have used the same example above.
WEATHER FORECAST
FP-Growth Algorithm
FP-GROWTH ALGORITHM
 The FP Growth algorithm operates in the following four modules.
 Preprocessing module
 FP Tree an FP Growth Module
 Association Rule Generation
 Results
FP-GROWTH ALGORITHM IN CPU

 This algorithm was proposed by Grahne and is based on the usage


of CPU time to compute frequent item sets using FP-Growth. They
observed that 80% of CPU time was used for traversing FP-Trees 
Therefore, they used an array-based data structure combined with
the FP-Tree data structure to reduce the traversal time, and
incorporates several optimization techniques.
FP-GROWTH ALGORITHM IN CPU
KNOWLEDGE DISCOVERY DATABASE
 FP Growth Algorithm is used to generate association rule
from an employee database

 Knowledge discovery in database refers to process that


retrieves knowledge from large database
IN KNOWLEDGE DISCOVERY
DATABASE
MANAGE MEDICAL DATA

 A medical data consist of multiple attributes of patient data related


with diagnostic and treatment process. This may include text,
numerical, images, and spatial data. FP growth algorithm is used to
extract knowledge from the medical data
MANAGE MEDICAL DATA
REAL LIFE USAGE OF KNN ALGORITM
APPLIACTIONS:

KNN as a data mining technique has a wide variety of


applications in classification as well as regression. Some
of the applications of this method are mentioned below:
I. TEXT MINING

 The KNN algorithm is one of the most popular


algorithms for text categorization or text mining.
SOCIAL MEDIA ANALYSIS

 There are many text mining tools designed exclusively


for analyzing the performance of social media platforms.

 These help to track and interpret the texts generated


online from the news, blogs, emails, etc.
SOCIAL MEDIA ANALYSIS
SOCIAL MEDIA ANALYSIS

 Furthermore, text mining tools can efficiently analyze


the number of posts, likes, and followers of your brand
on social media, thereby allowing you to understand the
reaction of people who are interacting with your brand
and online content.
 The analysis will enable you to understand ‘what’s hot
and what’s not’ for your target audience
II. MEDICINE

Finding diabetics ratio


 Diabetes diseases are based on age, health condition,
family tradition, and food habits. But is a particular
locality we can judge the ratio of diabetes based on the K
Nearest Neighbor Algorithm.
FINDING DIABETICS RATIO

 If you figure out the data of is age, pregnancies, glucose, blood pressure,
skin thickness, insulin, body mass index and other required data we can
easily plot the probability of diabetes at a certain age.
FINDING THE RATIO OF BREAST
CANCER
 In the medical sector, the KNN algorithm is widely used.
It is used to predict breast cancer. Here KNN algorithm
is used as the classifier. The K nearest neighbor is the
easiest algorithm to apply here. Based on the previous
history of the locality, age and other conditions KNN is
suitable for labeled data
III. INDUSTERIES
Concept Search
 Concept search is the industrial application of the K Nearest Neighbor
Algorithm. It means searching for similar documents simultaneously. The
data on the internet is increasing every single second. The main problem is
extracting concepts from the large set of databases. K-nearest neighbor
helps to find the concept from the simple approach.
RECOMMENDATION SYSTEM

 If we search any product to any online store it will show


the product. Decide that particular product it
recommends some other product. You will be astonished
after knowing that the 35% revenue of
 Amazon comes from the recommendation system.
Decide the online store, YouTube and all search engines
use the algorithms of k-nearest neighbor.
RECOMMENDATION SYSTEM
ARTIFICIAL NEURAL NETWORK

 An Artificial neural network consist of:

 Multiple layers(Input layer, Output layer and zero or


more Hidden layers)that consist of a set of neurones .

 Interconnection between nodes of different layers that


have weight assigned
 Activation functions for each neurone that convert
weighted input of neurons into output values.
ARTIFICIAL NEURAL NETWORK
RAEAL LIFE EXAMPLE ARTIFICIAL
NEURAL NETWORK

 Google translate
 Google Assistance

 Face recognition system

 Finger print recognition system


GOOGLE TRANSLATE

 Google Translates NMT system uses a large artificial neural


network capable of deep learning. By using millions of examples,
GNMT improves the quality of translation, using broader context to
deduce the most relevant translation. The result is then rearranged
and adapted to approach grammatically based human language.

 GNMT's proposed architecture of system learning was first tested


on over a hundred languages supported by Google.
GOOGLE TRANSLATE

 The following 101 languages are supported by Google


Translates Neural Machine

 Google Translate previously first translated the source


language into English and then translated the English
into the target language rather than translating directly
from one language to another.
GOOGLE TRANSLATE
GOOGLE TRANSLATE
GOOGLE TRANSLATE
FINGERPRINT RECOGNITION SYSTEM

 A fingerprint in its narrow sense is an impression left by


the friction ridges of a human finger.
 There pattern is permanent and unchangeable on each
finger during the whole life time of an individual.
 According to FBI the accuracy and reliability of
fingerprint scans are correct 99.8% of the time.
APPLICATIONS OF FINGERPRINT RECOGNITION

 The fingerprint recognition system can be easily


embedded in any system. It is used in,
 recognition of criminals in law enforcement.

 Used in providing security to cars, lockers, banks and


shops.
 To differentiate between persons.
HITS ALGORITHM
HITS ALGORITHM
 Hub
A hub is a page with many out-links.

 Authority:
An Authority Is a page with many
in-links.
APPLICATIONS OF HITS
ALGORITHM

 Hyperlink Analysis has been used in a wide variety of


applications.
 Web crawling

 Finding Web communities

 Web page categorization

.
WEB CRAWLING

  Web crawler, or spider, is a type of bot that's typically


operated by search engines like Google and Bing.
Their purpose is to index the content of websites all
across the Internet so that those websites can appear in
search engine results.
GOOGLE WEB CRAWLING
WEB COMMUNITY

 A “Web Community” can be defined as a group of pages


that address similar topics or reflect the common
interests of the creators of these pages. The similarity
can be based on content as well as the inherent link
structure. However, this description of a web community
is by no means a definition since various researchers
have defined in different ways.
WEB COMMUNITY
WEB PAGE CATEGORIZATION

 Web page categorization determines the category or class


a Web page belongs to, from a pre-determined set of
categories or classes. Topic Distillation is similar but in
Web page categorization, the categories can be based on
topics or other functionalities, e.g. home pages, content
pages, research papers etc.
Thank you

You might also like