Text Mining PPT Merged
Text Mining PPT Merged
• Lemmatization:
- Lemmatization considers the context and converts
the word to its meaningful base form, which is called
Lemma. For instance, stemming the word 'Caring'
would return 'Car'. For instance, lemmatizing the word
'Caring' would return 'Care'.
The steps to perform preprocessing of data :
The steps to perform preprocessing of data :
• Filtering (or) Removing Stop Words:
- It is a process of removing non-essential words,i.e
Words such as was, in, is, and, the, are called stop
words and can be removed.
How Does Text Mining Work?
Image mining
• Image mining systems can discover meaningful
information or image patterns from a huge collection
of images.
Video mining
• Video mining has the objective of describing interesting
patterns form large amount of video data.
Audio mining
• Audio mining is the technique in which audio signals are
automatically analyzed and searched. This technique is
generally implemented in automatic speech recognition.
Applications of Multimedia Mining:
• Digital Library
• Traffic Video Sequences
• Medical Analysis
• Media Making and Broadcasting
• Surveillance system
Process of Multimedia Data Mining:
Architecture for Multimedia Data Mining:
We considered two main families of multimedia
retrieval systems, i.e. similarity search in multimedia
data.
• clustering,
Urban Planning
Public Health
Environmental Management
• Spatial Data Mining also contributes to environmental
management by detecting changes in the environment,
identifying the land at risk, conserving water and
biodiversity, and monitoring natural resources.
Crime Analysis
• Spatial Data Mining can be used to identify crime
hotspots, understand crime patterns and develop proper
strategies to prevent crimes and hence improve public
safety.
Web Mining
- Discovering interesting and useful information
from Web content and usage data
What is Web Mining?
• Web mining is a data mining technique to extract
knowledge from web data.
Fun fact:
More than 80% of all Google searches are initiated by the Google staff,
in the process of developing and refining its search algorithms.
How Many Websites Are There in the World
World Wide Web
• Diverse types of data
– Text
– Images
– Audio & Video
Web Mining
• Web mining is the application of data mining
techniques to discover useful information
from the World Wide Web.
Disadvantage:
• This technology when used on data of personal
nature might cause concerns. The most criticized
ethical issue involving web usage mining is the
invasion of privacy.
Web Structure Mining
• Web structure mining is the application of discovering
structure information from the web.