0% found this document useful (0 votes)
20 views5 pages

Text Analytics

Text mining is a process that extracts meaningful patterns and knowledge from unstructured text data sources like emails, news articles, web pages, and more. It involves organizing the unstructured data through feature extraction and analysis to gain useful insights. The text mining process includes problem definition, text summarization, information extraction, visualization, and both unsupervised and supervised machine learning techniques like clustering and classification. Popular Python packages for text mining include NLTK, TextBlob, Gensim, Pattern, and spaCy, which support functions like tokenization, POS tagging, entity recognition, stemming, sentiment analysis, topic modeling, and more.

Uploaded by

Arham
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views5 pages

Text Analytics

Text mining is a process that extracts meaningful patterns and knowledge from unstructured text data sources like emails, news articles, web pages, and more. It involves organizing the unstructured data through feature extraction and analysis to gain useful insights. The text mining process includes problem definition, text summarization, information extraction, visualization, and both unsupervised and supervised machine learning techniques like clustering and classification. Popular Python packages for text mining include NLTK, TextBlob, Gensim, Pattern, and spaCy, which support functions like tokenization, POS tagging, entity recognition, stemming, sentiment analysis, topic modeling, and more.

Uploaded by

Arham
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

TEXT ANALYTICS

MD ARHAM ANALYTICS
TEXT ANALYSTICS

TEXT MINING
Text mining is a process to extracting interesting and significant patterns to
explore knowledge from textual data source.
Unstructured Data examples
 Emails
 Insurance claims
 News article
 Web pages
 Patent portfolios
 Customer complaint letter
 Contracts
 Transcripts of phone calls with customers
 Technical documents

Text mining applications


 Customer profile analysis
 Trend analysis
 Information filtering
 Event tracks
 News stories classification
 Web search
 Sentiment analysis
Text mining process
Gain subject matter expertise and define text mining
Problem definition

Blogs Survey
Reviews
Unorganized state
Emails Articles
Social
media

Organization

Feature extraction

Analysis

Organized state Insight, recommendation or analytical output

Text mining process


 Text summarization
 Information extraction from text
 Information visualization
 Unsupervised learning methods - cluster
 Supervised learning methods - regression
Packages used in python
 Natural language toolkit(NLTK)
 Tokenization
 POS tagging
 Named entity recognition
 Parsing
 Stemming
 Classification
 Text bob
 Text processing Phrase extraction
 Association mining
 Sentiment analysis
 Genism
 Topic modelling
 Similarity analysis
 Pattern
 Crawling
 Pos tagging
 nGrams, sentiment analysis, WordNet
 Vector space models, clustering, classification
 Vocabulary
 Antonyms
 Part of speech
 Translation
 Pronunciation
 And hyphenation of a given word
 spaCy
 tokenization
 Pos tagging
 Parsing
 Named entity recognition
 Data Type
 List – [int, float, string] – it can change
 Tuple – it cannot change
 Set – values cannot be change it contain only unique values
 Dictionary it stores value with the key(name)

You might also like