0% found this document useful (0 votes)
2 views

Quick Guide_ Steps To Perform Text Data Cleaning in Python

The document provides a quick guide on performing text data cleaning in Python, specifically for tweets, which are crucial for brand management. It emphasizes the importance of cleaning data to remove unwanted characters and errors before mining for insights. The infographic outlines the necessary steps and encourages readers to download the PDF for detailed Python code and further learning resources.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views

Quick Guide_ Steps To Perform Text Data Cleaning in Python

The document provides a quick guide on performing text data cleaning in Python, specifically for tweets, which are crucial for brand management. It emphasizes the importance of cleaning data to remove unwanted characters and errors before mining for insights. The infographic outlines the necessary steps and encourages readers to download the PDF for detailed Python code and further learning resources.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Quick Guide: Steps To Perform Text Data Cleaning in Python

BE G I NNE R BUS I NE S S A NA LYT I C S I NF O G RA PHI C I NF O G RA PHI C S NLP PYT HO N T E C HNI Q UE T E XT UNS T RUC T URE D D AT A

Introduction

Twitter has become an inevitable channel for brand management. It has compelled brands to become more
responsive to their customers. On the other hand, the damage it would cause can’t be undone. The 140
character tweets has now become a powerful tool for customers / users to directly convey messages to
brands.

For companies, these tweets carry a lot of information like sentiment, engagement, reviews and features of
its products and what not. However, mining these tweets isn’t easy. Why? Because, before you mine this
data, you need to perform a lot of cleaning. These tweets, once extracted can come with unwanted html
characters, bad grammar and poor spellings – making the mining very difficult.

Below is the infographic, which displays the steps of cleaning this data related to tweets before mining
them. While the example in use is of Twitter, you can of course apply these methods to any text mining
problem. We’ve used Python to execute these cleaning steps.
Download the PDF Version of this infographic and refer the python codes to perform
Text Mining and follow your ‘Next Steps…’ -> Download Here

To view the complete article on effective steps to perform data cleaning using python -> visit here
If you like what you just read & want to continue your analytics
learning, subscribe to our emails, follow us on twitter or like
our facebook page.

Article Url - https://www.analyticsvidhya.com/blog/2015/06/quick-guide-text-data-cleaning-python/

avcontentteam

You might also like