Flipkart Web Scrapping Project

Download as pdf or txt
Download as pdf or txt
You are on page 1of 11

FLIPKART WEB SCRAPPING PROJECT

Web scraping is a technique using which the webpages from the


internet are fetched and parsed to understand and extract specific
information similar to a human being. Web scrapping consists of two
parts:
• Web Crawling→ Accessing the webpages over the internet and
pulling data from them.
• HTML Parsing→ Parsing the HTML content of the webpages
obtained through web crawling and then extracting specific information
from it.
Hence, web scrappers are applications/bots, which automatically send 2
requests to websites and then extract the desired information from the
07-11-2022

website output. Let’s take an example: how do we buy a phone online?


• Competitive pricing.
• Manufacturers monitor the market, whether the retailer is maintaining a minimum price or not.
• Sentiment analysis of the consumers, whether they are happy with the services and products or not.
• To aggregate news articles.
• To aggregate Marketing data.
• To gain financial insights from the market.
• To gather data for research.
• To generate marketing leads.
• To collect trending topics by media houses. And, the list goes on.
3
07-11-2022
WEB SCRAPPING PROCESS

4
07-11-2022
PROPOSED WORK
In this document, we’ll take the example of buying a phone online further and try to scrap the reviews
from the website about the phone that we are planning to buy. For example, if we open filpkart.com and
search for ‘iPhone’, the search result will be as follows:

5
07-11-2022
Then if we click on a product link, it will take us to to the following page:

6
07-11-2022
If we scroll down on this page, we’ll get to see the comments posted by the customers:

7
Our end goal is to build a web scraper that collects the reviews of a
07-11-2022
product from the internet.
PREREQUISITES

The things needed before we start building a python based web scraper are:
• Python installed.
• A Python IDE (Integrated Development Environment): like PyCharm, Spyder, or any other IDE
of choice.
• Flask Installed. (A simple command: pip install flask)
• MongoDB installed (Explained Later).
• Basic understanding of Python and HTML.
• Basic understanding of Git (download Git CLI from https://gitforwindows.org/ )

8
07-11-2022
APPLICATION
ARCHITECTURE

9
07-11-2022
CONCLUSION AND FUTURE SCOPE

In this project, we take a simple example of an online cell phone purchase and tries to explain the concepts
simply, extensively, and thoroughly to create a review scrapper right from scratch and then its deployment to a
cloud environment.
It is a step by step guide for creating a web scraper, in this case, a review scrapper right from scratch and then
deploying it to the heroku cloud platform.
Text scrappers are extensively used in the industry today for competitive pricing, market studies, customer
sentiment analysis, etc...

In the near future, Web scraping will be one of the important tools in the lead generation process. The
web scraping tool can make market research of the particular product/services and enormous benefits
to offer in the marketing field.

10
07-11-2022
FINAL RESULT

11
07-11-2022

You might also like