0% found this document useful (0 votes)
98 views11 pages

A Dive Into Web Scraper World

This document provides an introduction to web scraping, discussing what it is and when it would be used. It also explores the legal aspects of web scraping, noting that personal use is generally legal but commercial use can be more complicated. The document outlines some lawsuits related to web scraping, including cases involving AT&T and eBay. It also describes the basic components of a web scraper, including the web crawler and data extraction processes. The document concludes by stating the mission of the project is to improve the company placement process for students.

Uploaded by

Saksham Tandon
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
98 views11 pages

A Dive Into Web Scraper World

This document provides an introduction to web scraping, discussing what it is and when it would be used. It also explores the legal aspects of web scraping, noting that personal use is generally legal but commercial use can be more complicated. The document outlines some lawsuits related to web scraping, including cases involving AT&T and eBay. It also describes the basic components of a web scraper, including the web crawler and data extraction processes. The document concludes by stating the mission of the project is to improve the company placement process for students.

Uploaded by

Saksham Tandon
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 11

A Dive into

Web Scraper
World
Introduction
What is web scraping and when would you want to use it?

Web scraping is related to


web indexing, whose task is to
index information on the web
with the help of a bot or web
crawler.

2
IS WEB
SCRAPING
LEGAL?
There are lots of different views of
different people on the legal and illegal
aspects of Scraping the Web. In today’s
world we can see many examples of the
legal use of Web Scraper such as price
comparison websites and reviewing
Websites.
“The key part is what you
want to do with the
scraped data.”

If you use it for your own, personal


use, then it is legal as it falls under
fair use doctrine.

The complications start if you


want to use scraped data for
other, especially commercial,
purposes.
4
Numerous Lawsuits that Tie
Web Scraping

⬡ Case of AT&T’s
“Andrew Auernheimer” was convicted of hacking based on the act of
web scraping. Although the data was unprotected and publically
available via AT&T’s website, the fact that he wrote web scrapers to
harvest that data in mass amounted to “brute force attack”.

This charge is a felony violation that is on par with hacking


or denial of service attacks and carries up to a 15-year
sentence for each charge.
Numerous Lawsuits that Tie
any Web Scraping
⬡ Case of “The Ebay’s Action”
Not much could be done about the practice until in 2000 eBay filed a
preliminary injunction against Bidder’s Edge. In the injunction eBay
claimed that the use of bots on the site, against the will of the company
violated Trespass to Chattels law.

The court granted the injunction because users had to opt in


and agree to the terms of service on the site and that a large
number of bots could be disruptive to eBay’s computer
systems.

6
Designing principles and
methods

Web Scraper

Web Crawler Data Extraction


for crawling from crawled
links. links.

7
Web Crawler
A Web Crawler is an internet bot
which systematically browses
the world wide web , for the
purpose of web indexing.
Data flow diagram

webpages Web structured


scraping data

9
Mission of our project!

❖ We look to bring about a massive change in


the sorting process of companies during
placements from a student's perspective.

❖ This initiative will extensively help students find


companies which suit their domain and need
more effectively reducing a large part of
manual work involved in the current setting.

10
References
❖ https://en.wikipedia.org/wiki/Web_scraping.h
tml

❖ https://www.webharvy.com/articles/what-is-
web-scraping.html

❖ http://resources.distilnetworks.com/h/i/5382
2104-is-webscraping-illegal-depends-on-what-
the-meaning-of-theword-is-is/181642

❖ https://www.quora.com/What-is-the-legality-
of-web-scraping

❖ https://en.wikipedia.org/wiki/Web_crawler

❖ Kolari , P. and Joshi, A. , “Web mining : research


and practice , Computing in Science &
Engineering”, IEEE Transactions on Knowledge
and Data Engineering, vol. 6, no. 2,Vol. 6 , No. 4 ,
11
2004

You might also like