Assignment 4 - Updated 2 - 1 - 1

Uploaded by

Meera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views3 pages

Assignment 4 - Updated 2 - 1 - 1

Uploaded by

Meera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

ASSIGNMENT

WEB SCRAPING – ASSIGNMENT 4

• Read all the problem statements, notes carefully and scrape the required data using any web scraping tool of
your choice.
• You have to handle commonly occurring EXCEPTIONS by using exception handling programing. To get
information about selenium Exceptions. You may visit following links:
1. https://selenium-python.readthedocs.io/api.html
2. https://www.guru99.com/exception-handling-selenium.html
3. https://stackoverflow.com/questions/38022658/selenium-python-handling-no-such-element-
exception/38023345
1. Scrape the details of most viewed videos on YouTube from Wikipedia. Url
= https://en.wikipedia.org/wiki/List_of_most-viewed_YouTube_videos You need to find following details: A)
Rank
B) Name
C) Artist
D) Upload date
E) Views

2. Scrape the details team India’s international fixtures from bcci.tv.

Url = https://www.bcci.tv/.
You need to find following details:
A) Series
B) Place
C) Date
D) Time
Note: - From bcci.tv home page you have reach to the international fixture page through code.

3. Scrape the details of State-wise GDP of India from statisticstime.com.

Url = http://statisticstimes.com/
You have to find following details: A) Rank
B) State
C) GSDP(18-19)- at current prices
D) GSDP(19-20)- at current prices
E) Share(18-19)
F) GDP($ billion)
Note: - From statisticstimes home page you have to reach to economy page through code.

4. Scrape the details of trending repositories on Github.com.

Url = https://github.com/
You have to find the following details:
A) Repository title
B) Repository description
C) Contributors count
D) Language used
ASSIGNMENT

Note: - From the home page you have to click on the trending option from Explore menu through code.

5. Scrape the details of top 100 songs on billiboard.com. Url = https:/www.billboard.com/ You have to find the
following details:
A) Song name
B) Artist name
C) Last week rank
D) Peak rank
E) Weeks on board

Note: - From the home page you have to click on the charts option then hot 100-page link through code.

6. Scrape the details of Highest selling novels.

A) Book name
B) Author name
C) Volumes sold
D) Publisher
E) Genre

Url - https://www.theguardian.com/news/datablog/2012/aug/09/best-selling-books-all-time-fifty-shades-grey-compare

7. Scrape the details most watched tv series of all time from imdb.com.
Url = https://www.imdb.com/list/ls095964455/ You have
to find the following details:
A) Name
B) Year span
C) Genre
D) Run time
E) Ratings
F) Votes

8. Details of Datasets from UCI machine learning repositories.

Url = https://archive.ics.uci.edu/ You
have to find the following details:
A) Dataset name
B) Data type
C) Task
D) Attribute type
E) No of instances
F) No of attribute G) Year

Note: - from the home page you have to go to the Show All Dataset page through code.

Retrieving Data From The Web
No ratings yet
Retrieving Data From The Web
9 pages
ds2 Present Web
No ratings yet
ds2 Present Web
169 pages
Data Science Papers
No ratings yet
Data Science Papers
109 pages
DAP Module4
No ratings yet
DAP Module4
109 pages
Python Module-4
No ratings yet
Python Module-4
109 pages
Python Packages For Web Data Access
No ratings yet
Python Packages For Web Data Access
16 pages
ML Week 6
No ratings yet
ML Week 6
11 pages
Symfony5 La Via Rapida
No ratings yet
Symfony5 La Via Rapida
350 pages
Objective: Homework: Web Crawling
No ratings yet
Objective: Homework: Web Crawling
12 pages
Sari Serhan Python Toolbox 100 Scripts For Developers 2023
No ratings yet
Sari Serhan Python Toolbox 100 Scripts For Developers 2023
193 pages
Getting Data
No ratings yet
Getting Data
54 pages
Class Assign
No ratings yet
Class Assign
3 pages
Python Using AI
No ratings yet
Python Using AI
9 pages
L2 - Data Acquisition
No ratings yet
L2 - Data Acquisition
48 pages
Beautifulsoap4 Experiments
No ratings yet
Beautifulsoap4 Experiments
7 pages
Session5 - Analytics For Programming II - Siryani - 091924
No ratings yet
Session5 - Analytics For Programming II - Siryani - 091924
35 pages
UI Ex 6 (61) - 1
No ratings yet
UI Ex 6 (61) - 1
3 pages
Ensoniq DP 4 Musicians Manual
No ratings yet
Ensoniq DP 4 Musicians Manual
212 pages
Practical Web Scraping For Economists 1744341390
No ratings yet
Practical Web Scraping For Economists 1744341390
33 pages
Lecture03 Data II
No ratings yet
Lecture03 Data II
42 pages
Efficient Python Tricks and Tools For Data Scientists
100% (1)
Efficient Python Tricks and Tools For Data Scientists
23 pages
SMA SI2012 2224 Technical Description
No ratings yet
SMA SI2012 2224 Technical Description
212 pages
Scraping Data From A Real Website + Pandas: 18. Conduct Web Scraping Experiments Using Various Tools
No ratings yet
Scraping Data From A Real Website + Pandas: 18. Conduct Web Scraping Experiments Using Various Tools
5 pages
DAP 4 Module
No ratings yet
DAP 4 Module
45 pages
RE L02 System Context Boundaries and Types of Requirements
No ratings yet
RE L02 System Context Boundaries and Types of Requirements
51 pages
Sna Lab Report (21mic7199)
No ratings yet
Sna Lab Report (21mic7199)
25 pages
Web Scraping Cheat Sheet (2021), Python For Web Scraping by Frank Andrade Geek Culture - Medium
100% (3)
Web Scraping Cheat Sheet (2021), Python For Web Scraping by Frank Andrade Geek Culture - Medium
26 pages
Data - Collection Python
No ratings yet
Data - Collection Python
40 pages
Fliprobo Assignment 2
No ratings yet
Fliprobo Assignment 2
8 pages
Internship Assignment Coding2024
No ratings yet
Internship Assignment Coding2024
6 pages
Webscraping
No ratings yet
Webscraping
12 pages
03 Web Scraping
No ratings yet
03 Web Scraping
41 pages
Intro To Windows Server 2022
No ratings yet
Intro To Windows Server 2022
8 pages
Personal Development Resouces
No ratings yet
Personal Development Resouces
7 pages
Assignment
No ratings yet
Assignment
5 pages
Web Crawling - Python
No ratings yet
Web Crawling - Python
34 pages
Chapter 11. Web Scraping
100% (1)
Chapter 11. Web Scraping
57 pages
Web Scraping Report
No ratings yet
Web Scraping Report
14 pages
Articulate Guide
No ratings yet
Articulate Guide
4 pages
E!c ApplicationNote WagoAppDatalogger
No ratings yet
E!c ApplicationNote WagoAppDatalogger
28 pages
BeautifulSoup Evaluation Assignment
No ratings yet
BeautifulSoup Evaluation Assignment
1 page
Quality Manual
100% (1)
Quality Manual
45 pages
Course Notes - Web Scraping and API Fundamentals in Python
No ratings yet
Course Notes - Web Scraping and API Fundamentals in Python
10 pages
Barsaati Media - Tech Internship Task
No ratings yet
Barsaati Media - Tech Internship Task
3 pages
VFD DD User Manual
No ratings yet
VFD DD User Manual
135 pages
3252 Ids 10
No ratings yet
3252 Ids 10
5 pages
Notes Regarding The Use of Beautifulsoup: Python
No ratings yet
Notes Regarding The Use of Beautifulsoup: Python
3 pages
Programming 2 Lectures
No ratings yet
Programming 2 Lectures
52 pages
A Study On The Variation of Dielectric Constants of Some Polymers With Temperature
No ratings yet
A Study On The Variation of Dielectric Constants of Some Polymers With Temperature
15 pages
Software Engineering Project
No ratings yet
Software Engineering Project
55 pages
nsxvc18 Aiwa PDF
No ratings yet
nsxvc18 Aiwa PDF
34 pages
Data Collection
No ratings yet
Data Collection
14 pages
Bluetooth Neckband Battery Care Tips
No ratings yet
Bluetooth Neckband Battery Care Tips
1 page
Sma 2
No ratings yet
Sma 2
9 pages
Beginners Guide On Web Scraping in R Using Rvest With Hands-On Example
No ratings yet
Beginners Guide On Web Scraping in R Using Rvest With Hands-On Example
20 pages
API Cheatsheet
No ratings yet
API Cheatsheet
4 pages
06 WebScrapingData
No ratings yet
06 WebScrapingData
39 pages
Exercises 5
No ratings yet
Exercises 5
7 pages
Web Scraping With Python
No ratings yet
Web Scraping With Python
21 pages
Api and Data Structure
No ratings yet
Api and Data Structure
3 pages
Web Scrapping Final
No ratings yet
Web Scrapping Final
7 pages
PDF Document 2
No ratings yet
PDF Document 2
24 pages
Software Presentation Easy Worship 2007 v1
100% (3)
Software Presentation Easy Worship 2007 v1
5 pages
SOLVED - Lexmark - A Scan Profile With The Same Name Already Exists On The Specified MFP - Up & Running Technologies, Tech How To's
No ratings yet
SOLVED - Lexmark - A Scan Profile With The Same Name Already Exists On The Specified MFP - Up & Running Technologies, Tech How To's
2 pages
Java Simple Program
No ratings yet
Java Simple Program
7 pages
Imt 2021 053 CC Lab
No ratings yet
Imt 2021 053 CC Lab
5 pages
Data Flow Diagrams
100% (1)
Data Flow Diagrams
5 pages
Lecture 4: Let's Get Data!: Prof. Esther Duflo
No ratings yet
Lecture 4: Let's Get Data!: Prof. Esther Duflo
44 pages
Integrated Control Panel
No ratings yet
Integrated Control Panel
7 pages
Xii Electronics Project Details
No ratings yet
Xii Electronics Project Details
2 pages
On Python Project VI Semester: Academic Year: 2018-2019
No ratings yet
On Python Project VI Semester: Academic Year: 2018-2019
7 pages
Data Analysis by Web Scraping Using Python
No ratings yet
Data Analysis by Web Scraping Using Python
6 pages
Chapter Four Description of Automatic Water Tank Level Control System
No ratings yet
Chapter Four Description of Automatic Water Tank Level Control System
14 pages
Web Scraping and Data Collection CheatSheet 1731972399
No ratings yet
Web Scraping and Data Collection CheatSheet 1731972399
10 pages
Flame Sensor Report
No ratings yet
Flame Sensor Report
69 pages
Collaborative Design and Planning For Digital Manufacturing
No ratings yet
Collaborative Design and Planning For Digital Manufacturing
427 pages
Development Web Scrapping
No ratings yet
Development Web Scrapping
14 pages
Multitech MT9234ZBA Datasheet
No ratings yet
Multitech MT9234ZBA Datasheet
2 pages
What Is Monolithic Concrete Construction - Is It Durable Like RCC - Quora
No ratings yet
What Is Monolithic Concrete Construction - Is It Durable Like RCC - Quora
1 page
AVENAR Panel 2000 Data Sheet enUS 82034422283
No ratings yet
AVENAR Panel 2000 Data Sheet enUS 82034422283
11 pages
Virtual Elements in CDS Views 1731724717
No ratings yet
Virtual Elements in CDS Views 1731724717
7 pages
How To Scrap Any Website's Content Using Scrapy
0% (1)
How To Scrap Any Website's Content Using Scrapy
20 pages
Dap M4
No ratings yet
Dap M4
18 pages
Study Material For 6th Class Entrance Exam
No ratings yet
Study Material For 6th Class Entrance Exam
5 pages
STNMS Entrance
No ratings yet
STNMS Entrance
9 pages
Web Scrapping: Dept - of CS&E, BIET, Davangere Page - 1
No ratings yet
Web Scrapping: Dept - of CS&E, BIET, Davangere Page - 1
8 pages
Code Explaination-Almala Project
No ratings yet
Code Explaination-Almala Project
4 pages
Practice Questions for UiPath Certified RPA Associate Case Based
From Everand
Practice Questions for UiPath Certified RPA Associate Case Based
Exam OG
No ratings yet
5S Audit Audit Grinding-Feb 21
No ratings yet
5S Audit Audit Grinding-Feb 21
14 pages
Part - 1-General Introduction
No ratings yet
Part - 1-General Introduction
19 pages
FCP - FortiAnalyzer 7.4 Administrator Exam Preparation
From Everand
FCP - FortiAnalyzer 7.4 Administrator Exam Preparation
Georgio Daccache
No ratings yet
Data Sheet For Three-Phase Squirrel-Cage-Motors SIMOTICS: Motor Type: 1CV2310A Simotics XP - 315 S - Im V1 - 2P
100% (1)
Data Sheet For Three-Phase Squirrel-Cage-Motors SIMOTICS: Motor Type: 1CV2310A Simotics XP - 315 S - Im V1 - 2P
2 pages
Problem 2-2
100% (1)
Problem 2-2
9 pages
CS 610 Solved MCQS 100% Correct
No ratings yet
CS 610 Solved MCQS 100% Correct
13 pages
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
From Everand
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
Georgio Daccache
No ratings yet
Radio Amateur Examination (Rae) Sample Questions For Revision
No ratings yet
Radio Amateur Examination (Rae) Sample Questions For Revision
5 pages
Sensor Deviations: Transfer Function Accuracy
No ratings yet
Sensor Deviations: Transfer Function Accuracy
1 page
Tle Grade 11 Melc
No ratings yet
Tle Grade 11 Melc
9 pages