Parsing The Web: Let S Find The Following Data For The First 100 Movies

The document describes parsing web data to extract the release date, movie title, and production budget for the first 100 movies from a website. It uses the Beautiful Soup and Pandas libraries in Python to make a request to the target URL, parse the HTML response, extract the data from table rows into a dictionary, add it to an info array, and convert that into a Pandas dataframe for output.

Uploaded by

Josue Sanchez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views3 pages

Parsing The Web: Let S Find The Following Data For The First 100 Movies

Uploaded by

Josue Sanchez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

PARSING THE WEB

Let´s find the following data for the first 100 movies:

Release Date Movie Production Budget

Código Implementado:
import requests
# Import the beautiful soup
from bs4 import BeautifulSoup
# Export library
import pandas as pd

TARGET_URL='https://www.the-numbers.com/movie/budgets/all'

info=[] # arreglo general

data={} # diccionoario final

myData=requests.get(TARGET_URL)
# Using beautiful soup library for parsing fetched data
soup= BeautifulSoup(myData.text, 'html.parser')
elements=soup.find_all("tr")
for elem in elements:
valores = []
dat = {}
itemtd=elem.find_all("td")
if itemtd:
valores.append(itemtd[1].text)
valores.append(itemtd[2].text)
valores.append(itemtd[3].text)

#se almacena la data en diccionarios con clave numérica por posición

dat[itemtd[0].text]=valores

#se agrega al arreglo general para crear el diccionario final

info.append(dat)

data["peliculas"]=info # se agraga para clave valor al diccionario data

dataFrame = pd.DataFrame.from_dict(data)
print(dataFrame)

Resultado al ejecutar el código:

Beautiful Soup Tutorial
100% (2)
Beautiful Soup Tutorial
56 pages
Web Scraping Cheat Sheet (2021), Python For Web Scraping by Frank Andrade Geek Culture - Medium
100% (3)
Web Scraping Cheat Sheet (2021), Python For Web Scraping by Frank Andrade Geek Culture - Medium
26 pages
Python Module-4
No ratings yet
Python Module-4
109 pages
Web Scraping With BeautifulSoup
100% (1)
Web Scraping With BeautifulSoup
8 pages
Practical Introduction To Web Scraping in Python
100% (1)
Practical Introduction To Web Scraping in Python
14 pages
Lecture03 Data II
No ratings yet
Lecture03 Data II
42 pages
Implementing Web Scraping in Python With Beautifulsoup
No ratings yet
Implementing Web Scraping in Python With Beautifulsoup
6 pages
Efficient Python Tricks and Tools For Data Scientists
100% (1)
Efficient Python Tricks and Tools For Data Scientists
23 pages
Tools For Data Science Notes
No ratings yet
Tools For Data Science Notes
16 pages
Beautiful Soup Documentation: Getting Help
100% (1)
Beautiful Soup Documentation: Getting Help
56 pages
SESION 10 (Pandas 2)
No ratings yet
SESION 10 (Pandas 2)
120 pages
Beautiful Soup Documentation - Beautiful Soup 4.9.0 Documentation
No ratings yet
Beautiful Soup Documentation - Beautiful Soup 4.9.0 Documentation
59 pages
Data Wrangling & Visualization - II
No ratings yet
Data Wrangling & Visualization - II
41 pages
Beautiful Soup
No ratings yet
Beautiful Soup
61 pages
Web Engineering Laboratory 3: Web2py Database Access
No ratings yet
Web Engineering Laboratory 3: Web2py Database Access
12 pages
DAP Module4
No ratings yet
DAP Module4
109 pages
Beautiful Soup Documentation
No ratings yet
Beautiful Soup Documentation
61 pages
Beautifulsoup: Web Scraping With Python
No ratings yet
Beautifulsoup: Web Scraping With Python
43 pages
Extracting Data From HTML Table
No ratings yet
Extracting Data From HTML Table
12 pages
Getting Data II Solutions
No ratings yet
Getting Data II Solutions
9 pages
Beautiful Soup Documentation
No ratings yet
Beautiful Soup Documentation
53 pages
DAP - Module 4
No ratings yet
DAP - Module 4
57 pages
Beautiful Soup Documentation - Beautiful Soup 4.13.0 Documentation
No ratings yet
Beautiful Soup Documentation - Beautiful Soup 4.13.0 Documentation
54 pages
Importing Data in Python Ii: Importing Flat Files From The Web
No ratings yet
Importing Data in Python Ii: Importing Flat Files From The Web
22 pages
02 Omdb-Api
No ratings yet
02 Omdb-Api
27 pages
Beautiful Soup Documentation - Beautiful Soup 4.4.0 Documentation
No ratings yet
Beautiful Soup Documentation - Beautiful Soup 4.4.0 Documentation
49 pages
On Python Project VI Semester: Academic Year: 2018-2019
No ratings yet
On Python Project VI Semester: Academic Year: 2018-2019
7 pages
Chapter1 PDF
No ratings yet
Chapter1 PDF
22 pages
Movie Management System
No ratings yet
Movie Management System
12 pages
01 Python 02 Data Sourcing
No ratings yet
01 Python 02 Data Sourcing
9 pages
Citl Exp 8
No ratings yet
Citl Exp 8
7 pages
Beautifulsoap4 Experiments
No ratings yet
Beautifulsoap4 Experiments
7 pages
Python Using AI
No ratings yet
Python Using AI
9 pages
A Guide To Web Scraping in Python Using Beautiful Soup
No ratings yet
A Guide To Web Scraping in Python Using Beautiful Soup
6 pages
Movie Ticket
No ratings yet
Movie Ticket
15 pages
Beautiful Soup
No ratings yet
Beautiful Soup
40 pages
Python Programs
No ratings yet
Python Programs
20 pages
Webscraping1 1 PDF
No ratings yet
Webscraping1 1 PDF
10 pages
Web Scrapping
No ratings yet
Web Scrapping
9 pages
Web Scraping
No ratings yet
Web Scraping
11 pages
2025events Scraper
No ratings yet
2025events Scraper
5 pages
MK
No ratings yet
MK
4 pages
Simple Web Scraping Example Using BeautifulSoup in
No ratings yet
Simple Web Scraping Example Using BeautifulSoup in
4 pages
Web Scraping Using Python (Step by Step Tutorial) - Pythonista Planet
No ratings yet
Web Scraping Using Python (Step by Step Tutorial) - Pythonista Planet
11 pages
Message
No ratings yet
Message
3 pages
SDFG
No ratings yet
SDFG
4 pages
Scraperskank
No ratings yet
Scraperskank
3 pages
Beginner Guide To Web Scraping of Data
No ratings yet
Beginner Guide To Web Scraping of Data
14 pages
Python Cheat Sheet - The Basics CC
No ratings yet
Python Cheat Sheet - The Basics CC
2 pages
Programming 2 Lectures
No ratings yet
Programming 2 Lectures
52 pages
Sahil Malhotra 16 BCE 0113 Web Mining L51+L52: 1. Universal Crawling 1.1. CODE
No ratings yet
Sahil Malhotra 16 BCE 0113 Web Mining L51+L52: 1. Universal Crawling 1.1. CODE
11 pages
Python Cheat Sheet - The Basics Coursera
No ratings yet
Python Cheat Sheet - The Basics Coursera
2 pages
Web Scraping and Data Collection CheatSheet 1731972399
No ratings yet
Web Scraping and Data Collection CheatSheet 1731972399
10 pages
Retrieving Data From The Web
No ratings yet
Retrieving Data From The Web
9 pages
Api and Data Structure
No ratings yet
Api and Data Structure
3 pages
Essential n8n Playbook
From Everand
Essential n8n Playbook
Leandro Calado
No ratings yet
Python and SQLite Development
From Everand
Python and SQLite Development
Agus Kurniawan
No ratings yet
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
From Everand
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
Abdelfattah Ragab
No ratings yet
Firebase Storage for Angular: A reliable file upload solution for your applications
From Everand
Firebase Storage for Angular: A reliable file upload solution for your applications
Abdelfattah Ragab
No ratings yet
How to a Developers Guide to 4k: Developer edition, #3
From Everand
How to a Developers Guide to 4k: Developer edition, #3
Xinc Cyberwizard
No ratings yet

Parsing The Web: Let S Find The Following Data For The First 100 Movies

Uploaded by

Parsing The Web: Let S Find The Following Data For The First 100 Movies

Uploaded by

PARSING THE WEB

Release Date Movie Production Budget

info=[] # arreglo general

#se almacena la data en diccionarios con clave numérica por posición

#se agrega al arreglo general para crear el diccionario final

data["peliculas"]=info # se agraga para clave valor al diccionario data

Resultado al ejecutar el código:

You might also like