Data Science Laboratory Worksheet
Data Science Laboratory Worksheet
Instructions:
Step 1: Select Your Target Page on TripAdvisor
1. Choose a Category:
o Popular Destinations: Go to the TripAdvisor destinations section
https://www.tripadvisor.com/TravelersChoice-Destinations
o Let it automatically detect data tables. Make sure it captures essential columns like
Name, Rating, Location, Review Count, etc.
3. Name your CSV file based on the category you scraped (e.g.,
tripadvisor_hotels.csv, tripadvisor_restaurants.csv).
</ CS 102: Data Preparations
• For Restaurants: Name, Cuisine Type, Rating, Price Range, Total Reviews.
o Standardize Text:
▪ Ensure consistent casing for text columns (e.g., restaurant names, locations).
o Split Columns:
▪ For columns like Price Range (e.g., "$20-$50"), split it into Min Price and Max
▪ Get average, median, and mode of numeric columns like Rating and Review
▪ For example, group hotels or restaurants by Location and calculate the average
rating for each.
5. Visualization Tasks
Create simple visualizations using matplotlib or seaborn.
Submission
2. Reflective Questions (Submit your answer using private comment after submitting your files
in google classroom)
Example:
o Describe two insights you gained from the data wrangling and analysis
Prepared by:
CHRISTIAN LESTER D. GIMENO