pandas__prac
pandas__prac
Introduction
This document contains exercises designed to help you practice data analysis using
pandas. Each exercise includes tasks that involve data exploration, cleaning, manip-
ulation, and visualization. The datasets can be downloaded from the provided links.
Tasks:
1. Loading Data:
2. Basic Exploration:
3. Data Cleaning:
4. Feature Engineering:
1
• Analyze survival rates by gender (Sex) and embarkation port (Embarked).
• Create a pivot table showing survival rates by Pclass and Sex.
6. Visualizations:
Tasks:
1. Data Cleaning:
3. Visualizations:
Tasks:
1. Loading Data:
2. Data Cleaning:
• Handle missing temperature values by filling them with the rolling mean (win-
dow = 12 months).
2
• Remove records with invalid country names (e.g., empty strings).
4. Visualizations:
Tasks:
1. Loading Data:
3. Feature Engineering:
4. Visualizations:
• Plot the price trend of the top 3 performing stocks over time.
• Create a histogram of daily returns for a selected stock.
5. Advanced Analysis:
3
Exercise 5: Netflix Data Analysis
Dataset: Netflix Movies and TV Shows Dataset (Use netflixt itles.csv).
Tasks:
2. Data Cleaning:
4. Visualizations:
• Create a bar chart for the top 5 countries producing Netflix content.
• Plot the distribution of movie durations using a histogram.
• Visualize the trend of Netflix content added over time.
5. Advanced Analysis: