The document outlines a data warehousing project using Python, focusing on data cleaning and transformation techniques. It includes steps for handling missing values, removing duplicates, creating new columns, and visualizing data distributions. The visualizations cover age group distributions, average purchase amounts by country, and sign-ups by month, utilizing libraries such as pandas and seaborn.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
0 ratings0% found this document useful (0 votes)
3 views
Lab9
The document outlines a data warehousing project using Python, focusing on data cleaning and transformation techniques. It includes steps for handling missing values, removing duplicates, creating new columns, and visualizing data distributions. The visualizations cover age group distributions, average purchase amounts by country, and sign-ups by month, utilizing libraries such as pandas and seaborn.