0% found this document useful (0 votes)
54 views2 pages

Lab 5 6 Data Cleaning Preparation and Visualization

The document outlines two laboratory activities focused on data cleaning and visualization in Excel. Activity 5 involves cleaning raw data by removing duplicates, handling missing values, correcting errors, and standardizing text, while Activity 6 emphasizes creating and interpreting various charts to visualize sales data. Both activities include specific instructions and expected outputs for effective data management and analysis.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
54 views2 pages

Lab 5 6 Data Cleaning Preparation and Visualization

The document outlines two laboratory activities focused on data cleaning and visualization in Excel. Activity 5 involves cleaning raw data by removing duplicates, handling missing values, correcting errors, and standardizing text, while Activity 6 emphasizes creating and interpreting various charts to visualize sales data. Both activities include specific instructions and expected outputs for effective data management and analysis.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Laboratory Activity 5: Data Cleaning and Preparation in Excel

Objective:

 Learn how to clean and preprocess raw data in MS Excel.


 Identify and handle missing values, duplicates, and errors.
 Apply basic data transformation techniques.

Instructions:

1. Download the Dataset: Open the provided dataset (Lab5_RawData.xlsx).


2. Remove Duplicates: Identify and remove any duplicate records.
3. Handle Missing Values: Use Excel functions (e.g., IF, ISBLANK, AVERAGE, MEDIAN) to
fill missing data appropriately.
4. Correct Data Errors: Identify and correct inconsistent data (e.g., incorrect date formats,
inconsistent spelling of categories).
5. Apply Filters and Sorting: Use Sort & Filter to organize data for better readability.
6. Standardize Text Data: Convert text to proper case and remove unnecessary spaces
using Excel functions (TRIM, PROPER).

Dataset (Lab5_RawData.xlsx) - Sample Columns:

Customer Name Age Gender Purchase Date of Region


ID Amount Purchase
1001 John Doe 25 Male 120.5 01-02-2023 North
1002 Jane Smith Female 89.9 2023/02/15 East
1003 Alice King 30 Female 15-03-2023 South
1001 JOHN 25 Male 120.5 01-02-2023 North
DOE
1005 Mike Chan 40 Male 200 03-04-2023 West

Expected Output:

 No duplicates.
 Missing Age values filled using median age.
 Missing Purchase Amount values filled with the average.
 Standardized text format (e.g., “John Doe” instead of “JOHN DOE”).
 Correct date formatting.
Laboratory Activity 6: Data Visualization in Excel

Objective:

 Create meaningful visual representations of data using Excel charts.


 Interpret insights from visualized data.

Instructions:

1. Download the Dataset: Open the provided dataset (Lab6_SalesData.xlsx).


2. Create Charts:
o Sales Trend Analysis: Create a line chart to show monthly sales trends.
o Regional Sales Performance: Use a bar chart to compare total sales per region.
o Customer Demographics: Create a pie chart showing gender distribution.
3. Customize Charts:
o Add proper titles, axis labels, and legends.
o Format data labels for better readability.
4. Interpret Insights:
o Identify the month with the highest and lowest sales.
o Determine which region generates the most revenue.
o Observe the proportion of male vs. female customers.

Dataset (Lab6_SalesData.xlsx) - Sample Columns:

Month Total Male Female North South East West


Sales Customers Customers Sales ($) Sales ($) Sales Sales
($) ($) ($)
January 15000 45 55 4000 3000 5000 3000
Februar 18000 50 50 4500 4000 6000 3500
y
March 22000 55 45 6000 5000 7000 4000
April 17000 47 53 4500 3500 5000 4000

Expected Output:

 A line chart showing increasing/decreasing sales trends.


 A bar chart comparing regional sales.
 A pie chart illustrating customer demographics.

You might also like