0% found this document useful (0 votes)
12 views2 pages

Q1 Dev Ans

Uploaded by

24cset03
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views2 pages

Q1 Dev Ans

Uploaded by

24cset03
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Question 1: What does EDA stand for?

Answer: EDA stands for Exploratory Data Analysis.

Explain the purpose of grouping datasets.

Answer: Grouping datasets involves organizing data into categories based on one or more
attributes. The purpose is to apply aggregate functions to each group, making it easier to
analyze and compare subsets of the data.

Describe a basic approach to merging two datasets.

Answer: To merge two datasets, you would align them based on a common key (or keys) and
combine them using a join operation, ensuring that the data in the key columns match and
handling any discrepancies such as missing values.

Construct a strategy for using a pivot table to analyze sales data by region and product.

Answer: To analyze sales data using a pivot table:

1. Load Data: Import sales data into a pivot table tool.


2. Define Rows and Columns: Set regions as rows and products as columns.
3. Apply Aggregations: Use aggregation functions (e.g., sum) to calculate total sales for
each product in each region.
4. Analyze Results: Review the pivot table to identify top-selling products and regions
with the highest sales.

Answer: To analyze sales data using a pivot table:

1. Load Data: Import sales data into a pivot table tool.


2. Define Rows and Columns: Set regions as rows and products as columns.
3. Apply Aggregations: Use aggregation functions (e.g., sum) to calculate total sales for
each product in each region.
4. Analyze Results: Review the pivot table to identify top-selling products and regions
with the highest sales.

: Design a basic workflow for merging and analyzing data from multiple sources.

Answer: A basic workflow for merging and analyzing data might include:

1. Data Collection: Gather data from multiple sources.


2. Data Cleaning: Clean and preprocess data to ensure consistency.
3. Merging: Use a common key to merge datasets into a unified dataset.
4. Transformation: Reshape and pivot the data as needed to prepare it for analysis.
5. Analysis: Apply statistical methods and visualizations to analyze the merged data.
6. Reporting: Summarize findings and insights in a report.

7. Why are visual aids important in EDA?


8. Answer: Why are visual aids important in EDA?

9. Visual aids are important in EDA because they help in understanding complex data
patterns, identifying trends and outliers, and communicating findings clearly through
graphical representation.

You might also like