0% found this document useful (0 votes)
46 views

Assignment Questions - Data Analysis and Visualization Using Power BI and Tableau

Uploaded by

gparmar
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
46 views

Assignment Questions - Data Analysis and Visualization Using Power BI and Tableau

Uploaded by

gparmar
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

ASSIGNMENT QUESTIONS:

1. Data Cleaning and Preprocessing:


Discuss the importance of handling missing data in a dataset. Provide three methods for handling
missing data and explain each with an example.
Explain the concept of outliers in data. How can outliers impact statistical analysis? Discuss two
techniques for detecting outliers.

2. Data Normalization and Standardization:


Define data normalization and standardization. Provide a stepbystep explanation of MinMax
normalization and Zscore normalization with numerical examples.
In what scenarios would you prefer normalization over standardization or vice versa? Justify
your answer.

3. Descriptive Statistics:
Calculate the mean, median, and mode for the following dataset: [10, 15, 20, 25, 30, 35, 40].
Discuss the use of the interquartile range (IQR) as a measure of variability. Calculate the IQR for
the dataset: [22, 25, 28, 30, 35, 40, 45].

4. Data Visualization Basics:


Create a bar chart to represent the sales of three products (A, B, C) in a given month. Provide
appropriate labels and titles.
Explain the significance of color theory in data visualization. How can color be effectively used
to convey information in a chart?

5. Exploratory Data Analysis:


Given a dataset of customer purchases, filter the data to show purchases made in the last quarter
of the year. Provide a brief interpretation of the results.
Sort a dataset of employee salaries in ascending order. Discuss the potential insights gained from
examining the sorted data.

NUMERICAL AND PRACTICAL BASED QUESTIONS:

1. Handling Missing Data:


Consider a dataset with missing values. Use pandas in Python to handle missing data through
imputation. Provide code snippets and explain the rationale behind your choices.

2. Data Normalization:
Given a dataset of exam scores ranging from 60 to 90, perform MinMax normalization to scale
the values between 0 and 1. Provide the normalized dataset.

3. Descriptive Statistics:
Using Python's statistical libraries, calculate the standard deviation for a dataset representing
daily temperatures over a month.

4. Data Visualization:
Using Matplotlib or any other suitable library, create a line chart to visualize the trend in stock
prices over the last six months. Include appropriate labels and a legend.

You might also like