0% found this document useful (0 votes)
6 views3 pages

Practice For Probability Theory

The document discusses various types of data, including numerical, categorical, ordinal, and nominal data, along with their classifications. It covers measures of central tendency such as mean, median, and mode, and their implications in datasets with outliers. Additionally, it explains percentiles, quartiles, and the IQR method for identifying outliers, providing examples and calculations throughout.

Uploaded by

h0ver251206
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views3 pages

Practice For Probability Theory

The document discusses various types of data, including numerical, categorical, ordinal, and nominal data, along with their classifications. It covers measures of central tendency such as mean, median, and mode, and their implications in datasets with outliers. Additionally, it explains percentiles, quartiles, and the IQR method for identifying outliers, providing examples and calculations throughout.

Uploaded by

h0ver251206
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Types of Data

1.​ Define numerical data and explain the difference between discrete and continuous data
with an example.
2.​ What is Categorical data ?Discuss about ordinal, and nominal data.
3.​ Can an ordinal variable ever be treated as numerical data? If yes, under what
conditions?
4.​ Classify the following variables as nominal, ordinal, discrete, or continuous:
○​ Customer satisfaction ratings (1-5 stars)
○​ Number of books read in a year
○​ Blood pressure readings of patients
○​ Names of different smartphone brands
5.​ A store collects customer data:
○​ Order amount
○​ Preferred product category
○​ Customer rating (1-5)
○​ Number of orders per month​
Classify each as nominal, ordinal, discrete, or continuous

Measures of Central Tendency (Mean, Median, Mode)


Dataset: {15, 18, 20, 22, 25, 30, 30, 35, 40, 42, 45, 50, 50}

1.​ Compute the mean, median, and mode.


2.​ If the dataset had an additional value 60, how would the mean, median, and mode
change?

3. If all values in a dataset are unique, does it have a mode? Why or why not?

4.​ A university professor conducted a mathematics test for a class of 35 students. After
grading, the professor categorized the students into different score ranges based on their
performance. The number of students for each score is shown below:​
Calculate the average (mean) test score of the students. What does this value indicate
about the overall class performance

5.​
6.​ A dataset’s mean is 35, but it has extreme values of 10 and 90.
○​ Which measure (mean, median, or mode) would best represent the data?

7. A company's salaries are: 2,5000, 2,7000, 3,0000, 3,2000, 5,0000, 1,20,000.

○​ Which measure of central tendency is best to choose ?


○​ If the highest salary (1,20,000.) is removed, how does this affect the mean and
median?

Percentiles, Quartiles, and IQR


Dataset: {5, 10, 12, 18, 22, 28, 30, 35, 40, 45, 50, 55, 60}

1.​ What is a percentile, and how is it different from a quartile?


2.​ Why is IQR a better measure than range in datasets with outliers?
3.​ What do you understand by following ways of finding percentile - Inclusive, Exclusive and
Midpoint Adjustment ?
4.​ A student scored in the 85th percentile on a standardized test.What does this indicate
about their performance compared to other test-takers?
5.​ Compute Q1 (25th percentile), Q2 (50th percentile), and Q3 (75th percentile) from given
data .Calculate the Interquartile Range (IQR) as well .
6.​ What does the 50th percentile correspond to in a dataset?
7.​ At Newton High School, the final exam scores for a class of 15 students are recorded as
follows:

Scores: {45, 50, 55, 60, 62, 65, 70, 75, 80, 82, 85, 90, 92, 95, 98}

●​ Find the score at the 70th percentile.


●​ Interpret the result—what does it mean for a student who scores at this percentile?
●​ If a scholarship is awarded to students scoring at or above the 90th percentile, what is
the minimum score required?
8.​ A marathon runner finishes in the 60th percentile. If there were 500 runners, how many
finished behind them?
9.​ A hospital analyzes patient recovery times:
○​ Q1 = 3 days, Median = 6 days, Q3 = 10 days.
○​ If a patient takes 25 days to recover, should it be considered an outlier

Outliers & IQR Method


Dataset: {10, 12, 15, 20, 25, 30, 30, 35, 40, 50, 100}

1.​ Identify the outliers using the IQR method.


2.​ Compute the upper and lower fences for detecting outliers.
3.​ A company detects fraudulent transactions where most values range between $50 and
$300, but one transaction is $5000.
○​ How can IQR help in fraud detection?
4.​ What are the consequences of removing outliers from a dataset?
5.​ A hospital records patient recovery times. Most recover in 5-7 days, but a few take 30+
days.
6.​ A manufacturing company analysed its processing times (in minutes) for completing
tasks. The boxplot below summarizes the processing times for a random sample of
tasks.

What is the Lower Bound (LB) of the given boxplot? Would you consider 5 minutes of
processing time as an outlier?

You might also like