0% found this document useful (0 votes)
3 views36 pages

Question Bank of Quiz - Retest

The document is a final test consisting of 100 questions covering various topics in statistics, data analysis, and programming in R and Python. Questions include identifying data types, reshaping arrays, creating plots, and understanding statistical concepts. It requires participants to select the correct answers from multiple-choice options.

Uploaded by

Shreya Khasne
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views36 pages

Question Bank of Quiz - Retest

The document is a final test consisting of 100 questions covering various topics in statistics, data analysis, and programming in R and Python. Questions include identifying data types, reshaping arrays, creating plots, and understanding statistical concepts. It requires participants to select the correct answers from multiple-choice options.

Uploaded by

Shreya Khasne
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 36

final test 01

The test contains 100 questions


1) Which of these is an example of ordinal data?
Answer by ticking the correct answers

[ ] Blood type (A, B, AB, O)

[ ] Grades in school (A, B, C)

[ ] Number of pets owned

[ ] Temperature in Celsius

2) How do you reshape a NumPy array arr to have 3 rows and 4


columns?
Answer by ticking the correct answers

[ ] arr.resize(3,4)

[ ] arr.reshape(3,4)

[ ] arr.reshape((3,4))

[ ] arr.shape(3,4)

3) How do you create an array with values ranging from 1 to 10 in


NumPy?
Answer by ticking the correct answers

[ ] np.range(1,10)

[ ] np.arange(1,11)

[ ] np.linspace(1,10)

[ ] np.list(1,10)
4) The normal distribution is:
Answer by ticking the correct answers

[ ] Skewed left

[ ] Bell-shaped and symmetric

[ ] Uniformly distributed

[ ] Bimodal

5) Which distribution shape can a histogram help identify?


Answer by ticking the correct answers

[ ] Normal

[ ] Skewed

[ ] Bimodal

[ ] All of the above

6) A dataset with multiple modes is called:


Answer by ticking the correct answers

[ ] Unimodal

[ ] Bimodal

[ ] Multimodal

[ ] Nonmodal
7) The mode represents:
Answer by ticking the correct answers

[ ] The average value of a dataset

[ ] The value that occurs most frequently

[ ] The middle value of the dataset

[ ] The spread of data

8) In a normal distribution, the mean, median, and mode are:


Answer by ticking the correct answers

[ ] Different

[ ] Equal

[ ] Random

[ ] Always zero

9) How can you assign a default value to a function argument in R?


Answer by ticking the correct answers

[ ] By assigning it in the function body

[ ] Using the default() function

[ ] Assigning a value in the argument list

[ ] Using the set() function


10) In ggplot2, which function is used for a histogram?
Answer by ticking the correct answers

[ ] geom_histogram()

[ ] geom_col()

[ ] geom_bar()

[ ] geom_density()

11) What is the output of if (FALSE) print("Hello")?


Answer by ticking the correct answers

[ ] Hello

[ ] FALSE

[ ] NULL

[ ] No output

12) In ggplot2, which geom function is used to create a line plot?


Answer by ticking the correct answers

[ ] geom_bar()

[ ] geom_line()

[ ] geom_histogram()

[ ] geom_col()
13) A null hypothesis is:
Answer by ticking the correct answers

[ ] A statement of no effect or no difference

[ ] Always true

[ ] A claim of significant effect

[ ] Randomly chosen

14) How do you create an infinite loop in R?


Answer by ticking the correct answers

[ ] while(TRUE) { ... }

[ ] for (i in 1:Inf) { ... }

[ ] repeat { ... }

[ ] All of the above

15) In base R, which argument in heatmap() controls clustering?


Answer by ticking the correct answers

[ ] scale

[ ] clustering

[ ] hclustfun

[ ] col
16) What does np.array([1, 2, 3]) return?
Answer by ticking the correct answers

[ ] A list

[ ] A NumPy array

[ ] A tuple

[ ] A dictionary

17) Which type of plot is most useful for detecting outliers?


Answer by ticking the correct answers

[ ] Box plot

[ ] Line plot

[ ] Histogram

[ ] Scatter plot

18) Type I error occurs when:


Answer by ticking the correct answers

[ ] Rejecting a true null hypothesis

[ ] Accepting a true null hypothesis

[ ] Rejecting a false null hypothesis

[ ] No error is made
19) In ggplot2, how do you convert a bar chart into a pie chart?
Answer by ticking the correct answers

[ ] Add coord_polar(theta = "y")

[ ] Use geom_pie()

[ ] Apply facet_wrap()

[ ] Change geom_col() to geom_point()

20) What visualization is best for checking if a dataset follows a


normal distribution?
Answer by ticking the correct answers

[ ] Histogram

[ ] Scatter plot

[ ] Bar chart

[ ] Pie chart
21)

Scenario: Employee Age Study


A survey is conducted to study the age distribution of employees in
a company. The ages are measured in whole years (e.g., 25, 30, 35).

Question: What type of data is represented by the ages of


employees?

Answer by ticking the correct answers

[ ] Continuous

[ ] Nominal

[ ] Discrete

[ ] Ordinal

22) Which chart should be used to analyze the relationship


between three numerical variables?
Answer by ticking the correct answers

[ ] Scatter plot with color mapping

[ ] Pie chart

[ ] Histogram

[ ] Box plot
23) If you want to visualize the proportion of missing values in a
dataset, which type of plot is most useful?
Answer by ticking the correct answers

[ ] Bar chart

[ ] Heatmap

[ ] Histogram

[ ] Line plot

24) Data such as "Yes" or "No" is:


Answer by ticking the correct answers

[ ] Discrete data

[ ] Continuous data

[ ] Nominal data

[ ] Ordinal data

25) Which Seaborn function is best for visualizing categorical data?


Answer by ticking the correct answers

[ ] sns.barplot()

[ ] sns.countplot()

[ ] sns.scatterplot()

[ ] Both a) and b)
26) How do you generate a random number between 0 and 1 in
NumPy?
Answer by ticking the correct answers

[ ] np.random.rand()

[ ] np.random.random(10,15)

[ ] np.random.randint(0,1)

[ ] None

27) Which visualization is best for showing the distribution of a


numerical variable?
Answer by ticking the correct answers

[ ] Histogram

[ ] Bar Chart

[ ] Line Plot

[ ] Scatter Plot

28) What is the best visualization for correlation between multiple


numerical variables?
Answer by ticking the correct answers

[ ] Heatmap

[ ] Pie Chart

[ ] Line Plot

[ ] Histogram
29) How do you add a legend to a Matplotlib plot?
Answer by ticking the correct answers

[ ] plt.legend()

[ ] plt.add_legend()

[ ] plt.show_legend()

[ ] plt.make_legend()

30) Which function is used to create violin plots in Seaborn?


Answer by ticking the correct answers

[ ] sns.violinplot()

[ ] sns.boxplot()

[ ] sns.stripplot()

[ ] sns.scatterplot()

31) Which method is used to drop rows with missing values?


Answer by ticking the correct answers

[ ] df.dropna()

[ ] df.fillna()

[ ] df.remove_na()

[ ] df.dropna(axis=1)
32) What does sns.pairplot(df) do?
Answer by ticking the correct answers

[ ] Creates scatter plots for all pairwise relationships

[ ] Plots a single histogram

[ ] Shows a bar chart of categorical values

[ ] Draws a heatmap

33) How do you change the color palette in Seaborn?


Answer by ticking the correct answers

[ ] sns.set_palette("pastel")

[ ] sns.set_theme("colorful")

[ ] sns.color_map("red")

[ ] sns.set_color("blue")

34) What is the best plot for time-series data?


Answer by ticking the correct answers

[ ] Line Chart

[ ] Bar Chart

[ ] Pie Chart

[ ] Scatter Plot
35) How do you generate a random integer between 10 and 100?
Answer by ticking the correct answers

[ ] np.random.randint(10,100)

[ ] np.random.random(10,100)

[ ] np.random.uniform(10,100)

[ ] np.random.normal(10,100)

36) What does plt.xlabel("X-axis") do?


Answer by ticking the correct answers

[ ] Adds a title

[ ] Labels the X-axis

[ ] Labels the Y-axis

[ ] Adds a legend

37) Which function creates a heatmap in Seaborn?


Answer by ticking the correct answers

[ ] sns.heatmap()

[ ] sns.correlationplot()

[ ] sns.matrixplot()

[ ] sns.gridplot()
38) What argument is used to change the line color in plt.plot()?
Answer by ticking the correct answers

[ ] color

[ ] fill

[ ] linecolor

[ ] shade

39) Which Seaborn function is used to create a histogram?


Answer by ticking the correct answers

[ ] sns.histplot()

[ ] sns.distplot()

[ ] sns.barplot()

[ ] sns.scatterplot()

40) What does arr[1:4] return in NumPy?


Answer by ticking the correct answers

[ ] Elements from index 1 to 4

[ ] Elements from index 1 to 3

[ ] Elements from index 0 to 3

[ ] Elements from index 2 to 4


41) How do you reset the index of a Pandas DataFrame?
Answer by ticking the correct answers

[ ] df.index_reset()

[ ] df.reindex()

[ ] df.reset_index()

[ ] df.drop_index()

42) How do you load built-in datasets in Seaborn?


Answer by ticking the correct answers

[ ] sns.datasets.load_dataset()

[ ] sns.load_dataset()

[ ] sns.get_data()

[ ] sns.read_data()

43) Which argument in geom_density() controls the transparency


of the curve?
Answer by ticking the correct answers

[ ] alpha

[ ] color

[ ] size

[ ] linetype
44) In base R, what function is used to create multiple box plots in
one plot?
Answer by ticking the correct answers

[ ] boxplot(var1, var2, ...)

[ ] plot.boxplot()

[ ] multi.boxplot()

[ ] box(var1, var2, ...)

45) What does the diagonal in a pair plot represent?


Answer by ticking the correct answers

[ ] Box plots

[ ] Histograms of each variable

[ ] Correlation values

[ ] Scatter plots

46) Which chart is best suited for showing time-series data?


Answer by ticking the correct answers

[ ] Heatmap

[ ] Line chart

[ ] Scatter plot

[ ] Bar chart
47) How do you select a single column from a Pandas DataFrame?
Answer by ticking the correct answers

[ ] df.column_name

[ ] df['column_name']

[ ] df.column['name']

[ ] df[[column_name]]

48) Which method displays the first 5 rows of a DataFrame?


Answer by ticking the correct answers

[ ] df.head()

[ ] df.first()

[ ] df.display()

[ ] df.show()

49) Which parameter controls point size in geom_point()?


Answer by ticking the correct answers

[ ] size

[ ] pointsize

[ ] width

[ ] alpha
50) Which chart is best suited for showing trends over time?
Answer by ticking the correct answers

[ ] Histogram

[ ] Line plot

[ ] Pie chart

[ ] Box plot

51) Which function is used to create a pie chart in base R?


Answer by ticking the correct answers

[ ] barplot()

[ ] pie()

[ ] hist()

[ ] plot()

52) What function is used to create a bar chart in base R?


Answer by ticking the correct answers

[ ] barplot()

[ ] hist()

[ ] plot()

[ ] pie()
53)

What will the following code output?

greet <- function(name = "Guest") {

paste("Hello,", name)

greet()

Answer by ticking the correct answers

[ ] Error

[ ] Hello, Guest

[ ] Null

[ ] Guest
54)

What is the output of the following code?

add <- function(x, y) { x + y }

add(3, 5)

Answer by ticking the correct answers

[ ]8

[ ] 15

[ ] Error

[ ]3

55) Which of these returns the first conditionally true expression?


Answer by ticking the correct answers

[ ] ifelse()

[ ] switch()

[ ] case_when()

[ ] else

56) What will be the result of 3^2 + 2 * 3 in R?


Answer by ticking the correct answers

[ ] 15

[ ] 18

[ ] 21

[ ] 27
57) What is the data type of c(TRUE, FALSE, TRUE)?
Answer by ticking the correct answers

[ ] Numeric

[ ] Logical

[ ] Character

[ ] Complex

58) What does df.fillna(0) do?


Answer by ticking the correct answers

[ ] Replaces all missing values with 0

[ ] Removes all missing values

[ ] Deletes the entire DataFrame

[ ] Drops rows with missing values

59) What function in Seaborn is used for KDE (Kernel Density


Estimation) plots?
Answer by ticking the correct answers

[ ] sns.kdeplot()

[ ] sns.histplot()

[ ] sns.densityplot()

[ ] sns.scatterplot()
60) What function returns the shape of a NumPy array?
Answer by ticking the correct answers

[ ] shape()

[ ] arr.shape

[ ] arr.size

[ ] arr.dimension

61) What is the main advantage of a scatter plot?


Answer by ticking the correct answers

[ ] Shows categorical relationships

[ ] Displays correlations between two numerical variables

[ ] Highlights median values

[ ] Represents time series data

62) What is the correct function for density plots in ggplot2?


Answer by ticking the correct answers

[ ] geom_density()

[ ] geom_histogram()

[ ] geom_boxplot()

[ ] geom_col()
63) Which function is used in ggplot2 for bar charts?
Answer by ticking the correct answers

[ ] geom_bar()

[ ] geom_point()

[ ] geom_line()

[ ] geom_histogram()

64) What does the return() function do in R?


Answer by ticking the correct answers

[ ] Exits the program

[ ] Exits the function and returns a value

[ ] Returns to the start of a loop

[ ] Returns nothing

65) What happens when break is used in a loop?


Answer by ticking the correct answers

[ ] Skips to the next iteration

[ ] Exits the loop

[ ] Stops the R session

[ ] Restarts the loop


66) What does the %in% operator do in R?
Answer by ticking the correct answers

[ ] Performs element-wise addition

[ ] Checks for membership

[ ] Combines two vectors

[ ] Assigns a value

67) Which function converts a numeric vector into a character


vector?
Answer by ticking the correct answers

[ ] as.numeric()

[ ] as.character()

[ ] as.logical()

[ ] as.vector()

68) Which method creates an array of zeros in NumPy?


Answer by ticking the correct answers

[ ] np.zeros()

[ ] np.ones()

[ ] np.empty()

[ ] np.full()
69) What is the correct syntax for a for loop in R?
Answer by ticking the correct answers

[ ] for (i in 1:5) { print(i) }

[ ] for i in range(1:5):

[ ] for i from 1 to 5:

[ ] loop (i in 1:5) { print(i) }

70) Which measure is most affected by outliers?


Answer by ticking the correct answers

[ ] Mean

[ ] Median

[ ] Mode

[ ] Interquartile range

71) Which function creates a box plot in base R?


Answer by ticking the correct answers

[ ] hist()

[ ] boxplot()

[ ] barplot()

[ ] density()
72) Which is not a measure of central tendency?
Answer by ticking the correct answers

[ ] Mean

[ ] Median

[ ] Mode

[ ] Standard deviation

73) The alternative hypothesis represents:


Answer by ticking the correct answers

[ ] The status quo

[ ] The presence of an effect or difference

[ ] No relationship in data

[ ] A sample statistic

74) Simple random sampling ensures:


Answer by ticking the correct answers

[ ] Equal chance for every population member to be selected

[ ] Selection based on convenience

[ ] Grouping data into clusters

[ ] Proportional selection of subgroups


75) Which of the following is an example of a random variable?
Answer by ticking the correct answers

[ ] Number of heads in 10 coin tosses

[ ] A fixed value like 3.14

[ ] A qualitative description like "red"

[ ] None of the above

76) In a normal distribution, about 99.7% of data falls within how


many standard deviations?
Answer by ticking the correct answers

[ ]1

[ ]2

[ ]3

[ ]4

77) What is the total area under a normal distribution curve?


Answer by ticking the correct answers

[ ] 0.5

[ ]1

[ ]2

[ ] 10
78) The standard normal distribution has a mean of:
Answer by ticking the correct answers

[ ]1

[ ]0

[ ] -1

[ ] Undefined

79) Approximately what percentage of data falls within 1 standard


deviation of the mean in a normal distribution?
Answer by ticking the correct answers

[ ] 50%

[ ] 68%

[ ] 95%

[ ] 99%

80) Which measure of dispersion is most robust to outliers?


Answer by ticking the correct answers

[ ] Standard deviation

[ ] Range

[ ] Variance

[ ] Interquartile range (IQR)


81) Range is defined as:
Answer by ticking the correct answers

[ ] The difference between the highest and lowest values

[ ] The average of the dataset

[ ] The most frequently occurring value

[ ] The middle value of the dataset

82) The classification of data into male and female is an example


of:
Answer by ticking the correct answers

[ ] Nominal data

[ ] Ordinal data

[ ] Interval data

[ ] Continuous data

83) Which measure of dispersion is most sensitive to outliers?


Answer by ticking the correct answers

[ ] Range

[ ] Interquartile range

[ ] Standard deviation

[ ] Median
84) If the mean of 10 numbers is 15, the sum of the numbers is:
Answer by ticking the correct answers

[ ] 150

[ ] 15

[ ] 10

[ ] 100

85) What is IBM Watson Studio primarily used for?


Answer by ticking the correct answers

[ ] Cloud storage

[ ] Data science and AI model development

[ ] Web hosting

[ ] File management

86) Which programming languages are supported in IBM Watson


Studio for data visualization?
Answer by ticking the correct answers

[ ] Python and R

[ ] Java and C++

[ ] HTML and CSS

[ ] Swift and Kotlin


87) Which tool in IBM Watson Studio is specifically used for
interactive data visualization?
Answer by ticking the correct answers

[ ] Watson Assistant

[ ] Data Refinery

[ ] Watson Discovery

[ ] AutoAI

88) IBM Watson Studio uses which popular Python libraries for
visualization?
Answer by ticking the correct answers

[ ] Matplotlib and Seaborn

[ ] NumPy and Pandas

[ ] TensorFlow and PyTorch

[ ] SQL and MongoDB

89) What is the primary advantage of using IBM Watson for data
visualization?
Answer by ticking the correct answers

[ ] Only experts can use it

[ ] No programming is required for basic visualizations

[ ] It only works with structured data

[ ] It does not support interactive charts


90) Which type of chart is best for showing trends over time in
Watson Studio?
Answer by ticking the correct answers

[ ] Bar Chart

[ ] Line Chart

[ ] Pie Chart

[ ] Scatter Plot

91) When analyzing the distribution of a single numeric variable,


which visualization should you use?
Answer by ticking the correct answers

[ ] Histogram

[ ] Pie Chart

[ ] Box Plot

[ ] Both a and c

92) What type of visualization is most effective for comparing


multiple categories in IBM Watson?
Answer by ticking the correct answers

[ ] Bar Chart

[ ] Scatter Plot

[ ] Heatmap

[ ] Violin Plot
93) Which type of visualization is best for showing relationships
between two continuous variables?
Answer by ticking the correct answers

[ ] Scatter Plot

[ ] Pie Chart

[ ] Bar Graph

[ ] Treemap

94) Heatmaps in IBM Watson Studio are commonly used for:


Answer by ticking the correct answers

[ ] Showing relationships between categorical variables

[ ] Visualizing correlation between numerical variables

[ ] Creating pie charts

[ ] Displaying time-series data

95) IBM Watson Studio can integrate data from which sources?
Answer by ticking the correct answers

[ ] Cloud databases

[ ] CSV and Excel files

[ ] APIs and IoT devices

[ ] All of the above


96) What is an advantage of using Watson’s AI-powered
visualizations?
Answer by ticking the correct answers

[ ] It predicts patterns in the data

[ ] It replaces human analysts completely

[ ] It does not require any data preparation

[ ] It only supports pre-defined charts

97) IBM Watson can suggest the best visualization type based on:
Answer by ticking the correct answers

[ ] Data structure and relationships

[ ] Random selection

[ ] User preferences only

[ ] Pre-defined templates

98) Can users customize visualizations in Watson Studio?


Answer by ticking the correct answers

[ ] Yes, users can modify colors, labels, and axes

[ ] No, visualizations are auto-generated

[ ] Only developers can modify them

[ ] It depends on the Watson plan


99) What type of visualization is recommended for detecting
outliers?
Answer by ticking the correct answers

[ ] Box Plot

[ ] Pie Chart

[ ] Line Chart

[ ] Area Chart

100) In Watson Studio, what feature allows users to create


dashboards with multiple charts?
Answer by ticking the correct answers

[ ] Watson Assistant

[ ] IBM Cognos Analytics

[ ] Data Refinery

[ ] AutoML

Powered by TCPDF (www.tcpdf.org)

You might also like