0% found this document useful (0 votes)
42 views48 pages

Aiml MCQS

Uploaded by

SPARSH MITTAL
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
42 views48 pages

Aiml MCQS

Uploaded by

SPARSH MITTAL
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 48

Question Description Answer Choice 1 Answer Choice 2 Answer Choice 3 Answer Choice 4 Correct Answer Choice Tags

What is the primary purpose of


Exploratory Data Analysis (EDA) To make data more To discover patterns and To discover patterns
in Data Science? complicated To simplify complex data insights in data To visualize data and insights in data EDA
What does the term "outlier" A point that falls within A data point that is part of A data point that falls far A data point that is A data point that falls
refer to in the context of EDA? the data range the main cluster from the rest missing far from the rest EDA
What does the term "mode"
refer to in the context of data The most frequently The most frequently
distribution analysis in EDA? occurring value The average value The spread of data The presence of outliers occurring value EDA
What is the main objective of
conducting Exploratory Data To make data more To discover patterns and To discover patterns
Analysis (EDA) in Data Science? complicated To simplify complex data insights in data To visualize data and insights in data EDA
In the context of EDA, what A point that falls within A data point that is part of A data point that falls far A data point that is A data point that falls
does the term "outlier" signify? the data range the main cluster from the rest missing far from the rest EDA
What is the meaning of the
term "mode" in the context of
analyzing data distributions The most frequently The most frequently
during EDA? occurring value The average value The spread of data The presence of outliers occurring value EDA
To visualize
What is the primary purpose of To show the distribution To visualize relationships To display categorical To represent time series relationships between
a scatter plot in EDA? of a single variable between variable data data variable EDA
Which of the following
information is not given by five-
number summary? Mean Median Mode All of the mentioned Mode EDA
What is the primary function of To visualize
a scatter plot in Exploratory To show the distribution To visualize relationships To display categorical To represent time series relationships between
Data Analysis (EDA)? of a single variable between variable data data variable EDA
The command to install Matp
Matplotlib library in python is Install pip matplotlib install matplotlib Pip matplotlib Pip install matplolib Pip install matplolib lotlib
A database management A data visualization Matp
What is Matplotlib? A programming language A data visualization library system An operating system library lotlib
How can you add a title to a By using the title() By using the label() By using the text() By using the legend() By using the title() Matp
Matplotlib plot? function function function function function lotlib
What is the purpose of the
xlabel() and ylabel() functions in To label the x and y axes To change the color of a To label the x and y Matp
Matplotlib? To add a legend to a plot To add a title to a plot of a plot plot axes of a plot lotlib
What is the purpose of the To create multiple
subplot() function in To create multiple plots To change the color of a To add annotations to a plots in the same Matp
Matplotlib? in the same figure To adjust the size of a plot plot plot figure lotlib
What is the purpose of the Matp
scatter() function in Matplotlib? To create line plots To create bar plots To create scatter plots To create a pie charts To create scatter plots lotlib
What is the purpose of the To change the x and y To change the color of a To label different lines or To change the x and y Matp
axis() function in Matplotlib? To add grid lines to a plot limits of a plot plot markers on a plot limits of a plot lotlib
Which module is used for data Matp
visualization in Python? NumPy Pandas Matplotlib Scikit-learn Matplotlib lotlib
Matplotlib is a ____ library for
the Python programming Matp
language. data science mathematics numpy plotting plotting lotlib
Which function is used to set a Matp
label for the x-axis in pyplot? xlabel() set_x() x-axis() xaxi() xlabel() lotlib
What command is used to
install the Matplotlib library in Matp
Python? Install pip matplotlib install matplotlib Pip matplotlib Pip install matplolib Pip install matplolib lotlib
A database management A data visualization Matp
Define Matplotlib. A programming language A data visualization library system An operating system library lotlib
How do you incorporate a title By using the title() By using the label() By using the text() By using the legend() By using the title() Matp
into a Matplotlib plot? function function function function function lotlib
What role do the xlabel() and
ylabel() functions serve in To label the x and y axes To change the color of a To label the x and y Matp
Matplotlib? To add a legend to a plot To add a title to a plot of a plot plot axes of a plot lotlib
What is the function of the To create multiple
subplot() function in To create multiple plots To change the color of a To add annotations to a plots in the same Matp
Matplotlib? in the same figure To adjust the size of a plot plot plot figure lotlib
Describe the purpose of the Matp
scatter() function in Matplotlib. To create line plots To create bar plots To create scatter plots To create a pie charts To create scatter plots lotlib
What does the axis() function To change the x and y To change the color of a To label different lines or To change the x and y Matp
achieve in Matplotlib? To add grid lines to a plot limits of a plot plot markers on a plot limits of a plot lotlib
What is the primary module
utilized for data visualization in Matp
Python? NumPy Pandas Matplotlib Scikit-learn Matplotlib lotlib
Matplotlib serves as a(n) ____
library for the Python Matp
programming language. data science mathematics numpy plotting plotting lotlib
Which function in pyplot is
employed to assign a label to Matp
the x-axis? xlabel() set_x() x-axis() xaxi() xlabel() lotlib
To add grid lines to a graph in
Matplotlib, which function is Matp
used? grid() gridlines() grid_lines() grids() grid() lotlib
Which function in Matplotlib is
employed to introduce grid Matp
lines to a graph? grid() gridlines() grid_lines() grids() grid() lotlib
The main focus of ML is
ML is a type of artificial to allow computer
intelligence that extract systems learn from
Machine Learning (ML) is patterns out of raw data by experience without being
What is true about Machine that field of computer using an algorithm or explicitly programmed or
Learning? science method. human intervention. All of the above All of the above ML
Among the following option
identify the one which is not a Semi unsupervised Semi unsupervised
type of learning learning Supervised learning Reinforcement Learning Unsupervised learning learning ML
Machine Learning is a subset of
which of the following. Artificial Intelligence Deep Learning Data Learning None of the above Artificial Intelligence ML
Which of the following are
common classes of problems in
machine learning? Regression Classification Clustering All of the above All of the above ML
Machine learning is
the autonomous
The autonomous The autonomous The selective acquisition acquisition of
acquisition of knowledge acquisition of knowledge of knowledge through the The selective acquisition knowledge through
through the use of through the use of manual use of computer of knowledge through the the use of computer
What is Machine learning? computer programs programs programs use of manual programs programs. ML
Out of the following options,
which one is not categorized as Semi unsupervised Semi unsupervised
a type of learning? learning Supervised learning Reinforcement Learning Unsupervised learning learning ML
In which broader domain does
Machine Learning operate as a
subset? Artificial Intelligence Deep Learning Data Learning None of the above Artificial Intelligence ML
Which problem classes are
commonly encountered in
Machine Learning? Regression Classification Clustering All of the above All of the above ML
Which of the following
functions is typically used to
add elements to a plot in the
base graphics system? lines() hist() plot() boxplot() boxplot() ML
A Python package which stands
for 'Numerical Python' is named
and used as _____ in Python Num
programs. Scipy Pandas NumPy Pyplot NumPy py
Which of the following method
creates a new array object that Num
looksat the same data? view copy paste all of the mentioned view py
_____ is the another name of Num
NumPy arrays. Python arrays Parrays NumPy Data Arrays Ndarray Ndarray py
What is the purpose of NumPy To do numerical None of the mentioned Num
in Python? calculations To do scientific computing Both A and B above Both A and B py
Which of the following is not a
legal way to create NumPy Num
arrays? numpy.array() numpy.linspace() numpy.arange() numpy.newarray() numpy.newarray() py
To horizontally combine two
ndarrays,which function may be Num
used? hstack() vstack() hsplit() vsplit() hstack() py
Tovertically combine two
ndarrays,which function may be Num
used? hstack() vstack() hsplit() vsplit() vsplit() py
Which method among the
following creates a new array
object that references the same Num
data? view copy paste all of the mentioned view py
Another term for NumPy arrays Num
is _____. Python arrays Parrays NumPy Data Arrays Ndarray Ndarray py
What is the role of NumPy in To do numerical None of the mentioned Num
Python? calculations To do scientific computing Both A and B above Both A and B py
To creates a sequence of
numbers,Numpy provides a
funtion _____ analogous to
range that returns arrays Num
instead of lists. arange aspace aline all of the mentioned arange py
Consider the statement AR+2.
What should be the type of AR
so that above statement
produces no error and gets Num
executed successfully? List Numpyarray Ndarray Both b and c Both b and c py
For generating a sequence of
numbers, NumPy offers a
function _____, which is similar
to range but produces arrays Num
instead of lists. arange aspace aline all of the mentioned arange py
Returns the number of Returns the number of
Returns the number of dimensions of the dimensions of the
elements in the Returns the Series as underlying data, by Returns a list of the axis underlying data, by Num
Why ndim is used? underlying data. ndarray. definition 1. labels definition 1. py
What will be the minimum
number of arguments require to None of the above None of the above Pand
pass in pandas series? 2 3 4 mentioned mentioned as
Amongst which of the following
is / are used to analyze the data None of the above Pand
in pandas. DataFrame Series Both A and B mentioned Both A and B as
Amongst which of the following
can be used to create various Numpy ndarrays and All of the above All of the above Pand
inputs using pandas DataFrame. Lists,dict Series Another DataFrame mentioned mentioned as
Indexing in Series is similar to Pand
that for NumPy arrays. TRUE FALSE TRUE as
DataFrame accepts many Pand
different kinds of input. TRUE FALSE TRUE as
Which of the following thing Pand
can be data in Pandas? a python dict an ndarray a scalar value all of the mentioned all of the mentioned as
Pand
PANDAS stands for ____ Panel Data Analysis Panel Data Analyst Panel Data Panel Dashboard Panel Data as
Which of the following
command is used to install Pand
pandas? pip install pandas install pandas pip pandas None of the above pip install pandas as
head( ) function return
__________ n rows and tail
function return _____________ Pand
n rows from a pandas object. last, first first, second last, seven first, last first, last as
We can have duplicate indexes Pand
in Series?(T/F) TRUE FALSE TRUE as
Which function is used to read a
CSV file into a Pandas Pand
DataFrame? read_csv() load_csv() open_csv() import_csv() read_csv() as
What is the correct way to
access the first five rows of a Pand
DataFrame called df in Pandas? df.head(5) df.first(5) df[:5] df.top(5) df.head(5) as
It creates subgroups It sorts the DataFrame It merges two It reshapes the It creates subgroups
What does the groupby() based on the unique based on a specified DataFrames based on a DataFrame into a based on the unique Pand
function in Pandas do? values in a column column common column specified format values in a column as
Which function is used to create Pand
a histogram in Seaborn? sns.lineplot() sns.scatterplot() sns.histplot() sns.barplot() sns.histplot() as
How do you sort values in a
DataFrame based on a specific df.sort_values(by='column df.order_by('column_na df.sort_values(by='col Pand
column? df.sort('column_name') _name') sort_df('column_name') me') umn_name') as
Which method is used to fill
missing values in a DataFrame Pand
in Pandas? fill_nulls() replace_nulls() fillna() complete_nulls() fillna() as
How can you arrange the values
in a DataFrame based on a df.sort_values(by='column df.order_by('column_na df.sort_values(by='col Pand
specific column? df.sort('column_name') _name') sort_df('column_name') me') umn_name') as
What method is employed to
populate missing values in a Pand
DataFrame in Pandas? fill_nulls() replace_nulls() fillna() complete_nulls() fillna() as
Is indexing in Series similar to Pand
that in NumPy arrays? TRUE FALSE TRUE as
Does DataFrame accommodate Pand
various types of input? TRUE FALSE TRUE as
What types of objects can serve Pand
as data in Pandas? a python dict an ndarray a scalar value all of the mentioned all of the mentioned as
What is the full form of the Pand
acronym PANDAS? Panel Data Analysis Panel Data Analyst Panel Data Panel Dashboard Panel Data as
Which command installs Pand
pandas? pip install pandas install pandas pip pandas None of the above pip install pandas as
Pandas key data structure is Pand
called? Keyframe DataFrame Statistics Econometrics DataFrame as
Amongst which of the following pandas.DataFrame( data,
is a correct syntax for panda's Pandas.DataFrame(data, index, columns, dtype, pandas.DataFrame(data, pandas.DataFrame( data, Pandas.DataFrame(dat Pand
dataframe? index, dtype, copy) copy) index, dtype, copy) index, rows, dtype, copy) a, index, dtype, copy) as
Amongst which of the following
is / are not an iterative function All of the above Pand
for dataframe? iterrows() itercolumns() iteritems() mentioned itercolumns() as
When we create a series from
dictionary then the keys of Pand
dictionary become _______ Index of the series Value of the series Caption of the series None of the series Index of the series as
A dataframe object can A column of dataframe
Which of the folowing is not be created by passing A dataframe is size A dataframe index can be can have data of different A dataframe is size Pand
true about dataframe? dictionaries. immutable string types immutable as
Mr. Kumar is working in an IT
company. He stored the salaries
of all the employees of January
month in Series ‘Jan_Sal’ and
salaries of February month in
Series ‘Feb_Sal’. Now he wants
to add the salaries of both
months. He has written the
following statement. Identify print(Feb_Sal plus print(Feb_Sal + Pand
the correct one. print(Feb_Sal + Jan_Sal) print(Feb_Sal_add_Jan_Sal) Jan_Sal) None of the above Jan_Sal) as
print(S1[-1]) will return
___________ Pand
#’S1′ is a series last element of series ‘S1’ first element of series ‘S1’ Key Error all elements of series ‘S1’ Key Error as
__________ function is used to
sort a Series object on the basis Pand
of values. sort.values( ) sort_values( ) sort_value( ) sort_Values sort_values( ) as
Which of the following attribute Pand
of Series returns the tuple? size shape values index shape as
What is the data type of series
‘S1’ given below ? Pand
S1=pd.Series([11, 12.5, “ok”]) int64 float64 object object64 object as
Which of the following
statement is correct to add NaN S1=pd.Series([10, S1=pd.Series([10, None, Pand
value in series? np.NaN,11]) 11]) Both of the above None of the above Both of the above as
Concatenates Concatenates
What does the pd.concat() Joins DataFrames Joins DataFrames DataFrames along a Combines DataFrames DataFrames along a Pand
function do in Pandas? vertically horizontally specified axis using SQL-style joins specified axis as
How do you drop a column df.remove_column(column df.drop(columns=[column delete_column(column_n df.drop(columns=[colu Pand
from a DataFrame in Pandas? df.drop(column_name) _name) _name]) ame) mn_name]) as
How do you eliminate a column df.remove_column(column df.drop(columns=[column delete_column(column_n df.drop(columns=[colu Pand
from a DataFrame in Pandas? df.drop(column_name) _name) _name]) ame) mn_name]) as
What do the keys of a
dictionary become when Pand
creating a series from it? Index of the series Value of the series Caption of the series None of the series Index of the series as
A natural language
A machine learning A data visualization library processing library in A web development A machine learning Sckit
What is Scikit-learn? library in Python in Python Python framework in Python library in Python learn
Which of the following is an
example of a supervised Principal component Sckit
learning algorithm? K-means clustering Decision tree analysis (PCA) Apriori algorithm Decision tree learn
Which of the following is an
example of a unsupervised
learning algorithm in Scikit- Support vector machines Sckit
learn? Linear Regression K-means clustering (SVM) Random Forest K-means clustering learn
A natural language
A machine learning A data visualization library processing library in A web development A machine learning Sckit
Define Scikit-learn. library in Python in Python Python framework in Python library in Python learn
Can you provide an example of
a supervised learning Principal component Sckit
algorithm? K-means clustering Decision tree analysis (PCA) Apriori algorithm Decision tree learn
Could you provide an example Support vector machines Sckit
of an unsupervised learning Linear Regression K-means clustering (SVM) Random Forest K-means clustering learn
algorithm available in Scikit-
learn?
Which measure of central
tendency is best used for
categorical data? Mode Median Mean Standard Deviation Mode Stats
What is the median of the data
set: 12, 15, 17, 20, 25? 15 17 20 22 15 Stats
What does the mode of a The most frequently The most frequently
dataset represent? occurring value The middle value The arithmetic average The spread of data points occurring value Stats
Which measure of central
tendency should be used for a
dataset with outliers to get a
representative value? Median Mode Mean Variance Median Stats
It is the measure of variability,
based on dividing a data set into
quartiles. Deviation Standard Deviation Range Inter Quartile Range Inter Quartile Range Stats
The value most recurrent in the
sample set is known as
________. Mean Median Mode Standard Deviation Mode stats
__________Statistics uses the
data to provide descriptions of
the population, either through
numerical calculations or graphs
or tables. Descriptive Quantitative Inferential Qualitative Descriptive stats
What is the median of the
following set of scores?
18, 6, 12, 10, 14 ? 10 14 18 12 12 stats
An approximate indicator
The square root of the of how numbers vary
The standard deviation is: variance A measure of variability from the mean All of the above All of the above stats
The _______ is often the
preferred measure of central
tendency if the data are
severely skewed. Mean Median Mode Range Median stats
Which measure of central
tendency is most suitable for
analyzing categorical data? Mode Median Mean Standard Deviation Mode Stats
Calculate the median for the
dataset: 12, 15, 17, 20, 25. 15 17 20 22 15 Stats
What information does the The most frequently The most frequently
mode provide about a dataset? occurring value The middle value The arithmetic average The spread of data points occurring value Stats
Which measure of central
tendency is recommended for
datasets with outliers to ensure
a representative value? Median Mode Mean Variance Median Stats
Describe the concept related to
dividing a dataset into quartiles
to gauge variability. Deviation Standard Deviation Range Inter Quartile Range Inter Quartile Range Stats
What term describes the most
frequently occurring value in a
sample set? Mean Median Mode Standard Deviation Mode stats
How does Descriptive Statistics
contribute to understanding a
population's characteristics? Descriptive Quantitative Inferential Qualitative Descriptive stats
Calculate the median for the
given set of scores: 18, 6, 12,
10, 14. 10 14 18 12 12 stats
An approximate indicator
Define the concept of standard The square root of the of how numbers vary
deviation. variance A measure of variability from the mean All of the above All of the above stats
In the presence of severe
skewness in data, which
measure of central tendency is
commonly preferred? Mean Median Mode Range Median stats
To make predictions To summarize and describe To infer conclusions To test hypotheses and To summarize and
What is the primary goal of about the future based the main features of a about a population based relationships between describe the main
descriptive statistics? on past data dataset on a sample variables features of a dataset Stats
Which measure of central
tendency is most affected by
extreme outliers? Mean Median Mode Range Mean Stats
Which of the following is a
measure of variability in a
dataset? Mean Mode Range Mode Range Stats
Which of the following
measures is the most resistant
to outliers? Median Mean Mode Range Median Stats
If a dataset has a positively
skewed distribution, which
measure of central tendency is
likely to be the largest? Mean Median Mode Range Mean Stats
How would you define the To make predictions To summarize and describe To infer conclusions To test hypotheses and To summarize and
primary objective of descriptive about the future based the main features of a about a population based relationships between describe the main
statistics? on past data dataset on a sample variables features of a dataset Stats
Which measure of central
tendency is particularly
influenced by extreme outliers? Mean Median Mode Range Mean Stats
Which metric assesses the
variability within a dataset? Mean Mode Range Mode Range Stats
Which measure of central
tendency is the most robust
against outliers? Median Mean Mode Range Median Stats
In a positively skewed
distribution, which measure of
central tendency tends to be
the largest? Mean Median Mode Range Mean Stats
Which EDA technique is used to
identify and visualize outliers in
a Dataset? Bar Chart Histogram Box Plot Scatter Plot Box Plot EDA
In EDA, what does the term The symmetry of a data The shape of a data The shape of a data
"skewness" refer to? distribution The spread of data distribution The presence of outliers distribution EDA
Which technique in EDA is
utilized to identify and visualize
outliers within a dataset? Bar Chart Histogram Box Plot Scatter Plot Box Plot EDA
Within EDA, how is "skewness" The symmetry of a data The shape of a data The shape of a data
defined? distribution The spread of data distribution The presence of outliers distribution EDA
To display
What is the purpose of a To visualize the To display relationships To calculate summary relationships between
correlation matrix in EDA? distribution of data between variables To identify missing values statistics variables EDA
What is the purpose of a
correlation matrix in To display
Exploratory Data Analysis To visualize the To display relationships To calculate summary relationships between
(EDA)? distribution of data between variables To identify missing values statistics variables EDA
What is the purpose of the
xlim() and ylim() functions in To set the background To set the x and y limits To add annotations to a To set the x and y Matp
Matplotlib? To set the size of a plot color of a plot of a plot plot limits of a plot lotlib
Set the background color of the
legend using the ______ Matp
function in Matplotlib. bgcolor() set_bgcolor() facecolor() set_facecolor() set_facecolor() lotlib
Matplotlib is designed to be as Matp
usable as ___ SciPy MATLAB AI All of the above MATLAB lotlib
Which method/attribute is used
to check the installed version of Matp
matplotlib? __version__ version() Both A and B None of the above __version__ lotlib
Which argument keyword can
be used to change the style of Matp
the plotted line? line lines linestyles linestyle linestyle lotlib
Which plot is also known as the Matp
'Whisker plot' in Matplotlib? Bar Pie Histogram Box plot Box plot lotlib
Which is used for plotting a Matp
horizontal line? hline() ahline() xhline() axhline() axhline() lotlib
What is the function of the
xlim() and ylim() functions in To set the background To set the x and y limits To add annotations to a To set the x and y Matp
Matplotlib? To set the size of a plot color of a plot of a plot plot limits of a plot lotlib
Specify the function in
Matplotlib used to adjust the Matp
background color of the legend. bgcolor() set_bgcolor() facecolor() set_facecolor() set_facecolor() lotlib
How is Matplotlib designed to Matp
function in terms of usability? SciPy MATLAB AI All of the above MATLAB lotlib
What method or attribute can
be employed to verify the Matp
installed version of Matplotlib? __version__ version() Both A and B None of the above __version__ lotlib
Which keyword argument is
employed to alter the style of Matp
the plotted line? line lines linestyles linestyle linestyle lotlib
Identify the plot also referred to
as the 'Whisker plot' in Matp
Matplotlib. Bar Pie Histogram Box plot Box plot lotlib
What method is used to plot a Matp
horizontal line? hline() ahline() xhline() axhline() axhline() lotlib
To display histogram with well df.plot(type = 'hist' , edge df.plot(type = 'hist' , df.plot(type = 'hist' , line df.plot(type = 'hist' , df.plot(type = 'hist' , Matp
defined edge we can write = 'red') edgecolor = 'red') = 'red') linecolor = 'red') edgecolor = 'red') lotlib
Select the correct statement to plt.boxplot(data , vert = plt.boxplot(data , horiz = plt.boxplot(data , vert Matp
display horizontal box plot plt.box(data , vert = False) plt.box(data , horiz = True) False) True) = False) lotlib
What is the default color for Matp
Matplotlib plots? Red Blue Green Black Blue lotlib
Which is the correct code
statement to draw a plot plt.plot(xpoints, ypoints, plt.plot(xpoints, ypoints, plt.plot(xpoints, Matp
without a line? plt.plot(xpoints, ypoints) plt.plot(xpoints, ypoints, 0) False) 'o') ypoints, 'o') lotlib
How can we generate a
histogram with clearly defined df.plot(type = 'hist' , edge df.plot(type = 'hist' , df.plot(type = 'hist' , line df.plot(type = 'hist' , df.plot(type = 'hist' , Matp
edges? = 'red') edgecolor = 'red') = 'red') linecolor = 'red') edgecolor = 'red') lotlib
Choose the correct statement
for displaying a horizontal box plt.boxplot(data , vert = plt.boxplot(data , horiz = plt.boxplot(data , vert Matp
plot. plt.box(data , vert = False) plt.box(data , horiz = True) False) True) = False) lotlib
What is the default color used Matp
for Matplotlib plots? Red Blue Green Black Blue lotlib
What is the correct code syntax
for generating a plot without a plt.plot(xpoints, ypoints, plt.plot(xpoints, ypoints, plt.plot(xpoints, Matp
line? plt.plot(xpoints, ypoints) plt.plot(xpoints, ypoints, 0) False) 'o') ypoints, 'o') lotlib
What is the term known as on
which the machine learning
algorithms build a model based
on sample data? Data Training Training Data Transfer Data None of the above Training Data ML
Among the following options
identify the one which is false It is used for the It is used for the It discovers casual It discovers casual
regarding regression. prediction interpretation It related input to outputs relationships relationships ML
Test set is used to test Test set is used to test
the accuracy of the It is a set of data is used to the accuracy of the
hypotheses generated by discover the potentially hypotheses generated
What is ‘Test set’? the learner. predictive relationship. Both A & B None of above by the learner. ML
Spam detection,
Spam detection, Pattern Pattern detection,
Which of the following are detection, Natural Image classification, Real- Autonomous car driving, Bioinformatics, Speech Natural Language
supervised learning applications Language Processing time visual tracking Logistic optimization recognition Processing ML
Axes, legends, and other
What is the role of exploratory They are made for formal They are typically made details are clean and They are used in place of They are typically
graphs in data analysis? presentations very quickly exactly detailed formal modeling made very quickly ML
Spam detection,
Which of the following Spam detection, Pattern Pattern detection,
represent applications of detection, Natural Image classification, Real- Autonomous car driving, Bioinformatics, Speech Natural Language
supervised learning? Language Processing time visual tracking Logistic optimization recognition Processing ML
What is the term used to
describe the process by which
machine learning algorithms
construct a model based on
sample data? Data Training Training Data Transfer Data None of the above Training Data ML
Which statement among the
following is incorrect when It is used for the It is used for the It discovers casual It discovers casual
considering regression? prediction interpretation It related input to outputs relationships relationships ML
Test set is used to test Test set is used to test
the accuracy of the It is a set of data is used to the accuracy of the
Can you define the concept of a hypotheses generated by discover the potentially hypotheses generated
'Test set'? the learner. predictive relationship. Both A & B None of above by the learner. ML
In pandas, Index values must Pand
be? unique hashable Both A and B None of the above Both A and B as
import pandas as pd
data =
[['Anuj',21],['Rama',25],['Kapil',2
2]]
df = Name Age Name Age Name Age Name Age Name Age
pd.DataFrame(data,columns=[' 0 Anuj 21 0 Anuj 21 0 Kapil 22 0 Rama 25 0 Anuj 21
Name','Age']) 1 Rama 25 1 Kapil 22 1 Rama 25 1 Anuj 21 1 Rama 25 Pand
print (df) 2 Kapil 22 2 Rama 25 2 Anuj 21 2 Kapil 22 2 Kapil 22 as
Pandas Series is size
__________ and value Pand
___________ Mutable, Mutable Immutable, Immutable Immutable, Mutable Mutable, Immutable Immutable, Mutable as
Anshuman wants to create a
series named ‘S1’. He has
written the following codes. His
friend Shubham checked the
code and said that one of the
code given below is not
working. As a friend of S1=pd.Series(data=[11, S1=pd.Series(data=[11
Anshuman, help him to find the 12, 5, S1=pd.Series([11, 12, 5, S1=pd.Series([11, 12, 5, S1=pd.Series(data=[11, , 12, 5, 6,9], Pand
incorrect code. 6,9],index=[1,2,3,4,5]) 6,9],index=[1,2,3,4,5]) 6,9],[1,2,3,4,5]) 12, 5, 6,9], [1,2,3,4,5]) [1,2,3,4,5]) as
Which of the following is Pand
parameter of Series( ) function? data index dtype All of the above All of the above as
By default Series have _______ Pand
data labels starting from ___. character, ‘a’ numeric, one numeric, zero character, zero numeric, zero as
Choose the correct statement :
Statement1 : A Numpy array
requires homogeneous data.
Statement2 : Pandas DataFrame Both the statements are Both the statements are Both the statements Pand
can have heterogeneous data. Statement1 is correct Statement2 is correct correct wrong are correct as
Pandas DataFrame is size Pand
________ and value ________ Mutable, Mutable Immutable, Immutable Immutable, Mutable Mutable, Immutable Mutable, Mutable as
A one-dimensional
A one-dimensional A two-dimensional labeled A three-dimensional labeled array capable
labeled array capable of array capable of holding labeled array capable of A one-dimensional array of holding data of any Pand
What is a Series in Pandas? holding data of any type data of any type holding data of any type without labels type as
df.rename(columns={'
How do you rename a column df.rename(columns={'old df.column_rename('old_n old_name': Pand
in a DataFrame? rename_column() change_column_name() _name': 'new_name'}) ame', 'new_name') 'new_name'}) as
Which of the following is NOT a
type of plot available in Pand
Seaborn? Line plot Scatter plot Bar plot Tree plot Tree plot as
How do you apply a function
element-wise to a DataFrame in Using the apply() Pand
Pandas? Using a loop Using the apply() function Using list comprehension Using the map() function function as
Reshapes a DataFrame by Changes the
What does the resample() Reshapes a DataFrame by sampling columns Changes the frequency of Reverses the order of frequency of time Pand
function do in Pandas? sampling rows randomly randomly time series data rows in a DataFrame series data as
How can you apply a function to
each element in a DataFrame in Using the apply() Pand
Pandas? Using a loop Using the apply() function Using list comprehension Using the map() function function as
What functionality does the Reshapes a DataFrame by Changes the
resample() function provide in Reshapes a DataFrame by sampling columns Changes the frequency of Reverses the order of frequency of time Pand
Pandas? sampling rows randomly randomly time series data rows in a DataFrame series data as
import pandas as pd
s=pd.Series([1,2,3,4,5
],index=
['a','b','c','d','e']) None of the above Pand
print(s['f']) KeyError IndexError Value Error mentioned KeyError as
Import pandas as pd
Import numpy as np
df =
pd.DataFrame(np.array([[4,6,9],
[5,1,3]])) SyntaxError : invalid None of the above SyntaxError : invalid Pand
print(df.shape) syntax KeyError IndexError mentioned syntax as
What are the different types of Pand
attributes? Nominal Ordinal Spacial All of the above All of the above as
Write the output of the
following:
import pandas as pd
S1=pd.Series(data=[11, 12,
None,
6,9,7],index=[1,12,3,4,2,4]) Pand
print(S1.count()) 4 6 5 Error 5 as
Which of the following
statement return Filtered
result?import pandas as pd
S1=pd.Series([11, 12, 5, 6,9])
print(S1) #Statement 1
print(S1>7) #Statement 2 Pand
print(S1[S1>7]) #Statement 3 Statement 1 Statement 2 Statement 3 None of the above Statement 3 as
What type of error is returned,
when the length of index and
the length of data in Series() Pand
function is not same? Key Error Value Error Syntax Error Name Error Value Error as
Two common ways for
accessing the elements of a
series are _________ and Labelled Indexing, Pand
_______ Indexing, Concatenation Positional Indexing Indexing, Slicing Slicing, Cutting Indexing, Slicing as
‘data’ in the following code
could be _____________ Pand
S1 = pd.Series(data) Python sequence Scalar value Python dictionary All of the above All of the above as
Reshapes a DataFrame Combines multiple
What does the pivot_table() Creates a pivot chart Creates a pivot table from by pivoting the rows and DataFrames into a single Creates a pivot table Pand
function do in Pandas? based on DataFrame data a DataFrame columns pivot table from a DataFrame as
Converts a DataFrame to Converts a DataFrame
What is the purpose of the a comma-separated Converts a CSV file to a Converts a DataFrame to Converts a DataFrame to to a comma-separated Pand
to_csv() function in Pandas? values (CSV) file DataFrame an Excel file a JSON file values (CSV) file as
What is the purpose of the Reshapes a DataFrame Combines multiple
pivot_table() function in Creates a pivot chart Creates a pivot table from by pivoting the rows and DataFrames into a single Creates a pivot table Pand
Pandas? based on DataFrame data a DataFrame columns pivot table from a DataFrame as
Converts a DataFrame to Converts a DataFrame
What task does the to_csv() a comma-separated Converts a CSV file to a Converts a DataFrame to Converts a DataFrame to to a comma-separated Pand
function serve in Pandas? values (CSV) file DataFrame an Excel file a JSON file values (CSV) file as
What is the purpose of the
transform() method in Scikit- To train a model using a To make predictions using To evaluate the To preprocess the data To preprocess the Sckit
learn? given dataset a trained model performance of a model for modeling data for modeling learn

The parameter______ allows


specifying the percentage of
elements to put into the Sckit
test/training set test_size training_size All above None of these All above learn
Describe the function of the
transform() method in Scikit- To train a model using a To make predictions using To evaluate the To preprocess the data To preprocess the Sckit
learn. given dataset a trained model performance of a model for modeling data for modeling learn
How does the parameter ____
facilitate the specification of the
proportion of elements
assigned to the test/training Sckit
set? test_size training_size All above None of these All above learn
The spread or
The spread or dispersion dispersion of data
What does the standard of data points around the The central value of the The total range of the The frequency of each points around the
deviation measure in a dataset? mean dataset dataset data point in the dataset mean Stats
Which descriptive statistic gives
the highest weight to outliers? Mean Median Mode Variance Mean Stats
In a dataset, if the mean is
greater than the median, what The distribution cannot
can you infer about the data The distribution is The distribution is The distribution is be determined from The distribution is
distribution? positively skewed negatively skewed symmetric these statistics positively skewed Stats
Which of the following
represents the fiftieth
percentile, or the middle point
in a set of numbers arranged in
order of magnitude? Mode Median Mean Variance Median stats
The spread or
What aspect of a dataset does The spread or dispersion dispersion of data
the standard deviation of data points around the The central value of the The total range of the The frequency of each points around the
quantify? mean dataset dataset data point in the dataset mean Stats
Among descriptive statistics,
which one assigns the greatest
importance to outliers? Mean Median Mode Variance Mean Stats
If the mean exceeds the median The distribution cannot
in a dataset, what inference can The distribution is The distribution is The distribution is be determined from The distribution is
be made about its distribution? positively skewed negatively skewed symmetric these statistics positively skewed Stats
Which percentile represents the
fiftieth percentile or the median
in a dataset ordered by
magnitude? Mode Median Mean Variance Median stats
In a dataset, if the mean is 30,
the median is 25, and the mode The distribution cannot
is 30, what can you infer about The distribution is The distribution is The distribution is be determined from The distribution is
the distribution? positively skewed negatively skewed symmetric these statistics positively skewed Stats
A measure indicating
What does the term A measure indicating the The most frequently the relative standing
"percentile" refer to in relative standing of a data The spread of data points occurring value in a of a data point in a
descriptive statistics? point in a dataset The mean of a dataset in a dataset dataset dataset Stats
___________ are used when
you want to visually examine
the relationship between two
quantitative variables. Bar graphs Pie graphs Line graphs Scatterplots Scatterplots stats
Given a dataset with a mean of
30, a median of 25, and a mode The distribution cannot
of 30, what can be deduced The distribution is The distribution is The distribution is be determined from The distribution is
about the distribution? positively skewed negatively skewed symmetric these statistics positively skewed Stats
A measure indicating
What does the term A measure indicating the The most frequently the relative standing
"percentile" denote in relative standing of a data The spread of data points occurring value in a of a data point in a
descriptive statistics? point in a dataset The mean of a dataset in a dataset dataset dataset Stats
What visual tool is utilized to
explore the relationship
between two quantitative
variables? Bar graphs Pie graphs Line graphs Scatterplots Scatterplots stats
NumPy main object is the
homogeneous In NumPy ,dimensions are NumPy array class is Num
Point out the correct statement. multidimensional array called axes called ndarray all of the mentioned all of the mentioned py
A NumPy array's dimensions Num
arenknown as _____ ndimension dimensions axes nsize axes py
The number of elements along
each axis of an ndarray can be
determined through ndarray's Num
_____ Shape Rank Itemsize Size Shape py
NumPy main object is the
homogeneous In NumPy ,dimensions are NumPy array class is Num
Identify the correct statement. multidimensional array called axes called ndarray all of the mentioned all of the mentioned py
The dimensions of a NumPy Num
array are referred to as _____. ndimension dimensions axes nsize axes py
Ar[2:7:2] will produce the result Num
as _____ array([6,10,14]) array([10,14,16]) array([2,6,10]) array([4,6,10]) array([6,10,14]) py
import pandas as pd
import numpy as np
s=
pd.Series(np.random.randn(4)) Num
print s.ndim 0 1 2 3 1 py
In EDA, what is the term for a
graphical representation that
displays the distribution of a
continuous variable? Pie Chart Histogram Bar Chart Scatter Plot Histogram EDA
Which of the following graph
can be used for simple
summarization of data? Scatterplot Overlaying Barplot All of the mentioned Barplot EDA
Color and shape are used to add
dimensions to graph data. TRUE FALSE TRUE EDA
In EDA, what term is used for a
visual representation that
illustrates the spread of a
continuous variable? Pie Chart Histogram Bar Chart Scatter Plot Histogram EDA
In EDA, which measure of
central tendency is typically
represented by the height of a
box in a box plot? Mean Median Mode Variance Median EDA
Which EDA technique is used to
visualize the relationship
between two continous
variables with a line connecting
data points? Scatter Plot Histogram Bar Chart Box Plot Scatter Plot EDA
Which of the following is NOT a
common EDA technique for
visualizing data distributions? Box Plot Histogarm Scatter Plot Bar Chart Bar Chart EDA
To visualize
What is the primary purpose of To show the distribution To visualize relationships To display categorical To represent time series relationships between
a scatter plot in EDA? of a single variable between variables data data variables EDA
In EDA, what is the purpose of a
Q-Q plot (Quantile-Quantile To visualize the To compare two different To check for normality in To display time series To check for normality
plot)? distribution of data datasets data data in data EDA
What is the primary purpose of To visualize the To display relationships To compare multiple To compare multiple
a bar chart in EDA? distribution of data between variables categories To identify missing values categories EDA
In EDA, which statistical metric
is typically depicted by the
height of a box in a box plot? Mean Median Mode Variance Median EDA
Which EDA technique is
employed to illustrate the
relationship between two
continuous variables with a line
connecting data points? Scatter Plot Histogram Bar Chart Box Plot Scatter Plot EDA
Among the following, which is
NOT a common technique in
EDA for visualizing data
distributions? Box Plot Histogarm Scatter Plot Bar Chart Bar Chart EDA
What is the main objective of
using a scatter plot in To visualize
Exploratory Data Analysis To show the distribution To visualize relationships To display categorical To represent time series relationships between
(EDA)? of a single variable between variables data data variables EDA
What is the purpose of a Q-Q
plot (Quantile-Quantile plot) in
Exploratory Data Analysis To visualize the To compare two different To check for normality in To display time series To check for normality
(EDA)? distribution of data datasets data data in data EDA
What is the primary function of
a bar chart in Exploratory Data To visualize the To display relationships To compare multiple To compare multiple
Analysis (EDA)? distribution of data between variables categories To identify missing values categories EDA
Which of the following does not Matp
visualize data? Charts Maps Shapes Graphs Shapes lotlib
By specifying the color By specifying the color
How can you change the color By using the color() By using the palette() parameter in the plot() parameter in the plot() Matp
of a Matplotlib plot? function By using the hue() function function function function lotlib
How can you change the size of By using the figure() By using the subplot() By using the size() By using the xlim() and By using the figure() Matp
a Matplotlib plot? function function function ylim() functions function lotlib
What is the purpose of the
imshow() function in Matp
Matplotlib? To display images To create line plots To create scatter plots To create bar charts To display images lotlib
Which Matplotlib function is Matp
used to set the title in a graph? graph_title() graph_titles() titles() title() title() lotlib
import
Which is the correct import import matplotlib.pyplot import pyplot from matplotlib.pyplot as Matp
statement to import pyplot? as plt matplotlib Both A and B None of the above plt lotlib
Which function is used to create Matp
histograms in matplotlib? histograms() histogram() histgraph() hist() hist() lotlib
Among the options provided,
which one does not represent a Matp
method of visualizing data? Charts Maps Shapes Graphs Shapes lotlib
By specifying the color By specifying the color
How do you alter the color of a By using the color() By using the palette() parameter in the plot() parameter in the plot() Matp
Matplotlib plot? function By using the hue() function function function function lotlib
How can you adjust the size of a By using the figure() By using the subplot() By using the size() By using the xlim() and By using the figure() Matp
Matplotlib plot? function function function ylim() functions function lotlib
Which Matplotlib function is
responsible for setting the title Matp
in a graph? graph_title() graph_titles() titles() title() title() lotlib
import
What is the appropriate import import matplotlib.pyplot import pyplot from matplotlib.pyplot as Matp
statement for importing pyplot? as plt matplotlib Both A and B None of the above plt lotlib
What function is employed to
construct histograms in Matp
Matplotlib? histograms() histogram() histgraph() hist() hist() lotlib
Which of the following type of Matp
chart is not supported by pyplot Histogram Boxplot Pie All are correct All are correct lotlib
Plot which is used to given Matp
statistical summary is Bar Line Histogram Box Plot Box Plot lotlib
Which of the following is
correct syntax to craete plt.hist(x, bins = plt.hist(x, bins = Matp
histogarm with bins specified? plt.hist(x, bins = 10) [10,11,12,13,14]) range(10,15)) All are correct All are correct lotlib
Which of the following is not a Matp
type of Matplotlib plot? Line Plot Scatter Plot Pie Chart Bar Chart Pie Chart lotlib
What is the purpose of the To label the x and y axes To add annotations to a To label different lines or To label different lines Matp
legend() function in Matplotlib? of a plot To add a title to a plot plot markers on a plot or markers on a plot lotlib
How to add labels to a plot in Matp
Matplotlib? xlabel() and ylabel() labelx() and labely() x_label() and y_label() labels() xlabel() and ylabel() lotlib
Which function is used to draw Matp
points (markers) in a diagram? write() draw() plot() paint() plot() lotlib
Which parameter is used to
define the position/alignment
of the plot title in the title() Matp
function? pos align loc All of the above loc lotlib
Which type of chart is not Matp
available/supported in pyplot? Histogram Boxplot Pie All are correct All are correct lotlib
What type of plot is typically
used to present statistical Matp
summaries? Bar Line Histogram Box Plot Box Plot lotlib
Which syntax is correct for
creating a histogram with plt.hist(x, bins = plt.hist(x, bins = Matp
specified bins? plt.hist(x, bins = 10) [10,11,12,13,14]) range(10,15)) All are correct All are correct lotlib
Among the following, which is
not a category of plot available Matp
in Matplotlib? Line Plot Scatter Plot Pie Chart Bar Chart Pie Chart lotlib
Explain the significance of the To label the x and y axes To add annotations to a To label different lines or To label different lines Matp
legend() function in Matplotlib. of a plot To add a title to a plot plot markers on a plot or markers on a plot lotlib
What method is utilized to
incorporate labels onto a plot in Matp
Matplotlib? xlabel() and ylabel() labelx() and labely() x_label() and y_label() labels() xlabel() and ylabel() lotlib
Which function is utilized to
plot points (markers) on a Matp
diagram? write() draw() plot() paint() plot() lotlib
Which parameter determines
the position or alignment of the Matp
plot title in the title() function? pos align loc All of the above loc lotlib
ML is a field of AI consisting of Improve their Over time with
learning algorithms that? performance At executing some task experience All of the above All of the above ML
Identify the kind of learning
algorithm for “facial identities
for facial expressions”. Prediction Recognition patterns Recognizing anomalies Generating patterns Recognition patterns ML
Identify the type of learning in
which labeled training data is Semi unsupervised
used. learning Supervised learning Reinforcement Learning Unsupervised learning Supervised learning ML
The main focus of ML is
ML is a type of artificial to allow computer
intelligence that extract systems learn from
Machine Learning (ML) is patterns out of raw data by experience without being
What characteristics define that field of computer using an algorithm or explicitly programmed or
Machine Learning? science method. human intervention. All of the above All of the above ML
Determine the learning
algorithm utilized for
"identifying facial expressions
from facial identities." Prediction Recognition patterns Recognizing anomalies Generating patterns Recognition patterns ML
Name the type of learning that
involves the utilization of Semi unsupervised
labeled training data. learning Supervised learning Reinforcement Learning Unsupervised learning Supervised learning ML
if there is only a discrete
number of possible outcomes
(called categories), the process
becomes a______. Regression Classification Modelfree categories Classification ML
When there are only a finite
number of potential outcomes
(referred to as categories), the
process is termed as a ______. Regression Classification Modelfree categories Classification ML
To creates an ndarray from a
Python list, which of the
following functions would you Num
use? Numpy.reshape() numpy.reshape() Numpy.array() numpy.array() numpy.array() py
Num
NumPy arrays can be ___. Indexed Sliced Iterated all of the mentioned all of the mentioned py
We can find the Operational activities on None of the mentioned We can find the Num
Using ndim we can find - dimension of the array Size of array matrix above dimension of the array py
Which of the following can
store different types of
elements ,i.e., heterogeneous Num
elements in them? List DataFrame Tuple all of the mentioned all of the mentioned py
Which of the following can
store only homogeneous
elements (elements having Num
same data type)? List Ndarray Tuple DataFrame Ndarray py
When converting a Python list
into an ndarray, which function Num
would you utilize? Numpy.reshape() numpy.reshape() Numpy.array() numpy.array() numpy.array() py
Enumerate the potential
operations that can be Num
performed on NumPy arrays. Indexed Sliced Iterated all of the mentioned all of the mentioned py
The _____ function returns its
argument with modified
shape,whereas the _____
method modifies the array Num
itself. reshape,resize resize,reshape reshape2,resize all of the mentioned reshape,resize py
Array([8,10,12,14]) is the result Num
of array slice ____ Ar[3:7] Ar[:5] Ar[:7] Ar[:-3] Ar[3:7] py
Array([10,12,14,16]) is the Num
result of array slice ____ Ar[:4] Ar[4:8] Ar[4:] Both b and c Both b and c py
From and ndarray
numpy.arange(9.0), what can
be the possible shapes of arrays
that can be created using Num
reshape()? (9,1) , (3,3) , (1,9) (3,3) (9,1) , (1,9) None of these (9,1) , (3,3) , (1,9) py
import pandas as pd
import numpy as np
s=
pd.Series(np.random.randn(2)) Num
print s.size 0 1 2 3 2 py
The _____ function modifies the
shape of its argument and
returns it, while the _____ Num
method alters the array directly. reshape,resize resize,reshape reshape2,resize all of the mentioned reshape,resize py
A Dataframe object is value Pand
mutable. TRUE FALSE TRUE as
Amongst which of the following
is / are not correct to access
individual item from dataframe Pand
'df' df.iat[2,2] df.iat[2,2] df.loc[2,2] df.at[2,2] df[0,0] as
When you print/display any
series tehn the left most
column is showing _______ Pand
value. index Data Value None of the above Index as
Which of the following method
is used to subtract the two Pand
series? subtract( ) subtraction( ) diff( ) sub( ) sub( ) as
Python libraries contain a
collection of built-in Pand
_____________ Data Modules Packages Data Structure Modules as
Which of the following
statement will modify the first Pand
three values of Series ‘S1’? S1[0, 1, 2] = 100 S1[0 : 3] = 100 S1[ : 3] = 100 All of the above All of the above as
How do you select a single df.select_column(column_ Pand
column in a DataFrame? df[column_name] name) df.loc[:, column_name] All of the above All of the above as
Which of the following methods
is used to check for missing Pand
values in a DataFrame? check_missing() isna() find_missing() isnull() isnull() as
How do you drop rows with
missing values from a Pand
DataFrame? drop_missing() remove_null() dropna() delete_null() dropna() as
Which method is used to
perform string operations on a Pand
DataFrame column in Pandas? str_ops() apply_str() str accessor string_ops() str accessor as
Which method facilitates
performing string operations on Pand
a DataFrame column in Pandas? str_ops() apply_str() str accessor string_ops() str accessor as
What value does the leftmost
column display when Pand
printing/displaying any series? index Data Value None of the above Index as
import pandas as pd
pd.Series([1,2], index= None of above Pand
['a','b','c']) Synatx Error Index Error Value Error mentioned Value Error as
A panel is a ___ container of Pand
data 1D 2D 3D Infinite 3D as
Pand
Axis 1, in panel represent? minor_axis major_axis items None of the above major_axis as
While performing mathematical
operations on series, index
matching is implemented and
all missing values are filled in Pand
with ___________ by default NaN None 0 1 NaN as
____________ statement will
assigns a name to the Series Pand
‘S1’. S1.name = “Empl” S1_name = “Empl” S1[name] = “Empl” S1.indexname = “Empl” S1.name = “Empl” as
Roshan has written few points
about iloc( ) function of Series in
Python. His friend Suman told
that one of the written
statement is not correct. Help This method include the This method include
him to find the incorrect In iloc( ) method, we have last element of the range S1.iloc[3] will display S1.iloc[:3] will display first the last element of the Pand
statement. to pass an integer index. passed. fourth value of Series ‘S1’ three values of Series ‘S1’ range passed. as
Raman performed addition of
series ‘S1’ and ‘S2’ and store the
result in series ‘S3’. Both the Pand
series ‘S1’ and ‘S2’ have five 2 3 4 5 5 as
mismatching index value. How
many NaN will be there in ‘S3’?
Series ‘S1’ has five values with
index value (0, 1, 2, 3, 4) and
series ‘S2’ has five values with
index (2, 3, 4, 5, 6). What will be
the total number of values in Pand
‘S3’ if S3 = S1 + S2 5 6 7 8 7 as
Rosy wants to display the series
‘S1’ in reverse order. Help her Pand
to find the correct code. S1[ : : 1] S1[ : : -1] S1[ -1 : :] S1[ : 1 1] S1[ : : -1] as
Can a Series have duplicate Yes, Only series with Yes, Only series with Pand
index value? Yes NO integer values character values Yes as
How many times value ’10’ will
be displayed in the given series
‘S1’?
import pandas as pd
S1=pd.Series(10, index =
range(1, 10, 3)) Pand
print(S1) 1 3 5 4 3 as
How many elements will be
there in given series ‘S1’?
import pandas as pd
S1=pd.Series('python practice') Pand
print(S1) 0 1 2 15 1 as
Which of the following
statement will return 10 values
from the bottom/end of the Pand
Series ‘S1’? S1.tail( ) S1.tail(10) S1.head(10) S1(10) S1.tail(10) as
Which method is used to
perform element-wise
mathematical operations
between two DataFrames of the Pand
same shape in Pandas? multiply() add() subtract() All of the above All of the above as
What is the purpose of the fit() To train a model using a To make predictions using To evaluate the To visualize the data To train a model using Sckit
method in Scikit-learn? given dataset a trained model performance of a model using a plot a given dataset learn
What is the purpose of the To train a model using a To make predictions using To evaluate the To visualize the data To make predictions Sckit
predict() method in Scikit-learn? given dataset a trained model performance of a model using a plot using a trained model learn
What is the objective of the fit() To train a model using a To make predictions using To evaluate the To visualize the data To train a model using Sckit
method in Scikit-learn? given dataset a trained model performance of a model using a plot a given dataset learn
Explain the purpose of the To train a model using a To make predictions using To evaluate the To visualize the data To make predictions Sckit
predict() method in Scikit-learn. given dataset a trained model performance of a model using a plot using a trained model learn
Which library is used to create Seab
statistical graphics in Python? Pandas Numpy Seaborn Matplotlib Seaborn orn
Which of the following is a
measure of central tendency? Standard deviation Variance Median Range Median Stats
Which of the following is a
measure of skewness in a
dataset? Kurtosis Variance Median Range Kurtosis Stats
Which descriptive statistic gives
a measure of the "peakedness"
of a dataset? Kurtosis Variance Median Range Kurtosis Stats
The difference between The difference
What does the range of a the largest and smallest The spread of data points The most frequently between the largest
dataset represent? values around the mean occurring value The middle value and smallest values Stats
In a symmetrical dataset, which
measure of central tendency
will be approximately equal? Mean and median Mean and mode Median and mode Mean, median, and mode Mean and median Stats
In a perfectly symmetrical
dataset, which measure of
central tendency will be at the
center? Mean Median Mode Variance Mean Stats
Which measure of variability is
often used to describe the
spread of data in a bell-shaped,
normal distribution? Standard deviation Range IQR (Interquartile Range) Variance Standard deviation Stats
As a general rule, the _______
is the best measure of central
tendency because it is more
precise. Mean Median Mode Range Mean stats
Which of the following is NOT a
measure of variability? Median Variance Standard Deviation Range Median stats
Identify a metric used as a
measure of central tendency. Standard deviation Variance Median Range Median Stats
What does skewness indicate in
a dataset? Kurtosis Variance Median Range Kurtosis Stats
How does a measure of kurtosis
assess the "peakedness" of a
dataset? Kurtosis Variance Median Range Kurtosis Stats
The difference between The difference
What aspect of a dataset does the largest and smallest The spread of data points The most frequently between the largest
its range signify? values around the mean occurring value The middle value and smallest values Stats
In a dataset with symmetrical
distribution, which measure of Mean and median Mean and mode Median and mode Mean, median, and mode Mean and median Stats
central tendency tends to be
roughly equivalent?
In a perfectly symmetrical
dataset, which measure of
central tendency occupies the
central position? Mean Median Mode Variance Mean Stats
Which measure of variability is
commonly utilized to
characterize the dispersion of
data in a bell-shaped, normal
distribution? Standard deviation Range IQR (Interquartile Range) Variance Standard deviation Stats
As a general guideline, which
measure of central tendency
offers greater precision? Mean Median Mode Range Mean stats
Among the options provided,
which one is not considered a
measure of variability? Median Variance Standard Deviation Range Median stats
What does the interquartile The spread of data
range (IQR) measure in a The spread of data points The spread of data points The spread of data points The total range of the points around the
dataset? around the median around the mean around the mode dataset median Stats
The spread or
The spread or dispersion dispersion of data
In descriptive statistics, what of data points around the The central value of the The total range of the The frequency of each points around the
does the "variance" measure? mean dataset dataset data point in the dataset mean Stats
Approximately what percentage
of scores fall within one
standard deviation of the mean
in a normal distribution? 34% 95% 99% 68% 68% stats
What aspect of a dataset does The spread of data
the interquartile range (IQR) The spread of data points The spread of data points The spread of data points The total range of the points around the
evaluate? around the median around the mean around the mode dataset median Stats
The spread or
In descriptive statistics, what The spread or dispersion dispersion of data
concept does "variance" of data points around the The central value of the The total range of the The frequency of each points around the
quantify? mean dataset dataset data point in the dataset mean Stats
Approximately, what
percentage of scores lie within
one standard deviation of the
mean in a normal distribution? 34% 95% 99% 68% 68% stats
Which type of neural network Artifi
layer is responsible for combining cial
Fully connected layer Dropout layer Normalization layer Activation layer A
inputs through weighted Neur
summation? al
Netw
orks
Artifi
cial
Which layer in a convolutional
Neur
neural network (CNN) is Convolutional layer Pooling layer Fully connected layer Activation layer A
al
responsible for feature extraction?
Netw
orks
Artifi
cial
Which layer in a convolutional
Neur
neural network (CNN) is Convolutional layer Pooling layer Fully connected layer Activation layer A
al
responsible for feature extraction?
Netw
orks
Artifi
cial
In a feedforward neural network, Neur
Neurons Weights Activation functions All of the above D
what does each layer consist of? al
Netw
orks
Artifi
cial
Which of the following is a
Neur
commonly used activation ReLU Sigmoid Tanh All of the above D
al
function in neural networks?
Netw
orks
Artifi
cial
Which technique is used to
Neur
prevent overfitting in neural Dropout Batch Normalization Weight regularization All of the above D
al
networks?
Netw
orks
Artifi
cial
Which technique is used to
Neur
prevent overfitting in neural Dropout Batch Normalization Weight regularization All of the above D
al
networks?
Netw
orks
K-
Which distance metric is Mean
commonly used in K-Means Euclidean Manhattan Cosine All of the above A s
clustering? Clust
ering
K-
Which method is commonly used Mean
to determine the optimal number Elbow method Silhouette method Hopkins statistic All of the above A s
of clusters in K-Means clustering? Clust
ering
K-
Which of the following is NOT a Mean
common distance metric used in Mahalanobis Chebyshev Hamming All of the above C s
K-Means clustering? Clust
ering
K-
Mean
Which type of dataset is K-Means
Non-convex Linearly separable Homogeneous All of the above A s
clustering unsuitable for?
Clust
ering
Princi
pal
Which method is commonly used Com
to select the number of principal Elbow method Scree plot method Silhouette method All of the above B pone
components in PCA? nt
Analy
sis
Princi
pal
Com
Which of the following is a
Image compression Text classification Speech recognition All of the above D pone
possible application of PCA?
nt
Analy
sis
Artifi
cial
In a feedforward neural network, Neur
Neurons Weights Activation functions All of the above D
what does each layer consist of? al
Netw
orks
K-
Which of the following methods
Mean
can help improve the performance Using a different distance Increasing the number of
Applying feature scaling All of the above D s
of K-Means clustering on non- metric clusters
Clust
spherical clusters?
ering
Artifi
What is the primary purpose of Measures the difference
Updates the weights and Applies the activation cial
the loss function in a neural between predicted and Regularizes the network A
biases function Neur
network? actual values
al
Netw
orks
Artifi
cial
What is the primary purpose of Measures the difference
Updates the weights and Applies the activation Neur
the loss function in a neural between predicted and Regularizes the network A
biases function al
network? actual values
Netw
orks
K-
In K-Means clustering, what
Mean
happens if a data point is Assigned to the cluster with Assigned based on class
Assigned randomly Assigned to the first cluster A s
equidistant from two or more the lowest index labels
Clust
cluster centroids?
ering
Princi
pal
Assumes linear Com
Which of the following is NOT a Requires a large amount of Assumes normally
relationships between Sensitive to outliers C pone
limitation of PCA? data distributed data
variables nt
Analy
sis
K-
Mean
How does K-Means clustering By maximizing silhouette Automatically using an
Specified by the user Based on class labels A s
determine the number of clusters? score algorithm
Clust
ering
Artifi
cial
Which method is used to update
Neur
the weights and biases in a neural Gradient Descent Activation function Forward pass Backpropagation D
al
network during training?
Netw
orks
Artifi
cial
Which method is used to update
Neur
the weights and biases in a neural Gradient Descent Activation function Forward pass Backpropagation D
al
network during training?
Netw
orks
K-
Mean
In K-Means clustering, how are Based on the smallest
Randomly Based on class labels Based on the largest cluster A s
cluster centroids initially chosen? cluster
Clust
ering
Princi
How are principal components Based on the amount of
Randomly Based on their index Based on their eigenvalues A pal
ordered in PCA? variance they capture
Com
pone
nt
Analy
sis
Princi
pal
Com
How does PCA handle By creating uncorrelated By removing correlated By combining correlated By replacing correlated
A pone
multicollinearity in the dataset? variables variables variables variables
nt
Analy
sis
Princi
pal
By projecting data onto a Com
How does PCA reduce the By removing redundant By increasing the number By transforming data into a
lower-dimensional A pone
dimensionality of the data? features of features higher-dimensional space
subspace nt
Analy
sis
Artifi
cial
What does the term "weight
Setting initial values for Neur
initialization" refer to in neural Adjusting weights and biases Learning rate optimization Calculating gradients A
weights al
networks?
Netw
orks
Princi
pal
Weights representing the
Com
What is the interpretation of the contribution of each Variance explained by each Importance of each feature
Cluster assignment A pone
loadings in PCA? original feature to the principal component in the dataset
nt
principal component
Analy
sis
Princi
pal
Which method is used to Com
reconstruct the original data from Inverse transform Direct transform Residual analysis Correlation analysis A pone
principal components in PCA? nt
Analy
sis
K-
Which method is commonly used Mean
to choose the initial cluster K-Means++ Random initialization Hierarchical clustering DBSCAN A s
centroids in K-Means clustering? Clust
ering
Princi
pal
What is the effect of increasing Com
Captures more variance in Reduces the variance in the Makes the data less Decreases computational
the number of principal A pone
the data data interpretable complexity
components in PCA? nt
Analy
sis
Princi
pal
Which factor does NOT influence Com
Amount of variance Desired level of
the choice of the number of Number of observations Dimensionality of the data A pone
explained interpretability
principal components in PCA? nt
Analy
sis
Artifi
In a neural network, which term
cial
refers to the number of times the
Neur
entire dataset is passed forward Epoch Learning rate Momentum Dropout rate A
al
and backward through the
Netw
network?
orks
Artifi
In a neural network, which term
cial
refers to the number of times the
Neur
entire dataset is passed forward Epoch Learning rate Momentum Dropout rate A
al
and backward through the
Netw
network?
orks
Princi
pal
What is the relationship between Eigenvalues are inversely Eigenvalues have no Com
Eigenvalues represent the Eigenvalues are equal to
eigenvalues and explained proportional to explained relation to explained A pone
explained variance explained variance
variance in PCA? variance variance nt
Analy
sis
Princi
pal
What is the maximum number of Com
Equal to the number of Less than or equal to the Less than or equal to the Equal to the number of
principal components that can be A pone
original features number of observations number of original features observations
generated in PCA? nt
Analy
sis
K-
Which algorithm is often used as Mean
an initialization technique for K- K-Means++ DBSCAN Agglomerative clustering Hierarchical clustering A s
Means clustering? Clust
ering
K-
Mean
Which type of data is K-Means
Numeric Categorical Text Image A s
clustering particularly suited for?
Clust
ering
Princi
pal
Com
What is the main drawback of Increased computational
Loss of interpretability Loss of information Increased overfitting B pone
PCA? complexity
nt
Analy
sis
Princi
pal
Combines correlated Com
How does PCA handle Increases dimensionality of
features into principal Removes correlated features Ignores correlated features A pone
multicollinearity in the dataset? the data
components nt
Analy
sis
Princi
pal
What is the primary benefit of
Com
performing PCA before feeding Reducing computational
Reducing overfitting Improving interpretability Increasing model accuracy B pone
data into a machine learning complexity
nt
model?
Analy
sis
Princi
pal
Com
Which statistical assumption does Independence of
Normality Linearity Homoscedasticity B pone
PCA rely on? observations
nt
Analy
sis
Princi
pal
Com
What happens if the eigenvalues Indicates that the data is Indicates that the data has Indicates that the data has Indicates that the data is
A pone
in PCA are all very close to zero? not suitable for PCA high variance low variance categorical
nt
Analy
sis
K-
How does K-Means clustering
Mean
performance change when It may degrade due to the It improves due to feature
It remains unaffected It converges faster A s
dealing with high-dimensional curse of dimensionality redundancy
Clust
data?
ering
K-
Mean
How does K-Means clustering It requires conversion into It treats categorical data as
It ignores categorical data It creates dummy variables A s
handle categorical data? numerical format continuous
Clust
ering
K-
Which clustering algorithm is Mean
They have similar It depends on the size of
computationally faster: K-Means K-Means Hierarchical A s
computational complexity the dataset
or hierarchical clustering? Clust
ering
Princi
pal
It ensures that all features Com
What is the effect of feature It reduces the need for It increases the number of
contribute equally to It has no effect on PCA A pone
scaling on PCA? normalization principal components
variance nt
Analy
sis
K-
Mean
What is the effect of scaling on K- It can improve clustering It may lead to biased It increases the number of
It has no effect on clustering A s
Means clustering? performance results clusters
Clust
ering
K-
Mean
What is the effect of feature It helps to equalize the It reduces the number of
It may distort the clusters It accelerates convergence A s
scaling on K-Means clustering? influence of features clusters
Clust
ering
K-
How does the computational
Mean
complexity of K-Means clustering
It increases linearly It increases exponentially It decreases linearly It remains constant A s
change with the number of
Clust
clusters (k)?
ering
Princi
pal
Com
In PCA, what does each principal Linear combination of
Individual feature Cluster centroid Label of data points A pone
component represent? original features
nt
Analy
sis
Artifi
cial
What is the basic building block of Neur
Neuron Weight Activation function Layer A
a neural network? al
Netw
orks
Artifi
cial
What is the primary building block
Neur
of an artificial neural network Neuron Weight Activation function Layer A
al
(ANN)?
Netw
orks
Artifi
cial
What does the term
Adjusting weights and Neur
"backpropagation" refer to in Forward pass Activation of neurons Learning rate optimization A
biases al
neural networks?
Netw
orks
Artifi
cial
What does the term
Adjusting weights and Neur
"backpropagation" refer to in Forward pass Activation of neurons Learning rate optimization A
biases al
neural networks?
Netw
orks
K-
Mean
What is the main drawback of K- Sensitivity to initial Limited to linearly
Requires labeled data Slow convergence A s
Means clustering? centroids separable data
Clust
ering
Artifi
cial
Which type of neural network is
Neur
typically used for simple Perceptron RNN CNN LSTM A
al
classification tasks?
Netw
orks
Princi
pal
Amount of variance Com
What does the term "explained Difference between actual Importance of each feature
captured by the principal Magnitude of eigenvalues A pone
variance" refer to in PCA? and predicted values in the dataset
components nt
Analy
sis
K-
Mean
What is the objective of K-Means Minimize the within-cluster Maximize the between- Minimize the number of Maximize the number of
A s
clustering? sum of squares cluster sum of squares clusters iterations
Clust
ering
Which algorithm is often used as
K-
an alternative to K-Means
DBSCAN Agglomerative clustering Spectral clustering Mean Shift A Mean
clustering for non-convex
s
clusters?
Clust
ering
Artifi
cial
What is the main function of the
Introduces non-linearity to Determines the number of Neur
activation function in a neural Sets the learning rate Measures the loss A
the model neurons al
network?
Netw
orks
K-
In K-Means clustering, what is the Mean
Measures the compactness Measures the separation Measures the number of Measures the noise level in
role of the inertia or within-cluster A s
of the clusters between clusters clusters the data
sum of squares? Clust
ering
Artifi
cial
What problem does the vanishing
Gradient becomes very Gradient explodes during Model becomes too Neur
gradient problem refer to in Model fails to converge A
small during training training complex to train al
neural networks?
Netw
orks
Artifi
cial
What problem does the vanishing
Gradient becomes very Gradient explodes during Model becomes too Neur
gradient problem refer to in Model fails to converge A
small during training training complex to train al
neural networks?
Netw
orks
Artifi
cial
What is the purpose of the input Neur
Receives input data Makes predictions Applies activation function None of the above A
layer in a neural network? al
Netw
orks
K-
What happens if K-Means
Mean
clustering is run with a large Each data point becomes
Clusters merge into one The algorithm crashes None of the above A s
number of clusters, close to the its own cluster
Clust
number of data points?
ering
K-
Which step of the K-Means Mean
algorithm may result in empty Initialization Assignment Update centroids None of the above C s
clusters? Clust
ering
How does K-Means clustering It's recommended to K-
Treat missing values as a
deal with missing values in the impute missing values Ignore missing values None of the above B Mean
separate cluster
dataset? before clustering s
Clust
ering
Which step of the K-Means K-
algorithm involves updating the Mean
cluster centroids based on the Update centroids Assignment Initialization None of the above A s
mean of the data points assigned Clust
to each cluster? ering
Princi
pal
Measure of variance Com
What is the significance of Number of features in the
captured by each principal Magnitude of eigenvectors None of the above A pone
eigenvalues in PCA? dataset
component nt
Analy
sis
Princi
pal
In PCA, how is the percentage of Com
Divide the eigenvalue by Divide the eigenvalue by the Multiply the eigenvalue by
explained variance calculated for None of the above A pone
the sum of all eigenvalues total variance 100
each principal component? nt
Analy
sis
Princi
pal
What is the relationship between
Fewer principal Equal number of principal Com
the number of principal More principal components
components than original components and original None of the above C pone
components and the number of than original features
features features nt
original features?
Analy
sis
Princi
pal
Com
How does PCA handle missing Removes observations with
Imputes missing values Ignores missing values None of the above B pone
values in the dataset? missing values
nt
Analy
sis
K-
Which step of the K-Means
Mean
algorithm involves assigning each
Assignment Initialization Update centroids None of the above A s
data point to the nearest cluster
Clust
centroid?
ering
K-
What is the primary disadvantage
Mean
of the K-Means++ initialization
Slower initialization time More sensitive to outliers Less accurate centroids None of the above A s
method compared to random
Clust
initialization?
ering
K-
Which step of the K-Means Mean
algorithm can be computationally Assignment Initialization Update centroids None of the above C s
expensive for large datasets? Clust
ering
K-
What is the major challenge of
Mean
using the Elbow Method to Subjectivity in selecting the
Computationally expensive Sensitivity to outliers None of the above A s
determine the optimal number of "elbow point"
Clust
clusters?
ering
K-
Which step of the K-Means
Mean
algorithm is responsible for
Assignment Initialization Update centroids None of the above A s
updating the cluster membership
Clust
of data points?
ering
Princi
pal
Com
What is the purpose of scaling the Ensures all features have Prevents numerical Reduces computational
None of the above B pone
data before performing PCA? equal importance instabilities complexity
nt
Analy
sis
Princi
pal
In PCA, how is the amount of Com
Eigenvalue of the principal Squared loading of each Cumulative percentage of
variance explained by each None of the above C pone
component feature explained variance
principal component calculated? nt
Analy
sis
Princi
pal
Negative influence of Com
What is the interpretation of Negative correlation Positive correlation
features on the principal None of the above A pone
negative loadings in PCA? between features between features
component nt
Analy
sis
Princi
pal
Com
What is the computational
O(n^3) O(n^2) O(n log n) O(n) B pone
complexity of PCA?
nt
Analy
sis
Princi
pal
Which matrix is diagonalized in Com
PCA to obtain eigenvalues and Covariance matrix Correlation matrix Identity matrix Orthogonal matrix A pone
eigenvectors? nt
Analy
sis
Princi
pal
Com
What does the term "eigenvector" Direction of maximum Magnitude of variance Weighted combination of Orthogonal transformation
C pone
represent in PCA? variance in the data explained original features matrix
nt
Analy
sis
Princi
pal
Outliers may influence Outliers decrease the Com
What is the impact of outliers on Outliers increase
principal component Outliers do not affect PCA number of principal A pone
PCA? interpretability
directions components nt
Analy
sis
Princi
pal
PCA can be applied to both Com
Which of the following statements PCA requires data to be PCA can only be applied to PCA is not affected by
numerical and categorical A pone
about PCA is true? normally distributed high-dimensional data outliers
data nt
Analy
sis
Princi
pal
PCA reduces PCA maximizes class Com
How does PCA differ from Linear PCA is unsupervised, while PCA is supervised, while LDA
dimensionality, while LDA separability, while LDA A pone
Discriminant Analysis (LDA)? LDA is supervised is unsupervised
maximizes class separability reduces dimensionality nt
Analy
sis
Artifi
cial
What is the primary difference
Perceptron has one layer, Perceptron uses softmax Perceptron is a type of Perceptron is supervised, Neur
between a perceptron and a A
MLP has multiple layers activation, MLP uses ReLU CNN, MLP is a type of RNN MLP is unsupervised al
multi-layer perceptron (MLP)?
Netw
orks
Artifi
Which layer in a neural network
cial
performs the element-wise Activation layer Convolutional layer Fully connected layer Pooling layer A
Neur
activation function?
al
Netw
orks
Artifi
cial
Which of the following is NOT a Neur
Normalization layer Dropout layer Activation layer Pooling layer A
type of neural network layer? al
Netw
orks
Artifi
cial
Which layer in a neural network
Neur
performs the element-wise Activation layer Convolutional layer Fully connected layer Pooling layer A
al
activation function?
Netw
orks
Artifi
cial
Which of the following is NOT a Neur
Normalization layer Dropout layer Activation layer Pooling layer A
type of neural network layer? al
Netw
orks
Artifi
cial
What is the process of adjusting
Neur
weights and biases in a neural Training Forward pass Backpropagation Prediction C
al
network to minimize errors?
Netw
orks
Artifi
cial
What is the primary purpose of
Controls the size of weight Determines the number of Neur
the learning rate in training a Activates neurons Prevents overfitting A
updates layers al
neural network?
Netw
orks
Artifi
cial
What is the role of the learning Controls the size of weight Determines the number of Neur
Activates neurons Prevents overfitting A
rate in training a neural network? updates layers al
Netw
orks
K-
Mean
Which of the following is NOT a Sensitive to initial cluster Struggles with non-linear
Assumes spherical clusters Prone to overfitting D s
limitation of K-Means clustering? centroids data
Clust
ering
What is the main function of the Shifts the activation Adds flexibility to the Provides additional Artifi
Helps prevent overfitting C
bias in a neural network? function horizontally model capacity to learn cial
Neur
al
Netw
orks
Artifi
cial
What is the main function of the Shifts the activation Adds flexibility to the Provides additional Neur
Helps prevent overfitting C
bias in a neural network? function horizontally model capacity to learn al
Netw
orks
Artifi
cial
Which type of neural network is
Recurrent Neural Network Convolutional Neural Multilayer Perceptron Radial Basis Function Neur
most suitable for time-series A
(RNN) Network (CNN) (MLP) Network (RBFN) al
prediction?
Netw
orks
Artifi
cial
Which type of neural network is
Recurrent Neural Network Convolutional Neural Multilayer Perceptron Radial Basis Function Neur
best suited for sequential data A
(RNN) Network (CNN) (MLP) Network (RBFN) al
processing?
Netw
orks
Artifi
cial
Which type of neural network
Convolutional Neural Recurrent Neural Network Multilayer Perceptron Radial Basis Function Neur
architecture is suitable for image A
Network (CNN) (RNN) (MLP) Network (RBFN) al
recognition tasks?
Netw
orks
Artifi
cial
Which type of neural network is
Recurrent Neural Network Convolutional Neural Multilayer Perceptron Radial Basis Function Neur
most suitable for time-series A
(RNN) Network (CNN) (MLP) Network (RBFN) al
prediction?
Netw
orks
Artifi
cial
Which type of neural network is
Recurrent Neural Network Convolutional Neural Multilayer Perceptron Radial Basis Function Neur
best suited for sequential data A
(RNN) Network (CNN) (MLP) Network (RBFN) al
processing?
Netw
orks
Princi
What is the primary goal of pal
Principal Component Analysis Dimensionality reduction Feature extraction Classification Regression A Com
(PCA)? pone
nt
Analy
sis
Artifi
cial
What is the purpose of the
Converts raw scores into Speeds up the training Adds non-linearity to the Neur
softmax function in a neural Regularizes the network A
probabilities process model al
network?
Netw
orks
Artifi
cial
What is the purpose of the
Converts raw scores into Speeds up the training Adds non-linearity to the Neur
softmax function in a neural Regularizes the network A
probabilities process model al
network?
Netw
orks
How many coefficients do you
need to estimate in a simple
linear regression model (One
independent variable)? 1 2 3 4 2 LR
How many coefficients must
you estimate in a simple linear
regression model with one
independent variable? 1 2 3 4 2 LR
How many variables are
required to represent a linear
regression model? 3 2 1 4 3 LR
How many variables are
necessary to articulate a linear
regression model? 3 2 1 4 3 LR
(TP * TN - FP * FN) /
How is the Matthews (TP * TN - FP * FN) / sqrt((TP + FP) * (TP + Class
Correlation Coefficient (MCC) (TP + TN) / (TP + TN + FP sqrt((TP + FP) * (TP + FN) * (2 * Precision * Recall) / (TP / (TP + FP)) * (TP / (TP FN) * (TN + FP) * (TN + ificati
calculated? + FN) (TN + FP) * (TN + FN)) (Precision + Recall) + FN)) FN)) on
Which evaluation metric is Class
more appropriate when dealing ificati
with imbalanced datasets? Accuracy Precision Recall F1-score F1-score on
Which evaluation metric is Class
more suitable for dealing with ificati
imbalanced datasets? Accuracy Precision Recall F1-score F1-score on
Which of the following is a Class
disadvantage of using accuracy It does not consider the It is sensitive to class It is not affected by It is sensitive to class ificati
as an evaluation metric? true negatives. imbalance. It is difficult to interpret. misclassification errors. imbalance. on
The learner is trying to predict Multivariate Logistic Multivariate Linear
housing prices based on the size Regression Logistic Regression Linear Regression Regression Linear Regression LR
of each house. What type of
regression is this?g
What type of regression is it
when the learner attempts to
forecast housing prices based Multivariate Logistic Multivariate Linear
on each house's size? Regression Logistic Regression Linear Regression Regression Linear Regression LR
In linear regression, what is the
name of the line that best fits
the data points? Best-Fit Line Regression Line Trend Line Correlation Line Regression Line LR
Which of the following metrics
can be used to evaluate the
performance of a binary Class
classifier at various threshold ificati
levels? Precision Recall ROC-AUC F1-score ROC-AUC on
In practice, Line of best fit or Sum of the absolute value Sum of the square of Sum of the square of Sum of the square of
regression line is found when Sum of residuals (∑(Y – of residuals (∑|Y-h(X)|) is residuals ( ∑ (Y-h(X))2) is residuals ( ∑ (Y-h(X))2) is residuals ( ∑ (Y-h(X))2)
_____________ h(X))) is minimum maximum minimum maximum is minimum LR
What triggers the discovery of Sum of the absolute value Sum of the square of Sum of the square of Sum of the square of
the Line of Best Fit or regression Sum of residuals (∑(Y – of residuals (∑|Y-h(X)|) is residuals ( ∑ (Y-h(X))2) is residuals ( ∑ (Y-h(X))2) is residuals ( ∑ (Y-h(X))2)
line in practice? h(X))) is minimum maximum minimum maximum is minimum LR
The residual
represents the
The residual represents The residual represents the difference between
the difference between difference between the The residual represents the predicted and
the predicted and actual actual and predicted the value of the The residual represents actual values, while
values, while the values, while the predicted dependent variable, while the slope of the the predicted value is
What is the difference between predicted value is the value is the value the predicted value is the regression line, while the the value estimated by
the residual and the predicted value estimated by the estimated by the value of the independent predicted value the regression
value in linear regression? regression equation. regression equation. variable. represents the intercept. equation. LR
The residuals have a
When performing linear The residuals have a non- The residuals have a The residuals have a varying variance
regression, what does it mean if linear relationship with constant variance across all varying variance across across different levels
the residuals exhibit the independent levels of the independent The residuals have a non- different levels of the of the independent
heteroscedasticity? variables. variables. normal distribution. independent variables. variables. LR
What does the relationship
between residuals and There is a relationship There is a relationship Class
predicted values signify in linear means our model is not There is a relationship means our model is ificati
regression? good means our model is good Can’t say None of these not good on
What does relationship means
between residuals and There is a relationship There is a relationship Class
predicted values in linear means our model is not There is a relationship means our model is ificati
regression? good means our model is good Can’t say None of these not good on
Class
ificati
on
(TP * TN - FP * FN) / Evalu
How is the Matthews (TP * TN - FP * FN) / sqrt((TP + FP) * (TP + ation
Correlation Coefficient (MCC) (TP + TN) / (TP + TN + FP + sqrt((TP + FP) * (TP + FN) * (2 * Precision * Recall) / (TP / (TP + FP)) * (TP / (TP FN) * (TN + FP) * (TN + Metr
calculated? FN) (TN + FP) * (TN + FN)) (Precision + Recall) + FN)) FN)) ics
Logis
tic
Within logistic regression, what Regr
is the output range produced by essio
the sigmoid function? [0, 1] [-1, 1] [0, ∞) (-∞, ∞) [0, 1] n
Logis
tic
In logistic regression, what is Regr
the output range of the sigmoid essio
function? [0, 1] [-1, 1] [0, ∞) (-∞, ∞) [0, 1] n
Regr
essio
n
Evalu
ation
What is the range of the R- Metr
squared value? [0, 1] [0, ∞) [-1, 1] [-∞, ∞) [0, 1] ics
Regr
essio
n
Evalu
ation
What is the range of the Mean Metr
Absolute Error (MAE)? [0, ∞) [0, 1] [-∞, ∞) [-1, 1] [0, ∞) ics
Regr
essio
80% of the variance in n
80% of the variance in the the dependent Evalu
dependent variable is 80% of the observations 80% of the observations variable is explained ation
What is the interpretation of an explained by the are correctly predicted by fall within the confidence 80% of the observations by the independent Metr
R-squared value of 0.8? independent variables the model interval have the same value variables ics
Rand
om
What does the term "forest" A collection of decision A group of random A set of rules for A collection of Fores
refer to in Random Forest? trees A single decision tree variables classification decision trees t
Rand
om
What does the term "forest" A collection of decision A group of random A set of rules for A collection of Fores
denote in Random Forest? trees A single decision tree variables classification decision trees t
A decision tree with a A decision tree with a Decis
What is a decision stump in single decision node and A decision tree with a A decision tree with a A decision tree with single decision node ion
decision trees? two leaf nodes single leaf node single root node multiple levels of nodes and two leaf nodes Tree
A flowchart-like
structure where each
A flowchart-like structure internal node
where each internal node represents a test on a
represents a test on a feature, each branch
feature, each branch represents the
represents the outcome outcome of the test,
of the test, and each leaf and each leaf node Decis
node represents a class A neural network represents a class ion
What is a decision tree? label or a decision A linear regression model A clustering algorithm architecture label or a decision Tree
A flowchart-like tree
structure where each
internal node
A flowchart-like tree represents a decision
structure where each based on a feature,
internal node represents each branch
a decision based on a represents the
feature, each branch outcome of the
represents the outcome A model that uses decision, and each leaf
of the decision, and each A linear model that uses gradient descent to A clustering algorithm node represents the Decis
What is a decision tree in leaf node represents the decision boundaries to minimize the loss that groups similar data final decision or ion
machine learning? final decision or outcome classify data points function points together outcome Tree
A flowchart-like tree
structure where each
internal node
A flowchart-like tree represents a decision
structure where each based on a feature,
internal node represents each branch
a decision based on a represents the
feature, each branch outcome of the
represents the outcome A model that uses decision, and each leaf
of the decision, and each A linear model that uses gradient descent to A clustering algorithm node represents the Decis
What does a decision tree leaf node represents the decision boundaries to minimize the loss that groups similar data final decision or ion
signify in machine learning? final decision or outcome classify data points function points together outcome Tree
A type of supervised An unsupervised learning A reinforcement learning A type of supervised Rand
What is Random Forest? learning algorithm algorithm A clustering algorithm algorithm learning algorithm om
Fores
t
Rand
om
What does Random Forest A type of supervised An unsupervised learning A reinforcement learning A type of supervised Fores
entail? learning algorithm algorithm A clustering algorithm algorithm learning algorithm t
Logis
tic
What commonly used Regr
evaluation metric is associated Mean Squared Error essio
with logistic regression? (MSE) Accuracy R-squared F1-score Accuracy n
Which of the following
measures is frequently Class
employed for binary Mean Absolute Error Root Mean Squared Error ificati
classification problems? (MAE) Accuracy R-squared (RMSE) Accuracy on
Which of the following metrics Class
is commonly used for binary Mean Absolute Error Root Mean Squared Error ificati
classification problems? (MAE) Accuracy R-squared (RMSE) Accuracy on
Class
ificati
on
Evalu
Which of the following metrics ation
is commonly used for binary Mean Absolute Error Root Mean Squared Error Metr
classification problems? (MAE) Accuracy R-squared (RMSE) Accuracy ics
Logis
tic
Which of the following Regr
evaluation metrics is commonly Mean Squared Error essio
used for logistic regression? (MSE) Accuracy R-squared F1-score Accuracy n
Accuracy is calculated as
Which of the following Accuracy considers both the ratio of true positives Accuracy considers Class
statements about accuracy is Accuracy is suitable for false positives and false to the total number of Accuracy can be used for both false positives ificati
true? imbalanced datasets. negatives. predictions. regression tasks. and false negatives. on
Accuracy is calculated as
Which of the subsequent Accuracy considers both the ratio of true positives Accuracy considers Class
assertions about accuracy is Accuracy is suitable for false positives and false to the total number of Accuracy can be used for both false positives ificati
correct? imbalanced datasets. negatives. predictions. regression tasks. and false negatives. on
Class
Accuracy is calculated as ificati
Which of the following Accuracy considers both the ratio of true positives Accuracy considers on
statements about accuracy is Accuracy is suitable for false positives and false to the total number of Accuracy can be used for both false positives Evalu
true? imbalanced datasets. negatives. predictions. regression tasks. and false negatives. ation
Metr
ics
Which of the following methods Rand
can be used to evaluate the om
performance of a Random Accuracy, Precision, Accuracy, Precision, Fores
Forest? Recall, F1-score Mean Squared Error (MSE) R-squared Silhouette Score Recall, F1-score t
Which of the following metrics Class
is suitable for multiclass ificati
classification problems? Precision Recall F1-score All of the above All of the above on
Which of the following
techniques can be used to Decis
prevent overfitting in decision ion
trees? Pruning Feature selection Regularization All of the above All of the above Tree
Class
ificati
on
Evalu
Which of the following metrics ation
is suitable for multiclass Metr
classification problems? Precision Recall F1-score All of the above All of the above ics
Which of the following metrics Class
is appropriate for multiclass ificati
classification problems? Precision Recall F1-score All of the above All of the above on
What is the criterion used to Decis
measure impurity in decision ion
trees? Gini impurity Entropy Misclassification error All of the above All of the above Tree
What criterion is employed to Decis
gauge impurity within decision ion
trees? Gini impurity Entropy Misclassification error All of the above All of the above Tree
Regr
essio
All of the variance in n
All of the variance in the None of the variance in the the dependent Evalu
dependent variable is dependent variable is variable is explained ation
What is the interpretation of an explained by the explained by the The model has perfect by the independent Metr
R-squared value of 1? independent variables independent variables prediction The model is underfitting variables ics
What occurs with bias and
variance as the size of training Class
data increases in linear Bias increases and Bias decreases and Bias increases and Bias increases and ificati
regression? Variance increases Variance increases Variance decreases Can’t Say Variance decreases on
Class
What will happen with bias and Bias increases and Bias decreases and Bias increases and Bias increases and ificati
variance as you increase the Variance increases Variance increases Variance decreases Can’t Say Variance decreases on
size of training data in linear
regression?
Rand
By adjusting class om
How does Random Forest By adjusting class weights By oversampling the By undersampling the By assigning equal weights during Fores
handle class imbalance? during training minority class majority class weights to all classes training t
Rand
By adjusting class om
How does Random Forest By adjusting class weights By oversampling the By undersampling the By assigning equal weights during Fores
address class imbalance? during training minority class majority class weights to all classes training t
Rand
By imputing missing By ignoring om
How does Random Forest By averaging the values By replacing missing values By ignoring observations values based on other observations with Fores
handle missing values? from other trees with the mode with missing values features missing values t
Rand
By imputing missing By ignoring om
How does Random Forest By averaging the values By replacing missing values By ignoring observations values based on other observations with Fores
manage missing values? from other trees with the mode with missing values features missing values t
In a simple linear regression
model (One independent
variable), If we change the input
variable by 1 unit. How much
output variable will change? by 1 no change by intercept by its slope by its slope LR
In a simple linear regression
model with one independent
variable, how much does the
output variable change when
the input variable is adjusted by
one unit? by 1 no change by intercept by its slope by its slope LR
Logis
tic
Regr
How is the likelihood function By minimizing the sum of By maximizing the sum of By maximizing the log- By minimizing the log- By maximizing the log- essio
optimized in logistic regression? squared residuals squared residuals likelihood function likelihood function likelihood function n
Logis
tic
How is the likelihood function Regr
maximized in logistic By minimizing the sum of By maximizing the sum of By maximizing the log- By minimizing the log- By maximizing the log- essio
regression? squared residuals squared residuals likelihood function likelihood function likelihood function n
By measuring the By measuring the ratio of By measuring the By measuring the
decrease in impurity after correctly classified probability of incorrectly By measuring the amount decrease in impurity Decis
How is information gain a dataset is split on an instances to the total classifying a randomly of information gained by after a dataset is split ion
calculated in decision trees? attribute number of instances chosen element splitting a node on an attribute Tree
By measuring the By measuring the ratio of By measuring the By measuring the
How is information gain decrease in impurity after correctly classified probability of incorrectly By measuring the amount decrease in impurity Decis
computed within decision a dataset is split on an instances to the total classifying a randomly of information gained by after a dataset is split ion
trees? attribute number of instances chosen element splitting a node on an attribute Tree
Rand
By using feature By ignoring noisy features om
How does Random Forest By randomly selecting By removing noisy features importance to filter out and relying on ensemble By randomly selecting Fores
handle noisy features? features for each tree during preprocessing noisy features learning features for each tree t
How does Random Forest
handle noisy features? Rand
By using feature By ignoring noisy features om
By randomly selecting By removing noisy features importance to filter out and relying on ensemble By randomly selecting Fores
features for each tree during preprocessing noisy features learning features for each tree t
By assigning a numerical
How does the decision tree By splitting the variable value to each level and By ignoring the variable By splitting the Decis
handle categorical variables into binary dummy treating it as a continuous during the tree By converting the variable variable into binary ion
with multiple levels? variables variable construction into ordinal levels dummy variables Tree
By assigning a numerical
How does a decision tree By splitting the variable value to each level and By ignoring the variable By splitting the Decis
handle categorical variables into binary dummy treating it as a continuous during the tree By converting the variable variable into binary ion
with multiple levels? variables variable construction into ordinal levels dummy variables Tree
By replacing missing By using surrogate
By using surrogate splits By imputing missing values values with a random splits to make Decis
How does a decision tree to make decisions based with the mean or median By ignoring observations value from the feature decisions based on ion
address missing values? on available data of the feature with missing values distribution available data Tree
By replacing missing By using surrogate
By using surrogate splits By imputing missing values values with a random splits to make Decis
How does a decision tree to make decisions based with the mean or median By ignoring observations value from the feature decisions based on ion
handle missing values? on available data of the feature with missing values distribution available data Tree

You might also like