Exploratory Data Analysis (EDA) To make data more To discover patterns and To discover patterns in Data Science? complicated To simplify complex data insights in data To visualize data and insights in data EDA What does the term "outlier" A point that falls within A data point that is part of A data point that falls far A data point that is A data point that falls refer to in the context of EDA? the data range the main cluster from the rest missing far from the rest EDA What does the term "mode" refer to in the context of data The most frequently The most frequently distribution analysis in EDA? occurring value The average value The spread of data The presence of outliers occurring value EDA What is the main objective of conducting Exploratory Data To make data more To discover patterns and To discover patterns Analysis (EDA) in Data Science? complicated To simplify complex data insights in data To visualize data and insights in data EDA In the context of EDA, what A point that falls within A data point that is part of A data point that falls far A data point that is A data point that falls does the term "outlier" signify? the data range the main cluster from the rest missing far from the rest EDA What is the meaning of the term "mode" in the context of analyzing data distributions The most frequently The most frequently during EDA? occurring value The average value The spread of data The presence of outliers occurring value EDA To visualize What is the primary purpose of To show the distribution To visualize relationships To display categorical To represent time series relationships between a scatter plot in EDA? of a single variable between variable data data variable EDA Which of the following information is not given by five- number summary? Mean Median Mode All of the mentioned Mode EDA What is the primary function of To visualize a scatter plot in Exploratory To show the distribution To visualize relationships To display categorical To represent time series relationships between Data Analysis (EDA)? of a single variable between variable data data variable EDA The command to install Matp Matplotlib library in python is Install pip matplotlib install matplotlib Pip matplotlib Pip install matplolib Pip install matplolib lotlib A database management A data visualization Matp What is Matplotlib? A programming language A data visualization library system An operating system library lotlib How can you add a title to a By using the title() By using the label() By using the text() By using the legend() By using the title() Matp Matplotlib plot? function function function function function lotlib What is the purpose of the xlabel() and ylabel() functions in To label the x and y axes To change the color of a To label the x and y Matp Matplotlib? To add a legend to a plot To add a title to a plot of a plot plot axes of a plot lotlib What is the purpose of the To create multiple subplot() function in To create multiple plots To change the color of a To add annotations to a plots in the same Matp Matplotlib? in the same figure To adjust the size of a plot plot plot figure lotlib What is the purpose of the Matp scatter() function in Matplotlib? To create line plots To create bar plots To create scatter plots To create a pie charts To create scatter plots lotlib What is the purpose of the To change the x and y To change the color of a To label different lines or To change the x and y Matp axis() function in Matplotlib? To add grid lines to a plot limits of a plot plot markers on a plot limits of a plot lotlib Which module is used for data Matp visualization in Python? NumPy Pandas Matplotlib Scikit-learn Matplotlib lotlib Matplotlib is a ____ library for the Python programming Matp language. data science mathematics numpy plotting plotting lotlib Which function is used to set a Matp label for the x-axis in pyplot? xlabel() set_x() x-axis() xaxi() xlabel() lotlib What command is used to install the Matplotlib library in Matp Python? Install pip matplotlib install matplotlib Pip matplotlib Pip install matplolib Pip install matplolib lotlib A database management A data visualization Matp Define Matplotlib. A programming language A data visualization library system An operating system library lotlib How do you incorporate a title By using the title() By using the label() By using the text() By using the legend() By using the title() Matp into a Matplotlib plot? function function function function function lotlib What role do the xlabel() and ylabel() functions serve in To label the x and y axes To change the color of a To label the x and y Matp Matplotlib? To add a legend to a plot To add a title to a plot of a plot plot axes of a plot lotlib What is the function of the To create multiple subplot() function in To create multiple plots To change the color of a To add annotations to a plots in the same Matp Matplotlib? in the same figure To adjust the size of a plot plot plot figure lotlib Describe the purpose of the Matp scatter() function in Matplotlib. To create line plots To create bar plots To create scatter plots To create a pie charts To create scatter plots lotlib What does the axis() function To change the x and y To change the color of a To label different lines or To change the x and y Matp achieve in Matplotlib? To add grid lines to a plot limits of a plot plot markers on a plot limits of a plot lotlib What is the primary module utilized for data visualization in Matp Python? NumPy Pandas Matplotlib Scikit-learn Matplotlib lotlib Matplotlib serves as a(n) ____ library for the Python Matp programming language. data science mathematics numpy plotting plotting lotlib Which function in pyplot is employed to assign a label to Matp the x-axis? xlabel() set_x() x-axis() xaxi() xlabel() lotlib To add grid lines to a graph in Matplotlib, which function is Matp used? grid() gridlines() grid_lines() grids() grid() lotlib Which function in Matplotlib is employed to introduce grid Matp lines to a graph? grid() gridlines() grid_lines() grids() grid() lotlib The main focus of ML is ML is a type of artificial to allow computer intelligence that extract systems learn from Machine Learning (ML) is patterns out of raw data by experience without being What is true about Machine that field of computer using an algorithm or explicitly programmed or Learning? science method. human intervention. All of the above All of the above ML Among the following option identify the one which is not a Semi unsupervised Semi unsupervised type of learning learning Supervised learning Reinforcement Learning Unsupervised learning learning ML Machine Learning is a subset of which of the following. Artificial Intelligence Deep Learning Data Learning None of the above Artificial Intelligence ML Which of the following are common classes of problems in machine learning? Regression Classification Clustering All of the above All of the above ML Machine learning is the autonomous The autonomous The autonomous The selective acquisition acquisition of acquisition of knowledge acquisition of knowledge of knowledge through the The selective acquisition knowledge through through the use of through the use of manual use of computer of knowledge through the the use of computer What is Machine learning? computer programs programs programs use of manual programs programs. ML Out of the following options, which one is not categorized as Semi unsupervised Semi unsupervised a type of learning? learning Supervised learning Reinforcement Learning Unsupervised learning learning ML In which broader domain does Machine Learning operate as a subset? Artificial Intelligence Deep Learning Data Learning None of the above Artificial Intelligence ML Which problem classes are commonly encountered in Machine Learning? Regression Classification Clustering All of the above All of the above ML Which of the following functions is typically used to add elements to a plot in the base graphics system? lines() hist() plot() boxplot() boxplot() ML A Python package which stands for 'Numerical Python' is named and used as _____ in Python Num programs. Scipy Pandas NumPy Pyplot NumPy py Which of the following method creates a new array object that Num looksat the same data? view copy paste all of the mentioned view py _____ is the another name of Num NumPy arrays. Python arrays Parrays NumPy Data Arrays Ndarray Ndarray py What is the purpose of NumPy To do numerical None of the mentioned Num in Python? calculations To do scientific computing Both A and B above Both A and B py Which of the following is not a legal way to create NumPy Num arrays? numpy.array() numpy.linspace() numpy.arange() numpy.newarray() numpy.newarray() py To horizontally combine two ndarrays,which function may be Num used? hstack() vstack() hsplit() vsplit() hstack() py Tovertically combine two ndarrays,which function may be Num used? hstack() vstack() hsplit() vsplit() vsplit() py Which method among the following creates a new array object that references the same Num data? view copy paste all of the mentioned view py Another term for NumPy arrays Num is _____. Python arrays Parrays NumPy Data Arrays Ndarray Ndarray py What is the role of NumPy in To do numerical None of the mentioned Num Python? calculations To do scientific computing Both A and B above Both A and B py To creates a sequence of numbers,Numpy provides a funtion _____ analogous to range that returns arrays Num instead of lists. arange aspace aline all of the mentioned arange py Consider the statement AR+2. What should be the type of AR so that above statement produces no error and gets Num executed successfully? List Numpyarray Ndarray Both b and c Both b and c py For generating a sequence of numbers, NumPy offers a function _____, which is similar to range but produces arrays Num instead of lists. arange aspace aline all of the mentioned arange py Returns the number of Returns the number of Returns the number of dimensions of the dimensions of the elements in the Returns the Series as underlying data, by Returns a list of the axis underlying data, by Num Why ndim is used? underlying data. ndarray. definition 1. labels definition 1. py What will be the minimum number of arguments require to None of the above None of the above Pand pass in pandas series? 2 3 4 mentioned mentioned as Amongst which of the following is / are used to analyze the data None of the above Pand in pandas. DataFrame Series Both A and B mentioned Both A and B as Amongst which of the following can be used to create various Numpy ndarrays and All of the above All of the above Pand inputs using pandas DataFrame. Lists,dict Series Another DataFrame mentioned mentioned as Indexing in Series is similar to Pand that for NumPy arrays. TRUE FALSE TRUE as DataFrame accepts many Pand different kinds of input. TRUE FALSE TRUE as Which of the following thing Pand can be data in Pandas? a python dict an ndarray a scalar value all of the mentioned all of the mentioned as Pand PANDAS stands for ____ Panel Data Analysis Panel Data Analyst Panel Data Panel Dashboard Panel Data as Which of the following command is used to install Pand pandas? pip install pandas install pandas pip pandas None of the above pip install pandas as head( ) function return __________ n rows and tail function return _____________ Pand n rows from a pandas object. last, first first, second last, seven first, last first, last as We can have duplicate indexes Pand in Series?(T/F) TRUE FALSE TRUE as Which function is used to read a CSV file into a Pandas Pand DataFrame? read_csv() load_csv() open_csv() import_csv() read_csv() as What is the correct way to access the first five rows of a Pand DataFrame called df in Pandas? df.head(5) df.first(5) df[:5] df.top(5) df.head(5) as It creates subgroups It sorts the DataFrame It merges two It reshapes the It creates subgroups What does the groupby() based on the unique based on a specified DataFrames based on a DataFrame into a based on the unique Pand function in Pandas do? values in a column column common column specified format values in a column as Which function is used to create Pand a histogram in Seaborn? sns.lineplot() sns.scatterplot() sns.histplot() sns.barplot() sns.histplot() as How do you sort values in a DataFrame based on a specific df.sort_values(by='column df.order_by('column_na df.sort_values(by='col Pand column? df.sort('column_name') _name') sort_df('column_name') me') umn_name') as Which method is used to fill missing values in a DataFrame Pand in Pandas? fill_nulls() replace_nulls() fillna() complete_nulls() fillna() as How can you arrange the values in a DataFrame based on a df.sort_values(by='column df.order_by('column_na df.sort_values(by='col Pand specific column? df.sort('column_name') _name') sort_df('column_name') me') umn_name') as What method is employed to populate missing values in a Pand DataFrame in Pandas? fill_nulls() replace_nulls() fillna() complete_nulls() fillna() as Is indexing in Series similar to Pand that in NumPy arrays? TRUE FALSE TRUE as Does DataFrame accommodate Pand various types of input? TRUE FALSE TRUE as What types of objects can serve Pand as data in Pandas? a python dict an ndarray a scalar value all of the mentioned all of the mentioned as What is the full form of the Pand acronym PANDAS? Panel Data Analysis Panel Data Analyst Panel Data Panel Dashboard Panel Data as Which command installs Pand pandas? pip install pandas install pandas pip pandas None of the above pip install pandas as Pandas key data structure is Pand called? Keyframe DataFrame Statistics Econometrics DataFrame as Amongst which of the following pandas.DataFrame( data, is a correct syntax for panda's Pandas.DataFrame(data, index, columns, dtype, pandas.DataFrame(data, pandas.DataFrame( data, Pandas.DataFrame(dat Pand dataframe? index, dtype, copy) copy) index, dtype, copy) index, rows, dtype, copy) a, index, dtype, copy) as Amongst which of the following is / are not an iterative function All of the above Pand for dataframe? iterrows() itercolumns() iteritems() mentioned itercolumns() as When we create a series from dictionary then the keys of Pand dictionary become _______ Index of the series Value of the series Caption of the series None of the series Index of the series as A dataframe object can A column of dataframe Which of the folowing is not be created by passing A dataframe is size A dataframe index can be can have data of different A dataframe is size Pand true about dataframe? dictionaries. immutable string types immutable as Mr. Kumar is working in an IT company. He stored the salaries of all the employees of January month in Series ‘Jan_Sal’ and salaries of February month in Series ‘Feb_Sal’. Now he wants to add the salaries of both months. He has written the following statement. Identify print(Feb_Sal plus print(Feb_Sal + Pand the correct one. print(Feb_Sal + Jan_Sal) print(Feb_Sal_add_Jan_Sal) Jan_Sal) None of the above Jan_Sal) as print(S1[-1]) will return ___________ Pand #’S1′ is a series last element of series ‘S1’ first element of series ‘S1’ Key Error all elements of series ‘S1’ Key Error as __________ function is used to sort a Series object on the basis Pand of values. sort.values( ) sort_values( ) sort_value( ) sort_Values sort_values( ) as Which of the following attribute Pand of Series returns the tuple? size shape values index shape as What is the data type of series ‘S1’ given below ? Pand S1=pd.Series([11, 12.5, “ok”]) int64 float64 object object64 object as Which of the following statement is correct to add NaN S1=pd.Series([10, S1=pd.Series([10, None, Pand value in series? np.NaN,11]) 11]) Both of the above None of the above Both of the above as Concatenates Concatenates What does the pd.concat() Joins DataFrames Joins DataFrames DataFrames along a Combines DataFrames DataFrames along a Pand function do in Pandas? vertically horizontally specified axis using SQL-style joins specified axis as How do you drop a column df.remove_column(column df.drop(columns=[column delete_column(column_n df.drop(columns=[colu Pand from a DataFrame in Pandas? df.drop(column_name) _name) _name]) ame) mn_name]) as How do you eliminate a column df.remove_column(column df.drop(columns=[column delete_column(column_n df.drop(columns=[colu Pand from a DataFrame in Pandas? df.drop(column_name) _name) _name]) ame) mn_name]) as What do the keys of a dictionary become when Pand creating a series from it? Index of the series Value of the series Caption of the series None of the series Index of the series as A natural language A machine learning A data visualization library processing library in A web development A machine learning Sckit What is Scikit-learn? library in Python in Python Python framework in Python library in Python learn Which of the following is an example of a supervised Principal component Sckit learning algorithm? K-means clustering Decision tree analysis (PCA) Apriori algorithm Decision tree learn Which of the following is an example of a unsupervised learning algorithm in Scikit- Support vector machines Sckit learn? Linear Regression K-means clustering (SVM) Random Forest K-means clustering learn A natural language A machine learning A data visualization library processing library in A web development A machine learning Sckit Define Scikit-learn. library in Python in Python Python framework in Python library in Python learn Can you provide an example of a supervised learning Principal component Sckit algorithm? K-means clustering Decision tree analysis (PCA) Apriori algorithm Decision tree learn Could you provide an example Support vector machines Sckit of an unsupervised learning Linear Regression K-means clustering (SVM) Random Forest K-means clustering learn algorithm available in Scikit- learn? Which measure of central tendency is best used for categorical data? Mode Median Mean Standard Deviation Mode Stats What is the median of the data set: 12, 15, 17, 20, 25? 15 17 20 22 15 Stats What does the mode of a The most frequently The most frequently dataset represent? occurring value The middle value The arithmetic average The spread of data points occurring value Stats Which measure of central tendency should be used for a dataset with outliers to get a representative value? Median Mode Mean Variance Median Stats It is the measure of variability, based on dividing a data set into quartiles. Deviation Standard Deviation Range Inter Quartile Range Inter Quartile Range Stats The value most recurrent in the sample set is known as ________. Mean Median Mode Standard Deviation Mode stats __________Statistics uses the data to provide descriptions of the population, either through numerical calculations or graphs or tables. Descriptive Quantitative Inferential Qualitative Descriptive stats What is the median of the following set of scores? 18, 6, 12, 10, 14 ? 10 14 18 12 12 stats An approximate indicator The square root of the of how numbers vary The standard deviation is: variance A measure of variability from the mean All of the above All of the above stats The _______ is often the preferred measure of central tendency if the data are severely skewed. Mean Median Mode Range Median stats Which measure of central tendency is most suitable for analyzing categorical data? Mode Median Mean Standard Deviation Mode Stats Calculate the median for the dataset: 12, 15, 17, 20, 25. 15 17 20 22 15 Stats What information does the The most frequently The most frequently mode provide about a dataset? occurring value The middle value The arithmetic average The spread of data points occurring value Stats Which measure of central tendency is recommended for datasets with outliers to ensure a representative value? Median Mode Mean Variance Median Stats Describe the concept related to dividing a dataset into quartiles to gauge variability. Deviation Standard Deviation Range Inter Quartile Range Inter Quartile Range Stats What term describes the most frequently occurring value in a sample set? Mean Median Mode Standard Deviation Mode stats How does Descriptive Statistics contribute to understanding a population's characteristics? Descriptive Quantitative Inferential Qualitative Descriptive stats Calculate the median for the given set of scores: 18, 6, 12, 10, 14. 10 14 18 12 12 stats An approximate indicator Define the concept of standard The square root of the of how numbers vary deviation. variance A measure of variability from the mean All of the above All of the above stats In the presence of severe skewness in data, which measure of central tendency is commonly preferred? Mean Median Mode Range Median stats To make predictions To summarize and describe To infer conclusions To test hypotheses and To summarize and What is the primary goal of about the future based the main features of a about a population based relationships between describe the main descriptive statistics? on past data dataset on a sample variables features of a dataset Stats Which measure of central tendency is most affected by extreme outliers? Mean Median Mode Range Mean Stats Which of the following is a measure of variability in a dataset? Mean Mode Range Mode Range Stats Which of the following measures is the most resistant to outliers? Median Mean Mode Range Median Stats If a dataset has a positively skewed distribution, which measure of central tendency is likely to be the largest? Mean Median Mode Range Mean Stats How would you define the To make predictions To summarize and describe To infer conclusions To test hypotheses and To summarize and primary objective of descriptive about the future based the main features of a about a population based relationships between describe the main statistics? on past data dataset on a sample variables features of a dataset Stats Which measure of central tendency is particularly influenced by extreme outliers? Mean Median Mode Range Mean Stats Which metric assesses the variability within a dataset? Mean Mode Range Mode Range Stats Which measure of central tendency is the most robust against outliers? Median Mean Mode Range Median Stats In a positively skewed distribution, which measure of central tendency tends to be the largest? Mean Median Mode Range Mean Stats Which EDA technique is used to identify and visualize outliers in a Dataset? Bar Chart Histogram Box Plot Scatter Plot Box Plot EDA In EDA, what does the term The symmetry of a data The shape of a data The shape of a data "skewness" refer to? distribution The spread of data distribution The presence of outliers distribution EDA Which technique in EDA is utilized to identify and visualize outliers within a dataset? Bar Chart Histogram Box Plot Scatter Plot Box Plot EDA Within EDA, how is "skewness" The symmetry of a data The shape of a data The shape of a data defined? distribution The spread of data distribution The presence of outliers distribution EDA To display What is the purpose of a To visualize the To display relationships To calculate summary relationships between correlation matrix in EDA? distribution of data between variables To identify missing values statistics variables EDA What is the purpose of a correlation matrix in To display Exploratory Data Analysis To visualize the To display relationships To calculate summary relationships between (EDA)? distribution of data between variables To identify missing values statistics variables EDA What is the purpose of the xlim() and ylim() functions in To set the background To set the x and y limits To add annotations to a To set the x and y Matp Matplotlib? To set the size of a plot color of a plot of a plot plot limits of a plot lotlib Set the background color of the legend using the ______ Matp function in Matplotlib. bgcolor() set_bgcolor() facecolor() set_facecolor() set_facecolor() lotlib Matplotlib is designed to be as Matp usable as ___ SciPy MATLAB AI All of the above MATLAB lotlib Which method/attribute is used to check the installed version of Matp matplotlib? __version__ version() Both A and B None of the above __version__ lotlib Which argument keyword can be used to change the style of Matp the plotted line? line lines linestyles linestyle linestyle lotlib Which plot is also known as the Matp 'Whisker plot' in Matplotlib? Bar Pie Histogram Box plot Box plot lotlib Which is used for plotting a Matp horizontal line? hline() ahline() xhline() axhline() axhline() lotlib What is the function of the xlim() and ylim() functions in To set the background To set the x and y limits To add annotations to a To set the x and y Matp Matplotlib? To set the size of a plot color of a plot of a plot plot limits of a plot lotlib Specify the function in Matplotlib used to adjust the Matp background color of the legend. bgcolor() set_bgcolor() facecolor() set_facecolor() set_facecolor() lotlib How is Matplotlib designed to Matp function in terms of usability? SciPy MATLAB AI All of the above MATLAB lotlib What method or attribute can be employed to verify the Matp installed version of Matplotlib? __version__ version() Both A and B None of the above __version__ lotlib Which keyword argument is employed to alter the style of Matp the plotted line? line lines linestyles linestyle linestyle lotlib Identify the plot also referred to as the 'Whisker plot' in Matp Matplotlib. Bar Pie Histogram Box plot Box plot lotlib What method is used to plot a Matp horizontal line? hline() ahline() xhline() axhline() axhline() lotlib To display histogram with well df.plot(type = 'hist' , edge df.plot(type = 'hist' , df.plot(type = 'hist' , line df.plot(type = 'hist' , df.plot(type = 'hist' , Matp defined edge we can write = 'red') edgecolor = 'red') = 'red') linecolor = 'red') edgecolor = 'red') lotlib Select the correct statement to plt.boxplot(data , vert = plt.boxplot(data , horiz = plt.boxplot(data , vert Matp display horizontal box plot plt.box(data , vert = False) plt.box(data , horiz = True) False) True) = False) lotlib What is the default color for Matp Matplotlib plots? Red Blue Green Black Blue lotlib Which is the correct code statement to draw a plot plt.plot(xpoints, ypoints, plt.plot(xpoints, ypoints, plt.plot(xpoints, Matp without a line? plt.plot(xpoints, ypoints) plt.plot(xpoints, ypoints, 0) False) 'o') ypoints, 'o') lotlib How can we generate a histogram with clearly defined df.plot(type = 'hist' , edge df.plot(type = 'hist' , df.plot(type = 'hist' , line df.plot(type = 'hist' , df.plot(type = 'hist' , Matp edges? = 'red') edgecolor = 'red') = 'red') linecolor = 'red') edgecolor = 'red') lotlib Choose the correct statement for displaying a horizontal box plt.boxplot(data , vert = plt.boxplot(data , horiz = plt.boxplot(data , vert Matp plot. plt.box(data , vert = False) plt.box(data , horiz = True) False) True) = False) lotlib What is the default color used Matp for Matplotlib plots? Red Blue Green Black Blue lotlib What is the correct code syntax for generating a plot without a plt.plot(xpoints, ypoints, plt.plot(xpoints, ypoints, plt.plot(xpoints, Matp line? plt.plot(xpoints, ypoints) plt.plot(xpoints, ypoints, 0) False) 'o') ypoints, 'o') lotlib What is the term known as on which the machine learning algorithms build a model based on sample data? Data Training Training Data Transfer Data None of the above Training Data ML Among the following options identify the one which is false It is used for the It is used for the It discovers casual It discovers casual regarding regression. prediction interpretation It related input to outputs relationships relationships ML Test set is used to test Test set is used to test the accuracy of the It is a set of data is used to the accuracy of the hypotheses generated by discover the potentially hypotheses generated What is ‘Test set’? the learner. predictive relationship. Both A & B None of above by the learner. ML Spam detection, Spam detection, Pattern Pattern detection, Which of the following are detection, Natural Image classification, Real- Autonomous car driving, Bioinformatics, Speech Natural Language supervised learning applications Language Processing time visual tracking Logistic optimization recognition Processing ML Axes, legends, and other What is the role of exploratory They are made for formal They are typically made details are clean and They are used in place of They are typically graphs in data analysis? presentations very quickly exactly detailed formal modeling made very quickly ML Spam detection, Which of the following Spam detection, Pattern Pattern detection, represent applications of detection, Natural Image classification, Real- Autonomous car driving, Bioinformatics, Speech Natural Language supervised learning? Language Processing time visual tracking Logistic optimization recognition Processing ML What is the term used to describe the process by which machine learning algorithms construct a model based on sample data? Data Training Training Data Transfer Data None of the above Training Data ML Which statement among the following is incorrect when It is used for the It is used for the It discovers casual It discovers casual considering regression? prediction interpretation It related input to outputs relationships relationships ML Test set is used to test Test set is used to test the accuracy of the It is a set of data is used to the accuracy of the Can you define the concept of a hypotheses generated by discover the potentially hypotheses generated 'Test set'? the learner. predictive relationship. Both A & B None of above by the learner. ML In pandas, Index values must Pand be? unique hashable Both A and B None of the above Both A and B as import pandas as pd data = [['Anuj',21],['Rama',25],['Kapil',2 2]] df = Name Age Name Age Name Age Name Age Name Age pd.DataFrame(data,columns=[' 0 Anuj 21 0 Anuj 21 0 Kapil 22 0 Rama 25 0 Anuj 21 Name','Age']) 1 Rama 25 1 Kapil 22 1 Rama 25 1 Anuj 21 1 Rama 25 Pand print (df) 2 Kapil 22 2 Rama 25 2 Anuj 21 2 Kapil 22 2 Kapil 22 as Pandas Series is size __________ and value Pand ___________ Mutable, Mutable Immutable, Immutable Immutable, Mutable Mutable, Immutable Immutable, Mutable as Anshuman wants to create a series named ‘S1’. He has written the following codes. His friend Shubham checked the code and said that one of the code given below is not working. As a friend of S1=pd.Series(data=[11, S1=pd.Series(data=[11 Anshuman, help him to find the 12, 5, S1=pd.Series([11, 12, 5, S1=pd.Series([11, 12, 5, S1=pd.Series(data=[11, , 12, 5, 6,9], Pand incorrect code. 6,9],index=[1,2,3,4,5]) 6,9],index=[1,2,3,4,5]) 6,9],[1,2,3,4,5]) 12, 5, 6,9], [1,2,3,4,5]) [1,2,3,4,5]) as Which of the following is Pand parameter of Series( ) function? data index dtype All of the above All of the above as By default Series have _______ Pand data labels starting from ___. character, ‘a’ numeric, one numeric, zero character, zero numeric, zero as Choose the correct statement : Statement1 : A Numpy array requires homogeneous data. Statement2 : Pandas DataFrame Both the statements are Both the statements are Both the statements Pand can have heterogeneous data. Statement1 is correct Statement2 is correct correct wrong are correct as Pandas DataFrame is size Pand ________ and value ________ Mutable, Mutable Immutable, Immutable Immutable, Mutable Mutable, Immutable Mutable, Mutable as A one-dimensional A one-dimensional A two-dimensional labeled A three-dimensional labeled array capable labeled array capable of array capable of holding labeled array capable of A one-dimensional array of holding data of any Pand What is a Series in Pandas? holding data of any type data of any type holding data of any type without labels type as df.rename(columns={' How do you rename a column df.rename(columns={'old df.column_rename('old_n old_name': Pand in a DataFrame? rename_column() change_column_name() _name': 'new_name'}) ame', 'new_name') 'new_name'}) as Which of the following is NOT a type of plot available in Pand Seaborn? Line plot Scatter plot Bar plot Tree plot Tree plot as How do you apply a function element-wise to a DataFrame in Using the apply() Pand Pandas? Using a loop Using the apply() function Using list comprehension Using the map() function function as Reshapes a DataFrame by Changes the What does the resample() Reshapes a DataFrame by sampling columns Changes the frequency of Reverses the order of frequency of time Pand function do in Pandas? sampling rows randomly randomly time series data rows in a DataFrame series data as How can you apply a function to each element in a DataFrame in Using the apply() Pand Pandas? Using a loop Using the apply() function Using list comprehension Using the map() function function as What functionality does the Reshapes a DataFrame by Changes the resample() function provide in Reshapes a DataFrame by sampling columns Changes the frequency of Reverses the order of frequency of time Pand Pandas? sampling rows randomly randomly time series data rows in a DataFrame series data as import pandas as pd s=pd.Series([1,2,3,4,5 ],index= ['a','b','c','d','e']) None of the above Pand print(s['f']) KeyError IndexError Value Error mentioned KeyError as Import pandas as pd Import numpy as np df = pd.DataFrame(np.array([[4,6,9], [5,1,3]])) SyntaxError : invalid None of the above SyntaxError : invalid Pand print(df.shape) syntax KeyError IndexError mentioned syntax as What are the different types of Pand attributes? Nominal Ordinal Spacial All of the above All of the above as Write the output of the following: import pandas as pd S1=pd.Series(data=[11, 12, None, 6,9,7],index=[1,12,3,4,2,4]) Pand print(S1.count()) 4 6 5 Error 5 as Which of the following statement return Filtered result?import pandas as pd S1=pd.Series([11, 12, 5, 6,9]) print(S1) #Statement 1 print(S1>7) #Statement 2 Pand print(S1[S1>7]) #Statement 3 Statement 1 Statement 2 Statement 3 None of the above Statement 3 as What type of error is returned, when the length of index and the length of data in Series() Pand function is not same? Key Error Value Error Syntax Error Name Error Value Error as Two common ways for accessing the elements of a series are _________ and Labelled Indexing, Pand _______ Indexing, Concatenation Positional Indexing Indexing, Slicing Slicing, Cutting Indexing, Slicing as ‘data’ in the following code could be _____________ Pand S1 = pd.Series(data) Python sequence Scalar value Python dictionary All of the above All of the above as Reshapes a DataFrame Combines multiple What does the pivot_table() Creates a pivot chart Creates a pivot table from by pivoting the rows and DataFrames into a single Creates a pivot table Pand function do in Pandas? based on DataFrame data a DataFrame columns pivot table from a DataFrame as Converts a DataFrame to Converts a DataFrame What is the purpose of the a comma-separated Converts a CSV file to a Converts a DataFrame to Converts a DataFrame to to a comma-separated Pand to_csv() function in Pandas? values (CSV) file DataFrame an Excel file a JSON file values (CSV) file as What is the purpose of the Reshapes a DataFrame Combines multiple pivot_table() function in Creates a pivot chart Creates a pivot table from by pivoting the rows and DataFrames into a single Creates a pivot table Pand Pandas? based on DataFrame data a DataFrame columns pivot table from a DataFrame as Converts a DataFrame to Converts a DataFrame What task does the to_csv() a comma-separated Converts a CSV file to a Converts a DataFrame to Converts a DataFrame to to a comma-separated Pand function serve in Pandas? values (CSV) file DataFrame an Excel file a JSON file values (CSV) file as What is the purpose of the transform() method in Scikit- To train a model using a To make predictions using To evaluate the To preprocess the data To preprocess the Sckit learn? given dataset a trained model performance of a model for modeling data for modeling learn
The parameter______ allows
specifying the percentage of elements to put into the Sckit test/training set test_size training_size All above None of these All above learn Describe the function of the transform() method in Scikit- To train a model using a To make predictions using To evaluate the To preprocess the data To preprocess the Sckit learn. given dataset a trained model performance of a model for modeling data for modeling learn How does the parameter ____ facilitate the specification of the proportion of elements assigned to the test/training Sckit set? test_size training_size All above None of these All above learn The spread or The spread or dispersion dispersion of data What does the standard of data points around the The central value of the The total range of the The frequency of each points around the deviation measure in a dataset? mean dataset dataset data point in the dataset mean Stats Which descriptive statistic gives the highest weight to outliers? Mean Median Mode Variance Mean Stats In a dataset, if the mean is greater than the median, what The distribution cannot can you infer about the data The distribution is The distribution is The distribution is be determined from The distribution is distribution? positively skewed negatively skewed symmetric these statistics positively skewed Stats Which of the following represents the fiftieth percentile, or the middle point in a set of numbers arranged in order of magnitude? Mode Median Mean Variance Median stats The spread or What aspect of a dataset does The spread or dispersion dispersion of data the standard deviation of data points around the The central value of the The total range of the The frequency of each points around the quantify? mean dataset dataset data point in the dataset mean Stats Among descriptive statistics, which one assigns the greatest importance to outliers? Mean Median Mode Variance Mean Stats If the mean exceeds the median The distribution cannot in a dataset, what inference can The distribution is The distribution is The distribution is be determined from The distribution is be made about its distribution? positively skewed negatively skewed symmetric these statistics positively skewed Stats Which percentile represents the fiftieth percentile or the median in a dataset ordered by magnitude? Mode Median Mean Variance Median stats In a dataset, if the mean is 30, the median is 25, and the mode The distribution cannot is 30, what can you infer about The distribution is The distribution is The distribution is be determined from The distribution is the distribution? positively skewed negatively skewed symmetric these statistics positively skewed Stats A measure indicating What does the term A measure indicating the The most frequently the relative standing "percentile" refer to in relative standing of a data The spread of data points occurring value in a of a data point in a descriptive statistics? point in a dataset The mean of a dataset in a dataset dataset dataset Stats ___________ are used when you want to visually examine the relationship between two quantitative variables. Bar graphs Pie graphs Line graphs Scatterplots Scatterplots stats Given a dataset with a mean of 30, a median of 25, and a mode The distribution cannot of 30, what can be deduced The distribution is The distribution is The distribution is be determined from The distribution is about the distribution? positively skewed negatively skewed symmetric these statistics positively skewed Stats A measure indicating What does the term A measure indicating the The most frequently the relative standing "percentile" denote in relative standing of a data The spread of data points occurring value in a of a data point in a descriptive statistics? point in a dataset The mean of a dataset in a dataset dataset dataset Stats What visual tool is utilized to explore the relationship between two quantitative variables? Bar graphs Pie graphs Line graphs Scatterplots Scatterplots stats NumPy main object is the homogeneous In NumPy ,dimensions are NumPy array class is Num Point out the correct statement. multidimensional array called axes called ndarray all of the mentioned all of the mentioned py A NumPy array's dimensions Num arenknown as _____ ndimension dimensions axes nsize axes py The number of elements along each axis of an ndarray can be determined through ndarray's Num _____ Shape Rank Itemsize Size Shape py NumPy main object is the homogeneous In NumPy ,dimensions are NumPy array class is Num Identify the correct statement. multidimensional array called axes called ndarray all of the mentioned all of the mentioned py The dimensions of a NumPy Num array are referred to as _____. ndimension dimensions axes nsize axes py Ar[2:7:2] will produce the result Num as _____ array([6,10,14]) array([10,14,16]) array([2,6,10]) array([4,6,10]) array([6,10,14]) py import pandas as pd import numpy as np s= pd.Series(np.random.randn(4)) Num print s.ndim 0 1 2 3 1 py In EDA, what is the term for a graphical representation that displays the distribution of a continuous variable? Pie Chart Histogram Bar Chart Scatter Plot Histogram EDA Which of the following graph can be used for simple summarization of data? Scatterplot Overlaying Barplot All of the mentioned Barplot EDA Color and shape are used to add dimensions to graph data. TRUE FALSE TRUE EDA In EDA, what term is used for a visual representation that illustrates the spread of a continuous variable? Pie Chart Histogram Bar Chart Scatter Plot Histogram EDA In EDA, which measure of central tendency is typically represented by the height of a box in a box plot? Mean Median Mode Variance Median EDA Which EDA technique is used to visualize the relationship between two continous variables with a line connecting data points? Scatter Plot Histogram Bar Chart Box Plot Scatter Plot EDA Which of the following is NOT a common EDA technique for visualizing data distributions? Box Plot Histogarm Scatter Plot Bar Chart Bar Chart EDA To visualize What is the primary purpose of To show the distribution To visualize relationships To display categorical To represent time series relationships between a scatter plot in EDA? of a single variable between variables data data variables EDA In EDA, what is the purpose of a Q-Q plot (Quantile-Quantile To visualize the To compare two different To check for normality in To display time series To check for normality plot)? distribution of data datasets data data in data EDA What is the primary purpose of To visualize the To display relationships To compare multiple To compare multiple a bar chart in EDA? distribution of data between variables categories To identify missing values categories EDA In EDA, which statistical metric is typically depicted by the height of a box in a box plot? Mean Median Mode Variance Median EDA Which EDA technique is employed to illustrate the relationship between two continuous variables with a line connecting data points? Scatter Plot Histogram Bar Chart Box Plot Scatter Plot EDA Among the following, which is NOT a common technique in EDA for visualizing data distributions? Box Plot Histogarm Scatter Plot Bar Chart Bar Chart EDA What is the main objective of using a scatter plot in To visualize Exploratory Data Analysis To show the distribution To visualize relationships To display categorical To represent time series relationships between (EDA)? of a single variable between variables data data variables EDA What is the purpose of a Q-Q plot (Quantile-Quantile plot) in Exploratory Data Analysis To visualize the To compare two different To check for normality in To display time series To check for normality (EDA)? distribution of data datasets data data in data EDA What is the primary function of a bar chart in Exploratory Data To visualize the To display relationships To compare multiple To compare multiple Analysis (EDA)? distribution of data between variables categories To identify missing values categories EDA Which of the following does not Matp visualize data? Charts Maps Shapes Graphs Shapes lotlib By specifying the color By specifying the color How can you change the color By using the color() By using the palette() parameter in the plot() parameter in the plot() Matp of a Matplotlib plot? function By using the hue() function function function function lotlib How can you change the size of By using the figure() By using the subplot() By using the size() By using the xlim() and By using the figure() Matp a Matplotlib plot? function function function ylim() functions function lotlib What is the purpose of the imshow() function in Matp Matplotlib? To display images To create line plots To create scatter plots To create bar charts To display images lotlib Which Matplotlib function is Matp used to set the title in a graph? graph_title() graph_titles() titles() title() title() lotlib import Which is the correct import import matplotlib.pyplot import pyplot from matplotlib.pyplot as Matp statement to import pyplot? as plt matplotlib Both A and B None of the above plt lotlib Which function is used to create Matp histograms in matplotlib? histograms() histogram() histgraph() hist() hist() lotlib Among the options provided, which one does not represent a Matp method of visualizing data? Charts Maps Shapes Graphs Shapes lotlib By specifying the color By specifying the color How do you alter the color of a By using the color() By using the palette() parameter in the plot() parameter in the plot() Matp Matplotlib plot? function By using the hue() function function function function lotlib How can you adjust the size of a By using the figure() By using the subplot() By using the size() By using the xlim() and By using the figure() Matp Matplotlib plot? function function function ylim() functions function lotlib Which Matplotlib function is responsible for setting the title Matp in a graph? graph_title() graph_titles() titles() title() title() lotlib import What is the appropriate import import matplotlib.pyplot import pyplot from matplotlib.pyplot as Matp statement for importing pyplot? as plt matplotlib Both A and B None of the above plt lotlib What function is employed to construct histograms in Matp Matplotlib? histograms() histogram() histgraph() hist() hist() lotlib Which of the following type of Matp chart is not supported by pyplot Histogram Boxplot Pie All are correct All are correct lotlib Plot which is used to given Matp statistical summary is Bar Line Histogram Box Plot Box Plot lotlib Which of the following is correct syntax to craete plt.hist(x, bins = plt.hist(x, bins = Matp histogarm with bins specified? plt.hist(x, bins = 10) [10,11,12,13,14]) range(10,15)) All are correct All are correct lotlib Which of the following is not a Matp type of Matplotlib plot? Line Plot Scatter Plot Pie Chart Bar Chart Pie Chart lotlib What is the purpose of the To label the x and y axes To add annotations to a To label different lines or To label different lines Matp legend() function in Matplotlib? of a plot To add a title to a plot plot markers on a plot or markers on a plot lotlib How to add labels to a plot in Matp Matplotlib? xlabel() and ylabel() labelx() and labely() x_label() and y_label() labels() xlabel() and ylabel() lotlib Which function is used to draw Matp points (markers) in a diagram? write() draw() plot() paint() plot() lotlib Which parameter is used to define the position/alignment of the plot title in the title() Matp function? pos align loc All of the above loc lotlib Which type of chart is not Matp available/supported in pyplot? Histogram Boxplot Pie All are correct All are correct lotlib What type of plot is typically used to present statistical Matp summaries? Bar Line Histogram Box Plot Box Plot lotlib Which syntax is correct for creating a histogram with plt.hist(x, bins = plt.hist(x, bins = Matp specified bins? plt.hist(x, bins = 10) [10,11,12,13,14]) range(10,15)) All are correct All are correct lotlib Among the following, which is not a category of plot available Matp in Matplotlib? Line Plot Scatter Plot Pie Chart Bar Chart Pie Chart lotlib Explain the significance of the To label the x and y axes To add annotations to a To label different lines or To label different lines Matp legend() function in Matplotlib. of a plot To add a title to a plot plot markers on a plot or markers on a plot lotlib What method is utilized to incorporate labels onto a plot in Matp Matplotlib? xlabel() and ylabel() labelx() and labely() x_label() and y_label() labels() xlabel() and ylabel() lotlib Which function is utilized to plot points (markers) on a Matp diagram? write() draw() plot() paint() plot() lotlib Which parameter determines the position or alignment of the Matp plot title in the title() function? pos align loc All of the above loc lotlib ML is a field of AI consisting of Improve their Over time with learning algorithms that? performance At executing some task experience All of the above All of the above ML Identify the kind of learning algorithm for “facial identities for facial expressions”. Prediction Recognition patterns Recognizing anomalies Generating patterns Recognition patterns ML Identify the type of learning in which labeled training data is Semi unsupervised used. learning Supervised learning Reinforcement Learning Unsupervised learning Supervised learning ML The main focus of ML is ML is a type of artificial to allow computer intelligence that extract systems learn from Machine Learning (ML) is patterns out of raw data by experience without being What characteristics define that field of computer using an algorithm or explicitly programmed or Machine Learning? science method. human intervention. All of the above All of the above ML Determine the learning algorithm utilized for "identifying facial expressions from facial identities." Prediction Recognition patterns Recognizing anomalies Generating patterns Recognition patterns ML Name the type of learning that involves the utilization of Semi unsupervised labeled training data. learning Supervised learning Reinforcement Learning Unsupervised learning Supervised learning ML if there is only a discrete number of possible outcomes (called categories), the process becomes a______. Regression Classification Modelfree categories Classification ML When there are only a finite number of potential outcomes (referred to as categories), the process is termed as a ______. Regression Classification Modelfree categories Classification ML To creates an ndarray from a Python list, which of the following functions would you Num use? Numpy.reshape() numpy.reshape() Numpy.array() numpy.array() numpy.array() py Num NumPy arrays can be ___. Indexed Sliced Iterated all of the mentioned all of the mentioned py We can find the Operational activities on None of the mentioned We can find the Num Using ndim we can find - dimension of the array Size of array matrix above dimension of the array py Which of the following can store different types of elements ,i.e., heterogeneous Num elements in them? List DataFrame Tuple all of the mentioned all of the mentioned py Which of the following can store only homogeneous elements (elements having Num same data type)? List Ndarray Tuple DataFrame Ndarray py When converting a Python list into an ndarray, which function Num would you utilize? Numpy.reshape() numpy.reshape() Numpy.array() numpy.array() numpy.array() py Enumerate the potential operations that can be Num performed on NumPy arrays. Indexed Sliced Iterated all of the mentioned all of the mentioned py The _____ function returns its argument with modified shape,whereas the _____ method modifies the array Num itself. reshape,resize resize,reshape reshape2,resize all of the mentioned reshape,resize py Array([8,10,12,14]) is the result Num of array slice ____ Ar[3:7] Ar[:5] Ar[:7] Ar[:-3] Ar[3:7] py Array([10,12,14,16]) is the Num result of array slice ____ Ar[:4] Ar[4:8] Ar[4:] Both b and c Both b and c py From and ndarray numpy.arange(9.0), what can be the possible shapes of arrays that can be created using Num reshape()? (9,1) , (3,3) , (1,9) (3,3) (9,1) , (1,9) None of these (9,1) , (3,3) , (1,9) py import pandas as pd import numpy as np s= pd.Series(np.random.randn(2)) Num print s.size 0 1 2 3 2 py The _____ function modifies the shape of its argument and returns it, while the _____ Num method alters the array directly. reshape,resize resize,reshape reshape2,resize all of the mentioned reshape,resize py A Dataframe object is value Pand mutable. TRUE FALSE TRUE as Amongst which of the following is / are not correct to access individual item from dataframe Pand 'df' df.iat[2,2] df.iat[2,2] df.loc[2,2] df.at[2,2] df[0,0] as When you print/display any series tehn the left most column is showing _______ Pand value. index Data Value None of the above Index as Which of the following method is used to subtract the two Pand series? subtract( ) subtraction( ) diff( ) sub( ) sub( ) as Python libraries contain a collection of built-in Pand _____________ Data Modules Packages Data Structure Modules as Which of the following statement will modify the first Pand three values of Series ‘S1’? S1[0, 1, 2] = 100 S1[0 : 3] = 100 S1[ : 3] = 100 All of the above All of the above as How do you select a single df.select_column(column_ Pand column in a DataFrame? df[column_name] name) df.loc[:, column_name] All of the above All of the above as Which of the following methods is used to check for missing Pand values in a DataFrame? check_missing() isna() find_missing() isnull() isnull() as How do you drop rows with missing values from a Pand DataFrame? drop_missing() remove_null() dropna() delete_null() dropna() as Which method is used to perform string operations on a Pand DataFrame column in Pandas? str_ops() apply_str() str accessor string_ops() str accessor as Which method facilitates performing string operations on Pand a DataFrame column in Pandas? str_ops() apply_str() str accessor string_ops() str accessor as What value does the leftmost column display when Pand printing/displaying any series? index Data Value None of the above Index as import pandas as pd pd.Series([1,2], index= None of above Pand ['a','b','c']) Synatx Error Index Error Value Error mentioned Value Error as A panel is a ___ container of Pand data 1D 2D 3D Infinite 3D as Pand Axis 1, in panel represent? minor_axis major_axis items None of the above major_axis as While performing mathematical operations on series, index matching is implemented and all missing values are filled in Pand with ___________ by default NaN None 0 1 NaN as ____________ statement will assigns a name to the Series Pand ‘S1’. S1.name = “Empl” S1_name = “Empl” S1[name] = “Empl” S1.indexname = “Empl” S1.name = “Empl” as Roshan has written few points about iloc( ) function of Series in Python. His friend Suman told that one of the written statement is not correct. Help This method include the This method include him to find the incorrect In iloc( ) method, we have last element of the range S1.iloc[3] will display S1.iloc[:3] will display first the last element of the Pand statement. to pass an integer index. passed. fourth value of Series ‘S1’ three values of Series ‘S1’ range passed. as Raman performed addition of series ‘S1’ and ‘S2’ and store the result in series ‘S3’. Both the Pand series ‘S1’ and ‘S2’ have five 2 3 4 5 5 as mismatching index value. How many NaN will be there in ‘S3’? Series ‘S1’ has five values with index value (0, 1, 2, 3, 4) and series ‘S2’ has five values with index (2, 3, 4, 5, 6). What will be the total number of values in Pand ‘S3’ if S3 = S1 + S2 5 6 7 8 7 as Rosy wants to display the series ‘S1’ in reverse order. Help her Pand to find the correct code. S1[ : : 1] S1[ : : -1] S1[ -1 : :] S1[ : 1 1] S1[ : : -1] as Can a Series have duplicate Yes, Only series with Yes, Only series with Pand index value? Yes NO integer values character values Yes as How many times value ’10’ will be displayed in the given series ‘S1’? import pandas as pd S1=pd.Series(10, index = range(1, 10, 3)) Pand print(S1) 1 3 5 4 3 as How many elements will be there in given series ‘S1’? import pandas as pd S1=pd.Series('python practice') Pand print(S1) 0 1 2 15 1 as Which of the following statement will return 10 values from the bottom/end of the Pand Series ‘S1’? S1.tail( ) S1.tail(10) S1.head(10) S1(10) S1.tail(10) as Which method is used to perform element-wise mathematical operations between two DataFrames of the Pand same shape in Pandas? multiply() add() subtract() All of the above All of the above as What is the purpose of the fit() To train a model using a To make predictions using To evaluate the To visualize the data To train a model using Sckit method in Scikit-learn? given dataset a trained model performance of a model using a plot a given dataset learn What is the purpose of the To train a model using a To make predictions using To evaluate the To visualize the data To make predictions Sckit predict() method in Scikit-learn? given dataset a trained model performance of a model using a plot using a trained model learn What is the objective of the fit() To train a model using a To make predictions using To evaluate the To visualize the data To train a model using Sckit method in Scikit-learn? given dataset a trained model performance of a model using a plot a given dataset learn Explain the purpose of the To train a model using a To make predictions using To evaluate the To visualize the data To make predictions Sckit predict() method in Scikit-learn. given dataset a trained model performance of a model using a plot using a trained model learn Which library is used to create Seab statistical graphics in Python? Pandas Numpy Seaborn Matplotlib Seaborn orn Which of the following is a measure of central tendency? Standard deviation Variance Median Range Median Stats Which of the following is a measure of skewness in a dataset? Kurtosis Variance Median Range Kurtosis Stats Which descriptive statistic gives a measure of the "peakedness" of a dataset? Kurtosis Variance Median Range Kurtosis Stats The difference between The difference What does the range of a the largest and smallest The spread of data points The most frequently between the largest dataset represent? values around the mean occurring value The middle value and smallest values Stats In a symmetrical dataset, which measure of central tendency will be approximately equal? Mean and median Mean and mode Median and mode Mean, median, and mode Mean and median Stats In a perfectly symmetrical dataset, which measure of central tendency will be at the center? Mean Median Mode Variance Mean Stats Which measure of variability is often used to describe the spread of data in a bell-shaped, normal distribution? Standard deviation Range IQR (Interquartile Range) Variance Standard deviation Stats As a general rule, the _______ is the best measure of central tendency because it is more precise. Mean Median Mode Range Mean stats Which of the following is NOT a measure of variability? Median Variance Standard Deviation Range Median stats Identify a metric used as a measure of central tendency. Standard deviation Variance Median Range Median Stats What does skewness indicate in a dataset? Kurtosis Variance Median Range Kurtosis Stats How does a measure of kurtosis assess the "peakedness" of a dataset? Kurtosis Variance Median Range Kurtosis Stats The difference between The difference What aspect of a dataset does the largest and smallest The spread of data points The most frequently between the largest its range signify? values around the mean occurring value The middle value and smallest values Stats In a dataset with symmetrical distribution, which measure of Mean and median Mean and mode Median and mode Mean, median, and mode Mean and median Stats central tendency tends to be roughly equivalent? In a perfectly symmetrical dataset, which measure of central tendency occupies the central position? Mean Median Mode Variance Mean Stats Which measure of variability is commonly utilized to characterize the dispersion of data in a bell-shaped, normal distribution? Standard deviation Range IQR (Interquartile Range) Variance Standard deviation Stats As a general guideline, which measure of central tendency offers greater precision? Mean Median Mode Range Mean stats Among the options provided, which one is not considered a measure of variability? Median Variance Standard Deviation Range Median stats What does the interquartile The spread of data range (IQR) measure in a The spread of data points The spread of data points The spread of data points The total range of the points around the dataset? around the median around the mean around the mode dataset median Stats The spread or The spread or dispersion dispersion of data In descriptive statistics, what of data points around the The central value of the The total range of the The frequency of each points around the does the "variance" measure? mean dataset dataset data point in the dataset mean Stats Approximately what percentage of scores fall within one standard deviation of the mean in a normal distribution? 34% 95% 99% 68% 68% stats What aspect of a dataset does The spread of data the interquartile range (IQR) The spread of data points The spread of data points The spread of data points The total range of the points around the evaluate? around the median around the mean around the mode dataset median Stats The spread or In descriptive statistics, what The spread or dispersion dispersion of data concept does "variance" of data points around the The central value of the The total range of the The frequency of each points around the quantify? mean dataset dataset data point in the dataset mean Stats Approximately, what percentage of scores lie within one standard deviation of the mean in a normal distribution? 34% 95% 99% 68% 68% stats Which type of neural network Artifi layer is responsible for combining cial Fully connected layer Dropout layer Normalization layer Activation layer A inputs through weighted Neur summation? al Netw orks Artifi cial Which layer in a convolutional Neur neural network (CNN) is Convolutional layer Pooling layer Fully connected layer Activation layer A al responsible for feature extraction? Netw orks Artifi cial Which layer in a convolutional Neur neural network (CNN) is Convolutional layer Pooling layer Fully connected layer Activation layer A al responsible for feature extraction? Netw orks Artifi cial In a feedforward neural network, Neur Neurons Weights Activation functions All of the above D what does each layer consist of? al Netw orks Artifi cial Which of the following is a Neur commonly used activation ReLU Sigmoid Tanh All of the above D al function in neural networks? Netw orks Artifi cial Which technique is used to Neur prevent overfitting in neural Dropout Batch Normalization Weight regularization All of the above D al networks? Netw orks Artifi cial Which technique is used to Neur prevent overfitting in neural Dropout Batch Normalization Weight regularization All of the above D al networks? Netw orks K- Which distance metric is Mean commonly used in K-Means Euclidean Manhattan Cosine All of the above A s clustering? Clust ering K- Which method is commonly used Mean to determine the optimal number Elbow method Silhouette method Hopkins statistic All of the above A s of clusters in K-Means clustering? Clust ering K- Which of the following is NOT a Mean common distance metric used in Mahalanobis Chebyshev Hamming All of the above C s K-Means clustering? Clust ering K- Mean Which type of dataset is K-Means Non-convex Linearly separable Homogeneous All of the above A s clustering unsuitable for? Clust ering Princi pal Which method is commonly used Com to select the number of principal Elbow method Scree plot method Silhouette method All of the above B pone components in PCA? nt Analy sis Princi pal Com Which of the following is a Image compression Text classification Speech recognition All of the above D pone possible application of PCA? nt Analy sis Artifi cial In a feedforward neural network, Neur Neurons Weights Activation functions All of the above D what does each layer consist of? al Netw orks K- Which of the following methods Mean can help improve the performance Using a different distance Increasing the number of Applying feature scaling All of the above D s of K-Means clustering on non- metric clusters Clust spherical clusters? ering Artifi What is the primary purpose of Measures the difference Updates the weights and Applies the activation cial the loss function in a neural between predicted and Regularizes the network A biases function Neur network? actual values al Netw orks Artifi cial What is the primary purpose of Measures the difference Updates the weights and Applies the activation Neur the loss function in a neural between predicted and Regularizes the network A biases function al network? actual values Netw orks K- In K-Means clustering, what Mean happens if a data point is Assigned to the cluster with Assigned based on class Assigned randomly Assigned to the first cluster A s equidistant from two or more the lowest index labels Clust cluster centroids? ering Princi pal Assumes linear Com Which of the following is NOT a Requires a large amount of Assumes normally relationships between Sensitive to outliers C pone limitation of PCA? data distributed data variables nt Analy sis K- Mean How does K-Means clustering By maximizing silhouette Automatically using an Specified by the user Based on class labels A s determine the number of clusters? score algorithm Clust ering Artifi cial Which method is used to update Neur the weights and biases in a neural Gradient Descent Activation function Forward pass Backpropagation D al network during training? Netw orks Artifi cial Which method is used to update Neur the weights and biases in a neural Gradient Descent Activation function Forward pass Backpropagation D al network during training? Netw orks K- Mean In K-Means clustering, how are Based on the smallest Randomly Based on class labels Based on the largest cluster A s cluster centroids initially chosen? cluster Clust ering Princi How are principal components Based on the amount of Randomly Based on their index Based on their eigenvalues A pal ordered in PCA? variance they capture Com pone nt Analy sis Princi pal Com How does PCA handle By creating uncorrelated By removing correlated By combining correlated By replacing correlated A pone multicollinearity in the dataset? variables variables variables variables nt Analy sis Princi pal By projecting data onto a Com How does PCA reduce the By removing redundant By increasing the number By transforming data into a lower-dimensional A pone dimensionality of the data? features of features higher-dimensional space subspace nt Analy sis Artifi cial What does the term "weight Setting initial values for Neur initialization" refer to in neural Adjusting weights and biases Learning rate optimization Calculating gradients A weights al networks? Netw orks Princi pal Weights representing the Com What is the interpretation of the contribution of each Variance explained by each Importance of each feature Cluster assignment A pone loadings in PCA? original feature to the principal component in the dataset nt principal component Analy sis Princi pal Which method is used to Com reconstruct the original data from Inverse transform Direct transform Residual analysis Correlation analysis A pone principal components in PCA? nt Analy sis K- Which method is commonly used Mean to choose the initial cluster K-Means++ Random initialization Hierarchical clustering DBSCAN A s centroids in K-Means clustering? Clust ering Princi pal What is the effect of increasing Com Captures more variance in Reduces the variance in the Makes the data less Decreases computational the number of principal A pone the data data interpretable complexity components in PCA? nt Analy sis Princi pal Which factor does NOT influence Com Amount of variance Desired level of the choice of the number of Number of observations Dimensionality of the data A pone explained interpretability principal components in PCA? nt Analy sis Artifi In a neural network, which term cial refers to the number of times the Neur entire dataset is passed forward Epoch Learning rate Momentum Dropout rate A al and backward through the Netw network? orks Artifi In a neural network, which term cial refers to the number of times the Neur entire dataset is passed forward Epoch Learning rate Momentum Dropout rate A al and backward through the Netw network? orks Princi pal What is the relationship between Eigenvalues are inversely Eigenvalues have no Com Eigenvalues represent the Eigenvalues are equal to eigenvalues and explained proportional to explained relation to explained A pone explained variance explained variance variance in PCA? variance variance nt Analy sis Princi pal What is the maximum number of Com Equal to the number of Less than or equal to the Less than or equal to the Equal to the number of principal components that can be A pone original features number of observations number of original features observations generated in PCA? nt Analy sis K- Which algorithm is often used as Mean an initialization technique for K- K-Means++ DBSCAN Agglomerative clustering Hierarchical clustering A s Means clustering? Clust ering K- Mean Which type of data is K-Means Numeric Categorical Text Image A s clustering particularly suited for? Clust ering Princi pal Com What is the main drawback of Increased computational Loss of interpretability Loss of information Increased overfitting B pone PCA? complexity nt Analy sis Princi pal Combines correlated Com How does PCA handle Increases dimensionality of features into principal Removes correlated features Ignores correlated features A pone multicollinearity in the dataset? the data components nt Analy sis Princi pal What is the primary benefit of Com performing PCA before feeding Reducing computational Reducing overfitting Improving interpretability Increasing model accuracy B pone data into a machine learning complexity nt model? Analy sis Princi pal Com Which statistical assumption does Independence of Normality Linearity Homoscedasticity B pone PCA rely on? observations nt Analy sis Princi pal Com What happens if the eigenvalues Indicates that the data is Indicates that the data has Indicates that the data has Indicates that the data is A pone in PCA are all very close to zero? not suitable for PCA high variance low variance categorical nt Analy sis K- How does K-Means clustering Mean performance change when It may degrade due to the It improves due to feature It remains unaffected It converges faster A s dealing with high-dimensional curse of dimensionality redundancy Clust data? ering K- Mean How does K-Means clustering It requires conversion into It treats categorical data as It ignores categorical data It creates dummy variables A s handle categorical data? numerical format continuous Clust ering K- Which clustering algorithm is Mean They have similar It depends on the size of computationally faster: K-Means K-Means Hierarchical A s computational complexity the dataset or hierarchical clustering? Clust ering Princi pal It ensures that all features Com What is the effect of feature It reduces the need for It increases the number of contribute equally to It has no effect on PCA A pone scaling on PCA? normalization principal components variance nt Analy sis K- Mean What is the effect of scaling on K- It can improve clustering It may lead to biased It increases the number of It has no effect on clustering A s Means clustering? performance results clusters Clust ering K- Mean What is the effect of feature It helps to equalize the It reduces the number of It may distort the clusters It accelerates convergence A s scaling on K-Means clustering? influence of features clusters Clust ering K- How does the computational Mean complexity of K-Means clustering It increases linearly It increases exponentially It decreases linearly It remains constant A s change with the number of Clust clusters (k)? ering Princi pal Com In PCA, what does each principal Linear combination of Individual feature Cluster centroid Label of data points A pone component represent? original features nt Analy sis Artifi cial What is the basic building block of Neur Neuron Weight Activation function Layer A a neural network? al Netw orks Artifi cial What is the primary building block Neur of an artificial neural network Neuron Weight Activation function Layer A al (ANN)? Netw orks Artifi cial What does the term Adjusting weights and Neur "backpropagation" refer to in Forward pass Activation of neurons Learning rate optimization A biases al neural networks? Netw orks Artifi cial What does the term Adjusting weights and Neur "backpropagation" refer to in Forward pass Activation of neurons Learning rate optimization A biases al neural networks? Netw orks K- Mean What is the main drawback of K- Sensitivity to initial Limited to linearly Requires labeled data Slow convergence A s Means clustering? centroids separable data Clust ering Artifi cial Which type of neural network is Neur typically used for simple Perceptron RNN CNN LSTM A al classification tasks? Netw orks Princi pal Amount of variance Com What does the term "explained Difference between actual Importance of each feature captured by the principal Magnitude of eigenvalues A pone variance" refer to in PCA? and predicted values in the dataset components nt Analy sis K- Mean What is the objective of K-Means Minimize the within-cluster Maximize the between- Minimize the number of Maximize the number of A s clustering? sum of squares cluster sum of squares clusters iterations Clust ering Which algorithm is often used as K- an alternative to K-Means DBSCAN Agglomerative clustering Spectral clustering Mean Shift A Mean clustering for non-convex s clusters? Clust ering Artifi cial What is the main function of the Introduces non-linearity to Determines the number of Neur activation function in a neural Sets the learning rate Measures the loss A the model neurons al network? Netw orks K- In K-Means clustering, what is the Mean Measures the compactness Measures the separation Measures the number of Measures the noise level in role of the inertia or within-cluster A s of the clusters between clusters clusters the data sum of squares? Clust ering Artifi cial What problem does the vanishing Gradient becomes very Gradient explodes during Model becomes too Neur gradient problem refer to in Model fails to converge A small during training training complex to train al neural networks? Netw orks Artifi cial What problem does the vanishing Gradient becomes very Gradient explodes during Model becomes too Neur gradient problem refer to in Model fails to converge A small during training training complex to train al neural networks? Netw orks Artifi cial What is the purpose of the input Neur Receives input data Makes predictions Applies activation function None of the above A layer in a neural network? al Netw orks K- What happens if K-Means Mean clustering is run with a large Each data point becomes Clusters merge into one The algorithm crashes None of the above A s number of clusters, close to the its own cluster Clust number of data points? ering K- Which step of the K-Means Mean algorithm may result in empty Initialization Assignment Update centroids None of the above C s clusters? Clust ering How does K-Means clustering It's recommended to K- Treat missing values as a deal with missing values in the impute missing values Ignore missing values None of the above B Mean separate cluster dataset? before clustering s Clust ering Which step of the K-Means K- algorithm involves updating the Mean cluster centroids based on the Update centroids Assignment Initialization None of the above A s mean of the data points assigned Clust to each cluster? ering Princi pal Measure of variance Com What is the significance of Number of features in the captured by each principal Magnitude of eigenvectors None of the above A pone eigenvalues in PCA? dataset component nt Analy sis Princi pal In PCA, how is the percentage of Com Divide the eigenvalue by Divide the eigenvalue by the Multiply the eigenvalue by explained variance calculated for None of the above A pone the sum of all eigenvalues total variance 100 each principal component? nt Analy sis Princi pal What is the relationship between Fewer principal Equal number of principal Com the number of principal More principal components components than original components and original None of the above C pone components and the number of than original features features features nt original features? Analy sis Princi pal Com How does PCA handle missing Removes observations with Imputes missing values Ignores missing values None of the above B pone values in the dataset? missing values nt Analy sis K- Which step of the K-Means Mean algorithm involves assigning each Assignment Initialization Update centroids None of the above A s data point to the nearest cluster Clust centroid? ering K- What is the primary disadvantage Mean of the K-Means++ initialization Slower initialization time More sensitive to outliers Less accurate centroids None of the above A s method compared to random Clust initialization? ering K- Which step of the K-Means Mean algorithm can be computationally Assignment Initialization Update centroids None of the above C s expensive for large datasets? Clust ering K- What is the major challenge of Mean using the Elbow Method to Subjectivity in selecting the Computationally expensive Sensitivity to outliers None of the above A s determine the optimal number of "elbow point" Clust clusters? ering K- Which step of the K-Means Mean algorithm is responsible for Assignment Initialization Update centroids None of the above A s updating the cluster membership Clust of data points? ering Princi pal Com What is the purpose of scaling the Ensures all features have Prevents numerical Reduces computational None of the above B pone data before performing PCA? equal importance instabilities complexity nt Analy sis Princi pal In PCA, how is the amount of Com Eigenvalue of the principal Squared loading of each Cumulative percentage of variance explained by each None of the above C pone component feature explained variance principal component calculated? nt Analy sis Princi pal Negative influence of Com What is the interpretation of Negative correlation Positive correlation features on the principal None of the above A pone negative loadings in PCA? between features between features component nt Analy sis Princi pal Com What is the computational O(n^3) O(n^2) O(n log n) O(n) B pone complexity of PCA? nt Analy sis Princi pal Which matrix is diagonalized in Com PCA to obtain eigenvalues and Covariance matrix Correlation matrix Identity matrix Orthogonal matrix A pone eigenvectors? nt Analy sis Princi pal Com What does the term "eigenvector" Direction of maximum Magnitude of variance Weighted combination of Orthogonal transformation C pone represent in PCA? variance in the data explained original features matrix nt Analy sis Princi pal Outliers may influence Outliers decrease the Com What is the impact of outliers on Outliers increase principal component Outliers do not affect PCA number of principal A pone PCA? interpretability directions components nt Analy sis Princi pal PCA can be applied to both Com Which of the following statements PCA requires data to be PCA can only be applied to PCA is not affected by numerical and categorical A pone about PCA is true? normally distributed high-dimensional data outliers data nt Analy sis Princi pal PCA reduces PCA maximizes class Com How does PCA differ from Linear PCA is unsupervised, while PCA is supervised, while LDA dimensionality, while LDA separability, while LDA A pone Discriminant Analysis (LDA)? LDA is supervised is unsupervised maximizes class separability reduces dimensionality nt Analy sis Artifi cial What is the primary difference Perceptron has one layer, Perceptron uses softmax Perceptron is a type of Perceptron is supervised, Neur between a perceptron and a A MLP has multiple layers activation, MLP uses ReLU CNN, MLP is a type of RNN MLP is unsupervised al multi-layer perceptron (MLP)? Netw orks Artifi Which layer in a neural network cial performs the element-wise Activation layer Convolutional layer Fully connected layer Pooling layer A Neur activation function? al Netw orks Artifi cial Which of the following is NOT a Neur Normalization layer Dropout layer Activation layer Pooling layer A type of neural network layer? al Netw orks Artifi cial Which layer in a neural network Neur performs the element-wise Activation layer Convolutional layer Fully connected layer Pooling layer A al activation function? Netw orks Artifi cial Which of the following is NOT a Neur Normalization layer Dropout layer Activation layer Pooling layer A type of neural network layer? al Netw orks Artifi cial What is the process of adjusting Neur weights and biases in a neural Training Forward pass Backpropagation Prediction C al network to minimize errors? Netw orks Artifi cial What is the primary purpose of Controls the size of weight Determines the number of Neur the learning rate in training a Activates neurons Prevents overfitting A updates layers al neural network? Netw orks Artifi cial What is the role of the learning Controls the size of weight Determines the number of Neur Activates neurons Prevents overfitting A rate in training a neural network? updates layers al Netw orks K- Mean Which of the following is NOT a Sensitive to initial cluster Struggles with non-linear Assumes spherical clusters Prone to overfitting D s limitation of K-Means clustering? centroids data Clust ering What is the main function of the Shifts the activation Adds flexibility to the Provides additional Artifi Helps prevent overfitting C bias in a neural network? function horizontally model capacity to learn cial Neur al Netw orks Artifi cial What is the main function of the Shifts the activation Adds flexibility to the Provides additional Neur Helps prevent overfitting C bias in a neural network? function horizontally model capacity to learn al Netw orks Artifi cial Which type of neural network is Recurrent Neural Network Convolutional Neural Multilayer Perceptron Radial Basis Function Neur most suitable for time-series A (RNN) Network (CNN) (MLP) Network (RBFN) al prediction? Netw orks Artifi cial Which type of neural network is Recurrent Neural Network Convolutional Neural Multilayer Perceptron Radial Basis Function Neur best suited for sequential data A (RNN) Network (CNN) (MLP) Network (RBFN) al processing? Netw orks Artifi cial Which type of neural network Convolutional Neural Recurrent Neural Network Multilayer Perceptron Radial Basis Function Neur architecture is suitable for image A Network (CNN) (RNN) (MLP) Network (RBFN) al recognition tasks? Netw orks Artifi cial Which type of neural network is Recurrent Neural Network Convolutional Neural Multilayer Perceptron Radial Basis Function Neur most suitable for time-series A (RNN) Network (CNN) (MLP) Network (RBFN) al prediction? Netw orks Artifi cial Which type of neural network is Recurrent Neural Network Convolutional Neural Multilayer Perceptron Radial Basis Function Neur best suited for sequential data A (RNN) Network (CNN) (MLP) Network (RBFN) al processing? Netw orks Princi What is the primary goal of pal Principal Component Analysis Dimensionality reduction Feature extraction Classification Regression A Com (PCA)? pone nt Analy sis Artifi cial What is the purpose of the Converts raw scores into Speeds up the training Adds non-linearity to the Neur softmax function in a neural Regularizes the network A probabilities process model al network? Netw orks Artifi cial What is the purpose of the Converts raw scores into Speeds up the training Adds non-linearity to the Neur softmax function in a neural Regularizes the network A probabilities process model al network? Netw orks How many coefficients do you need to estimate in a simple linear regression model (One independent variable)? 1 2 3 4 2 LR How many coefficients must you estimate in a simple linear regression model with one independent variable? 1 2 3 4 2 LR How many variables are required to represent a linear regression model? 3 2 1 4 3 LR How many variables are necessary to articulate a linear regression model? 3 2 1 4 3 LR (TP * TN - FP * FN) / How is the Matthews (TP * TN - FP * FN) / sqrt((TP + FP) * (TP + Class Correlation Coefficient (MCC) (TP + TN) / (TP + TN + FP sqrt((TP + FP) * (TP + FN) * (2 * Precision * Recall) / (TP / (TP + FP)) * (TP / (TP FN) * (TN + FP) * (TN + ificati calculated? + FN) (TN + FP) * (TN + FN)) (Precision + Recall) + FN)) FN)) on Which evaluation metric is Class more appropriate when dealing ificati with imbalanced datasets? Accuracy Precision Recall F1-score F1-score on Which evaluation metric is Class more suitable for dealing with ificati imbalanced datasets? Accuracy Precision Recall F1-score F1-score on Which of the following is a Class disadvantage of using accuracy It does not consider the It is sensitive to class It is not affected by It is sensitive to class ificati as an evaluation metric? true negatives. imbalance. It is difficult to interpret. misclassification errors. imbalance. on The learner is trying to predict Multivariate Logistic Multivariate Linear housing prices based on the size Regression Logistic Regression Linear Regression Regression Linear Regression LR of each house. What type of regression is this?g What type of regression is it when the learner attempts to forecast housing prices based Multivariate Logistic Multivariate Linear on each house's size? Regression Logistic Regression Linear Regression Regression Linear Regression LR In linear regression, what is the name of the line that best fits the data points? Best-Fit Line Regression Line Trend Line Correlation Line Regression Line LR Which of the following metrics can be used to evaluate the performance of a binary Class classifier at various threshold ificati levels? Precision Recall ROC-AUC F1-score ROC-AUC on In practice, Line of best fit or Sum of the absolute value Sum of the square of Sum of the square of Sum of the square of regression line is found when Sum of residuals (∑(Y – of residuals (∑|Y-h(X)|) is residuals ( ∑ (Y-h(X))2) is residuals ( ∑ (Y-h(X))2) is residuals ( ∑ (Y-h(X))2) _____________ h(X))) is minimum maximum minimum maximum is minimum LR What triggers the discovery of Sum of the absolute value Sum of the square of Sum of the square of Sum of the square of the Line of Best Fit or regression Sum of residuals (∑(Y – of residuals (∑|Y-h(X)|) is residuals ( ∑ (Y-h(X))2) is residuals ( ∑ (Y-h(X))2) is residuals ( ∑ (Y-h(X))2) line in practice? h(X))) is minimum maximum minimum maximum is minimum LR The residual represents the The residual represents The residual represents the difference between the difference between difference between the The residual represents the predicted and the predicted and actual actual and predicted the value of the The residual represents actual values, while values, while the values, while the predicted dependent variable, while the slope of the the predicted value is What is the difference between predicted value is the value is the value the predicted value is the regression line, while the the value estimated by the residual and the predicted value estimated by the estimated by the value of the independent predicted value the regression value in linear regression? regression equation. regression equation. variable. represents the intercept. equation. LR The residuals have a When performing linear The residuals have a non- The residuals have a The residuals have a varying variance regression, what does it mean if linear relationship with constant variance across all varying variance across across different levels the residuals exhibit the independent levels of the independent The residuals have a non- different levels of the of the independent heteroscedasticity? variables. variables. normal distribution. independent variables. variables. LR What does the relationship between residuals and There is a relationship There is a relationship Class predicted values signify in linear means our model is not There is a relationship means our model is ificati regression? good means our model is good Can’t say None of these not good on What does relationship means between residuals and There is a relationship There is a relationship Class predicted values in linear means our model is not There is a relationship means our model is ificati regression? good means our model is good Can’t say None of these not good on Class ificati on (TP * TN - FP * FN) / Evalu How is the Matthews (TP * TN - FP * FN) / sqrt((TP + FP) * (TP + ation Correlation Coefficient (MCC) (TP + TN) / (TP + TN + FP + sqrt((TP + FP) * (TP + FN) * (2 * Precision * Recall) / (TP / (TP + FP)) * (TP / (TP FN) * (TN + FP) * (TN + Metr calculated? FN) (TN + FP) * (TN + FN)) (Precision + Recall) + FN)) FN)) ics Logis tic Within logistic regression, what Regr is the output range produced by essio the sigmoid function? [0, 1] [-1, 1] [0, ∞) (-∞, ∞) [0, 1] n Logis tic In logistic regression, what is Regr the output range of the sigmoid essio function? [0, 1] [-1, 1] [0, ∞) (-∞, ∞) [0, 1] n Regr essio n Evalu ation What is the range of the R- Metr squared value? [0, 1] [0, ∞) [-1, 1] [-∞, ∞) [0, 1] ics Regr essio n Evalu ation What is the range of the Mean Metr Absolute Error (MAE)? [0, ∞) [0, 1] [-∞, ∞) [-1, 1] [0, ∞) ics Regr essio 80% of the variance in n 80% of the variance in the the dependent Evalu dependent variable is 80% of the observations 80% of the observations variable is explained ation What is the interpretation of an explained by the are correctly predicted by fall within the confidence 80% of the observations by the independent Metr R-squared value of 0.8? independent variables the model interval have the same value variables ics Rand om What does the term "forest" A collection of decision A group of random A set of rules for A collection of Fores refer to in Random Forest? trees A single decision tree variables classification decision trees t Rand om What does the term "forest" A collection of decision A group of random A set of rules for A collection of Fores denote in Random Forest? trees A single decision tree variables classification decision trees t A decision tree with a A decision tree with a Decis What is a decision stump in single decision node and A decision tree with a A decision tree with a A decision tree with single decision node ion decision trees? two leaf nodes single leaf node single root node multiple levels of nodes and two leaf nodes Tree A flowchart-like structure where each A flowchart-like structure internal node where each internal node represents a test on a represents a test on a feature, each branch feature, each branch represents the represents the outcome outcome of the test, of the test, and each leaf and each leaf node Decis node represents a class A neural network represents a class ion What is a decision tree? label or a decision A linear regression model A clustering algorithm architecture label or a decision Tree A flowchart-like tree structure where each internal node A flowchart-like tree represents a decision structure where each based on a feature, internal node represents each branch a decision based on a represents the feature, each branch outcome of the represents the outcome A model that uses decision, and each leaf of the decision, and each A linear model that uses gradient descent to A clustering algorithm node represents the Decis What is a decision tree in leaf node represents the decision boundaries to minimize the loss that groups similar data final decision or ion machine learning? final decision or outcome classify data points function points together outcome Tree A flowchart-like tree structure where each internal node A flowchart-like tree represents a decision structure where each based on a feature, internal node represents each branch a decision based on a represents the feature, each branch outcome of the represents the outcome A model that uses decision, and each leaf of the decision, and each A linear model that uses gradient descent to A clustering algorithm node represents the Decis What does a decision tree leaf node represents the decision boundaries to minimize the loss that groups similar data final decision or ion signify in machine learning? final decision or outcome classify data points function points together outcome Tree A type of supervised An unsupervised learning A reinforcement learning A type of supervised Rand What is Random Forest? learning algorithm algorithm A clustering algorithm algorithm learning algorithm om Fores t Rand om What does Random Forest A type of supervised An unsupervised learning A reinforcement learning A type of supervised Fores entail? learning algorithm algorithm A clustering algorithm algorithm learning algorithm t Logis tic What commonly used Regr evaluation metric is associated Mean Squared Error essio with logistic regression? (MSE) Accuracy R-squared F1-score Accuracy n Which of the following measures is frequently Class employed for binary Mean Absolute Error Root Mean Squared Error ificati classification problems? (MAE) Accuracy R-squared (RMSE) Accuracy on Which of the following metrics Class is commonly used for binary Mean Absolute Error Root Mean Squared Error ificati classification problems? (MAE) Accuracy R-squared (RMSE) Accuracy on Class ificati on Evalu Which of the following metrics ation is commonly used for binary Mean Absolute Error Root Mean Squared Error Metr classification problems? (MAE) Accuracy R-squared (RMSE) Accuracy ics Logis tic Which of the following Regr evaluation metrics is commonly Mean Squared Error essio used for logistic regression? (MSE) Accuracy R-squared F1-score Accuracy n Accuracy is calculated as Which of the following Accuracy considers both the ratio of true positives Accuracy considers Class statements about accuracy is Accuracy is suitable for false positives and false to the total number of Accuracy can be used for both false positives ificati true? imbalanced datasets. negatives. predictions. regression tasks. and false negatives. on Accuracy is calculated as Which of the subsequent Accuracy considers both the ratio of true positives Accuracy considers Class assertions about accuracy is Accuracy is suitable for false positives and false to the total number of Accuracy can be used for both false positives ificati correct? imbalanced datasets. negatives. predictions. regression tasks. and false negatives. on Class Accuracy is calculated as ificati Which of the following Accuracy considers both the ratio of true positives Accuracy considers on statements about accuracy is Accuracy is suitable for false positives and false to the total number of Accuracy can be used for both false positives Evalu true? imbalanced datasets. negatives. predictions. regression tasks. and false negatives. ation Metr ics Which of the following methods Rand can be used to evaluate the om performance of a Random Accuracy, Precision, Accuracy, Precision, Fores Forest? Recall, F1-score Mean Squared Error (MSE) R-squared Silhouette Score Recall, F1-score t Which of the following metrics Class is suitable for multiclass ificati classification problems? Precision Recall F1-score All of the above All of the above on Which of the following techniques can be used to Decis prevent overfitting in decision ion trees? Pruning Feature selection Regularization All of the above All of the above Tree Class ificati on Evalu Which of the following metrics ation is suitable for multiclass Metr classification problems? Precision Recall F1-score All of the above All of the above ics Which of the following metrics Class is appropriate for multiclass ificati classification problems? Precision Recall F1-score All of the above All of the above on What is the criterion used to Decis measure impurity in decision ion trees? Gini impurity Entropy Misclassification error All of the above All of the above Tree What criterion is employed to Decis gauge impurity within decision ion trees? Gini impurity Entropy Misclassification error All of the above All of the above Tree Regr essio All of the variance in n All of the variance in the None of the variance in the the dependent Evalu dependent variable is dependent variable is variable is explained ation What is the interpretation of an explained by the explained by the The model has perfect by the independent Metr R-squared value of 1? independent variables independent variables prediction The model is underfitting variables ics What occurs with bias and variance as the size of training Class data increases in linear Bias increases and Bias decreases and Bias increases and Bias increases and ificati regression? Variance increases Variance increases Variance decreases Can’t Say Variance decreases on Class What will happen with bias and Bias increases and Bias decreases and Bias increases and Bias increases and ificati variance as you increase the Variance increases Variance increases Variance decreases Can’t Say Variance decreases on size of training data in linear regression? Rand By adjusting class om How does Random Forest By adjusting class weights By oversampling the By undersampling the By assigning equal weights during Fores handle class imbalance? during training minority class majority class weights to all classes training t Rand By adjusting class om How does Random Forest By adjusting class weights By oversampling the By undersampling the By assigning equal weights during Fores address class imbalance? during training minority class majority class weights to all classes training t Rand By imputing missing By ignoring om How does Random Forest By averaging the values By replacing missing values By ignoring observations values based on other observations with Fores handle missing values? from other trees with the mode with missing values features missing values t Rand By imputing missing By ignoring om How does Random Forest By averaging the values By replacing missing values By ignoring observations values based on other observations with Fores manage missing values? from other trees with the mode with missing values features missing values t In a simple linear regression model (One independent variable), If we change the input variable by 1 unit. How much output variable will change? by 1 no change by intercept by its slope by its slope LR In a simple linear regression model with one independent variable, how much does the output variable change when the input variable is adjusted by one unit? by 1 no change by intercept by its slope by its slope LR Logis tic Regr How is the likelihood function By minimizing the sum of By maximizing the sum of By maximizing the log- By minimizing the log- By maximizing the log- essio optimized in logistic regression? squared residuals squared residuals likelihood function likelihood function likelihood function n Logis tic How is the likelihood function Regr maximized in logistic By minimizing the sum of By maximizing the sum of By maximizing the log- By minimizing the log- By maximizing the log- essio regression? squared residuals squared residuals likelihood function likelihood function likelihood function n By measuring the By measuring the ratio of By measuring the By measuring the decrease in impurity after correctly classified probability of incorrectly By measuring the amount decrease in impurity Decis How is information gain a dataset is split on an instances to the total classifying a randomly of information gained by after a dataset is split ion calculated in decision trees? attribute number of instances chosen element splitting a node on an attribute Tree By measuring the By measuring the ratio of By measuring the By measuring the How is information gain decrease in impurity after correctly classified probability of incorrectly By measuring the amount decrease in impurity Decis computed within decision a dataset is split on an instances to the total classifying a randomly of information gained by after a dataset is split ion trees? attribute number of instances chosen element splitting a node on an attribute Tree Rand By using feature By ignoring noisy features om How does Random Forest By randomly selecting By removing noisy features importance to filter out and relying on ensemble By randomly selecting Fores handle noisy features? features for each tree during preprocessing noisy features learning features for each tree t How does Random Forest handle noisy features? Rand By using feature By ignoring noisy features om By randomly selecting By removing noisy features importance to filter out and relying on ensemble By randomly selecting Fores features for each tree during preprocessing noisy features learning features for each tree t By assigning a numerical How does the decision tree By splitting the variable value to each level and By ignoring the variable By splitting the Decis handle categorical variables into binary dummy treating it as a continuous during the tree By converting the variable variable into binary ion with multiple levels? variables variable construction into ordinal levels dummy variables Tree By assigning a numerical How does a decision tree By splitting the variable value to each level and By ignoring the variable By splitting the Decis handle categorical variables into binary dummy treating it as a continuous during the tree By converting the variable variable into binary ion with multiple levels? variables variable construction into ordinal levels dummy variables Tree By replacing missing By using surrogate By using surrogate splits By imputing missing values values with a random splits to make Decis How does a decision tree to make decisions based with the mean or median By ignoring observations value from the feature decisions based on ion address missing values? on available data of the feature with missing values distribution available data Tree By replacing missing By using surrogate By using surrogate splits By imputing missing values values with a random splits to make Decis How does a decision tree to make decisions based with the mean or median By ignoring observations value from the feature decisions based on ion handle missing values? on available data of the feature with missing values distribution available data Tree