TDS ET With Solution PK
TDS ET With Solution PK
Sub-Section Number : 1
Sub-Section Id : 64065351871
Correct Marks : 0
Options :
6406531184609. Yes
6406531184610. No
Sub-Section Number : 2
Sub-Section Id : 64065351872
The dataset consists of geographic, demographic information about countries and their respective
GDPs. You would like to visualize this data and study the relationship between the location of
countries and their GDPs. You decide to use Power BI to visualize the dataset. But you would also
like to generate a summary of the data. Choose the most suitable answer among the given
options.
Options :
6406531184611. The summary can be generated using Quill and this is possible because Quill
can be used as an extension in Power BI. Quill is in Tableau, not Microsoft BI
6406531184612. Quill can only be used for visualization. Therefore a summary of the dataset
cannot be generated.
6406531184613. Power BI does not support generation of summary. Therefore using other
MS Power BI generate summary, as well as Quill in
visualization tools such as Tableau would work. Tableau, Quill is an extension of Tableau, for
summary generation
6406531184614. None of the options are appropriate for the generation of summary for the
given question.
Correct Marks : 1
Your project requires you to study the districts and their respective health indicators. You have a
shapefile with you that provides the required details. The objective of the project is to identify and
carve out districts that present high levels of health indicators. Choose the most suitable answer
among the given options.
Options :
6406531184615. QGIS can be used to create the shapefiles for districts with high levels of health
We can create Shaprfiles based on conditions too in QGIS
indicators.
6406531184616. While QGIS can be used to create shapefiles for the requirement, it cannot be
used to identify the districts with high levels of health indicators.
6406531184618. None of the options are suitable to meet the objectives of the project.
Correct Marks : 1
Question Label : Multiple Choice Question
What are the two outputs provided by the Excel Azure Machine Learning plugin?
Options :
6406531184621. Sentiment, Score Used for sentiment analysis, gives Sentiment as Neutral,
+ve, -ve, and Score, ranging from 0 to 1
6406531184622. Score, Labels
Correct Marks : 1
Provided below is an incomplete code snippet that enables you to compute distance between two
locations. Choose the most appropriate option that can be used in place of <missing line> to
compute the distance. Assume the coordinates of location one is stored in the variable “location1”
and the coordinates of location 2 is stored in the variable “location2”.
Options :
Correct Marks : 1
Provided below is a snippet of the code block of HTML tags from a website providing weather
forecast. Your goal is to scrape the high and low values for the 10-day temperature forecast.
Options :
6406531184627.
6406531184628.
6406531184630.
Correct Marks : 1
The dataset consists of year, annual cotton production, annual rainfall, loan interest rates and fuel
prices. You would like to compute the correlation coefficient between annual cotton production
and other variables in the dataset to analyze the effects of various variables on the target variable.
Choose the most suitable option among the following choices:
Options :
6406531184631. Excel cannot be used to compute correlation coefficients. Although we can use
excel to visualize the data using scatter plots to study the relationships.
6406531184632. The CORREL() function in Excel is not suitable for this analysis because it
The CORREL() function in Excel is suitable for this analysis,
doesn’t take more than two variables as inputs. we can do by taking two different variables one by one.
Correct Marks : 1
Options :
6406531184635. Your choice of pandas-profiling library is not appropriate because it does not
provide information about outliers.
Correct Marks : 1
Which piece of code below will extract and display details of the 9 scheduled airlines in India?
Options :
instead of .text,
here .content will also
work.
6406531184638.
6406531184639.
6406531184640.
6406531184641.
Correct Marks : 1
Comicgen is a useful tool in narrating data stories using comics. Which of the following is not a
function of comicgen?
Options :
6406531184643. Comicgen provides options to custom create different comic characters and
their emotions and pose
6406531184644. Comicgen can be easily integrated into Google sheets or Excel to narrate your
data stories
Last lecture of Module 7, narrate a story
6406531184645. You can type in your data story into comicgen to get your comic in return
First 3 options can be performed in Comicgen, check yourself, its amazing.
Correct Marks : 1
Question Label : Multiple Choice Question
A very large Matrix A has a lot of zero entries in it.Which function from the scipy library is useful in
efficient storage of such a matrix A?
Options :
6406531184646. compressed_mat
6406531184647. comp_mat
6406531184649. zip_mat
Correct Marks : 1
Which of the following libraries has functions and tools that are useful in the analysis of large
graphs?
Options :
6406531184652. numpy-network
6406531184653. pd-network
Correct Marks : 1
Options :
Correct Marks : 1
Which of the following libraries has functions extensively written to extract data from Wikipedia
pages?
Options :
6406531184658. BeautifulSoup
Self explanatory
6406531184659. wikimedia
6406531184660. wikipedia
6406531184661. wiki_scrape
Correct Marks : 1
A dataset provided to you has information about countries and respective populations. You plan to
visualize the data in Tableau using the map representation. But you are unable to do so because
the map representation is not activated for you to choose. What might be the issue? Provided
below is a snapshot of the dataset column names and types. Choose the most appropriate option
that would solve the problem.
Options :
6406531184663. We also need Latitude and Longitude information to activate the map
We have to convert data type of columns like countries names to geographic data to use
representation it in maps.
6406531184665. The given information provided would not have caused any issues. It is
sufficient for map representation
Correct Marks : 1
Logical calculations in tableau helps to determine if a certain condition is true or false. Is the
following expression valid ?
Options :
6406531184666. TRUE
6406531184667. FALSE
Correct Marks : 1
__________ is helpful to understand the structure of (or inspect) a website before writing a scraping
script.
Options :
6406531184668. BeautifulSoup
6406531184669. Developer Tools We get developer tools in the inspect
6406531184670. Airflow
6406531184671. Pycaret
Correct Marks : 1
__________ library has tools to get a webpage’s html contents into Python.
Options :
6406531184672. BeautifulSoup
6406531184673. numpy The requests module allows you to send HTTP requests using Python.
The HTTP request returns a Response Object with all the response data
6406531184674. requests (content, encoding, status, etc).
Correct Marks : 1
Which among the following code blocks will get you the latitude and longitude of “IIT Madras”?
Assume the Nominatim library is imported using the command given below:
Options :
6406531184676.
6406531184677.
6406531184678.
As per the colab file, first we make locator with user agent, then using the locator, we
find location details, using geocode of locator.
6406531184679.
Correct Marks : 1
Options :
6406531184680. TRUE
6406531184681. FALSE
Correct Marks : 1
Options :
Correct Marks : 1
Options :
We get correlation on y-axis in auto correlation plot, check week 4
6406531184687. Correlation lectures,where they taught about Moving Average i.e. (ARIMA)
6406531184688. Covariance
6406531184690. Variance
Correct Marks : 1
Which of the following tools cannot be used for anonymising the data?
Options :
6406531184692. Anonimatron
6406531184694. PowerBI used for data analysis, not anonymisation. others are used for static and dynamic
anonymisation.
6406531184695. Amnesia
6406531184696. sdcMicro
Question Number : 387 Question Id : 640653357405 Question Type : MCQ Is Question
Mandatory : No Calculator : None Response Time : N.A Think Time : N.A Minimum Instruction
Time : 0
Correct Marks : 1
For a one-time anonymization, static anonymization is sufficient. Is this statement true or false?
Options :
6406531184697. TRUE
6406531184698. FALSE
Correct Marks : 1
We have a variable X, which can take values AA, BB, or CC. The first 4 values of this variable in a
dataset are CC, AA, BB, AA. The format of representing this information as shown in the table
below is called:
Options :
6406531184708. one - hot encoding the integer encoded variable is removed and a new binary variable is
added for each unique integer value.
6406531184709. long format
6406531184710. integer
Question Number : 389 Question Id : 640653357410 Question Type : MCQ Is Question
Mandatory : No Calculator : None Response Time : N.A Think Time : N.A Minimum Instruction
Time : 0
Correct Marks : 1
We have a variable X, which can take values AA, BB, or CC. The first 4 values of this variable in a
dataset are CC, AA, BB, AA. This information is represented as shown below.
Options :
6406531184711. pandas.get_dummies One hot encoding is done for those categorical variables which are
not ordinal. For ordinal, we can use ordinalencoder.
6406531184712. from sklearn.preprocessing import BinaryEncoder
Other thing which can be used
6406531184713. import numpy as np here is label encoder.
Correct Marks : 1
k-means is typically influenced by the start values. What option in sklearn.cluster.KMeans helps
reduce the impact?
Options :
6406531184715. verbose
Correct Marks : 1
A Pandas dataframe DF has a column named salary_range which contains the salary details of
10000 employees of a firm binned as medium, high, and very high. You are interested in finding out
the number of employees in each category of salary_range. Which of the following commands will
help you to achieve this goal?
Options :
6406531184719. DF[‘salary_range’].bin_count()
6406531184721. DF$‘salary_range.bin_count()
6406531184722. DF$‘salary_range.value_counts()
Correct Marks : 1
Scikit-learn has a DecisionTreeClassifier module that is useful in building decision tree classifiers.
Suppose, our dataset is imbalanced in class. Which feature in the DecisionTreeClassifier() will help
us tackle this problem?
Options :
6406531184723. random_state
6406531184724. min_sample_split
6406531184725. class_balance
6406531184726. class_weight ="balanced" will work in actual for the given issue
Correct Marks : 1
Options :
Correct Marks : 1
We are interested in fitting an ARIMA model to our time series data. Specifically, we are interested
in a moving average model of 0, setting a lag value of 4 for autoregression, and a difference order
of 1. Which of the following gives you such a model?
Options : ARIMA is an acronym for “autoregressive integrated moving average.” It's a model used in
statistics and econometrics to measure events that happen over a period of time.
6406531184731. ARIMA(..., trend = (4,1,0))
An autoregressive integrated moving average (ARIMA)
6406531184732. ARIMA(..., order = (4,1,0)) process (aka a Box-Jenkins process) adds differencing
to an ARMA process.
An ARMA(p,q) process with d-order differencing is called
6406531184733. ARIMA(..., order = (0,4,1)) an ARIMA(p.d,q) process. Thus, for example, an ARIMA
(2,1,0) process is an AR(2) process with first-order
6406531184734. ARIMA(..., trend = (0,4,1)) differencing.
p = AR term
d = integrated term
q = MA term
Question Number : 395 Question Id : 640653357416 Question Type : MCQ Is Question
Mandatory : No Calculator : None Response Time : N.A Think Time : N.A Minimum Instruction
Time : 0
Correct Marks : 1
pycaret is a
Options :
Correct Marks : 1
subjectivity and polarity are two properties returned by the sentiment function of library:
Options :
Sentment analysis, week 5a lectures, TextBlob usage
6406531184739. TextBulb
6406531184740. NLPtext
from textblob import TextBlob
6406531184741. TextBlob
subjectivity=TextBlob(input_text).sentiment.subjectivity
6406531184742. NLP polarity=TextBlob(input_text).sentiment.polarity
Correct Marks : 1
Correct Marks : 1
Options :
6406531184748. has a negative sentiment Polarity score ranges from -1 to 1, tells about your
statement, whether that is bad or good. if bad, then -ve,
else +ve.
6406531184749. is more of an opinion statement
Correct Marks : 1
You are working on a piece of code that classifies different fruits into its respective groups (citrus,
berries, melons, apples & pears, and tropical & exotic). Which of the following loss functions from
Keras would you pick for the task?
Options :
6406531184751. binary_crossentropy
6406531184754. mean_absolute_error
Question Number : 400 Question Id : 640653357421 Question Type : MCQ Is Question
Mandatory : No Calculator : None Response Time : N.A Think Time : N.A Minimum Instruction
Time : 0
Correct Marks : 1
Options :
6406531184755. builds a decision tree classifier and prints the accuracy of the classifier
Correct Marks : 1
6406531184760. helps reduce matrix space when there are a lot of zero entries in the matrix
6406531184761. helps reduce matrix space when there are a lot of negative entries in the
matrix
(5 0 0 0
0800
6406531184762. makes matrix multiplication more meaningful and powerful 0030
Eg: V =[5836] 0600)
COL_INDEX = [ 0 1 2 1 ] for above matrix , csr
ROW_INDEX = [ 0 1 2 3 4 ] is given along left.
Question Number : 402 Question Id : 640653357424 Question Type : MCQ Is Question
Mandatory : No Calculator : None Response Time : N.A Think Time : N.A Minimum Instruction
Time : 0
Correct Marks : 1
Options :
6406531184769. create dashboards for small scale projects an online tool for converting data
into customizable informative
reports and dashboards
6406531184770. Edit photographs and videos
Correct Marks : 1
Which of the following tabs is used to identify API calls in the Inspect element in any browser?
Options :
6406531184771. Network We can see the API calls in the network element of Inspect.
You may have learnt it in Developer tools lecture, which was optional, or in
6406531184772. Elements Web scrapping where they tell about BBC & IMDB data. Also learnt in App
dev 1
6406531184773. Console
6406531184774. Sources
Correct Marks : 1
Options :
6406531184775. Urllib urllib is a package that collects several modules for working with URLs: urllib.
request for opening and reading URLs.
6406531184776. BeautifulSoup
6406531184777. Requests
6406531184778. Pandas
Question Number : 405 Question Id : 640653357427 Question Type : MCQ Is Question
Mandatory : No Calculator : None Response Time : N.A Think Time : N.A Minimum Instruction
Time : 0
Correct Marks : 1
The final output from the BBC Weather Location Service API is in JSON format:
Options :
6406531184779. TRUE Almost all APIs are in JSON format, so is BBSc weather location service API
6406531184780. FALSE
Correct Marks : 1
Which among the following excel charts is the most suitable for detecting outliers in the data?
Options :
6406531184783. Box and Whisker chart also called as Box Plot, we learnt in Statistics 1 about it, that it
is used to detect outliers in the data.
6406531184784. Histogram
Correct Marks : 1
Provided below is a snapshot of the dataset which consists of movie reviews and respective labels.
To compute the sentiment scores the Azure Machine Learning add-in requires input and output
values. In the figure provided below the input and output cells need to be populated with
appropriate values to obtain sentiment scores.
Choose the most appropriate option that enables you to predict sentiment scores using the Excel
Azure Machine Learning add-in.
Options :
6406531184786.
Sub-Section Number : 3
Sub-Section Id : 64065351873
Correct Marks : 1
Options :
Sub-Section Number : 4
Sub-Section Id : 64065351874
Sub questions
Correct Marks : 1
Question Label : Multiple Choice Question
We are analyzing how much the number of lecture hours attended by students affects their exam
scores. Which Excel function would you use as a starting point in this analysis?
Options :
6406531184699. STDEV.P()
6406531184700. STDEV.S()
6406531184701. SLOPE() gives the slope as usual to straight lines concept. y = mx+c, it gives m.
6406531184702. EXACT()
Correct Marks : 1
In the previous question, we plan to run a regression analysis after the preliminary analysis. Which
of the following features provide you with the capability to do this?
Options :