0% found this document useful (0 votes)
277 views

TYCS - Data Science MCQ

The document contains a collection of multiple choice questions about topics related to data science and machine learning. The questions cover subjects like languages used in data science, machine learning algorithms and techniques, data types, data warehousing concepts, data mining processes, and more. In total there are 50 questions testing one's knowledge of fundamental data science concepts.

Uploaded by

SNEHAL AHER
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
277 views

TYCS - Data Science MCQ

The document contains a collection of multiple choice questions about topics related to data science and machine learning. The questions cover subjects like languages used in data science, machine learning algorithms and techniques, data types, data warehousing concepts, data mining processes, and more. In total there are 50 questions testing one's knowledge of fundamental data science concepts.

Uploaded by

SNEHAL AHER
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

TYCS - Data Science MCQ

Identify the language which is used in data science?


A. C++
B. Java
C. Python
D. Ruby

A column is a _________ representation of data.


A. Diagonal
B. Vertical
C. Horizontal
D. Top

Machine learning is a subset of which of the following.


A. Artificial Intelligence
B. Deep Learning
C. Reinforcement Learning
D. Data Cleaning

Data science is the process of diverse set of data through


A. Organizing data
B. Processing data
C. Analyzing data
D. All of the above

Structured Query Language is an example of


A. Structured Data
B. Unstructured Data
C. Semi-Structured Data
D. Tabular Data

Decision tree is a which type of machine learning algorithm


A. Semi-supervised Machine learning
B. Unsupervised Machine learning
C. Supervised Machine learning
D. Reinforcement Machine learning

K- means clustering is a type of machine learning algorithm?


A. Semi-supervised Machine learning
B. Unsupervised Machine learning
C. Supervised Machine learning
D. Reinforcement Machine learning

Which of the following are the applications of data science?


A. Risk detection
B. Image recognition
C. Speech recognition
D. All of the above

If you are predicting a value then which type of Machine Learning algorithm will you use?
A. Classification
B. Recommendation
C. Regression
D. Prediction

Data cleaning is defined as -


A. Large collection of data mostly stored in a computer system
B. The removal of noise errors and incorrect input from a database
C. The systematic description of the syntactic structure of a specific database. It
describes the structure of the attributes, the tables and foreign key relationships.
D. None of these

Which of the following is not a data type?


A. Symbolic Data
B. Alphanumeric Data
C. Numeric Data
D. Alphabetic Data

What is the difference between BI (Business intelligence) and Data science?


A. Data science deals with all types of data whereas BI deals with only structured
types of data
B. BI deals with all types of data whereas Data science deals with only structured types
of data
C. BI deals with only structured and unstructured types of data but not semi-structured
whereas Data science deals with only structured types of data
D. Data science deals with only structured and unstructured types of data but not
semi-structured whereas BI deals with only structured types of data

What do you mean by machine learning?


A. ML is a branch of science that deals with data and the processing of data
B. ML is the branch of AI (artificial intelligence) that give machines the power of
what a human can do
C. ML is the branch of AI (artificial intelligence) that only deals with computer programs
to make valuable insight from the data
D. ML is a branch of science that deals with physical machines

Which type of machine learning is defined by using only labeled data to predict some
outcome?
A. Semi-supervised Machine learning
B. Unsupervised Machine learning
C. Supervised Machine learning
D. Reinforcement Machine learning

Which type of machine learning is defined by using only unlabelled data to analyze the
data?
A. Semi-supervised Machine learning
B. Unsupervised Machine learning
C. Supervised Machine learning
D. Reinforcement Machine learning

Which type of machine learning is defined by a combination of labeled data and unlabeled
data to analyze the data?
A. Semi-supervised Machine learning
B. Unsupervised Machine learning
C. Supervised Machine learning
D. Reinforcement Machine learning

Which type of machine learning is feedback-based machine learning?


A. Semi-supervised Machine learning
B. Unsupervised Machine learning
C. Supervised Machine learning
D. Reinforcement Machine learning

Which type of data analysis gives a summary of the raw data set?
A. Descriptive data analysis
B. Diagnostic data analysis
C. Predictive data analysis
D. Prescriptive data analysis

Which type of data analysis focuses on the question "what might happen in the future" and
helps in making predictions about some sort of data?
A. Descriptive data analysis
B. Diagnostic data analysis
C. Predictive data analysis
D. Prescriptive data analysis

Identify the options below that a data warehouse can include.


A. Database Tables
B. Online Data
C. Flat Files
D. All of the above

Where is data warehousing used?


A. Transaction System
B. Logical System
C. Decision Support System
D. Analytical System

Choose the incorrect property of the data warehouse.


A. Data from heterogeneous Sources
B. Subject oriented
C. Time variant
D. Volatile

Who is responsible for running queries and reports against data warehouse tables?
A. Software
B. Hardware
C. End user
D. Middle ware

ETL stands for ____________


A. Efficient, transfer and load
B. Explain transfer and load
C. Extract transfer and load
D. Extract transform and load

What does OLTP stand for?


A. Offline Transaction Processing
B. Online Transaction Processing
C. Outline traffic processing
D. Online Transcription Processing

A goal of data mining includes which of the following?


A. To explain some observed event or condition
B. To confirm that data exists
C. To analyze data for expected relationships
D. To create a new data warehouse

A data warehouse is which of the following?


A. Can be updated by end users.
B. Contains numerous naming conventions and formats.
C. Organized around important subject areas.
D. Contains only current data.

Which of the following is an essential process in which the intelligent methods are applied to
extract data patterns?
A. Warehousing
B. Data Mining
C. Text Mining
D. Data Selection

What are the functions of Data Mining?


A. Association and correctional analysis classification
B. Prediction and characterization
C. Cluster analysis and Evolution analysis
D. All of the above

Which of the following statement is true about the classification?


A. It is a measure of accuracy
B. It is a subdivision of a set
C. It is the task of assigning a classification
D. None of the above

Which of the following statements is correct about data mining?


A. It can be referred to as the procedure of mining knowledge from data
B. Data mining can be defined as the procedure of extracting information from a set of
the data
C. The procedure of data mining also involves several other processes like data
cleaning, data transformation, and data integration
D. All of the above

Which one of the following can be defined as the data object which does not comply with the
general behavior (or the model of available data)?
A. Evaluation Analysis
B. Outlier Analysis
C. Classification
D. Prediction

Which one of the following statements is not correct about the data cleaning?
A. It refers to the process of data cleaning
B. It refers to the transformation of wrong data into correct data
C. It refers to correcting inconsistent data
D. All of the above

The analysis performed to uncover the interesting statistical correlation between associated
-attributes value pairs are known as the _______.
A. Mining of association
B. Mining of correlation
C. Mining of clusters
D. All of the above

Which of the following can be considered as the classification or mapping of a set or class
with some predefined group or classes?
A. Data set
B. Data Characterization
C. Data Sub Structure
D. Data Discrimination

Which one of the following correctly refers to the task of the classification?
A. A measure of the accuracy, of the classification of a concept that is given by a certain
theory
B. The task of assigning a classification to a set of examples
C. A subdivision of a set of examples into a number of classes
D. None of the above

Which of the following also used as the first step in the knowledge discovery process?
A. Data selection
B. Data cleaning
C. Data transformation
D. Data integration

Which of the following refers to the steps of the knowledge discovery process, in which the
several data sources are combined?
A. Data selection
B. Data cleaning
C. Data transformation
D. Data integration

Which one of the following issues must be considered before investing in data mining?
A. Compatibility
B. Functionality
C. Vendor consideration
D. All of the above

In certain cases, it is not clear what kind of pattern need to find, data mining
should_________:
A. Try to perform all possible tasks
B. Perform both predictive and descriptive task
C. It may allow interaction with the user so that he can guide the mining process
D. All of the above

You might also like