0% found this document useful (0 votes)
2 views

Exercise Chapter 2 Data Science

The document contains self-check exercises related to Data Science, including true-false statements and multiple-choice questions. It covers topics such as artificial intelligence, big data characteristics, data types, and the data life cycle. The exercises are designed to assess understanding of key concepts in data science and big data management.

Uploaded by

rajitesfaye034
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views

Exercise Chapter 2 Data Science

The document contains self-check exercises related to Data Science, including true-false statements and multiple-choice questions. It covers topics such as artificial intelligence, big data characteristics, data types, and the data life cycle. The exercises are designed to assess understanding of key concepts in data science and big data management.

Uploaded by

rajitesfaye034
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Prepare de B: - Zerihun T.

Telegram Emerging Technology

Chapter 2 Data Science

Self-check Exercise II

Part I: True-False Items

1. Artificial Intelligence (AI) and Machine Learning (ML) can serve as a tool for building
explanation (Model) from the big data and enable prediction.
2. Large part of data in organization is unstructured data.
3. The goal of most big data system is to surface insights and connections from large volumes
of heterogeneous data.
4. Data science is defined as a multi-disciplinary field that users scientific methods.
5. Conceptually data is more of unprocessed raw, figures, symbols, etc.
6. Big data is being represented by the ‘’five Vs.’’.
7. Big data life cycle constitutes six major workflows.
8. Ingesting is a process of adding data into big data system using different tools.
9. A precise definition of ‘’Big data’’ is available for potential users.
10. Data curation is performed by expert curators that are responsible for improving the
accessibility and quality of data.

Part II: multiple Choice Items

1. Which one of the following disciplines contributes to the field of Data Science?
A. Artificial Intelligence & Machine Learning
B. Programing
C. Statistics & strong quantitative background
D. Data Warehousing, Data Mining and Modeling
E. All of the above
2. Which of the following is not true about Data Science?
A. Data science extracts knowledge and insights from data
B. It finds useful patterns, connections, and relationships within data
C. It is concerned with building explanations and making predictions based on the data
D. It analyzed
E. None of the above
3. Information is --------
A. Processed data D. Unprocessed facts and figures
B. Classified data E. All except D
C. Data with meaning
4. Given that you insert your ATM card into AM machine the input is ---------
A. The pin number that we write into the system
B. Checking of the pin number by the machine
C. The amount of money that you write into the system
Prepare de B: - Zerihun T.
Telegram Emerging Technology

D. A and C
E. Counting and giving the amount of money that you required
5. Given the same scenario of ATM transaction stated above, the process is -------
A. The pin number that we write into the system
B. Checking of the pin number by the machine
C. The amount of money that you write into the system
D. Counting and giving the amount of money that you required
E. C and D
6. Given the same scenario of ATM transaction stated above, the process is -------
A. The pin number that we write into the system
B. Checking of the pin number by the machine
C. The amount of money that you write into the system
D. Counting and giving the amount of money that you required
E. B and C
7. ----------- is a data type that conforms to a tabular format with relationship between the
different rows and columns
A. Semi-structured data D. Meta-data
B. Structured data E. All of the above
C. Unstructured
8. ------------- is information that either does not have predefined data model or is not
organized in a pre-defined manner.
a. Semi-structured data d. Meta-data
b. Structured data e. All of the above
c. Unstructured
9. Today’s Big Data that organizations are facing is more of
a. Semi-structured data d. Meta-data
b. Structured data e. None of the above
c. Unstructured
10. --------- is a form of structured data that does not conform with the formal structure of data
models but contain tags or other makers to separate sematic elements.
a. Semi-structured data d. Meta-data
b. Structured data e. None of the above
c. Unstructured
11. --------- is data about data that provides additional information about a specific set of data.
a. Semi-structured data d. Meta-data
b. Structured data e. None of the above
c. Unstructured
12. Which of the following is true about flexibility of structured, semi-structured and
unstructured data?
A. Structured data is not flexible since it is based on fixed or rigid schema
Prepare de B: - Zerihun T.
Telegram Emerging Technology

B. Semi-structured data is more flexible than structured data but less flexible than
unstructured data since the schema can be easily changed
C. Unstructured data is very flexible since there is absence of any schema
D. All of the above
E. None of the above
13. Which of the following big data value chain is concerned with exploring, transforming and
modeling data with the goal of highlighting relevant data and synthesizing and extracting
useful hidden information
A. Data acquisition D. Data storage
B. Data storage data analysis E. Data usage
C. Data curating
F.
14. ---------- involves content creation, selection, classification, transformation, validation, and
preservation.
a. Data acquisition d. Data storage
b. Data analysis e. Data usage
c. Data curating
15. The persistence and management of data in a scalable way to satisfy the requirement of
fast access to the data is known as ---------
a. Data acquisition d. Data storage
b. Data analysis e. Data usage
c. Data curating
16. --------- involves activities that need access to data, its analysis, and the tools needed to
integrate the data analysis within the business activity.
a. Data acquisition d. Data storage
b. Data analysis e. Data usage
c. Data curating
17. Which of the following is not true about Big Data?
A. It is a large dataset
B. It refers to computing strategies and technologies that are used to handle large datasets
C. It can be processed or stored with traditional tools or on a single computer
D. A and B
E. None of the above
18. A characteristics of big data which a explained by the massiveness of information being
processed is --------
A. Velocity C. Variety
B. Volume D. None of the above
19. The characteristics of big data signified by the speed that information moved through the
system is ----
a. Velocity b. Volume
Prepare de B: - Zerihun T.
Telegram Emerging Technology

c. Variety d. None of the above


20. Big data programs caused by wide range of the sources being processed and their relative
quality is -----------
a. Velocity c. Variety
b. Volume d. None of the above

You might also like