DataOps Methodology
DataOps Methodology
LESSON 2 :
Review Question 1 - Before we can put together a data strategy, we need to have a
good understanding of the data available and how it is used in the organization.
-> True
Review Question 2 - What is a data strategy?
-> An architecture and actionable roadmap along with an action plan.
Review Question 3 - Implementing a data strategy should always result in cost savings
in the year the plan is realized.
-> False
Review Question 4 - Which of the following statements about Data Strategy are correct?
-> All types of data – both structured and unstructured need to be considered
Review Question 5 - Data Governance is a key part of executing a data strategy.
-> True
LESSON 3 :
Review Question 1 - A DataOps team consists of members mostly from IT departments.
-> False
Review Question 2 - Which of the following roles are active team members of any
DataOps team?
-> Chief Data Officer
-> Data Engineer
-> Data Steward
-> Data Scientist
Review Question 3 - Creating and maintain business terms is a major responsibility of
which following role?
-> Data Steward
Review Question 4 - Only Chief Data Officer can update the KPIs for a data sprint.
-> False
Review Question 5 - DataOps relies heavily on the use of automation, so that
communication among team members is not necessary.
-> False
Module 2:
Lesson 1 - Establish Toolchain Review QuestionsReview Question 1 - DataOps
toolchain helps you deliver quality data slowly.
-> False
Review Question 2 - DataOps Toolchain and DevOps are the same thing.
-> False
Review Question 3 - DataOps Toolchain can work without DataOps API(s).
-> False
Review Question 4 - What are the key components of DataOps Toolchain?
-> All of above
Review Question 5 - Who is responsible for creating DataOps Toolchain? (Choose all
that apply)
-> Administrator
-> Data Engineer
Module 3:
Lesson 1 - Discover Review Questions
Review Question 1 - You will need someone on your team with detailed knowledge of
the business processes you’re going to analyze so selected data elements are
appropriate to reaching your objectives.
-> True
Review Question 2 - What should you do if you identify gaps or mismatches in the data
required for the analysis?
-> All of the above
Review Question 3 - You should trace the linage of data elements to be used for
analysis to make sure they come from a trusted source.
-> True
Review Question 4 - What is the primary objective of the Discover phase?
-> Identify and locate the specific data elements required to accomplish an analysis
Review Question 5 - A Data Engineer who thoroughly understands where specific data
resides, including the specific databases and files where each identified data element
resides, should be involved in Data Discovery process.
-> True
Module 5:
Lesson 1 - Self Service Review Questions
Review Question 1 - Self Service of data is only possible when any data movement and
transformation required to join multiple data assets have been performed.
-> False
Review Question 2 - Self Service can use the following governance artefacts to refine a
search in a catalog. (Choose all that apply)
-> Business Terms
-> Tags
Review Question 3 - A data consumer should not be able to access data that has been
identified as sensitive, where there is not a business need to do so.
-> True
Review Question 4 - Which of the following statements about Self Service are correct?
-> Data Protection rules prevent a data consumer from inadvertently seeing data that is
sensitive
-> Creating multiple catalogs can partition data assets by their content and anticipated
audience
Review Question 5 - Data Consumers provide valuable input to data scientists by
clarifying the combination of data assets and how they need to be transformed, prior to
data movement being designed and implemented.
-> True
Module 6:
Review Question 1 - DataOps is a fixed process which should not be changed once
defined.
-> False
Review Question 2 - Improvements to the DataOps process could involve changes to
-> All of the above
Review Question 3 - Reviewing the Data classification phase involves reviewing how
accurate the data mappings to the business terms are.
-> False
Review Question 4 - Reviewing the Establish Baseline Process should include reviewing
how effective the processes are for establishing a baseline for -
-> All of the above
Review Question 5 - KPIs are key in determining the effectiveness of all parts of the
DataOps process.
-> True
Final Exam :
Question 1
What is a data strategy?
-> An architecture and actionable roadmap along with an action plan
Question 2
Which of the following statements about Data Strategy are correct?
-> All types of data – both structured and unstructured need to be considered
Question 3
Which of the following roles are active team members of any DataOps team?
-> Chief Data Officer
-> Data Engineer
-> Data Steward
-> Data Scientist
Question 4
Creating and maintaining business terms is a major responsibility of which following
role?
-> Data Steward
Question 5
Business Priority should be the primary focus when deciding what the DataOps team
should do.
-> True
Question 6
What is a data backlog?
-> A prioritized set of requirements expressed as data tasks
Question 7
A Data Task should be prioritized by considering:
-> All of the above
Question 8
KPIs are used to determine the progress and throughput of a DataOps data sprint.
-> True
Question 9
What are key components of DataOps toolchain?
-> All of above
Question 10
Who is responsible for creating DataOps toolchain? (Choose all that apply)
-> Administrator
-> Data Engineer
Question 11
What is the primary objective of the Discover phase?
-> Identify and locate the specific data elements required to accomplish an analysis
Question 12
Which description best defines taxonomy?
-> Organizing data elements into meaningful structures.
Question 13
Which of the following is the objective of classification?
-> All of the above
Question 14
A data quality framework consists of which of the following 4 phases:
-> Define
-> Remediate
-> Monitor
-> Assess
Question 15
How does data classification affect defining policies?
-> Protection, accessibility and retention
Question 16
What impact does a highly sensitive classification have on a policy definition?
-> Limit access to the data and/or require data masking
Question 17
Self Service can use the following governance artefacts to refine a search in a catalog.
(Choose all that apply)
-> Business Terms
-> Tags
Question 18
Which of the following statements about Self Service are correct?
-> Data Protection rules prevent a data consumer from inadvertently seeing data that is
sensitive
Question 19
Which of the following does not represent a data integration pattern:
-> Data lineage
Question 20
Which of the following is not a Data Movement and Integration Job Design
consideration?
-> Everything should be programmed in Python
Question 21
Data consumers can first start to provide feedback to the current data sprint in the
stakeholder review meeting.
-> False
Question 22
Which of the following could be found in catalogue?
Code
-> Business terms
Question 23
All issues need to be remediated before moving on to the next data sprint.
-> False
Question 24
Improvements to the DataOps process could involve changes to
-> All of the above
Question 25
Reviewing the Establish Baseline Process should include reviewing how effective are
the processes for establishing a baseline for -
-> All of the above