Session 2
Session 2
Data Management
1.2.
1.4.8
1.4.8 Metadata Management
Data Management
1.2.
1.4.8
1.4.8 Metadata Management
Metadata is typically categorized into three types: descriptive (describes the
content of the data), structural or technical (information about the technical
details and systems that store the data), and administrative or operational
(details about the processing and accessing of data)
ISO 11179
Data Management
1.2.
1.4.8
1.4.8 Metadata Management
Data Management
1.2.
Data Management
1.2.
Data quality
The term data quality refers both to the characteristics associated with high quality
data and to the processes used to measure or improve the quality of data
A Data Quality program should focus on the data most critical to the enterprise
and its customers
• Completeness: The proportion of data stored against the potential for 100%.
• Uniqueness: No entity instance (thing) will be recorded more than once based
upon how that thing is identified.
• Timeliness: The degree to which data represent reality from the required point
in time.
• Validity: Data is valid if it conforms to the syntax (format, type, range) of its
definition.
• Accuracy: The degree to which data correctly describes the ‘real world’ object
or event being described.
Data Management
1.2. Data quality
Data Management
1.2. Data quality
Data Management
1.2.
Data Management
1. Data Governance provides direction and oversight for data management
Abstract of by establishing a system of decision rights over data that accounts for the
needs of the enterprise.
Knowledge
Areas 2. Data Architecture defines the blueprint for managing data assets by
aligning with organizational strategy to establish strategic data requirements
and designs to meet these requirements.
10. Metadata includes planning, implementation, and control activities to enable access
to high quality, integrated Metadata, including definitions, models, data flows, and other
information critical to understanding data and the systems through which it is created,
maintained, and accessed.
11. Data Quality includes the planning and implementation of quality management
techniques to measure, assess, and improve the fitness of data for use within an
organization
Data Management
1.2.
Data Management
1.2.
Context diagram
Data Management
1.2.
Context diagram
Data Management
1.2.
Context diagram
Data Management
1.2.
Context diagram
Data Management
1.2.
Context diagram
Data Management
1.2.
Context diagram
Data Management
1.2.
Context diagram
Data Management
1.2.
Context diagram
Data Management
1.2.
Context diagram
Data Management
1.2.
Context diagram
Data Management
1.2.
Context diagram
Data Management
1.2. Job Titles of Data Scientists
•Data Scientist •Chief Actuary of GeoSpatial Analytics and Modeling •Director, Business Planning & Analytics
•Data Analyst •Chief Strategy & Analytics Officer •Assistant Professor
•Research & Analytics Director •Database Manager •Machine Learning Engineer
•Business Analyst •Customer Analytics & Pricing •Python Developer
•Project Coordinator •Data Visualization Analyst •Analytics Officer
•Director - Advanced Analytics •Assistant Vice President •Executive Director
•Chief Credit & Analytics Officer •Research Analyst •Director, Big Data Analytics and Segmentation
•Director, Business Intelligence and Analytics •Director of Technology •Data Engineer
•Chief Analytic Officer •Chief Analytics & Algorithms Officer •Database Administrator
•Data Learning Engineer •Data Architect •Strategic Data Analytics Analyst
•Chief Analytic Officer •Statistician •Data and Analytics Manager
•Director of Risk Analytics and Policy •AI Product Manager •Director, Data Warehousing & Analytics
•GIS Analyst •Information Security Analyst •AI Architect
•Data Visualizers •Research Analyst •Data Science Director
•Chief Technology Officer •Statistical Modeling and Analytics •Data Ecologists
•Health Analytics •Principal Big Data Architect •Forensic Data Analytics
•Director Marketing Analytics •Customer Analytics •Data Manager
•Big Data Developer •Web Analytics •Director, Database Marketing & Analytics
•Data Developer •Risk and Business Analytics •Director of Analytics
•Clinical Analytics •Geospatial Data Scientist •Reporting/Analytics
•Big Data Architect •R&D Engineer Data Scientist •Predictive Analytics
•Python Data Developer
Data Management
1.2.
Presentation
Subject:
Introduce the job, its characteristics and required skills,
Present the most important knowledge areas (Based on DMBOK) for this job
Present potential techniques, tools and methods in these knowledge areas
Data Management