0% found this document useful (0 votes)
269 views25 pages

Azure Data Fundamentals 1

The document provides an overview of core data concepts. It discusses how data is defined, structured, and stored. It also describes different data workloads like transactional and analytical processing. Key data roles are explored including database administrators, data engineers, and data analysts. Common data analytics techniques are also summarized such as descriptive, diagnostic, predictive, and prescriptive analytics.

Uploaded by

yeolbest8412
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
269 views25 pages

Azure Data Fundamentals 1

The document provides an overview of core data concepts. It discusses how data is defined, structured, and stored. It also describes different data workloads like transactional and analytical processing. Key data roles are explored including database administrators, data engineers, and data analysts. Common data analytics techniques are also summarized such as descriptive, diagnostic, predictive, and prescriptive analytics.

Uploaded by

yeolbest8412
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 25

Module 1: Explore core

data concepts
Author name
Date

© Copyright Microsoft Corporation. All rights reserved.


Explore core data concepts

Explore roles and responsibilities in the world of data


Agenda

Explore concepts of data analytics

© Copyright Microsoft Corporation. All rights reserved.


Lesson 1: Explore core data concepts

© Copyright Microsoft Corporation. All rights reserved.


Identify how data is defined and stored

Identify characteristics of relational and non-relational data

Lesson 1
objectives Describe and differentiate data workloads

Describe and differentiate batch and streaming data

© Copyright Microsoft Corporation. All rights reserved.


What is data?
Collection of facts, numbers, descriptions, objects , stored in a structured, semi-structured,
unstructured way
Structured Semi-structured Unstructured
Table
Cột 1 Cột 2 Cột 3 Cột 4 … … Cột 7

Format: csv, txt, ... Format: json, xml, ...

© Copyright Microsoft Corporation. All rights reserved.


Transactional vs analytical data stores
Online Transactional Processing (OLTP) Online Analytical Processing (OLAP)

Customer
CustomerID CustomerName CustomerPhone

Orders
OrderID CustomerID OrderDate

© Copyright Microsoft Corporation. All rights reserved.


Transactional workloads

Transactional data is information that tracks the interactions related to an


organization's activities.

-$

+$

© Copyright Microsoft Corporation. All rights reserved.


Analytical Workloads

Analytical workloads are used for data analysis and decision making.

• Summaries

• Trends

• Business information

© Copyright Microsoft Corporation. All rights reserved.


Data Processing
Data processing is the conversion of raw data to meaningful information through
a process.

Batch Processing: data elements are Daily


collected into a group. The whole Batch Job
group is then processed at a future
time as a batch

System System
Stream Processing: each new piece of Upload Upload
data is processed when it arrives.

© Copyright Microsoft Corporation. All rights reserved.


Lesson 1: Knowledge check
How is data in a structured table organized?
❑ Rows and Columns
❑ Header and Footer
❑ Pages and Paragraphs

Which of the following is an example of unstructured data?


❑ An Employee table with columns Employee ID, Employee Name, and Employee Designation

❑ Audio and Video files

❑ A table within SQL Server database

What of the following is an example of a streaming dataset?


❑ Data from sensor feeds

❑ Sales data for the past month

❑ List of employees working for a company


Lesson 2: Explore roles and responsibilities in the world of data

© Copyright Microsoft Corporation. All rights reserved.


Explore data job roles

Lesson 2
Explore common tasks and tools for data job roles
objectives

© Copyright Microsoft Corporation. All rights reserved.


Roles in data

© Copyright Microsoft Corporation. All rights reserved.


Roles in data

Database Administrator Data Engineer Data Analyst

Database Management Data Pipelines and processes Provides insights into the data
Implements Data Security Data Ingestion storage Visual Reporting
Backups Prepare data for Analytics Modeling Data for Analysis
User Access Prepare data for analytical Combines data for visualization
processing and analysis
Monitors performance
Common tools – Database administrator

SQL Server
Azure Data Studio Azure Portal/CLI
Management Studio
Graphical interface for managing Graphical interface for managing Tools for management and
on-premises and cloud-based data on-premises and cloud-based data provisioning of Azure
services services Data Services
Runs on Windows, macOS, Linux Runs on Windows Manual and automation of scripts
using Azure Resource Manager or
Comprehensive Database
Command Line Interface scripting
Administration tool
Common tools – Data engineering

SQL Server
Azure Synapse Studio Azure Portal/CLI
Management Studio
Azure Portal integrated to manage Graphical interface for managing Tools for management and
Azure Synapse on-premises and cloud-based data provisioning of Azure resources
services
Data Ingestion (Azure Data Factory) Manual and automation of scripts
Runs on Windows using Azure Resource Manager or
Management of Azure Synapse
Command Line Interface scripting
assets (SQL Pools/Spark Pool) Comprehensive Database
Administration tool
Common tools – Data analyst

Power BI Portal/
Power BI Desktop Power BI Report Builder
Power BI Service
Data Visualization tool Authoring and management of Data Visualization tool for
Power BI reports paginated reports
Model and Visualize Data
Authoring of Power BI dashboards Model and Visualize paginated
Management of Azure Synapse
reports
assets (SQL Pools/Spark Pool) Share Reports/Datasets
Lesson 2: Knowledge check
Which one of the following tasks is a role of a database administrator?
❑ Backing up and restoring databases
❑ Creating dashboards and reports
❑ Identifying data quality issues

Which of the following tools is a visualization and reporting tool?


❑ SQL Server Management Studio

❑ Power BI

❑ SQL

Which one of the following roles is not a data job role?


❑ Systems Administrator

❑ Data Analyst

❑ Database Administrator
Lesson 3: Explore concepts of data analytics

© Copyright Microsoft Corporation. All rights reserved.


Learn about data ingestion and processing

Lesson 3
Explore data visualization
objectives

Explore data analytics

© Copyright Microsoft Corporation. All rights reserved.


The Data Journey
Data Ingestion Data Processing Data Visualization
The process of obtaining and importing data Takes the data in its raw form, cleans it, and Query the data and create graphical
for immediate use or storage in a database converts it into a more meaningful format representations of information and data

ETL

ELT

© Copyright Microsoft Corporation. All rights reserved.


Explore data analytics

Descriptive Diagnostic Predictive Prescriptive Cognitive

© Copyright Microsoft Corporation. All rights reserved.


Five types of data analytics
Type What is it ? Example
Helps you answer the question Analyze the financial statements within one
Descriptive
"What happened?" month to figure out what happened
Helps you answer the question "Why did Analyze why did the revenue of this month
Diagnostic
this happen?" decline comparing to the previous one
Using historical data to project sales, revenue,
Helps you answer the question "What might
Predictive and expenses to craft a picture of the
happen in the future?"
future and make decisions
Helps you to answer the question "What should Ways to boost the revenue of the following
Prescriptive
we do about it?" month
Is a field of analytics that tries to mimic
the human brain by draw inferences from
Netflix might recommend movies based on a
existing data and patterns, draws
Cognitive user's past preferences
conclusions based on existing knowledge bases
and then inserts this back into the knowledge
base for future inferences

© Copyright Microsoft Corporation. All rights reserved.


Lesson 3: Knowledge check
What is data ingestion?
❑ The process of transforming raw data into models containing meaningful information
❑ Analyzing data for anomalies
❑ Capturing raw data streaming/batching from various sources and storing it

Which type of analytics helps answer questions about what has happened in the past?
❑ Descriptive analytics
❑ Prescriptive analytics

❑ Predictive analytics
THANK YOU !

© Copyright Microsoft Corporation. All rights reserved.

You might also like