SlideShare a Scribd company logo
Data Science Recap
Mark Tabladillo Ph.D.
May 21, 2020
Founder, PASS Data Science Virtual Chapter
2020
Recap of Main News
from Microsoft
Build 2020
Cloud Solution Architect
Microsoft United States
Connect on LinkedIn
Twitter @marktabnet
Topics
 Azure Synapse Link
 Responsible AI
 Project Bonsai & Project Moab
 AI Models at Scale
What if I want to run analytics in near real-
time on my operational data at scale?
Azure Synapse Link
 Microsoft is announcing Azure Synapse Link, a cloud-native
implementation of HTAP (hybrid transactional analytical processing),
which is an architecture for enabling analytics on live operational
data. With Azure Synapse Link, Azure is the first cloud service to
deliver on the promises of HTAP, without the costs, complexities and
trade-offs associated with implementations on-premises.
 Azure Synapse Link is now available with Azure Cosmos DB and will
soon be available with Azure SQL, Azure Database for PostgreSQL
and Azure Database for MySQL.
Azure Synapse Link:
Building real-time HTAP solutions with Azure
Cosmos DB & Azure Synapse Analytics
https://azure.microsoft.com/en-us/blog/azure-analytics-clarity-in-an-instant/
 Azure Cosmos DB is optimized for
operational workloads with single-digit
millisecond read and write latency
 99.999% high availability, guaranteed
throughput and consistency
 Turnkey global data replication across all
Azure regions
Fast NoSQL database with open APIs for any scale
What is Azure Cosmos DB
Real-time
Applications
& Services
Azure
Cosmos DB
 If you have large amounts of data,
analytical queries will take a long time
to run and will be resource intensive
 HUGE performance impact on the
OLTP workloads
Running OLTP and OLAP workloads on the same
database
Real-time
Applications &
Services
Azure
Cosmos DB
Reporting &
Dashboards
Azure Cosmos
DB
Spark connector
User
Applications
Azure
Cosmos DB
Data Lake
Extract
(Pipelines)
Transform
Enrich
Orchestrate
Power BI
Serve
Ingest data periodically from Azure Cosmos DB to Data Lake
Manage data formats and storage layer to optimize for analytics
Apache Spark
for Synapse
Synapse SQL
Separating OLTP & OLAP
Analytical Store
Column store optimized for
analytical queries
Transactional Store
Row store optimized for
transactional operations
Azure Cosmos DB Azure Synapse Analytics
Container
Cloud-Native HTAP
Azure
Synapse Link
SQL
Auto-Sync
Machine learning
Big data analytics
BI Dashboards
Operational
Data
Generate near real-time insights on your operational data
Azure Synapse Link: How it works?
Data Acquisition
& Understanding
Modeling
Business Understanding
Deployment
& Ops
How can we approach responsibility?
Responsible AI
Responsible AI in Three Areas
Understand New model interpretability and fairness assessment capabilities enable the
development of more accurate and fair models.
Protect New differential privacy computing capabilities enable customers to build
machine learning models using sensitive data while safeguarding the privacy of
individuals. This is a result of the partnership between Microsoft and Harvard’s
Institute for Quantitative School Science, which was announced last September.
Additionally, new confidential machine learning capabilities provide a secure and
trusted environment for machine learning.
Control New capabilities for fine-grained traceability, lineage, and access control of data,
models and experiments enable organizations to meet strict regulatory
requirements. Additionally, new workflow documentation capabilities to enforce
accountability in the machine learning process will be made available to
customers shortly after the Build conference.
Understand
Protect
Control
Project Bonsai Public Preview
Create and optimize intelligence for
industrial control systems with simulations
and machine teaching
Project Moab
Open-source machine teaching robotics
hardware kit
New Technical Demos and Customer Stories
featuring SCG and partner simulations using
Project Bonsai
AI models at scale
Massive, multi-purpose
AI models
Infrastructure at scale
The AI Supercomputer
Development at scale
Empowering every
developer
▪ Microsoft Turing: Largest AI
model ever built (17B
parameters)
▪ Changing how AI is
developed: from narrow,
custom models to multi-
purpose, customized, massive
models
▪ Turing language: Most
powerful model for multi-task
natural language processing
▪ The road of generalization:
Multi-modality text / images /
video
AI models and
development at scale
▪ Announcing Open Source frameworks & optimizers for massive
model training
▪ Future release of Microsoft Turing language model
AI computing at scale ▪ Announcing one of the top five publicly disclosed supercomputers
in the world
Outlook Meeting
Insights
Word Document
Summary
Bing Q&A
Dynamics 365
Seller Suggestions
• Lowering the barrier for state-of-the-art AI for every developer
• Enabling AI development scale @ Microsoft
Microsoft Build 2020: Data Science Recap
Microsoft Build 2020: Data Science Recap
Microsoft Build 2020: Data Science Recap

More Related Content

PDF
Big Data Adavnced Analytics on Microsoft Azure
PPTX
The Developer Data Scientist – Creating New Analytics Driven Applications usi...
PPTX
A developer's introduction to big data processing with Azure Databricks
PDF
Azure Databricks—Apache Spark as a Service with Sascha Dittmann
PDF
201905 Azure Databricks for Machine Learning
PDF
Spark as a Service with Azure Databricks
PDF
Azure Databricks – Customer Experiences and Lessons Denzil Ribeiro Madhu Ganta
PPTX
Global AI Bootcamp Madrid - Azure Databricks
Big Data Adavnced Analytics on Microsoft Azure
The Developer Data Scientist – Creating New Analytics Driven Applications usi...
A developer's introduction to big data processing with Azure Databricks
Azure Databricks—Apache Spark as a Service with Sascha Dittmann
201905 Azure Databricks for Machine Learning
Spark as a Service with Azure Databricks
Azure Databricks – Customer Experiences and Lessons Denzil Ribeiro Madhu Ganta
Global AI Bootcamp Madrid - Azure Databricks

What's hot (20)

PPTX
Azure data bricks by Eugene Polonichko
PPTX
TechEvent Databricks on Azure
PDF
Using Redash for SQL Analytics on Databricks
PPTX
Ai & Data Analytics 2018 - Azure Databricks for data scientist
PPTX
Develop scalable analytical solutions with Azure Data Factory & Azure SQL Dat...
PDF
Data Lakes with Azure Databricks
PPTX
Modern data warehouse
PPTX
Overview on Azure Machine Learning
PDF
Modern data warehouse with Azure
PPTX
Running cost effective big data workloads with Azure Synapse and ADLS (MS Ign...
PDF
Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...
PDF
Part 3 - Modern Data Warehouse with Azure Synapse
PDF
5 Comparing Microsoft Big Data Technologies for Analytics
PPTX
Leveraging Azure Databricks to minimize time to insight by combining Batch an...
PDF
Azure Synapse 101 Webinar Presentation
PDF
Einstieg in Machine Learning für Datenbankentwickler
PPTX
Azure Databricks - An Introduction (by Kris Bock)
PDF
USQ Landdemos Azure Data Lake
PDF
Azure databricks c sharp corner toronto feb 2019 heather grandy
PDF
How Azure Databricks helped make IoT Analytics a Reality with Janath Manohara...
Azure data bricks by Eugene Polonichko
TechEvent Databricks on Azure
Using Redash for SQL Analytics on Databricks
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Develop scalable analytical solutions with Azure Data Factory & Azure SQL Dat...
Data Lakes with Azure Databricks
Modern data warehouse
Overview on Azure Machine Learning
Modern data warehouse with Azure
Running cost effective big data workloads with Azure Synapse and ADLS (MS Ign...
Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...
Part 3 - Modern Data Warehouse with Azure Synapse
5 Comparing Microsoft Big Data Technologies for Analytics
Leveraging Azure Databricks to minimize time to insight by combining Batch an...
Azure Synapse 101 Webinar Presentation
Einstieg in Machine Learning für Datenbankentwickler
Azure Databricks - An Introduction (by Kris Bock)
USQ Landdemos Azure Data Lake
Azure databricks c sharp corner toronto feb 2019 heather grandy
How Azure Databricks helped make IoT Analytics a Reality with Janath Manohara...
Ad

Similar to Microsoft Build 2020: Data Science Recap (20)

PDF
Analytics in a Day Ft. Synapse Virtual Workshop
 
PDF
Microsoft Azure Overview
PPTX
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...
PDF
Customer Migration to Azure SQL Database_2024.pdf
PDF
Analytics in a Day Ft. Synapse Virtual Workshop
 
PDF
Modern Business Intelligence and Advanced Analytics
PPTX
Azure Synapse Analytics Overview (r1)
PDF
Azure Meetup: Novità CosmosDB modalità Serverless e Cognitive Services
PPTX
CC -Unit4.pptx
PPTX
How does Microsoft solve Big Data?
PPTX
Azure Data.pptx
PPTX
Building near real-time HTAP solutions using Synapse Link for Azure Cosmos DB
PPTX
Azure Synapse Analytics Overview (r2)
PPTX
AzureSynapse.pptx
PDF
Azure and Predix
PPTX
NYC Data Amp - Microsoft Azure and Data Services Overview
PPTX
UTAD - Jornadas de Informática - Potential of Big Data
DOCX
UNIT -IV.docx
PPTX
Cloud Scale Analytics Pitch Deck
PPTX
Cortana Analytics Suite
Analytics in a Day Ft. Synapse Virtual Workshop
 
Microsoft Azure Overview
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...
Customer Migration to Azure SQL Database_2024.pdf
Analytics in a Day Ft. Synapse Virtual Workshop
 
Modern Business Intelligence and Advanced Analytics
Azure Synapse Analytics Overview (r1)
Azure Meetup: Novità CosmosDB modalità Serverless e Cognitive Services
CC -Unit4.pptx
How does Microsoft solve Big Data?
Azure Data.pptx
Building near real-time HTAP solutions using Synapse Link for Azure Cosmos DB
Azure Synapse Analytics Overview (r2)
AzureSynapse.pptx
Azure and Predix
NYC Data Amp - Microsoft Azure and Data Services Overview
UTAD - Jornadas de Informática - Potential of Big Data
UNIT -IV.docx
Cloud Scale Analytics Pitch Deck
Cortana Analytics Suite
Ad

More from Mark Tabladillo (20)

PDF
How to find low-cost or free data science resources 202006
PDF
201909 Automated ML for Developers
PDF
201908 Overview of Automated ML
PDF
201906 01 Introduction to ML.NET 1.0
PDF
201906 04 Overview of Automated ML June 2019
PDF
201906 03 Introduction to NimbusML
PDF
201906 02 Introduction to AutoML with ML.NET 1.0
PDF
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
PDF
Big Data Advanced Analytics on Microsoft Azure 201904
PDF
Managing Enterprise Data Science 201904
PDF
Training of Python scikit-learn models on Azure
PDF
Advanced Analytics with Power BI 201808
PDF
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
PDF
Machine learning services with SQL Server 2017
PDF
Microsoft Technologies for Data Science 201612
PDF
How Big Companies plan to use Our Big Data 201610
PDF
Georgia Tech Data Science Hackathon September 2016
PDF
Microsoft Data Science Technologies 201608
PDF
Insider's guide to azure machine learning 201606
PDF
Window functions for Data Science
How to find low-cost or free data science resources 202006
201909 Automated ML for Developers
201908 Overview of Automated ML
201906 01 Introduction to ML.NET 1.0
201906 04 Overview of Automated ML June 2019
201906 03 Introduction to NimbusML
201906 02 Introduction to AutoML with ML.NET 1.0
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
Big Data Advanced Analytics on Microsoft Azure 201904
Managing Enterprise Data Science 201904
Training of Python scikit-learn models on Azure
Advanced Analytics with Power BI 201808
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
Machine learning services with SQL Server 2017
Microsoft Technologies for Data Science 201612
How Big Companies plan to use Our Big Data 201610
Georgia Tech Data Science Hackathon September 2016
Microsoft Data Science Technologies 201608
Insider's guide to azure machine learning 201606
Window functions for Data Science

Recently uploaded (20)

PPTX
batch data Retailer Data management Project.pptx
PPTX
Business Acumen Training GuidePresentation.pptx
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
Bharatiya Antariksh Hackathon 2025 Idea Submission PPT.pptx
PDF
Chad Readey - An Independent Thinker
PPTX
Machine Learning Solution for Power Grid Cybersecurity with GraphWavelets
PPTX
LESSON-1-NATURE-OF-MATHEMATICS.pptx patterns
PDF
Company Profile 2023 PT. ZEKON INDONESIA.pdf
PDF
Nashik East side PPT 01-08-25. vvvhvjvvvhvh
PDF
Report The-State-of-AIOps 20232032 3.pdf
PPTX
办理新西兰毕业证(Lincoln毕业证书)林肯大学毕业证毕业 证
PPT
Performance Implementation Review powerpoint
PPTX
CL11_CH20_-LOCOMOTION-AND-MOVEMENT-Autosaved.pptx
PDF
Taxes Foundatisdcsdcsdon Certificate.pdf
PDF
Data Analyst Certificate Programs for Beginners | IABAC
PDF
Company Presentation pada Perusahaan ADB.pdf
PPTX
artificial intelligence deeplearning-200712115616.pptx
PPTX
lec_5(probability).pptxzzjsjsjsjsjsjjsjjssj
PDF
CB-Insights_Artificial-Intelligence-Report-Q2-2025.pdf
PDF
Foundation of Data Science unit number two notes
batch data Retailer Data management Project.pptx
Business Acumen Training GuidePresentation.pptx
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
Bharatiya Antariksh Hackathon 2025 Idea Submission PPT.pptx
Chad Readey - An Independent Thinker
Machine Learning Solution for Power Grid Cybersecurity with GraphWavelets
LESSON-1-NATURE-OF-MATHEMATICS.pptx patterns
Company Profile 2023 PT. ZEKON INDONESIA.pdf
Nashik East side PPT 01-08-25. vvvhvjvvvhvh
Report The-State-of-AIOps 20232032 3.pdf
办理新西兰毕业证(Lincoln毕业证书)林肯大学毕业证毕业 证
Performance Implementation Review powerpoint
CL11_CH20_-LOCOMOTION-AND-MOVEMENT-Autosaved.pptx
Taxes Foundatisdcsdcsdon Certificate.pdf
Data Analyst Certificate Programs for Beginners | IABAC
Company Presentation pada Perusahaan ADB.pdf
artificial intelligence deeplearning-200712115616.pptx
lec_5(probability).pptxzzjsjsjsjsjsjjsjjssj
CB-Insights_Artificial-Intelligence-Report-Q2-2025.pdf
Foundation of Data Science unit number two notes

Microsoft Build 2020: Data Science Recap

  • 1. Data Science Recap Mark Tabladillo Ph.D. May 21, 2020 Founder, PASS Data Science Virtual Chapter 2020
  • 2. Recap of Main News from Microsoft Build 2020 Cloud Solution Architect Microsoft United States Connect on LinkedIn Twitter @marktabnet
  • 3. Topics  Azure Synapse Link  Responsible AI  Project Bonsai & Project Moab  AI Models at Scale
  • 4. What if I want to run analytics in near real- time on my operational data at scale?
  • 5. Azure Synapse Link  Microsoft is announcing Azure Synapse Link, a cloud-native implementation of HTAP (hybrid transactional analytical processing), which is an architecture for enabling analytics on live operational data. With Azure Synapse Link, Azure is the first cloud service to deliver on the promises of HTAP, without the costs, complexities and trade-offs associated with implementations on-premises.  Azure Synapse Link is now available with Azure Cosmos DB and will soon be available with Azure SQL, Azure Database for PostgreSQL and Azure Database for MySQL.
  • 6. Azure Synapse Link: Building real-time HTAP solutions with Azure Cosmos DB & Azure Synapse Analytics https://azure.microsoft.com/en-us/blog/azure-analytics-clarity-in-an-instant/
  • 7.  Azure Cosmos DB is optimized for operational workloads with single-digit millisecond read and write latency  99.999% high availability, guaranteed throughput and consistency  Turnkey global data replication across all Azure regions Fast NoSQL database with open APIs for any scale What is Azure Cosmos DB Real-time Applications & Services Azure Cosmos DB
  • 8.  If you have large amounts of data, analytical queries will take a long time to run and will be resource intensive  HUGE performance impact on the OLTP workloads Running OLTP and OLAP workloads on the same database Real-time Applications & Services Azure Cosmos DB Reporting & Dashboards Azure Cosmos DB Spark connector
  • 9. User Applications Azure Cosmos DB Data Lake Extract (Pipelines) Transform Enrich Orchestrate Power BI Serve Ingest data periodically from Azure Cosmos DB to Data Lake Manage data formats and storage layer to optimize for analytics Apache Spark for Synapse Synapse SQL Separating OLTP & OLAP
  • 10. Analytical Store Column store optimized for analytical queries Transactional Store Row store optimized for transactional operations Azure Cosmos DB Azure Synapse Analytics Container Cloud-Native HTAP Azure Synapse Link SQL Auto-Sync Machine learning Big data analytics BI Dashboards Operational Data Generate near real-time insights on your operational data Azure Synapse Link: How it works?
  • 11. Data Acquisition & Understanding Modeling Business Understanding Deployment & Ops How can we approach responsibility?
  • 13. Responsible AI in Three Areas Understand New model interpretability and fairness assessment capabilities enable the development of more accurate and fair models. Protect New differential privacy computing capabilities enable customers to build machine learning models using sensitive data while safeguarding the privacy of individuals. This is a result of the partnership between Microsoft and Harvard’s Institute for Quantitative School Science, which was announced last September. Additionally, new confidential machine learning capabilities provide a secure and trusted environment for machine learning. Control New capabilities for fine-grained traceability, lineage, and access control of data, models and experiments enable organizations to meet strict regulatory requirements. Additionally, new workflow documentation capabilities to enforce accountability in the machine learning process will be made available to customers shortly after the Build conference.
  • 17. Project Bonsai Public Preview Create and optimize intelligence for industrial control systems with simulations and machine teaching Project Moab Open-source machine teaching robotics hardware kit New Technical Demos and Customer Stories featuring SCG and partner simulations using Project Bonsai
  • 18. AI models at scale Massive, multi-purpose AI models Infrastructure at scale The AI Supercomputer Development at scale Empowering every developer
  • 19. ▪ Microsoft Turing: Largest AI model ever built (17B parameters) ▪ Changing how AI is developed: from narrow, custom models to multi- purpose, customized, massive models ▪ Turing language: Most powerful model for multi-task natural language processing ▪ The road of generalization: Multi-modality text / images / video
  • 20. AI models and development at scale ▪ Announcing Open Source frameworks & optimizers for massive model training ▪ Future release of Microsoft Turing language model AI computing at scale ▪ Announcing one of the top five publicly disclosed supercomputers in the world
  • 21. Outlook Meeting Insights Word Document Summary Bing Q&A Dynamics 365 Seller Suggestions • Lowering the barrier for state-of-the-art AI for every developer • Enabling AI development scale @ Microsoft