000 Into Machine Learning
000 Into Machine Learning
Hotmail Bing maps Bing search Kinect launches Skype Translator Azure Machine
launches launches launches launches Learning GA
What does that
What’s the best Which searches motion “mean”? What is that What will
Which email is way to home? are most
junk? person saying? happen next?
relevant?
Why Machine Learning?
What is the probability
of a click on each ad?
What
language?
What is the
intent?
Which ads to
show, and in Machine learning enables nearly every
what order?
value proposition of web search.
Are any of these
pages malicious?
Misspelled?
What pages
Which links are should we index?
most likely to get
clicked? What ad pricing
will optimize
revenue?
Image Analyze
Accent Color: Which border color is the best?
Accent Color: Analyze Image
Accent Color: Windows 10 Store
Accent Color: Windows 10 Store
Text Analytics: User reviews
Positive Negative
Data Science
• Data Science is far too complex
• Cost of accessing/using efficient ML algorithms is high
• Comprehensive knowledge required on different tools/platforms to develop
a complete ML project
• Difficult to put the developed solution into a scalable production stage
Recommendations
What should I do?
Decision support
Predictions
Data What will happen?
Decision Action
Interactive dashboards
Why did it happen?
Value
Transform data into intelligent action
ML
Algorithms
Best of MS
ML Operationalization IT Professional
ML APIs
Marketplace ISVs & Developers
Azure Machine Learning Ecosystem
Get/Prepare
Data
Evaluate
Build/Edit Azure Data
Get Azure Create Model
Experiment Publish Web
Subscription Workspace Results Marketplace
Service
Create/Update
Model
API examples
• Green Score, Wealth Score, Giving Score
• Frequently Bought Together API
• Recommendations API
• Anomaly Detection API
• Lexicon Based Sentiment Analysis
• Forecasting: Exponential Smoothing
• Forecasting: ETS+STL
• Forecasting: AutoRegressive Integrated
Moving Average (ARIMA)
• Binary Classifier API
• Cluster Model API
• Survival Analysis API
• Multivariate Linear Regression API
• Survival Analysis API
• Multivariate Linear Regression API
• Normal Distribution Quantile Calculator
• Binomial Distribution Quantile Calculator
• And more on datamarket.azure.com
Azure Machine Learning Service
Data -> Predictive model -> Operational web API in minutes
Data Clients
ML STUDIO API
Model is now a web
service that is callable
Blobs and Tables
Integrated development
Hadoop (HDInsight) environment for Machine
Relational DB (Azure SQL DB) Learning
… … … … …
1990 50°F 30°F 68°F 95°F
2000 29°F sample
48°F forecast
Weather 70°F 98°F
2010 49°F 27°F 67°F 96°F
2020 ? ? ? ?
Using known data, develop a model to predict unknown data.
Model (Regression)
90°F
1990 50°F 30°F 68°F 95°F
Predict 2020 Summer
Xbox-One
Customer
Income >
$50K
Not Xbox-One
Customer
Age<30 Xbox-One
Customer
Income >
$50K
Days Played > Not Xbox-One
728 Customer
Xbox-One
Customer
EXAMPLE
Model (Classification)
Classify a news article as (politics, sports, technology, health, …)
Tech
Health
Politics
Documents consist of
unstructured text. Machine
Politics
learning typically assumes a
more structured format of
examples
Health
Feature
Politics
Politics
Sports
i.e.
Blood Pressure
Age Height/Weight Hearth Rate
Features Label
Data instance
Feature Vector
Developing a Model
Training data Base
Documents Labels Feature Vectors Model
Tech Adjust
Parameters
Health
Politics
Train
Politics the
Model
Sports
Politics
train
ed m th
Tra od e tru pred
l wit e l icti
in ing h fe ab on
Tech
atures els
da
ta Health
80
% Politics
Politics
Train the Model
Sports
Sports
Steps to Build a Machine Learning Solution
1
Problem
Framing
5
2
Evaluate / Get/Prepare
Track Data
Performance
3.1
3.5 Analysis/
Evaluation Metric
definition
4 3
Deploy Develop 3.4 3.2
Model Model Parameter Feature
Tuning Engineering
3.3
Model
Training
Example use cases
Finance Sales Customer Operations
and risk and marketing and channel and workforce
$$$ Investment modelling Sales lead scoring Product recommendation Smart buildings
Supply chain
Risk management optimization
Machine Learning Algorithms
Algorithms: K-means
Regression
Scenarios: Regression
Stock prices prediction
Sales forecasts
Premiums on insurance based on different factors
Quality control: number of complaints over time based on product
specs, utilization, etc.
Workforce prediction
Workload prediction