Map Reduce Machine Learning

The document discusses a general parallel programming framework for machine learning algorithms on multicore architectures, addressing the limitations of traditional methods that lack scalability. It integrates the Map-Reduce paradigm to facilitate efficient parallelization of various algorithms, including Logistic Regression, Neural Networks, and k-means. The proposed approach reformulates computations as summations, allowing for independent processing across multiple cores, thus achieving linear speedups in performance.

Uploaded by

Abebe Zeihun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views16 pages

Map Reduce Machine Learning

Uploaded by

Abebe Zeihun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Map-Reduce for Machine Learning on Multicore

Abebe Zerihun
Introduction
• Multicore Era: Increasing number of processing cores
per chip, but lack of effective programming
frameworks for multicore machine learning.
• Goal: Develop a general, parallel programming
method to leverage multicore architectures for a
wide range of machine learning algorithms.
• Challenge: traditional approaches focus on specific
optimizations for individual algorithms, lacking
scalability and generalizability.
Map-Reduce
Contribution
• General Framework:
– Algorithms fitting the Statistical Query Model are
expressed in a summation form, enabling easy
parallelization.
– This method ensures exact, non-approximated
implementations of machine learning algorithms.
• Example instead of directly accessing a dataset {(xi, yi)}ni=1,
an algorithm may query.

where f(x, y) is a function of the data.

Cont..
• Integration with Map-Reduce:
– Adapts Google's Map-Reduce paradigm for multicore
systems, simplifying programming and achieving linear
speedups.
• Broad Applicability:
– Demonstrates parallelization for diverse algorithms:
Logistic Regression, SVM, Neural Networks, PCA, ICA, k-
means, EM, Naive Bayes, etc.
Workflow
• Summation Form:
– Reformulates computations as summations over data
subsets, ideal for parallel execution.
• Map-Reduce Framework:
– Map Phase: Data is split among cores, and partial
computations are performed.
• Reduce Phase:
– Partial results are aggregated, and final computations are
completed.
• Parallel Gradient Descent:
– Key algorithms like Logistic Regression and Neural
Networks use batch gradient descent for efficient parallel
optimization
Algorithms Demonstrated
• Locally Weighted Linear Regression
(LWLR):Parallelizes matrix multiplications and
statistics aggregation
• Logistic Regression: computes gradients and
Hessian in parallel for optimization.
• Neural Networks (Backpropagation):Parallelizes
error backpropagation and gradient computations for
weight updates
• Principal Component Analysis (PCA):Parallelizes
covariance matrix computation and mean
calculations.
• k-means Clustering: parallelizes distance
Logistic Regression
• In logistic regression, we aim to optimize the parameters θ of the
hypothesis:

– To predict probabilities for binary classification.

• The negative log-likelihood gives the log-loss (cost function):
Gradient Computation
• The gradient of J(θ) with respect to θ :

• This is already in summation form since we compute a

summation over all data points.
• Each term can be computed independently for each data
point (xi,yi).
Parallelizing Logistic Regression
• Step 1: Partition the data
– Divide the dataset into P disjoint subsets Dp(one for each processor).
– Each subset contains mpexamples:

• Step 2: Compute Partial Gradients Locally

– On each processor p, compute the partial gradient over its assigned
subset:
Cont..

• Aggregate gradients

– Once the gradient is computed, update the parameters 𝜃θ

• Update parameters

using gradient descent:

– Where α is learning rate

Example
• Suppose there are 500 training samples.
• Partitioned data for 5 cores.
• Let D1,D2,…,D5be the data subsets for each core,

• Core 1: 𝐷1={(𝑥1,𝑦1),…,(𝑥100,𝑦100)},
where each subset contains 100 samples:

• Core 2: 𝐷2={(𝑥101,𝑦101),…,(𝑥200,𝑦200)},
• Core 3: 𝐷3={(𝑥201,𝑦201),…,(𝑥300,𝑦300)},
• Core 4: 𝐷4={(𝑥301,𝑦301),…,(𝑥400,𝑦400)}
• Core 5: 𝐷5={(𝑥401,𝑦401),…,(𝑥500,𝑦500)}.
• Each core computes the partial
gradients for its assigned data • Core 4:
subset.
• Core1:

• Core 5:
• Core 2:

• Core 3
Aggregating Gradients
• After all cores compute their partial gradients, the gradients are
aggregated to form the total gradient.

• Update
Time complexity
• Logistic Regression
• Single machine/ core
– O(mn2 + n3)
• Multi-core
– O(+ n2log(p))
Q and A ?

03 MapReduce
No ratings yet
03 MapReduce
184 pages
Open FOAM - The Open Source CFD Toolbox. User Guide
No ratings yet
Open FOAM - The Open Source CFD Toolbox. User Guide
211 pages
Service Manual: M0C3/M0C4/M0C5/M0C6/M0C7 M0EV/M0EW/M0ER/M0ES/M0ET
0% (1)
Service Manual: M0C3/M0C4/M0C5/M0C6/M0C7 M0EV/M0EW/M0ER/M0ES/M0ET
340 pages
Java ML
No ratings yet
Java ML
7 pages
11 Distributed Dl Printable (1)
No ratings yet
11 Distributed Dl Printable (1)
73 pages
Introduction To MapReduce
No ratings yet
Introduction To MapReduce
43 pages
Distributed Linear Regression Class Notes
No ratings yet
Distributed Linear Regression Class Notes
140 pages
Artificial Intelligence
100% (1)
Artificial Intelligence
47 pages
Why Use PCA
No ratings yet
Why Use PCA
85 pages
SEng5305-chap-1-Introduction To ML
No ratings yet
SEng5305-chap-1-Introduction To ML
85 pages
Supervised Learning 1 PDF
100% (1)
Supervised Learning 1 PDF
162 pages
21cs743 Solutions
No ratings yet
21cs743 Solutions
19 pages
Unit III
No ratings yet
Unit III
8 pages
ML Short U1-4
No ratings yet
ML Short U1-4
60 pages
Course Introduction Artificial Intelligence: Eun-Sol Kim
No ratings yet
Course Introduction Artificial Intelligence: Eun-Sol Kim
58 pages
Week-12 - Introduction To ML-NN-CNN
No ratings yet
Week-12 - Introduction To ML-NN-CNN
45 pages
THEORY FILE - Machine Learning (6th Sem) !!
No ratings yet
THEORY FILE - Machine Learning (6th Sem) !!
26 pages
Css
No ratings yet
Css
113 pages
AML Slides Indexed 2in1
No ratings yet
AML Slides Indexed 2in1
33 pages
DC Unit IV
No ratings yet
DC Unit IV
28 pages
BD 08 Map Reduce
No ratings yet
BD 08 Map Reduce
77 pages
Cloud 4 Unit
No ratings yet
Cloud 4 Unit
26 pages
Intro Slides
No ratings yet
Intro Slides
31 pages
02 PAS Install The Vault
No ratings yet
02 PAS Install The Vault
66 pages
Week10-Map Reducible Algo-2025
No ratings yet
Week10-Map Reducible Algo-2025
39 pages
ML Notes All
No ratings yet
ML Notes All
32 pages
Machine Learning Engineer Interview Preparation Guide
No ratings yet
Machine Learning Engineer Interview Preparation Guide
14 pages
Aid For ", (-,, .I+ Feceived $P:: ": R' 3yr$ - / - "+ .3 // :: 7
No ratings yet
Aid For ", (-,, .I+ Feceived $P:: ": R' 3yr$ - / - "+ .3 // :: 7
56 pages
Lecture 1 Parallel and Scalable Machine Learning by HPC Morris Riedel
No ratings yet
Lecture 1 Parallel and Scalable Machine Learning by HPC Morris Riedel
50 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
Mapreduce Article Review
No ratings yet
Mapreduce Article Review
8 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
15 pages
Summary - Data Analytics& Machine Learning
No ratings yet
Summary - Data Analytics& Machine Learning
18 pages
Machine Learning
No ratings yet
Machine Learning
22 pages
21cs743 Model Question Paper Solution
No ratings yet
21cs743 Model Question Paper Solution
33 pages
ML Parallelization1
No ratings yet
ML Parallelization1
14 pages
Lecture8 MapReduce 2023
No ratings yet
Lecture8 MapReduce 2023
27 pages
JVC kd-r601
No ratings yet
JVC kd-r601
85 pages
Design and Implementation of A Real Time Chat Application.: OBADJERE, Eric Nyerhovwo
No ratings yet
Design and Implementation of A Real Time Chat Application.: OBADJERE, Eric Nyerhovwo
56 pages
SDL Unit 1
No ratings yet
SDL Unit 1
7 pages
MODULE 2 ML
No ratings yet
MODULE 2 ML
15 pages
Make 06 00090 v2
No ratings yet
Make 06 00090 v2
17 pages
MLT Missing QSTN
No ratings yet
MLT Missing QSTN
16 pages
ML Lecture 1 Intro
No ratings yet
ML Lecture 1 Intro
21 pages
Machine Learning 2
No ratings yet
Machine Learning 2
10 pages
Ann Unit V
No ratings yet
Ann Unit V
30 pages
Linear Learning With Allreduce: John Langford (With Help From Many)
No ratings yet
Linear Learning With Allreduce: John Langford (With Help From Many)
33 pages
Unit 4 - Aia
No ratings yet
Unit 4 - Aia
32 pages
ML - 1 - Sovan - Introduction To ML
No ratings yet
ML - 1 - Sovan - Introduction To ML
83 pages
Map-Reduce For Machine Learning On Multicore PDF
No ratings yet
Map-Reduce For Machine Learning On Multicore PDF
8 pages
En
No ratings yet
En
44 pages
Comaprison of Machine Learning Algorithms
No ratings yet
Comaprison of Machine Learning Algorithms
10 pages
Presentation 7
No ratings yet
Presentation 7
7 pages
Week 8 DS Practical
No ratings yet
Week 8 DS Practical
13 pages
Cybersecurity Analytics 1st Edition Rakesh M Verma David J Marchette PDF Download
No ratings yet
Cybersecurity Analytics 1st Edition Rakesh M Verma David J Marchette PDF Download
81 pages
FaisalNawab COP
No ratings yet
FaisalNawab COP
12 pages
Eem520l1 2023
No ratings yet
Eem520l1 2023
20 pages
ML Parallelization
No ratings yet
ML Parallelization
6 pages
Fpga-Based Mapreduce Framework For Machine Learning: Bo Wang, Yi Shan, Jing Yan, Yu Wang,, Huangzhong Yang
No ratings yet
Fpga-Based Mapreduce Framework For Machine Learning: Bo Wang, Yi Shan, Jing Yan, Yu Wang,, Huangzhong Yang
28 pages
SPH Whitepaper
No ratings yet
SPH Whitepaper
25 pages
AML Syllabus
No ratings yet
AML Syllabus
2 pages
p553 Boehm
No ratings yet
p553 Boehm
12 pages
Healy in Comparison To Royal Rife Machines, Spooky2 and Other Frequency Devices in General
100% (7)
Healy in Comparison To Royal Rife Machines, Spooky2 and Other Frequency Devices in General
2 pages
AI Skills For Student and Professional
No ratings yet
AI Skills For Student and Professional
2 pages
P21 User Manual: 1、Main Technology Parameters
No ratings yet
P21 User Manual: 1、Main Technology Parameters
3 pages
Screenshot 2024-06-27 at 2.57.46 PM
No ratings yet
Screenshot 2024-06-27 at 2.57.46 PM
9 pages
Org Baldurs Gate II Shadow of Amn Quick Reference Card
No ratings yet
Org Baldurs Gate II Shadow of Amn Quick Reference Card
6 pages
Tomcat Vulnerabilities (CVE - ) NOT Impacting SAP BI Platform 4.x / 3.x
No ratings yet
Tomcat Vulnerabilities (CVE - ) NOT Impacting SAP BI Platform 4.x / 3.x
4 pages
Function and Relations (Part 1)
No ratings yet
Function and Relations (Part 1)
23 pages
Requirement For Variant Generation in
No ratings yet
Requirement For Variant Generation in
15 pages
FBM224 说明书
No ratings yet
FBM224 说明书
16 pages
Dot Net Core - Lab-1
No ratings yet
Dot Net Core - Lab-1
30 pages
Client Copy Procedure
No ratings yet
Client Copy Procedure
14 pages
ReaKontrol v110 Manual
No ratings yet
ReaKontrol v110 Manual
8 pages
External VGA - GPU For Laptops Using EXP GDC Beast - 15 Steps (With Pictures) - Instructables
No ratings yet
External VGA - GPU For Laptops Using EXP GDC Beast - 15 Steps (With Pictures) - Instructables
5 pages
TradeGecko B2B ECommerce Getting Started Ebook
No ratings yet
TradeGecko B2B ECommerce Getting Started Ebook
17 pages
Keeping OpenShift Evergreen
No ratings yet
Keeping OpenShift Evergreen
37 pages
8580bf5e 586f 455b 9b04 D2477a6c6bbgfg7 - AngularJS - Syllabus - BestDotNetTraining
No ratings yet
8580bf5e 586f 455b 9b04 D2477a6c6bbgfg7 - AngularJS - Syllabus - BestDotNetTraining
4 pages
Lecture 4 - Spanning Tree Protocol + Token Ring
No ratings yet
Lecture 4 - Spanning Tree Protocol + Token Ring
21 pages
Datasheet BNI005W 221621 en
No ratings yet
Datasheet BNI005W 221621 en
2 pages
Arslan CV
No ratings yet
Arslan CV
1 page
Manual Cooler CNPS9500A LED
No ratings yet
Manual Cooler CNPS9500A LED
9 pages
Real Time Braille To Speech Using Python
100% (1)
Real Time Braille To Speech Using Python
10 pages
DISPLAYwiring
No ratings yet
DISPLAYwiring
1 page

Map Reduce Machine Learning

Uploaded by

Map Reduce Machine Learning

Uploaded by

Map-Reduce for Machine Learning on Multicore

where f(x, y) is a function of the data.

– To predict probabilities for binary classification.

• This is already in summation form since we compute a

• Step 2: Compute Partial Gradients Locally

– Once the gradient is computed, update the parameters 𝜃θ

using gradient descent:

– Where α is learning rate

You might also like