Computer Network

Distributed machine learning utilizes multiple computing resources to efficiently handle large-scale datasets and complex models. Key frameworks include Apache Spark, TensorFlow, and PyTorch, which support distributed training and data processing. The approach emphasizes scalability, fault tolerance, and effective communication among distributed components.

Uploaded by

Gaurav Jena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views10 pages

Computer Network

Uploaded by

Gaurav Jena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 10

UNIVERSITY INSTITUTE OF

ENGINEERING
COMPUTER SCIENCE ENGINEERING
Bachelor of Engineering (Computer Science & Engineering)
Subject Name: Big Data Analytics
Subject Code: 20CST-471
Distributed machine learning
Mapped with CO5

Prepared By: DISCOVER . LEARN . EMPOWER

Er. Ankita Sharma 1
Distributed machine learning
• Distributed machine learning refers to the use of multiple computing
resources, often organized in a cluster or a distributed computing
environment, to perform machine learning tasks. This approach is
essential for handling large-scale datasets, complex models, and
computationally intensive operations that cannot be efficiently
processed on a single machine. The primary goals of distributed
machine learning are to improve scalability, reduce processing time,
and enable the handling of big data challenges.
• 1. Distributed Computing Frameworks:
• Apache Spark: A popular open-source distributed computing framework that
provides a unified analytics engine for large-scale data processing. Spark
includes MLlib, a library for distributed machine learning.
• TensorFlow and PyTorch: Popular deep learning frameworks that can be
configured to work in a distributed manner, allowing the training of deep
neural networks across multiple GPUs or servers.
• Dask: A parallel computing framework in Python that enables distributed
computing for machine learning and other data-intensive tasks.
• Hadoop MapReduce: While not as commonly used for machine learning as
Spark, Hadoop MapReduce can also be adapted for distributed machine
learning tasks.
• 2. Parallelism and Data Distribution:
• Data Parallelism: Distributing the dataset across multiple nodes or
machines and performing parallel computation on different subsets of the
data. Each machine processes a portion of the data independently.
• Model Parallelism: Distributing the components of a model across different
machines, where each machine is responsible for computing specific parts
of the model.
Distributed machine learning refers to the use of multiple computing resources, often
organized in a cluster or a distributed computing environment, to perform machine
learning tasks. This approach is essential for handling large-scale datasets, complex
models, and computationally intensive operations that cannot be efficiently processed on
a single machine. The primary goals of distributed machine learning are to improve
scalability, reduce processing time, and enable the handling of big data challenges.
• Here are key concepts and aspects related to distributed machine learning:
• 1. Distributed Computing Frameworks:
• Apache Spark: A popular open-source distributed computing framework that provides a
unified analytics engine for large-scale data processing. Spark includes MLlib, a library for
distributed machine learning.
• TensorFlow and PyTorch: Popular deep learning frameworks that can be configured to
work in a distributed manner, allowing the training of deep neural networks across
multiple GPUs or servers.
• 3. Communication and Synchronization:
• Parameter Server Architectures: In distributed machine learning,
parameter servers are used to manage and distribute model
parameters across the cluster. Workers perform computations and
update the parameters by communicating with the parameter server.
• Synchronization Strategies: Ensuring that the different components
of the distributed system are synchronized is crucial. Strategies
include synchronous updates, asynchronous updates, and a
combination of both.
• 4. Scaling Algorithms:
• Horizontal Scaling: Increasing the number of machines or nodes in
the cluster to handle larger datasets and more complex models.
• Vertical Scaling: Utilizing more powerful hardware or increasing the
computational capacity of individual machines.
• 5. Fault Tolerance:
• Distributed machine learning systems need to be resilient to failures
in the cluster. Techniques such as data replication, checkpointing, and
fault-tolerant algorithms are employed to handle failures gracefully.
•Thankyou

Ujian Diagnostik Matematik Tingkatan 1 - PDF
No ratings yet
Ujian Diagnostik Matematik Tingkatan 1 - PDF
17 pages
Final List of Degree Recipients 2015 (11092015)
100% (1)
Final List of Degree Recipients 2015 (11092015)
76 pages
TR18 AD MSFT Defence at Scale
No ratings yet
TR18 AD MSFT Defence at Scale
31 pages
Mitsubishi L-Series Operator Manual
100% (1)
Mitsubishi L-Series Operator Manual
42 pages
Big Data Meets AI - Optimizing Distributed Computing For Scalable
No ratings yet
Big Data Meets AI - Optimizing Distributed Computing For Scalable
12 pages
Machine Learning With Spark Nick Pentreath Download
No ratings yet
Machine Learning With Spark Nick Pentreath Download
61 pages
Mathematics Research Proposal - Anneqa
No ratings yet
Mathematics Research Proposal - Anneqa
9 pages
A Survey On Distributed Machine Learning
No ratings yet
A Survey On Distributed Machine Learning
33 pages
Documentation Distributed ML
No ratings yet
Documentation Distributed ML
55 pages
4251 Assignment 6
No ratings yet
4251 Assignment 6
11 pages
Thesis Proposal: Scaling Distributed Machine Learning With System and Algorithm Co-Design
No ratings yet
Thesis Proposal: Scaling Distributed Machine Learning With System and Algorithm Co-Design
12 pages
Unit - 4 Pyq Short
No ratings yet
Unit - 4 Pyq Short
8 pages
CCD Chapter 6 Notes
No ratings yet
CCD Chapter 6 Notes
18 pages
Machine Learning with Python: A Comprehensive Guide with a Practical Example
From Everand
Machine Learning with Python: A Comprehensive Guide with a Practical Example
MARTIN NEEL
No ratings yet
Learning Advanced Programming
From Everand
Learning Advanced Programming
IT Campus Academy
No ratings yet
A New Platform For Distributed
No ratings yet
A New Platform For Distributed
19 pages
Machine Learning Platform Design and Application Based On SparkProceedings of SPIE The International Society For Optical Engineering
No ratings yet
Machine Learning Platform Design and Application Based On SparkProceedings of SPIE The International Society For Optical Engineering
6 pages
Machine Learning With Spark
No ratings yet
Machine Learning With Spark
1 page
OneFlow for Parallel and Distributed Deep Learning Systems: The Complete Guide for Developers and Engineers
From Everand
OneFlow for Parallel and Distributed Deep Learning Systems: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
UNIT-3 - Technologies For Handling Big Data
No ratings yet
UNIT-3 - Technologies For Handling Big Data
21 pages
Reasearch Proposal
No ratings yet
Reasearch Proposal
2 pages
Technical Foundations of Torch: Definitive Reference for Developers and Engineers
From Everand
Technical Foundations of Torch: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Machine Learning With Big Data: Vietnam National University of HCMC
No ratings yet
Machine Learning With Big Data: Vietnam National University of HCMC
45 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
A Study On Distributed Machine Learning Techniques For Large Scale Weather Forecasting
No ratings yet
A Study On Distributed Machine Learning Techniques For Large Scale Weather Forecasting
22 pages
The Landscape of Machine,...
No ratings yet
The Landscape of Machine,...
31 pages
StarPU: Parallel Computing and Task Scheduling Techniques
From Everand
StarPU: Parallel Computing and Task Scheduling Techniques
Richard Johnson
No ratings yet
Practical MXNet Applications: Definitive Reference for Developers and Engineers
From Everand
Practical MXNet Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Databricks Certified Machine Learning Associate Exam Guide
No ratings yet
Databricks Certified Machine Learning Associate Exam Guide
9 pages
JOERI HERMANS Distributed Keras
No ratings yet
JOERI HERMANS Distributed Keras
23 pages
Databricks Certified Machine Learning Associate Exam Guide
No ratings yet
Databricks Certified Machine Learning Associate Exam Guide
9 pages
Unit I
No ratings yet
Unit I
48 pages
Sony Ai Content
No ratings yet
Sony Ai Content
26 pages
Lecture 6 - Spark ML
No ratings yet
Lecture 6 - Spark ML
31 pages
S2-23 - AIMLCZG515: Distributed Machine Learning: BITS Pilani BITS Pilani
No ratings yet
S2-23 - AIMLCZG515: Distributed Machine Learning: BITS Pilani BITS Pilani
54 pages
Ahishek File
No ratings yet
Ahishek File
6 pages
Machine Learning With Spark - Sample Chapter
100% (1)
Machine Learning With Spark - Sample Chapter
36 pages
Spark & SparkMLLib
No ratings yet
Spark & SparkMLLib
6 pages
1-Comparison of ML Vs DL-18-07-2024
No ratings yet
1-Comparison of ML Vs DL-18-07-2024
4 pages
Practical High Performance Computing: Definitive Reference for Developers and Engineers
From Everand
Practical High Performance Computing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Popular Machine Learning Algorithms in Apache Spark
No ratings yet
Popular Machine Learning Algorithms in Apache Spark
6 pages
Caffe Deep Learning Framework Essentials: Definitive Reference for Developers and Engineers
From Everand
Caffe Deep Learning Framework Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Scala and Spark Overview PDF
No ratings yet
Scala and Spark Overview PDF
37 pages
OpenCL Programming and Architecture: Definitive Reference for Developers and Engineers
From Everand
OpenCL Programming and Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
ML Notes-1
No ratings yet
ML Notes-1
59 pages
A Comprehensive Guide To Machine Learning
No ratings yet
A Comprehensive Guide To Machine Learning
8 pages
Unit 6-CCD
No ratings yet
Unit 6-CCD
23 pages
ML - Part - A
No ratings yet
ML - Part - A
10 pages
What Is The Difference Between Machine Learning and Deep Learning
No ratings yet
What Is The Difference Between Machine Learning and Deep Learning
3 pages
PyTorch Foundations and Applications: Definitive Reference for Developers and Engineers
From Everand
PyTorch Foundations and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
L4 - Challenges
No ratings yet
L4 - Challenges
6 pages
Applied Machine Learning with MLlib: Definitive Reference for Developers and Engineers
From Everand
Applied Machine Learning with MLlib: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
ML Libraries Frameworks Updated
No ratings yet
ML Libraries Frameworks Updated
13 pages
Machine Learning Tools
No ratings yet
Machine Learning Tools
14 pages
16 Decentralised Learning
No ratings yet
16 Decentralised Learning
140 pages
Mastering C: Advanced Techniques and Tricks
From Everand
Mastering C: Advanced Techniques and Tricks
Ted Norice
No ratings yet
Bda Bi Jit Chapter-6
No ratings yet
Bda Bi Jit Chapter-6
16 pages
Image Recognitiion
No ratings yet
Image Recognitiion
50 pages
Keras Deep Learning Essentials: Definitive Reference for Developers and Engineers
From Everand
Keras Deep Learning Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Deep Learning With Databricks: Srijith Rajamohan, Ph.D. John O'Dwyer
No ratings yet
Deep Learning With Databricks: Srijith Rajamohan, Ph.D. John O'Dwyer
38 pages
ML Notes
No ratings yet
ML Notes
52 pages
FL 1
No ratings yet
FL 1
25 pages
Deep Learning U1
No ratings yet
Deep Learning U1
5 pages
Deep Learning with Fast.ai: Definitive Reference for Developers and Engineers
From Everand
Deep Learning with Fast.ai: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Etrust Antivirus Administrator Guide
No ratings yet
Etrust Antivirus Administrator Guide
194 pages
Uhhhhh
0% (1)
Uhhhhh
35 pages
Job Opportunities in Q1-2012 (NCG) - Novellus Vietnam LTD
No ratings yet
Job Opportunities in Q1-2012 (NCG) - Novellus Vietnam LTD
3 pages
Parts of The MS Word
No ratings yet
Parts of The MS Word
3 pages
Project Report Topic: Analysis of Statically Determinate Beam
No ratings yet
Project Report Topic: Analysis of Statically Determinate Beam
22 pages
A960D+ BIOS Manual
No ratings yet
A960D+ BIOS Manual
35 pages
x75 Overview
No ratings yet
x75 Overview
8 pages
E-Commerce Product Recommendation System Base Paper
No ratings yet
E-Commerce Product Recommendation System Base Paper
11 pages
COIT20248 - Information System Analysis and Design - Assessment Item 1 (UPDATED)
No ratings yet
COIT20248 - Information System Analysis and Design - Assessment Item 1 (UPDATED)
11 pages
Website Post Purchase Audit Framework
No ratings yet
Website Post Purchase Audit Framework
2 pages
Hydrogen and Fuel Cell Technology (Vafa Chiragova)
No ratings yet
Hydrogen and Fuel Cell Technology (Vafa Chiragova)
21 pages
Notes ICE Unit 06
No ratings yet
Notes ICE Unit 06
18 pages
Tathagata M.: - +44 7770614797 - Glasgow, UK
No ratings yet
Tathagata M.: - +44 7770614797 - Glasgow, UK
1 page
Tsm-Pc05 Tsm-Pa05: The Universal Solution
No ratings yet
Tsm-Pc05 Tsm-Pa05: The Universal Solution
2 pages
Kpit JD
No ratings yet
Kpit JD
7 pages
Unit Sub-Station 160kva
No ratings yet
Unit Sub-Station 160kva
2 pages
Asw July12
No ratings yet
Asw July12
68 pages
Service Manual: XR-M130 XR-M150
No ratings yet
Service Manual: XR-M130 XR-M150
2 pages
SYMTRIK RF Time Solutions: SYM-RFT-XX Modules
No ratings yet
SYMTRIK RF Time Solutions: SYM-RFT-XX Modules
3 pages
Manual View Power
No ratings yet
Manual View Power
54 pages
Cloud Computing Program Brochure
No ratings yet
Cloud Computing Program Brochure
19 pages
Tj509dw5a en
No ratings yet
Tj509dw5a en
4 pages
The Problem and Review of Related Literature: Oil Refineries Crude Oil Distillation
No ratings yet
The Problem and Review of Related Literature: Oil Refineries Crude Oil Distillation
50 pages
PSI Dossier Rev-03
No ratings yet
PSI Dossier Rev-03
1 page
Unit of Measure Related Settings in SAP - Application Server Infrastructure - SCN Wiki
No ratings yet
Unit of Measure Related Settings in SAP - Application Server Infrastructure - SCN Wiki
6 pages
Adgs1412-1074036 Spi Serial SPST Swtich
No ratings yet
Adgs1412-1074036 Spi Serial SPST Swtich
27 pages

Computer Network

Uploaded by

Computer Network

Uploaded by

UNIVERSITY INSTITUTE OF

Prepared By: DISCOVER . LEARN . EMPOWER

You might also like