0% found this document useful (0 votes)
13 views33 pages

DC Unit 1

The document outlines the vision and mission of the Suman Ramesh Tulsiani Charitable Trust's SRTTC-FOE in Kamshet, focusing on developing competent engineers in Artificial Intelligence and Data Science. It discusses the fundamentals of Distributed Computing, including its characteristics, types, and applications in various fields such as predictive maintenance, fraud detection, and healthcare. Additionally, it highlights the integration of AI and data science in enhancing performance and scalability in distributed systems.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views33 pages

DC Unit 1

The document outlines the vision and mission of the Suman Ramesh Tulsiani Charitable Trust's SRTTC-FOE in Kamshet, focusing on developing competent engineers in Artificial Intelligence and Data Science. It discusses the fundamentals of Distributed Computing, including its characteristics, types, and applications in various fields such as predictive maintenance, fraud detection, and healthcare. Additionally, it highlights the integration of AI and data science in enhancing performance and scalability in distributed systems.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 33

WELCOME

Dis
Distributed Computing [417531]

Dr. R. V. Babar
Head of Dept.
Dept. of Artificial Intelligence and Data Science
SRTTC-FOE, Kamshet

[email protected]
Cell. +91 7588048922/9423558020
Suman Ramesh Tulsiani Charitable Trust’s

SRTTC, FOE, Kamshet

B.E. [Artificial Intelligence and Data Science]


Distributed Computing [417531]

UNIT No.: 1
Introduction to Distributed Computing

SRTTC, FOE, Kamshet


Institute Vision and Mission

▪ Vision: To contribute for progressive development of nation through achieving excellence in


technical education by developing globally competent engineers, transforming students into
responsible citizens, competent professionals and aspiring academicians.

▪ Mission:
▪ To impart knowledge & skill based education in collaboration with industry, academia &
research organization.
▪ To prepare competent engineers with the spirit of entrepreneurship by conducting various
Technical events and MOU.
▪ Prepare engineers to respond to the current and future needs of the industry, higher studies as
well as research.
▪ We aim to instill a sense of social responsibility and leadership qualities in our graduates,
enabling them to make positive contributions to the society at large.

SRTTC, FOE, Kamshet


Department Vision and Mission

▪ Vision:
▪ The Artificial Intelligence and Data Science is dedicated to persistently improve its
educational environment in order to develop rural youth with the strong academic and
technical backgrounds.
▪ Mission:
▪ To encourage students to become dynamic, problem solving individuals who can find
and understand the knowledge needed to be successful in the profession.
▪ Enrich Industry Institute Interaction program to get accustomed with corporate culture.
▪ To develop the students to survive with pioneering technology to meet IT industry
needs and contributing the progress of nation.

SRTTC, FOE, Kamshet


Short Term and Long-Term goals

▪ Short Term Goals:


▪ Improving in student's placement.
▪ Encouraging faculty for Ph. D. / research work.
▪ Emphasizing in Institute – Industry Liaison.
▪ Undertaking UG Industry sponsored projects.
▪ Implementation of project based learning.

▪ Long Term Goals:


▪ Undertaking consultancy activity for internal revenue generation.
▪ Establishing center of excellence in the area of computer science and engineering.

SRTTC, FOE, Kamshet


Program Specific Outcomes (PSO)

▪ PSO1:Professional Skills-The ability to understand, analyze and develop


computer programs in the areas related to algorithms, system software,
multimedia, web design, big data analytics, and networking for efficient
design of computer-based systems of varying complexities.

▪ PSO2:Problem-Solving Skills- The ability to apply standard practices and


strategies in software project development using open-ended programming
environments to deliver a quality product for business success.

▪ PSO3: Successful Career and Entrepreneurship- The ability to employ modern


computer languages, environments, and platforms in creating innovative
career paths to be an entrepreneur, and a zest for higher studies.

SRTTC, FOE, Kamshet


Teaching/Examination Scheme
Course Outcome
Job Opportunities
Role of AI in Distributed Computing
Unit 1: Contents
Basics
Distributed Computing:

Distributed computing refers to the use of multiple interconnected computers or processors that work together to solve a
complex problem or perform a task.

Characteristics of Distributed Computing:

1. Concurrency
2. Fault Tolerance
3. Scalability
4. Interprocess Communication
5. Transparency
6. Heterogeneity
7. Consistency and Replication
8. Load Balancing
9. Security
Issues of Distributed Computing
● Heterogeneity

● Scalability

● Openness

● Transparency

● Concurrency

● Security

● Failure Handling
Types of Distributed System

Peer-to-peer networks Three-Tier


Client-server architecture Middleware

● You can think of middleware as ● There are three


an application that serves two layers in it:
different apps by sitting in application, data,
between them.
and presentation.
● It serves as a foundation for
Most commonly, this
several interoperability
programmes that run on various is utilized in online
OS systems. This service allows or web applications.
for the sharing of data between
users.
3 Tier
Architecture
Distributed System Models:

1. Distributed
Computing System
➔ Cluster Computing
Click to View

➔ Grid Computing
Cluster Computing

Click to View

Grid Computing
Distributed System Models:

2. Distributed Information System

● Distributed transaction processing: It works across different servers using multiple communication models. The
four characteristics that transactions have:
● Atomic: the transaction taking place must be indivisible to the others.
● Consistent: The transaction should be consistent after the transaction has been done.
● Isolated: A transaction must not interfere with another transaction.
● Durable: Once an engaged transaction, the changes are permanent. Transactions are often constructed as
several sub-transactions, jointly forming a nested transaction.
Distributed System Models:

3. Distributed Pervasive System

Click to View
Introduction to AI & Data Science in DC

This combination opens up amazing possibilities across a range of domains, from advancing industry efficiency to taking on
challenging scientific problems. Two quickly developing fields, artificial intelligence (AI) and data science, use sophisticated
computer methods to mine data for insightful information.

Distributing computational tasks

The following are some essential ideas and methods for allocating computing tasks:

1. Workload Division
2. Communication and Synchronization
3. Scheduling and Load Balancing:
4. Orchestration and Fault Tolerance:
Introduction to AI & Data Science in DC
Data Storage and Access:

● Hadoop Distributed File System (HDFS).


Click Here
● AWS and Google Cloud Storage
● NoSQL databases:

Data Processing and Analysis:

● Batch processing: Using frameworks like Hadoop MapReduce or Spark, analyze big datasets offline.
● Stream processing: Using Apache Kafka or Apache Flink, real-time data stream analysis is possible.
● In-memory computing: Although it uses more resources, this method of processing and storing data in RAM allows for
faster analysis.
● Distributed analytics systems: Scalable and effective platforms for analyzing big datasets are offered by programs like
Spark and Google BigQuery.
Introduction to AI & Data Science in DC
Data Management and Quality:

● Data integration: Creating a single, cohesive perspective by combining data from several sources.
● Data cleaning: Fixing mistakes and discrepancies in the data.
● Data governance: Creating guidelines and protocols to guarantee privacy, security, and correctness in data management.
● Data compression: lowering the amount of data stored without sacrificing information.

Tools and Technologies:

● Apache Hadoop is an open-source system for data storage and distributed processing.
● Apache Spark: Batch and stream processing unified analytics engine.
● Apache Kafka: Real-time data processing via distributed streaming.
● Google BigQuery: A cloud-based data warehouse designed for extensive analysis of data.
● Amazon Redshift: An analytics and data warehousing cloud-based data warehouse.
Introduction to AI & Data Science in DC
● Understanding Parallel Processing:
● Numerous processors or cores: Many modern computers have numerous processing cores that can handle multiple tasks
at once.
● Distributed systems: Even more parallelization is possible when the processing capacity of several computers is
combined over a network.
● Algorithms and code optimization: Parallelization is a natural fit for some algorithms but not for others. For the best use
of processor cores, proper code optimization is essential.

Benefits of Parallel Processing:

● Quicker execution: Workloads are split up and handled separately, which drastically cuts down on completion times.
● Scalability: Performance is further enhanced by adding more processing power (cores or machines).
● Real-time capabilities: For applications that move quickly, parallelization allows for real-time analysis and response.
● Resource optimization: Tasks are divided among several cores or computers to make efficient use of the resources that
are available.
Introduction to AI & Data Science in DC
Strategies for Leveraging Parallel Processing:

● Determining which tasks may be parallelized: Some jobs cannot be parallelized because of dependencies or constraints
for sequential execution. Examine your process to find qualified applicants.

● Selecting the appropriate libraries and tools: Parallel programming and task distribution features are provided by
frameworks such as CUDA(Compute Unified Device Architecture), MPI, and OpenMP. Your unique needs and the design of
your system will determine which tool is best for you.

● Tuning performance and optimization: You can overcome potential bottlenecks and greatly increase parallelization
efficiency by fine-tuning your code and algorithms.
Application of Integrating AI & DS in DS

Predictive Maintenance: (Process Plant)

The most cutting-edge method for managing maintenance in process plants is called predictive maintenance,
or PdM.

Predictive maintenance differs from other types of maintenance in many ways. Let’s start by looking at
various different types of maintenance, such as:

● Reactive maintenance, or run-to-failure


Clickhere
● Preventive maintenance
● Prescriptive maintenance
● Predictive maintenance (PdM), or condition monitoring
Application of Integrating AI & DS in DS
Predictive Maintenance: (Process Plant)

The most cutting-edge method for managing maintenance in process plants is called predictive maintenance,
or PdM.

Predictive maintenance differs from other types of maintenance in many ways. Let’s start by looking at
various different types of maintenance, such as:

● Reactive maintenance, or run-to-failure


● Preventive maintenance
● Prescriptive maintenance
● Predictive maintenance (PdM), or condition monitoring

SAMGUARD Tool
Application of Integrating AI & DS in DS
Fraud Detection

Another fascinating area where the combination of data science and artificial intelligence might unleash enormous potential in
distributed systems is fraud detection. Now let's explore some particular use cases and applications:

1. Scalable Anomaly Detection:

2. Distributed Graph-based Fraud Detection:

3. Adaptive Fraud Scoring and Risk Assessment:

4. Collaborative Threat Intelligence Sharing:

5. Edge-based Fraud Detection:


Application of Integrating AI & DS in DS
Intelligent Transportation Systems (ITS) View

Another excellent illustration of how AI and data science excel in distributed systems is found in intelligent transportation
systems (ITS). By combining several technologies, they seek to increase the sustainability, safety, and efficiency of transportation
networks:

● 1. Real-time Traffic Management and Congestion Control


● 2. Connected and Autonomous Vehicles (CAVs)
● 3. Public Transportation Optimization
● 4. Predictive Maintenance for Transportation Infrastructure
● 5. Multimodal Transportation Planning and Integration
Application of Integrating AI & DS in DS
Supply Chain optimization

● Supply chain optimization is an extremely fascinating field where data science and artificial intelligence are applied to
dispersed systems! Let's investigate a few particular application domains.

1. Demand Forecasting and Inventory Management


2. Logistics and Transportation Optimization
3. Risk Management and Proactive Planning
4. Smart Contracts and Collaborative Optimization:
5. Predictive Maintenance for Machinery and Assets
Application of Integrating AI & DS in DS
Energy Management

Another exciting area where AI and data science combine in distributed systems to produce game-changing solutions is energy
management. Here are a few crucial areas in which they excel:

1. Demand Forecasting and Grid Optimization

2. Renewable Energy Integration and Forecasting:

3. Smart Grids and Distributed Resource Management:

4. Energy Efficiency and Smart Building Controls:


Application of Integrating AI & DS in DS
Healthcare and Medical diagnostics

In this extremely important subject, AI and data science are transforming methods for diagnosing diseases and providing
medical care. Let's examine a few crucial areas where they are having a big influence:

1. Medical Imaging and Diagnosis:


2. Clinical Decision Support and Risk Prediction:
3. Personalized Medicine and Precision Healthcare:
4. Remote Patient Monitoring and Telemedicine:
5. Drug Safety and Pharmacovigilance:
Application of Integrating AI & DS in DS
Customer Behavior Analysis and Natural Language Processing (NLP):

NLP can be used to analyze consumer behavior in the following ways:

1. Social Media Listening and Sentiment Analysis:


2. Customer Feedback Analysis:
3. Chatbot Conversations and Personalized Recommendations:
4. Predictive Analytics and Customer Churn:
5. Voice of the Customer (VOC) Analysis
Application of Integrating AI & DS in DS
Case Study: Enhancing Scalability and Performance in E-Commerce through Distributed Computing

1. Abstract:
2. Introduction:
3. Problem Statement:
4. Solution:
5. Implementation:
6. Results
7. Conclusion
3 Department of AI & DS Engineering | SRTTC-

You might also like