0% found this document useful (0 votes)

12 views55 pages

Chap2 ComputingTrends

Chapter 2 of DS 444 discusses computing trends relevant to Big Data, covering various computing technologies such as high-performance computing, grid computing, and cloud computing. It highlights the challenges and objectives in managing big data workflows, the importance of computing power in scientific applications, and the ongoing research in data-intensive computing. The chapter also outlines the evolution of computing paradigms and the significance of energy efficiency, data management, and algorithm development in achieving exascale computing capabilities.

Uploaded by

Pavan Frustum

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views55 pages

Chap2 ComputingTrends

Uploaded by

Pavan Frustum

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 55

DS 444: Introduction to Big Data

Chapter 2. Computing Trends for Big Data

Yijie Zhang

New Jersey Institute of Technology

[email protected]

1
Outline
• Introduction
• Challenges & Objectives
• Computing Technologies
− High-performance computing
• Supercomputing and Cluster Computing
− Grid computing
− Computing continuum: from edge to fog to cloud
− Mobile computing
• Ongoing Research in Our Group
− Data-intensive computing
− High-performance networking

2
3
Office

Supercomputer

4
Big-data Applications
• BIG DATA: rapidly increase from T to P, to E, to Z, to Y, and
beyond…
− Science
• Simulation
− Astrophysics, climate modeling, combustion research, etc.
• Experimental
− Spallation Neutron Source, Large Hadron Collider, microarray, genome sequencing,
protein folding, etc.
• Observational
− Large-scale sensor networks, astronomical imaging/radio devices (Dark Energy
Camera, Five-hundred-meter Aperture Spherical Telescope – FAST), etc.
− Business
• Financial transactions
− Wal-Mart, NY stock trading center, Amazon, Alibaba
− Social media
• YouTube, Facebook, Twitter, Weblogs, TikTok, WeChat
No matter which type of data is considered, we need
a high-performance end-to-end computing solution
to support data generation, storage, transfer, processing, and analysis!
5
Big-data Workflows
• Require massively distributed resources
− Hardware
• Computing facilities, storage systems, special rendering engines, display devices
(tiled display, powerwall, etc.), network infrastructures, etc.
− Software
• Domain-specific data analytics/processing tools, programs, etc.
− Data
• Real-time, archival

• Feature different complexities

− Simple case: linear pipeline (a special case of DAG)
− Complex case: DAG-structured graph
• Different application types have different performance
requirements
− Interactive: minimize total end-to-end delay for fast response
− Streaming: maximize frame rate to achieve smooth data flow

6
Computing Paradigms: an Overview
• Client–Server Model
− Client–server computing refers broadly to any distributed application that distinguishes between service providers (servers)
and service requesters (clients)

• High-performance Computing (Supercomputing, Cluster Computing)

− Powerful computers: supercomputer, PC cluster
− Used mainly by large organizations for critical applications, typically bulk data processing such as scientific computing,
enterprise resource planning, and financial transaction processing
− Programming models: MPI, OpenMP, CUDA, MapReduce/Hadoop, Spark

• Grid Computing
− A form of distributed computing and parallel computing, whereby a “super and virtual computer” is composed of a cluster of
networked, loosely coupled computers acting in concert to perform very large tasks

• Cloud (Utility) Computing

− The packaging of computing resources, such as computation and storage, as a metered service similar to a traditional public
utility, such as electricity

• Service-Oriented Computing
− Cloud computing provides services related to computing while, in a reciprocal manner, service-oriented computing consists of
the computing techniques that operate on software-as-a-service

• Edge Computing
− A distributed computing paradigm that brings computation and data storage closer to the sources of data to improve response
times and save bandwidth
− Internet of things (IoT) is an example of edge computing

• Peer-to-peer (P2P) Computing

− Distributed architecture without the need for central coordination, with participants being at the same time both suppliers and
consumers of resources (in contrast to the traditional client–server model)

• Mobile Computing
− Computing on the go!

7
High-performance Computing
(Supercomputing, Cluster Computing)

8
Supercomputing for Scientific Applications
Astrophysics
Computational biology

Nanoscience

Climate research

Neutron sciences

Flow dynamics

Fusion simulation
Computational materials

9
Why do we care about computing power?
Computer Security: Exhaustive Key Search
• Two types of security
− Computational security
− Unconditional security
• Two types of encryption methods
− Conventional (a.k.a. single-key, secret-key, symmetric): DES/DEA
− Public key-based (a.k.a. asymmetric): RSA, D-H
• Either key could be used for encryption, but only the other key can be used for decryption
• Attack on computational security
− Always possible to simply try every key
− Most basic attack, proportional to key size
− Assume to either know / recognize plaintext
Key Size (bits) Number of Alternative Time required at Time required at
Keys 1 decryption/µs 106 decryptions/µs
32 232 = 4.3 × 109 231 µs = 35.8 minutes 2.15 milliseconds
56 256 = 7.2 × 1016 255 µs = 1142 years 10.01 hours
128 2128 = 3.4 × 1038 2127 µs = 5.4  1024 years 5.4  1018 years

168 2168 = 3.7 × 1050 2167 µs = 5.9  1036 years 5.9  1030 years

26 letters 26! = 4 × 1026 2  1026 µs = 6.4  1012 years 6.4  106 years
(permutation)
10
11
Exascale Race/Technologies
Projected Exascale Dates and Suppliers

U.S. EU
§ Sustained ES*: 2022-2023 § PEAK ES: 2023-2024
§ Peak ES: 2021 § Pre-ES: 2021-2022
§ Vendors: U.S. § Vendors: U.S., Europe
§ Processors: U.S. (some ARM?) § Processors: Likely ARM
§ Initiatives: NSCI/ECP § Initiatives: EuroHPC
§ Cost: $600M per system, plus § Cost: $300-$350M per system,
heavy R&D investments plus heavy R&D investments

China Japan
§ Sustained ES*: 2021-2022 § Sustained ES*: ~2022
§ Peak ES: 2020 § Peak ES: Likely as a AI/ML/DL system
§ Vendors: Chinese (multiple sites) § Vendors: Japanese
§ Processors: Chinese (plus U.S.?) § Processors: Japanese
§ 13th 5-Year Plan § Cost: $800M-$1B, this includes both 1
§ Cost: $350-$500M per system, system and the R&D costs, will also do
plus heavy R&D many smaller size systems
* 1 exaflops on a 64-bit real application

© Hyperion Research 2018 14 14

12
13
14
Top 500 June 2024 Release

15
16
17
Top 10 Challenges to Exascale
3 Hardware, 4 Software, 3 Algorithms/Math Related
• Energy efficiency: • Data management:
• Creating more energy efficient circuit, • Creating data management software that
power, and cooling technologies. can handle the volume, velocity and
diversity of data that is anticipated.
• Interconnect technology:
• Increasing the performance and energy • Scientific productivity:
efficiency of data movement. • Increasing the productivity of
computational scientists with new software
• Memory Technology: engineering tools and environments.
Integrating advanced memory
•
technologies to improve both capacity • Exascale Algorithms:
and bandwidth. • Reformulating science problems and
refactoring their solution algorithms for
• Scalable System exascale systems.
Software: • Algorithms for discovery,
Developing scalable system software
•
that is power and resilience aware. design, and decision:
• Facilitating mathematical optimization and
• Programming systems: uncertainty quantification for exascale
• Inventing new programming discovery, design, and decision making.
environments that express massive
parallelism, data locality, and resilience • Resilience and correctness:
• Ensuring correct scientific computation in
face of faults, reproducibility, and
algorithm verification challenges. 13

18
19
Computing power continues to increase over time

• Frontier: first ever reaching exascale supercomputing!

More computing power

More complex models More control parameters More simulation runs

Colossal amounts of scientific datasets!

20
Terascale Supernova Initiative (TSI)
• Collaborative project
− Supernova explosion
• TSI simulation
− 1 terabyte a day with a
small portion of
parameters
− From TSI to PSI to ESI
• Transfer to remote sites
− Interactive distributed
visualization
− Collaborative data
analysis
− Computation monitoring
− Computation steering

Visualization channel

Visualization control channel

Computation steering channel

Client Supercomputer or Cluster

21
High Performance Computing
Supercomputing Cluster Computing
(MPI, OpenMP) (MapReduce, Spark)
HPC HPDA
Data Data Data Data
Numerically Data
In Out In Out
Intensive Intensive

Input Function Solution Input + Solution

19
simulation learning
inference

22
Comparison of Data Analytics and Computing Ecosystems

Java, Python, Scala

Spark

23
HPC Architectures

▪ Distributed memory multiprocessors

▪ MPP: Massively Parallel Processing
▪ Tightly integrated
▪ Constellations
▪ Clusters of supercomputers
24
How are processors connected?

25
Type of Parallelization
Trivial:
• Each CPU does a part of
the work independently.

Nontrivial:
• Each CPU relies on its
neighbor CPUs to complete
the assigned work.

26
Performance Metrics
• Execution time
• Time when the last
processor finishes its work
• Speedup
• (time on 1CPU) / (time on
P CPUs)
• Efficiency
• Speedup / P

27
28
Grid Computing

29
Grid Computing
• Who needs grid computing?
− Particular software capabilities
• Modelling, simulation, etc.
− High hardware/computing demands
• Processing, storage, etc.
− Large network bandwidth
• Circuit provisioning to support large data transfer
• Problems, which are hard (or impossible) to solve at a
single site, can be solved with the right kind of
parallelization and distribution of the tasks involved.
• There are two primary types of grids
− Computational grids
• Open Science Grid (OSG)
• Worldwide LHC Computing Grid (WLCG)
− Data grids
• Earth System Grid (ESG)

30
Requirements
• Computational grids
− Manage a variety of computing resources
− Select computing resources capable of running a user’s job
− Predict loads on grid resources
− Decide about resource availability
− Dynamic resource configuration and provisioning
• Data grids
− Provide data virtualization service
− Support flexible data access, filtering, and transfer mechanisms
− Provide security and privacy mechanisms
• Grid computing environments are constructed upon
three foundations
− Coordinated resources
− Open standard protocols and frameworks
− Non-trivial QoS

31
Computing Continuum:
from Edge to Fog to Cloud

32
Cloud Computing
• What is cloud computing?
− The phrase originated from the cloud symbol used to symbolize the
Internet
− A model for enabling convenient, on-demand network access to a
shared pool of configurable computing resources (e.g., networks,
servers, storage, applications, and services) that can be rapidly
provisioned and released with minimal management effort or service
provider interaction
− Provide computation, software, data access, and storage services that
do not require end-user knowledge of the physical location and
configuration of the system that delivers the services
• Cloud architecture
− Involve multiple cloud components communicating with each other
over application programming interfaces, usually web services and 3-
tier architecture
• Front end: seen by the user, such as a web browser
• Back end: the cloud itself comprising computers, servers, data storage devices, etc.

33
Five Layers in Cloud Computing
Front and back ends:
• Client
− A cloud client consists of computer hardware and/or computer software that relies on
cloud computing for application delivery, or that is specifically designed for delivery of
cloud services
• Server
− The servers layer consists of computer hardware and/or computer software products that
are specifically designed for the delivery of cloud services, including multi-core
processors, cloud-specific operating systems and combined offerings

Three types of cloud computing services

• Application
− Cloud application services or "Software as a Service (SaaS)" deliver software as a service
over the Internet, eliminating the need to install and run the application on the customer's
own computers and simplifying maintenance and support
• Platform
− Cloud platform services or "Platform as a Service (PaaS)" deliver a computing platform
and/or solution stack as a service, often consuming cloud infrastructure and sustaining
cloud applications
• Infrastructure
− Cloud infrastructure services, also known as "Infrastructure as a Service (IaaS)", delivers
computer infrastructure – typically a platform virtualization environment – as a service

34
Real-life Cloud Computing Environments
• Microsoft Windows Azure
• Google Gov Cloud
• Amazon EC2
• Alibaba Cloud
• Baidu Cloud
• Eucalyptus (first open-source platform for private clouds, 2008)
• A software platform for the implementation of private cloud computing on
computer clusters
• Many others

35
Managing Amazon EC2 instances

• AWS management console

− Web-based, most powerful
− http://aws.amazon.com/console/
 Command line tools
− http://aws.amazon.com/developertools/351?_encoding=
UTF8&jiveRedirect=1
• Third-party UI tools
− Example: ElasticFox browser add on

36
Login AWS management console

37
Select AMI to create instance(s)

Select instance type: http://aws.amazon.com/ec2/instance-types/

38
Assign instance to security group(s)

• A security group defines firewall rules for

instances
• At launch time, instance can be assigned to
multiple groups
− Default group doesn’t allow any network traffic
• Once an instance is running, it can't change to
which security group(s) it belongs
• Can modify rules for a group at any time
• New rules automatically enforced for all running
instances and instances launched in the future
39
Illustration of security group

40
Launch instance

41
Instance IP addresses

• Public IP and DNS

− Public addresses are reachable from the Internet
• Private IP and DNS
− Private addresses are only reachable from within the
Amazon EC2 network
• Elastic IPs
− Static IP addresses associated with account, not
specific instances
− If instance using elastic IP fails, this address can be
quickly remapped to another instance
• DNS propagation may take a long time if remapping the
name to another IP address
42
Checking instance IP/DNS

43
Regions and
availability zones

• Regions are located in separate geographic areas (US:

Virginia & California, Ireland, Singapore, etc.)
− Each Region is completely isolated
− Failure independence and stability
• Availability Zones are distinct locations within a Region
− Isolated, but connected through low-latency links
− Failure resilience
• Current pricing: http://aws.amazon.com/ec2/pricing/
44
Connecting to instances
• Windows instance
− Connect from your browser using the Java SSH
Client
− Connect using Remote Desktop
• Linux Instance
− Connect from your browser using the Java SSH
Client
− Putty.exe/putty key generator
− SSH command on linux machine
• ssh -i key.pem ec2-user@ec2-23-20-83-
139.compute-1.amazonaws.com

45
Connect to Linux instance

46
Connect to Windows instance: get
administrator password

47
Connect to Windows instance

• Type in public DNS name of

instance
• Use the retrieved
administrator password

48
IAM – Identity Access Management

• “AWS Identity and Access Management (IAM)

helps to securely control access to AWS
resources

• IAM is used to control who can use your AWS

resources (authentication) and what resources
they can use and in what ways (authorization).”

49
Mobile and Ubiquitous Computing

50
Mobile and Ubiquitous Computing

51
Mobile and Ubiquitous Computing
Internet of Things (IoT)

GPRS: General Packet Radio Service

WTP: Wireless Transport Protocol
WAP: Wireless Application Protocol with 2 components: WML and WMLScript
WML: Wireless Markup Language
cHTML: Compact HTML
HDML: Handheld Device Markup Language
52
Wireless Sensor Networks
• Pervasive applications
• Agricultural
• Military
• Industrial

53
Ongoing Research in Our Big Data Group
• Data-intensive computing
− Big data ecosystem
− AI and ML
− Scientific Workflow optimization
• Mapping, scheduling, modeling
• High-performance networking
− Bandwidth scheduling
− Transport control
− Control plane design
• Distributed sensor networks
− Deployment, routing, fusion
• Cyber security
− Monitoring, game theory
• Visualization and image processing
54
Thanks！☺
Questions ?

Basics Computer Architecture by Pooyan Jamshidi 1731311297
No ratings yet
Basics Computer Architecture by Pooyan Jamshidi 1731311297
266 pages
Cloud Computing Unit-1
100% (1)
Cloud Computing Unit-1
88 pages
Unit 1 - Computing Paradigms
No ratings yet
Unit 1 - Computing Paradigms
31 pages
Unit - 1 Systems Modelling, Clustering and Virtualization: 1. Scalable Computing Over The Internet
No ratings yet
Unit - 1 Systems Modelling, Clustering and Virtualization: 1. Scalable Computing Over The Internet
28 pages
Parallel Computing
No ratings yet
Parallel Computing
57 pages
Module - 01 CC (BCS601)
No ratings yet
Module - 01 CC (BCS601)
47 pages
Csc4306 Net-Centric Computing
100% (1)
Csc4306 Net-Centric Computing
5 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
34 pages
Unit-1 (Cloud Computing) 1. (Accessible) Scalable Computing Over The Internet
100% (1)
Unit-1 (Cloud Computing) 1. (Accessible) Scalable Computing Over The Internet
17 pages
Introduction To High-Performance Computing
No ratings yet
Introduction To High-Performance Computing
13 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
134 pages
VTU Exam Question Paper With Solution of 18CS72 Big Data and Analytics Feb-2022-Dr. v. Vijayalakshmi
No ratings yet
VTU Exam Question Paper With Solution of 18CS72 Big Data and Analytics Feb-2022-Dr. v. Vijayalakshmi
25 pages
Wind On Sheds PDF
100% (1)
Wind On Sheds PDF
12 pages
Spark Introduction
No ratings yet
Spark Introduction
90 pages
Lecture Notes: (R15A0529) B.Tech Iv Year - I Sem (R15) (2019 - 20)
No ratings yet
Lecture Notes: (R15A0529) B.Tech Iv Year - I Sem (R15) (2019 - 20)
223 pages
Module-1 Notes
No ratings yet
Module-1 Notes
46 pages
4.big Data Platforms
No ratings yet
4.big Data Platforms
49 pages
Visvesvaraya Technological University (VTU) : Created by
No ratings yet
Visvesvaraya Technological University (VTU) : Created by
47 pages
(Viral) Kamal Kaur Viral Video Original Link
No ratings yet
(Viral) Kamal Kaur Viral Video Original Link
5 pages
1 Introduction
No ratings yet
1 Introduction
48 pages
Scalable Computing Over The Internet
No ratings yet
Scalable Computing Over The Internet
41 pages
CLOUD COMPUTING Unit 1
No ratings yet
CLOUD COMPUTING Unit 1
51 pages
Lecture Week - 1 Introduction 1 - SP-24
No ratings yet
Lecture Week - 1 Introduction 1 - SP-24
51 pages
HPC Tools and Technologies For Web Programming
No ratings yet
HPC Tools and Technologies For Web Programming
33 pages
BCS601 Module 01
No ratings yet
BCS601 Module 01
36 pages
Parallel Computing An Introduction
No ratings yet
Parallel Computing An Introduction
40 pages
CUDA Algorithms For JAVA
No ratings yet
CUDA Algorithms For JAVA
45 pages
U2 Direct Shear Test & Unconfined Compression Test
88% (8)
U2 Direct Shear Test & Unconfined Compression Test
34 pages
4.big Data Platforms
No ratings yet
4.big Data Platforms
37 pages
Lecture 12 Exascale, Quantum Computing, Review
No ratings yet
Lecture 12 Exascale, Quantum Computing, Review
38 pages
4.big Data Platforms
No ratings yet
4.big Data Platforms
40 pages
Lecture 1 - Introduction To Cloud Computing
No ratings yet
Lecture 1 - Introduction To Cloud Computing
32 pages
Cloud COMPUTING Module 4
No ratings yet
Cloud COMPUTING Module 4
50 pages
Cloud Computing - Unit-1
No ratings yet
Cloud Computing - Unit-1
54 pages
Water Penetration of Metal Roof Panel Systems by Static Water Pressure Head
No ratings yet
Water Penetration of Metal Roof Panel Systems by Static Water Pressure Head
4 pages
Module1 Part1
No ratings yet
Module1 Part1
26 pages
A Comparative Survey of Big Data Computing and HPC
No ratings yet
A Comparative Survey of Big Data Computing and HPC
38 pages
Lec1 and 2
No ratings yet
Lec1 and 2
52 pages
HPC Week1 Samp
No ratings yet
HPC Week1 Samp
23 pages
Recent Computing Trends - Paradigm Shifts
No ratings yet
Recent Computing Trends - Paradigm Shifts
24 pages
Lecture 9
No ratings yet
Lecture 9
72 pages
Grid and Cloud Computing
No ratings yet
Grid and Cloud Computing
46 pages
Scalability: Scalable Computing Over The Internet - Simplified
No ratings yet
Scalability: Scalable Computing Over The Internet - Simplified
17 pages
Lecture 1 - Overview of Distributed Computing
No ratings yet
Lecture 1 - Overview of Distributed Computing
71 pages
ModernComputingVision1 s2.0 S2772503024000021 Main
No ratings yet
ModernComputingVision1 s2.0 S2772503024000021 Main
38 pages
Lecture 1 COAL
No ratings yet
Lecture 1 COAL
13 pages
Lec1 Introduction
No ratings yet
Lec1 Introduction
23 pages
CC 1
No ratings yet
CC 1
11 pages
Sci & Tech Booklet B
No ratings yet
Sci & Tech Booklet B
16 pages
CAQA5e ch1
No ratings yet
CAQA5e ch1
42 pages
2219-Article Text-15412-2-10-20230802
No ratings yet
2219-Article Text-15412-2-10-20230802
12 pages
Big Data in Research and Education
No ratings yet
Big Data in Research and Education
70 pages
Age of Exascale
No ratings yet
Age of Exascale
46 pages
UNIT I Notes
No ratings yet
UNIT I Notes
5 pages
Big Data
No ratings yet
Big Data
29 pages
No Exaflops For You
No ratings yet
No Exaflops For You
61 pages
Computer Architecture: Challenges and Opportunities For The Next Decade
No ratings yet
Computer Architecture: Challenges and Opportunities For The Next Decade
13 pages
Chapter1 PDF
No ratings yet
Chapter1 PDF
30 pages
Theories in Nursing Informatics
No ratings yet
Theories in Nursing Informatics
31 pages
Closed-Loop Control of DC Drives With Controlled Rectifier
0% (1)
Closed-Loop Control of DC Drives With Controlled Rectifier
40 pages
Cloud Compting Prince
No ratings yet
Cloud Compting Prince
3 pages
Cloud Computing Rupai
No ratings yet
Cloud Computing Rupai
3 pages
Drain PCC 136 BOQ With Rates
No ratings yet
Drain PCC 136 BOQ With Rates
29 pages
Advanced Technologies in Computing
No ratings yet
Advanced Technologies in Computing
2 pages
Lecture III Notes
No ratings yet
Lecture III Notes
3 pages
Group 4 Travel Device
No ratings yet
Group 4 Travel Device
8 pages
Chapter 4 Flexural Design - (Part 3)
No ratings yet
Chapter 4 Flexural Design - (Part 3)
37 pages
Chemical Catalog
No ratings yet
Chemical Catalog
58 pages
Installation Information Emg Models: Passive / Passive: Output
No ratings yet
Installation Information Emg Models: Passive / Passive: Output
1 page
"Rectbeam" - Rectangular Concrete Beam Analysis/Design: Program Description
No ratings yet
"Rectbeam" - Rectangular Concrete Beam Analysis/Design: Program Description
46 pages
Amit Yadav Project
No ratings yet
Amit Yadav Project
49 pages
AMV in Pharma
No ratings yet
AMV in Pharma
13 pages
Ecr RS
No ratings yet
Ecr RS
11 pages
English Notebook Face Sheet-21.04.25
No ratings yet
English Notebook Face Sheet-21.04.25
3 pages
Grade 2, Module 2 (45 Pages)
No ratings yet
Grade 2, Module 2 (45 Pages)
45 pages
Mendel and Heredity Worksheet
No ratings yet
Mendel and Heredity Worksheet
11 pages
Koala - Wikipedia
No ratings yet
Koala - Wikipedia
24 pages
Lecture 15 - Summing Up of Part-1 (Policy) & Introduction To Housing Planning
No ratings yet
Lecture 15 - Summing Up of Part-1 (Policy) & Introduction To Housing Planning
17 pages
PH Formative Assessment Training - Final Report - FINAL
No ratings yet
PH Formative Assessment Training - Final Report - FINAL
50 pages
History Compass - 2022 - Milne Smith - Gender and Madness in Nineteenth Century Britain
No ratings yet
History Compass - 2022 - Milne Smith - Gender and Madness in Nineteenth Century Britain
10 pages
Sentence and Reading Comprehension
No ratings yet
Sentence and Reading Comprehension
4 pages
MY Resume
No ratings yet
MY Resume
1 page
Unique Aspects of Accounting - Non-Profit and Healthcare Organizations
No ratings yet
Unique Aspects of Accounting - Non-Profit and Healthcare Organizations
28 pages
Introduction To Integrated Library Systems: Lesson 5. How Do You Implement An Integrated Library System?
No ratings yet
Introduction To Integrated Library Systems: Lesson 5. How Do You Implement An Integrated Library System?
19 pages
Theory of Literature
No ratings yet
Theory of Literature
5 pages
Course Catalogue and Timetable 1st-2nd Semester - Dept of Engineering Enzo Ferrari - AA2023-2024
No ratings yet
Course Catalogue and Timetable 1st-2nd Semester - Dept of Engineering Enzo Ferrari - AA2023-2024
2 pages
Dual, Low Noise, High Performance Uncompensated Operational Amplifier
No ratings yet
Dual, Low Noise, High Performance Uncompensated Operational Amplifier
5 pages
Cove R Lin e
No ratings yet
Cove R Lin e
17 pages
Study Guide Cisco 300-915 DEVIOT Developing Solutions using Cisco IoT and Edge Platforms Exam
From Everand
Study Guide Cisco 300-915 DEVIOT Developing Solutions using Cisco IoT and Edge Platforms Exam
Anand Vemula
No ratings yet
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)

Chap2 ComputingTrends

Uploaded by

Chap2 ComputingTrends

Uploaded by

DS 444: Introduction to Big Data

Chapter 2. Computing Trends for Big Data

New Jersey Institute of Technology

• Feature different complexities

• High-performance Computing (Supercomputing, Cluster Computing)

• Cloud (Utility) Computing

• Peer-to-peer (P2P) Computing

© Hyperion Research 2018 14 14

• Frontier: first ever reaching exascale supercomputing!

More computing power

More complex models More control parameters More simulation runs

Colossal amounts of scientific datasets!

Visualization control channel

Client Supercomputer or Cluster

Input Function Solution Input + Solution

Java, Python, Scala

▪ Distributed memory multiprocessors

Three types of cloud computing services

• AWS management console

Select instance type: http://aws.amazon.com/ec2/instance-types/

• A security group defines firewall rules for

• Public IP and DNS

• Regions are located in separate geographic areas (US:

• Type in public DNS name of

• “AWS Identity and Access Management (IAM)

• IAM is used to control who can use your AWS

GPRS: General Packet Radio Service

You might also like