DSMP Whitepaper

The document describes Distributed Symmetric Multi-Processing (DSMP), a new paradigm for high performance computing that provides affordable symmetric multiprocessing (SMP) supercomputing capabilities. DSMP creates a large, shared-memory software architecture across an Infiniband connected cluster of SMP nodes, allowing applications to access a single global address space. It supports distributed multi-threading and uses a transactional distributed shared memory system with page-based coherency to provide a unified shared memory view of the entire cluster. This allows DSMP to service large data sets and memory-intensive applications more efficiently than traditional MPI-based clusters.

Uploaded by

aschecastillo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views10 pages

DSMP Whitepaper

Uploaded by

aschecastillo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Distributed Symmetric Multi-Processing (DSMP)

A New Paradigm for High Performance Computing

Symmetric Computing
Venture Development Center
University of Massachusetts
100 Morrissey Boulevard
Boston, MA 02125
USA
This page is intentionally blank.

Page 2 of 10
Introduction
Today, the de-facto standard for high performance computing (HPC) is distributed memory
clusters connected using the Message Passing Interface (MPI) protocol. However, to
achieve the performance that HPC clusters promise, applications must be tailored for the
architecture. Over time, a library of cluster-ready HPC applications has been developed.
However, there are many important end-user applications left unaddressed. Problems with
big data-sets are awkward at best for the MPI model. There is no shared memory for
storing large data structures. Even after a successful port, many programs suffer poor
performance due to MPI hierarchy and message latency, and/or reliance on file systems as
a working global memory. Entire fields of scientific endeavor that could benefit from high
performance computing have not, due primarily to the programming complexity of HPC
clusters.
Most scientists, researchers, engineers and analysts like to focus on their specialty, get their
computations executed quickly, and avoid becoming entangled in the programming
complexities that supercomputing clusters demand. They typically develop their work on
Symmetric Multi-Processing (SMP) workstations -- SMP computers aligning more closely
with their computer skills, and only need supercomputers for their more demanding
applications and data sets. Scientists, researchers, engineers and analysts would rather not
re-write their applications for a supercomputing cluster and would prefer to use SMP
supercomputers. However, SMP supercomputers have been out of reach economically for
many. They have been too expensive due to their reliance on costly proprietary hardware
and proprietary interconnects, and they have had limited scalability.

What is needed is the ability to make computing clusters function like large SMP machines.
There are two approaches that have attempted to achieve this goal. The first effort tried to
mimic the architecture of mainframe SMP machines by building custom boards that could
be added to each node in a cluster that would effectively enforce cache line coherence and
shared memory across all nodes. The second effort capitalized on the techniques of
virtualization, and built a hypervisor that ran on each node creating a virtual SMP machine.
The performance of both of these approaches was somewhat disappointing.
Symmetric Computing’s patented distributed symmetric multi-processing (DSMP) takes a
different approach. By recognizing the limitations of the mainframe cache line coherency
model, and implementing our algorithms as extensions to the Linux kernel, we are able to
deliver the performance of mainframe supercomputers at the cost of computing clusters.
Our model maintains the programming simplicity of SMP. The DSMP technology can
potentially bring the benefits of supercomputing to heretofore unreached fields of
research, development and analysis. Furthermore, this technology has the potential to
replace traditional mainframes in many HPC and enterprise applications.

Page 3 of 10
Limitations of MPI Supercomputing Clusters
Computations assigned to a MPI cluster must be carefully structured to accommodate the
limitations of the individual server nodes that make up the cluster. In many cases, highly
skilled programmers are needed to modify code to accommodate the clusters hierarchy,
per-node memory limitation and messaging scheme. Once the data-sets and program are
ready to run, they must first be propagated onto each and every node within the cluster,
usually by means of a cluster file system such as Lustre. Only then can actual work begin.
Besides complexity, there are entire classes of applications and data sets that are
inappropriate for MPI clusters. Many high performance computing applications (e.g.,
genomic sequencing, coupled engineering models) invoke large data sets and require large
shared memory (≥512-GB RAM). Addressing these problems is awkward using the MPI-1
model, for it has no shared memory concept, and MPI-2 has only a limited distributed
shared memory concept with a significant latency penalty. Hence a significant restructuring
of the application and associated data sets is required in order to use MPI.
Data-sets in many fields (e.g., bioinformatics & life sciences, Computer-Aided Engineering
(CAE), energy, earth sciences, financial analyses) are becoming too large and too
computationally intensive for single commodity SMP servers. In many cases, it is
impractical and inefficient to rewrite the application to use an MPI cluster. The alternatives
are to:
 Restructure the problem (to fit within the memory limitations of the nodes and
suffer inefficiencies)
 Wait for and purchase time on a University or National Labs SMP supercomputer.
Each of these options has their own drawbacks, ranging from latency & performance to
lengthy queues for time on a government (e.g., NSF, DoE) supercomputer. What HPC users
really want is unencumbered access to an affordable, large shared-memory SMP
supercomputer.

Distributed Symmetric Multi-Processing (DSMP)

Symmetric Computing DSMP architecture provides affordable SMP supercomputing. It
enables Distributed Shared Memory (DSM), or Distributed Global Address Space (DGAS),
across an Infiniband connected cluster of homogeneous Symmetric Multiprocessing (SMP)
nodes. The cluster is converted into a DSMP supercomputer that can service very large
data-sets or accommodate MPI applications with increased efficiency and throughput
running on shared memory. DSMP provides an alternative to the MPI protocol for a wide
range of memory intensive applications because of its ability to service economically a
wider class of problems with greater efficiency.
DSMP creates a large, shared-memory software architecture at the operating system level.
It supports distributed multi-threading and synchronization across all processor cores on
all of the nodes using the standard POSIX thread model (Pthreads). From the programmer’s

Page 4 of 10
perspective, there is a single software image and one Linux operating system for a DSMP
cluster. Since DSMP runs on clusters built with industry standard severs, it delivers large
shared memory, many-core SMP mainframe computing with both economy and
performance.
The key features of the DSMP architecture are:
1. A transactional distributed shared-memory architecture
2. A kernel based RDMA inter-node communication driver using Infiniband
3. An application driven, memory-page coherency scheme
4. A new kernel based distributed POSIX threads implementation supporting
localization
5. Support for process execution and System V IPC across all nodes
6. Distributed buffered file I/O
7. Single system image via head node

How DSMP Works

DSMP is implemented as a Linux kernel enhancement. The DSMP Linux kernel is installed
on every node in the Infiniband connected cluster. These kernels coordinate their activities,
effectively operating as a single cluster operating system. One node is designated as the
head node and the others as worker nodes. An application program begins execution on the
head node, but has the ability to allocate global memory from a combined pool taken from
multiple nodes. It also has the ability to launch execution threads running on multiple
worker nodes, but referencing the same global address space. Currently, DSMP will support
up to 16 nodes that host global memory. Additional compute only worker nodes that can
access but not host global memory are supported. The generalized DSMP architecture is
shown as follows. Nodes that have a green shaded global memory region are global
memory hosts, whereas nodes with white shaded global memory regions are compute only
and access global memory.

Page 5 of 10
Transactional Distributed Shared-Memory System1
The centerpiece of the DSMP architecture is its transactional distributed shared-memory
architecture, which is based on a two-tier memory organization. DSMP divides physical
memory into two partitions: local working memory and global shared memory. The global
partitions on each node are combined to form a single global shared memory that is
linearly addressable by all nodes in a consistent manner. The global memory maintains a
reference copy of each 4096-byte memory page used by a program running on the system
at a fixed address. The local memory contains a subset of the total memory pages used by
the running program. Memory pages are copied via hardware based demand-paging from
global memory to local memory when needed by the executing program. Any changes
made to local memory pages are written back to the global memory. At the same time, a
page invalidation message is sent to all nodes in order to force any node that has a copy of
the page, to update that copy. In the event that a node receives a page invalidation request
for a page that has been locally modified, a page invalidation fault is generated.
DSMP sets the size of local memory at boot time, typically 64 GB. When there is a page-
fault in local memory, the DSMP kernel finds an appropriate not recently used (NRU) 4096-
byte memory page in local memory and swaps in the missing global memory page. The
large local memory (cache) provides all the performance benefits (STREAMS,

1 The use of the word “transactional” here is drawn from the standard programming model
used in database development, where the programmer is responsible for managing record
locks in order to ensure data coherency.

Page 6 of 10
RandomAccess and Linpack) of local memory in a legacy SMP server, with the ability to
service a page fault from the large globally shared memory in less than 5µ-seconds. Not
only is this architecture unique and extremely powerful, it can scale to hundreds of nodes
with no appreciable loss in performance.
Kernel based RDMA Infiniband Driver: DSMP is made possible with the advent of a low-
latency, commercial-off-the-shelf network fabric. Today Infiniband is the fabric of choice
for most supercomputing clusters due to its low latency and high bandwidth. In order to
squeeze every last nanosecond of performance out of the fabric, DSMP bypasses the Linux
Infiniband protocol stack with its own low-level driver. The DSMP kernel based Infiniband
driver leverages the native RDMA capabilities of the Infiniband host channel adapter
(HCA). This allows the HCA to service and move memory-page requests without processor
intervention. Hence, RDMA eliminates the overhead for message construction and
deconstruction, reducing system-wide latency.
Application-Driven, Memory Page Coherency Scheme: All proprietary shared memory
mainframe computers maintain memory consistency and/or coherency via a hardware
extension of the host processors cache-line coherency scheme. DSMP, which utilizes local
and global memory resources, takes a different approach. Coherency within the local
memory of each of the individual SMP servers is maintained by the x86-64 Memory
Management Unit (MMU) on a cache-line basis. Memory consistency between a page in
global memory and all copies of the page in local memory is maintained by the DSMP Linux
kernel. This is further supported by a set of system calls that perform memory page lock
operations similar to those used in database transactions:
 Allow a global memory page to be locked for exclusive access by a node
 Release the lock
 Force a local memory page to be immediately updated to global memory
This Symmetric Computing API allows programs with multiple execution threads, possibly
running on multiple nodes, to maintain global memory page consistency across all nodes.
This API, combined with some simple intuitive programming rules, makes porting an
application to a multi-node DSMP platform simple and manageable. Those rules are as
follows:
 Be sensitive to the fact that memory-pages are swapped into and out of local
memory (cache) from global memory in 4096-byte pages.
 Since the granularity of a DSMP global memory lock is a 4096-byte page, it is
important not to map data structures that need to be locked independently on the
same memory page. A new malloc( ) option is provided to force alignment on a
4096-byte boundary when your application requires it. This ensures that data
structures that need to be accessed independently by the program are on separate
pages.
 Identify the cause of any page invalidation faults, and add locks to handle access to
the affected pages by multiple threads.

Page 7 of 10
 If there is a data-structure that can be accessed and modified by multiple threads,
then the three new system calls can be used to maintain memory-consistency:
1. msync( ) forces immediate synchronization of a data structure with its reference
copy in global-memory
2. mlock( ) prevents any other process thread from accessing and subsequently
modifying the noted data-structure. mlock( ) also invalidates all other copies of
the data structure (memory-pages) within the computing-system. If a process
thread on another node accesses a memory-page associated with a locked data
structure, execution is blocked until the structure (memory page) is released
3. munlock( ) unlocks a previously locked data structure
Distributed POSIX Threads: The standard for parallelizing shared-memory C/C++ or
Fortran programs is by using OpenMP and the POSIX thread library Pthreads. The DSMP
implementation of Pthreads is within the kernel and operates transparently across all the
nodes in the system. In addition, support is provided for localizing a thread on a particular
node. Mutex and other synchronization primitives function across all nodes.
Distributed Process Execution and System V IPC: DSMP supports the launching of standard
Linux processes on any selected node from the head node. In addition, System V IPC
features such as shared memory segments are supported across all nodes.
Distributed Buffered File I/O: DSMP supports a distributed file I/O feature, where a process
on the head node can open a file with a distributed file descriptor that will allow buffered
file read and writes by threads or processes running on any node in the system.

DSMP Price-Performance
The table below compares MPI Clusters, SMP Mainframes and DSMP-enabled clusters:

Page 8 of 10
MPI Cluster SMP Mainframe DSMP Cluster
Proprietary Hardware No Yes No
Affordability $ $$$ $
Shared Memory No Yes Yes
Single Software Image No Yes Yes
IPMI Yes Yes Yes
RAS Yes Yes Yes
Scalable Yes No Yes

Performance of technical computing applications is largely a function of two metrics:

1. Processor performance (computational throughput) and;
2. Global Memory Read/Write performance (particularly random access).
Currently, DSMP cannot match mainframe performance in either of these metrics.
However, due to the faster pace of development of industry standard processors and
interconnects, both processors performance and global memory access performance are
rapidly improving. The performance of latest generation industry standard processors
already rivals that of proprietary mainframe processors. Over time the performance gap
between a DSMP cluster and a proprietary SMP mainframe computer of equivalent
processor/memory density will continue to narrow and eventually disappear. Given that
the cost of the cluster is one-tenth the cost of an equivalent mainframe, it already
dramatically excels in price / performance.
Today, Symmetric Computing is offering its direct connect family of departmental
supercomputers. These are small DSMP clusters directly connected via infiniband without
the need of a switch. They include two, three, four and five node system configurations,
with shared memory capacities of 2 to 5 TB, and 128 to 320 processor cores. The following
diagram shows a Trio Departmental Supercomputer with 192 processor cores and 3 TB of
RAM. It is made with three homogeneous 4P (four processor sockets) servers each with 64
cores (using 16-core AMD Opteron™ 6380 series processors) and 1 TB of physical memory
per node.

Page 9 of 10
Looking forward, Symmetric Computing plans to introduce a multi-node Infiniband switch-
based system delivering up to 2048 cores and 32 TB of RAM in a single 42U rack. In
addition, we are working with our partners to deliver turnkey platforms optimized for
application specific missions.

About Symmetric Computing

Symmetric Computing is a Boston based software company with offices at the Venture
Development Center on the campus of the University of Massachusetts. We design software
to accelerate the use and application of shared-memory computing systems for
bioinformatics and life sciences, computer-aided engineering, energy, earth sciences,
financial analyses and related fields. Symmetric Computing is dedicated to delivering
standards-based, customer-focused technical computing solutions for users, ranging from
Universities to enterprises.

For more information, please visit www.SymmetricComputing.com

Page 10 of 10

Accelerated Computing with HIP
From Everand
Accelerated Computing with HIP
Yifan Sun
4.5/5 (2)
DS License Server: Installation Guide
No ratings yet
DS License Server: Installation Guide
176 pages
Parallel Programming with MPI: Definitive Reference for Developers and Engineers
From Everand
Parallel Programming with MPI: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Synopsis On "Massive Parallel Processing (MPP) "
No ratings yet
Synopsis On "Massive Parallel Processing (MPP) "
4 pages
Jamshed 2015
No ratings yet
Jamshed 2015
17 pages
Embedded System Architecture
No ratings yet
Embedded System Architecture
10 pages
OpenMP in Practice: Definitive Reference for Developers and Engineers
From Everand
OpenMP in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
2 Parallel Computer Memory Architectures
No ratings yet
2 Parallel Computer Memory Architectures
26 pages
Practical High Performance Computing: Definitive Reference for Developers and Engineers
From Everand
Practical High Performance Computing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
OpenMPI Programming and Architecture: Definitive Reference for Developers and Engineers
From Everand
OpenMPI Programming and Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
OpenCL Programming and Architecture: Definitive Reference for Developers and Engineers
From Everand
OpenCL Programming and Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Cluster Computing: A Seminar Report Submitted To
100% (6)
Cluster Computing: A Seminar Report Submitted To
39 pages
Parallel Distributed Computing
No ratings yet
Parallel Distributed Computing
64 pages
Introduction To Parallel Computing: John Von Neumann Institute For Computing
No ratings yet
Introduction To Parallel Computing: John Von Neumann Institute For Computing
18 pages
CC Unit-1
No ratings yet
CC Unit-1
23 pages
ROCm Deep Dive: Definitive Reference for Developers and Engineers
From Everand
ROCm Deep Dive: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
PP CS
No ratings yet
PP CS
89 pages
Mastering the Craft of C Programming: Unraveling the Secrets of Expert-Level Programming
From Everand
Mastering the Craft of C Programming: Unraveling the Secrets of Expert-Level Programming
Steve Jones
No ratings yet
15 Parallel Processing
No ratings yet
15 Parallel Processing
36 pages
Chapter Four - Parallel Computing
No ratings yet
Chapter Four - Parallel Computing
86 pages
Understanding Non-Uniform Memory Access - NUMA
No ratings yet
Understanding Non-Uniform Memory Access - NUMA
3 pages
MPICH Essentials: Definitive Reference for Developers and Engineers
From Everand
MPICH Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Cloud Computing Lecture3
No ratings yet
Cloud Computing Lecture3
50 pages
GASNet Programming and Architecture: Definitive Reference for Developers and Engineers
From Everand
GASNet Programming and Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
5 4 Parallel
No ratings yet
5 4 Parallel
47 pages
Multicore Programming: K. Nagalakshmi, ASP/IT Department of Information Technology E.G.S. Pillay Engineering Technology
No ratings yet
Multicore Programming: K. Nagalakshmi, ASP/IT Department of Information Technology E.G.S. Pillay Engineering Technology
19 pages
Chapter - 5 Multiprocessors and Thread-Level Parallelism: A Taxonomy of Parallel Architectures
No ratings yet
Chapter - 5 Multiprocessors and Thread-Level Parallelism: A Taxonomy of Parallel Architectures
41 pages
Slide02 Parallel Computers
No ratings yet
Slide02 Parallel Computers
44 pages
1 of 1 PDF
No ratings yet
1 of 1 PDF
7 pages
Unit 2 Cloud Computing
No ratings yet
Unit 2 Cloud Computing
19 pages
Overview of Parallel Computing: Shawn T. Brown
No ratings yet
Overview of Parallel Computing: Shawn T. Brown
46 pages
Module 4 - Architecture
No ratings yet
Module 4 - Architecture
22 pages
5CS022 Lecture 1
No ratings yet
5CS022 Lecture 1
36 pages
Chapter 5 - Shared Memory Multiprocessor
No ratings yet
Chapter 5 - Shared Memory Multiprocessor
96 pages
5CS022 Lecture 1
No ratings yet
5CS022 Lecture 1
36 pages
Ca Research
No ratings yet
Ca Research
5 pages
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
No ratings yet
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
26 pages
OpenACC Programming Essentials: Definitive Reference for Developers and Engineers
From Everand
OpenACC Programming Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
U1-Theory of Parallelism
No ratings yet
U1-Theory of Parallelism
43 pages
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
28 MIMD Architecture
No ratings yet
28 MIMD Architecture
28 pages
Parallel and Distributed Computing Using MPI On Raspberry Pi Cluster
No ratings yet
Parallel and Distributed Computing Using MPI On Raspberry Pi Cluster
5 pages
Liu 2009
No ratings yet
Liu 2009
4 pages
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Parallel Computing Lecture # 6: Parallel Computer Memory Architectures
No ratings yet
Parallel Computing Lecture # 6: Parallel Computer Memory Architectures
16 pages
ACA Unit5 Notes
No ratings yet
ACA Unit5 Notes
26 pages
Unit 2.1
No ratings yet
Unit 2.1
18 pages
Mpi
No ratings yet
Mpi
17 pages
Lecture 1
No ratings yet
Lecture 1
23 pages
Parallel Computers
No ratings yet
Parallel Computers
39 pages
Whitepaper Imsl Increase Performance Parallel Programming Numerical Libraries
No ratings yet
Whitepaper Imsl Increase Performance Parallel Programming Numerical Libraries
8 pages
Unit 1 - Part - 2
No ratings yet
Unit 1 - Part - 2
30 pages
To HPC With MPI For Data Science: Frank Nielsen
No ratings yet
To HPC With MPI For Data Science: Frank Nielsen
304 pages
PARALLEL COMPUTER MEMORY ARCHITECTURE Hybrid Distributed Shared Memory
No ratings yet
PARALLEL COMPUTER MEMORY ARCHITECTURE Hybrid Distributed Shared Memory
20 pages
Cluster Computing: The Promise of Supercomputing To The Average PC User ?
No ratings yet
Cluster Computing: The Promise of Supercomputing To The Average PC User ?
57 pages
IV. Physical Organization and Models: March 9, 2009
No ratings yet
IV. Physical Organization and Models: March 9, 2009
35 pages
Chapter 1PARALLEL PROGRAM
No ratings yet
Chapter 1PARALLEL PROGRAM
6 pages
IV. Physical Organization and Models: March 9, 2009
No ratings yet
IV. Physical Organization and Models: March 9, 2009
35 pages
CUDA Programming Fundamentals: Definitive Reference for Developers and Engineers
From Everand
CUDA Programming Fundamentals: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Cluster Computing Tools, Applications, and Australian Initiatives For Low Cost Supercomputing
No ratings yet
Cluster Computing Tools, Applications, and Australian Initiatives For Low Cost Supercomputing
11 pages
Unit-1 - Cloud Computing
No ratings yet
Unit-1 - Cloud Computing
19 pages
Red Hat Enterprise Linux-8-Performing A Standard Rhel 8 Installation-En-us
No ratings yet
Red Hat Enterprise Linux-8-Performing A Standard Rhel 8 Installation-En-us
233 pages
Using Sigtool
No ratings yet
Using Sigtool
6 pages
Dpboss Er Diagram
No ratings yet
Dpboss Er Diagram
2 pages
How To Install Windows 11 On Legacy BIOS Without Secure Boot or TPM 2.0 - All Things How
No ratings yet
How To Install Windows 11 On Legacy BIOS Without Secure Boot or TPM 2.0 - All Things How
13 pages
OS Important M-1
No ratings yet
OS Important M-1
11 pages
MINIX 3: A Case Study in More Reliable Operating Systems
No ratings yet
MINIX 3: A Case Study in More Reliable Operating Systems
5 pages
Windows CMD Command List
No ratings yet
Windows CMD Command List
5 pages
SoftPerfect-RAM Disk User Manual
No ratings yet
SoftPerfect-RAM Disk User Manual
8 pages
Log
No ratings yet
Log
27 pages
Bugreport Veux - P - Global SKQ1.211006.001 2023 03 15 01 52 53 Dumpstate - Log 6396
No ratings yet
Bugreport Veux - P - Global SKQ1.211006.001 2023 03 15 01 52 53 Dumpstate - Log 6396
36 pages
Add or Remove Network From Navigation Panel in Windows 10
No ratings yet
Add or Remove Network From Navigation Panel in Windows 10
3 pages
Crash 2022 04 14 - 06.50.02 Client
No ratings yet
Crash 2022 04 14 - 06.50.02 Client
33 pages
CloudComputing Syllabus
No ratings yet
CloudComputing Syllabus
4 pages
Chapter 4
No ratings yet
Chapter 4
37 pages
Unit 4 AIX Software Installation and Maintenance
No ratings yet
Unit 4 AIX Software Installation and Maintenance
21 pages
Practice Test 4 Rhcsa Ex200
No ratings yet
Practice Test 4 Rhcsa Ex200
11 pages
CUDA PPT Anurita Unit3
No ratings yet
CUDA PPT Anurita Unit3
42 pages
Log... 21.11
No ratings yet
Log... 21.11
2 pages
Native Debugging: / Don't Connect Twice
No ratings yet
Native Debugging: / Don't Connect Twice
12 pages
Chapter 9 - Main Memory
No ratings yet
Chapter 9 - Main Memory
3 pages
Assignment 1,2&3
No ratings yet
Assignment 1,2&3
3 pages
MSC 2015 10
No ratings yet
MSC 2015 10
121 pages
NBP - Log Job File1
No ratings yet
NBP - Log Job File1
17 pages
Archetype Write-Up: Impacket
No ratings yet
Archetype Write-Up: Impacket
17 pages
Linux TCS
No ratings yet
Linux TCS
229 pages
How To Identify - Determine The Maximum JVM Heap Size - My Notes
No ratings yet
How To Identify - Determine The Maximum JVM Heap Size - My Notes
3 pages
How To Install The Panel:: Online Version of The Installation Guide
No ratings yet
How To Install The Panel:: Online Version of The Installation Guide
8 pages
OS Lab Manual
No ratings yet
OS Lab Manual
49 pages
Application Note 76: TCP/IP RTOS Integration Case Study
No ratings yet
Application Note 76: TCP/IP RTOS Integration Case Study
14 pages

DSMP Whitepaper

Uploaded by

DSMP Whitepaper

Uploaded by

Distributed Symmetric Multi-Processing (DSMP)

A New Paradigm for High Performance Computing

Distributed Symmetric Multi-Processing (DSMP)

How DSMP Works

Performance of technical computing applications is largely a function of two metrics:

About Symmetric Computing

For more information, please visit www.SymmetricComputing.com

You might also like