0% found this document useful (0 votes)

62 views22 pages

M Via

M-VIA and MVICH are modular implementations of VIA and MPICH for message passing over high-performance networks. M-VIA provides a reference VIA implementation for Linux that supports emulated and hardware drivers. MVICH implements the MPICH message passing interface using M-VIA to enable low-latency communication. Future plans include improved performance, multi-threading support, and integrating with process management systems.

Uploaded by

api-27351105

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views22 pages

M Via

Uploaded by

api-27351105

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

M-VIA and MVICH:

Status and Future Plans

Michael Welcome
Paul Hargrove
Lawrence Berkeley National Lab
[email protected]
[email protected]
Overview

• VIA: Virtual Interface Architecture

• M-VIA: Modular VIA for Linux

• MVICH: Implementation of MPICH ADI-2 for VIA

Why VIA – Fast IPC for Clusters
• Reduce message passing overhead incurred by traditional
protocol stacks (TCP).
• Thin software layer
• OS Bypass
• Asynchronous processing by intelligent NIC
• Industry Driven Standard – promoted by Intel, Compaq and Microsoft
• VI Developers Forum (VIDF) – Stewardship of current standard.
• API for application programmers (VIPL)
• Defined Application-Hardware Interaction
• Commodity Networking Hardware
• Giganet C-lan
• Compaq Servernet II
• Myricom – VIA over GM (alpha testing)
VI Architecture
• Standardized API (VIPL)
• VI – Communication Endpoint
Application • Send and Recv Queue
• Doorbell
VI Provider Library • Descriptor-mediated interaction
with NIC
• VI NIC
VI Kernel • Send/Recv or RDMA – No CPU
VI VI interaction
Agent • Gather-Scatter of data segments
• Requires “pinned” data regions
• VI Kernel Agent – Device Driver
VI Hardware • Open/Close NIC
• Connection Setup
• Memory registration
VIA – RDMA Operation
Node A Node B
User Address Space User Address Space
Pinned Memory Pinned Memory

RDMA
Descriptor

VI NIC VI NIC

Network
M-VIA: Modular VIA for Linux
• Goals:
• Reference implementation
• Emulated drivers for non-VIA aware NICs
• Open Source – BSD style license
• Compaq Servernet II
• SysKonnect
• Portable to other Architectures
• Thread safe
• Kernel recompilation not required – Loadable Modules
• Co-exist with traditional network protocols
• Promote rapid development of new drivers
M-VIA Architecture
• VIPL implementation is driver
independent – IOCTL and Fast Trap

VI Provider Library • Kernel Agent is further abstracted:

• Device Independent Kernel Agent
M-VIA Kernel Agent • Connection Manager
• Protection Tag Manager
M-VIA Device Driver • Registered Memory Manager
• Error Queue Manager
Hardware NIC • Device Dependent Drivers (Emulated)
• Registers with Kernel Agent
• May replace kernel agent managers
• Device Classes allow similar drivers
to share code
M-VIA: Status
• M-VIA 1.0 Released September 1999
• Implements all VIA functionality except peer-to-peer
• Passes Intel conformance test suite (100K lines of code)
• Emulated drivers supported
• Loop-back – Fast on-node process to process copy
• Fast Ethernet:
• Tulip chipset
• Intel EE Pro/100
• Gigabit Ethernet:
• Packet Engines G-NIC I and II
• Main developer quit 12/99 – replaced 7/00
• 9/00 Group leader left for DOT.COM
M-VIA 1.1 and 1.X
• M-VIA 1.1 – Before 1/01
• Bug Fixes – Memory Leak
• Peer-to-peer connections implemented (VIA option).
• More Drivers
• 3-COM Fast Ethernet
• Intel Gigabit Ethernet
• M-VIA 1.X – (date unknown)
• Linux 2.4 Kernel support
• VIPL 1.1 support (when available)
• Reliable Delivery
• IA64 support (when available)
M-VIA 2
• The real target for M-VIA is VIA-aware networks
• M-VIA 1 design based on pre-release 0.9X VIA spec
• VIA 1.0 Specification relaxed specified interaction between VIPL
and NIC

• M-VIA 2 has better support for VIA NICs

• Introduces modularity at user level.

• Improved internal interfaces based on phase 1 feedback

M-VIA 2 Plugins

Application

M-VIPL M-VIPL Plugin

M-KA
M-KA Plugin

VI Hardware
M-VIA 2 Features
• Plugins are dynamically loaded
• User-level and kernel level loading
• Multiple simultaneous plugins possible
• Applications do not need to be recompiled or relinked to
use different NICs, even though applications interact
directly with the NIC
• Installation easier
• Software distribution much easier
• Example: Single application could use multiple devices:
• M-VIA loop-back for communication within node
• hardware NIC(s) for communication with other nodes.
• Status: in design stage, some code written.
MVICH
MPI
(mpich) • MVICH is an implementation of the
MPICH ADI-2 for VIA
• Implements point-to-point message passing
ADI • From-scratch implementation of ADI-2
• No channels, no chameleon
• Multi-device support stripped out
• To Build:
channel mvich
• Put MVICH source tree in MPICH/mpid/via
• Configure with your VIA device
TCP • Build

“devices”
MVICH Implementation
• N-1 VI’s created on each node, one for each node-to-node
communication channel.
• Buffering
• “vbufs” are VIA memory-registered MPI-managed buffers
• Contain control info and, in certain cases, message data
• Flow control – VIA recv must be posted before send.
• Credit scheme implements accounting system for pre-posted recvs.
• Initially, each node pre-posts M recv vbufs on each VI, senders
given M credits on each VI.
• Sender decrements credit on send.
• Receiver posts another vbuf after recv, “refresh” credits are
piggybacked on rendezvous acknowledgements.
• Credit scheme throttles sender
Protocols

• Short/Eager
• Send data in one or more packets through vbufs
• Requires buffering on receiver
• “R3” Rendezvous
• Standard 3-way rendezvous through vbufs with pipelining
• “Rput” rendezvous
• Zero copy RDMA write from sender to receiver
• “Rget” rendezvous
• Zero copy RDMA read by receiver
• Rput and Rget revert to R3 if either side fails to register the
user’s data area.
Dynamic memory registration

• Memory registration (pinning) is relatively expensive.

• In many MPI applications, same buffers are reused, even if
persistent send/recv not used
• Basic scheme
• Register on first use. Cache info.
• Quickly (<< 1us) detect registration on subsequent use
• Fall back to rendezvous through vbufs if necessary
• Unregister after disuse (LRU)
• MUST insure virtual-to-physical mapping has not changed
between uses. Mvich uses mallopt options.
Process manager interface

• Drawback of MPICH/ch_p4 and LAM: communication

and process management are closely tied.
• A number of projects are developing low-overhead process
startup/management systems.
• Need to be able to plug together different process
managers with communication systems.
• MVICH has a simple interface that lets it interact with any
reasonable process manager.
Performance
• Current version is untuned and has known inefficiencies.

• Loopback: VIA latency 3.9us; MVICH latency 9us

• Giganet: VIA latency 8us; MVICH latency 14.5 us

• Giganet: VIA BW 90 MB/s; MVICH BW 71 MB/s

Note: this is the “slow” protocol with two copies.
Status
• Alpha release 1.0a6.1 release Sept 2000.
• Implemented
• All basic infrastructure, including dynamic registration,
• Eager, R3, Rput and Rget
• Runs on M-VIA, Giganet VIA and Servernet II M-VIA
• Passes all but 5 MPICH conformance tests
• Many bugs fixed since 1.0a5 release
• MPI_CANCEL for SEND is not implemented (3 failures)
• Aborts, rather than returning control to error handler in some cases.
(2 failures – truncation tests)
• Correctly runs NAS benchmarks.
MVICH – Future Plans
• 1.0 release targeted for Jan 2001
• Add tuning hooks – settable at run time
• Add statistics package to trace behavior
• Run “real” application, eg. adaptive mesh CFD.

• MVICH-2 (no estimated date)

• Multi-threaded implementation
• Support for unreliable networks
• Asynchronous communication
Production/Commercial MPI
• MPI Software Technologies makes MPI for VIA on NT
and Linux (www.mpi-softtech.com)

• Some advantages:
• Support
• Base implementation targeted for VIA (MPICH is “general”).
• Thread safe
• Full asynchronous communication
• Datatype optimizations
For more info...

• http://www.nersc.gov/research/ftg/via

• [email protected] (LBL developers)

• [email protected]

Message Passing Interface (MPI)
No ratings yet
Message Passing Interface (MPI)
14 pages
Blue Book PDF
No ratings yet
Blue Book PDF
187 pages
Product Brochure: Star Charge International BD
No ratings yet
Product Brochure: Star Charge International BD
6 pages
Openvswitch en
No ratings yet
Openvswitch en
27 pages
Network Basics: Sanjeevi M, AIX Consultant
No ratings yet
Network Basics: Sanjeevi M, AIX Consultant
32 pages
Open MPI: Goals, Concept, and Design of A Next Generation MPI Implementation
No ratings yet
Open MPI: Goals, Concept, and Design of A Next Generation MPI Implementation
9 pages
QOS Openvswitch - en
No ratings yet
QOS Openvswitch - en
27 pages
System P Virtual Network
No ratings yet
System P Virtual Network
32 pages
OpenMPI Houston 04 05
No ratings yet
OpenMPI Houston 04 05
20 pages
VIAP
No ratings yet
VIAP
7 pages
Class03 - MPI, Part 1, Intermediate PDF
No ratings yet
Class03 - MPI, Part 1, Intermediate PDF
83 pages
Intel X550-T2 10G 雙埠RJ45 伺服器網路卡 (Bulk) Datashee X550T2BLK
No ratings yet
Intel X550-T2 10G 雙埠RJ45 伺服器網路卡 (Bulk) Datashee X550T2BLK
4 pages
Project Via Flowcontrol
No ratings yet
Project Via Flowcontrol
46 pages
Libfabric Old Mpi Presentations 2014-01-14
No ratings yet
Libfabric Old Mpi Presentations 2014-01-14
20 pages
M.Sc. IT Semester III Virtualization QUESTION BANK 2014 - 2015 Unit 1
No ratings yet
M.Sc. IT Semester III Virtualization QUESTION BANK 2014 - 2015 Unit 1
5 pages
Computing LLNL Gov
No ratings yet
Computing LLNL Gov
42 pages
ATPESC 2019 Track-2 1-7-30 830am Guo-Raffenetti-Thakur-MPI For Scalable Computing
No ratings yet
ATPESC 2019 Track-2 1-7-30 830am Guo-Raffenetti-Thakur-MPI For Scalable Computing
199 pages
Lab Mpi
No ratings yet
Lab Mpi
29 pages
BIg Data Anslysi
No ratings yet
BIg Data Anslysi
57 pages
Virtualization Technology Trends: Intel Corporation
No ratings yet
Virtualization Technology Trends: Intel Corporation
35 pages
M 08avb
No ratings yet
M 08avb
30 pages
Message-Passing Multicomputer
No ratings yet
Message-Passing Multicomputer
57 pages
A High Performance MPI For Parallel and Distributed Computing
No ratings yet
A High Performance MPI For Parallel and Distributed Computing
4 pages
Mpi
No ratings yet
Mpi
17 pages
Lab Mpi
No ratings yet
Lab Mpi
32 pages
Elts
No ratings yet
Elts
45 pages
Mpi Java1995
No ratings yet
Mpi Java1995
105 pages
Message Passing and MPI: John Mellor-Crummey
No ratings yet
Message Passing and MPI: John Mellor-Crummey
78 pages
Poster Companion Reference Hyper V Stora
No ratings yet
Poster Companion Reference Hyper V Stora
12 pages
MiniTool Partition Wizard Crack 12 Key Download Free 2025
No ratings yet
MiniTool Partition Wizard Crack 12 Key Download Free 2025
29 pages
CS6456: Graduate Operating Systems: Bradjc@virginia - Edu
No ratings yet
CS6456: Graduate Operating Systems: Bradjc@virginia - Edu
45 pages
SR-IOV Networking in Kubernetes: The Complete Guide for Developers and Engineers
From Everand
SR-IOV Networking in Kubernetes: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Cs-3006 6 Mpi Basics 2
No ratings yet
Cs-3006 6 Mpi Basics 2
52 pages
Message Passing Interface (MPI) : Author: Blaise Barney, Lawrence Livermore National Laboratory
No ratings yet
Message Passing Interface (MPI) : Author: Blaise Barney, Lawrence Livermore National Laboratory
41 pages
Using MPI
No ratings yet
Using MPI
385 pages
Final Parallrl
No ratings yet
Final Parallrl
4 pages
Intel Mic
No ratings yet
Intel Mic
38 pages
Message Passing Interface (MPI) : EC3500: Introduction To Parallel Computing
100% (1)
Message Passing Interface (MPI) : EC3500: Introduction To Parallel Computing
40 pages
02 Message Passing Interface Tutorial
No ratings yet
02 Message Passing Interface Tutorial
34 pages
Multiprotocol Automatic Fault Tolerantmpich V Project A Multiprotocol Automatic Fault Tolerant Mpi
No ratings yet
Multiprotocol Automatic Fault Tolerantmpich V Project A Multiprotocol Automatic Fault Tolerant Mpi
15 pages
CS-3006 - 5 - MPI Basics
No ratings yet
CS-3006 - 5 - MPI Basics
53 pages
Message Passing Interface - Point To Point
No ratings yet
Message Passing Interface - Point To Point
3 pages
Advanced Message Passing Interface (MPI) : Bruno C. Mundim
No ratings yet
Advanced Message Passing Interface (MPI) : Bruno C. Mundim
86 pages
M 09avb
No ratings yet
M 09avb
33 pages
WS-013 Azure Stack HCI
No ratings yet
WS-013 Azure Stack HCI
111 pages
High Performance Computing: Matthew Jacob Indian Institute of Science
No ratings yet
High Performance Computing: Matthew Jacob Indian Institute of Science
25 pages
HPC Lecture40
No ratings yet
HPC Lecture40
25 pages
PA
No ratings yet
PA
87 pages
An Introduction To MPI: Parallel Programming With The Message Passing Interface
No ratings yet
An Introduction To MPI: Parallel Programming With The Message Passing Interface
48 pages
Session 27 and 28
No ratings yet
Session 27 and 28
25 pages
Ms. V. Uma Maheswari, Assistant Lecturer, Department of Information Technology, National Institute of Technology, Surathkal
No ratings yet
Ms. V. Uma Maheswari, Assistant Lecturer, Department of Information Technology, National Institute of Technology, Surathkal
91 pages
MPI4 Py
No ratings yet
MPI4 Py
28 pages
Networking For Virtualization: Deep Dive
No ratings yet
Networking For Virtualization: Deep Dive
7 pages
Intro To MPI: Hpc-Support@duke - Edu
No ratings yet
Intro To MPI: Hpc-Support@duke - Edu
56 pages
Week 10
No ratings yet
Week 10
52 pages
04 cmsc416 Mpi
No ratings yet
04 cmsc416 Mpi
31 pages
Juniper Networks VMX Overview
No ratings yet
Juniper Networks VMX Overview
16 pages
KVM Virtualization Essentials: Definitive Reference for Developers and Engineers
From Everand
KVM Virtualization Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Study Guide Cisco 300-540 SPCNI Designing and Implementing Cisco Service Provider Cloud Network Infrastructure
From Everand
Study Guide Cisco 300-540 SPCNI Designing and Implementing Cisco Service Provider Cloud Network Infrastructure
Anand Vemula
No ratings yet
Mastering KVM Virtualization
From Everand
Mastering KVM Virtualization
Humble Devassy Chirammal
5/5 (1)
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
Franco Mario
No ratings yet
IBM - NFS in AIX
No ratings yet
IBM - NFS in AIX
334 pages
Parallel Programming in OpenMP
No ratings yet
Parallel Programming in OpenMP
245 pages
IBM - Developing and Porting C On AIX
No ratings yet
IBM - Developing and Porting C On AIX
546 pages
Fdi 2008 Lecture8
No ratings yet
Fdi 2008 Lecture8
34 pages
Fdi 2008 Lecture4
No ratings yet
Fdi 2008 Lecture4
38 pages
Myrinet Express (MX) : A High Performance, Low Level, Message Passing Interface For Myrinet Version 1.1 January 01, 2006
No ratings yet
Myrinet Express (MX) : A High Performance, Low Level, Message Passing Interface For Myrinet Version 1.1 January 01, 2006
54 pages
Hpcclustertools Superg
No ratings yet
Hpcclustertools Superg
7 pages
IBM - AIX 5L Porting Guide
No ratings yet
IBM - AIX 5L Porting Guide
646 pages
Fdi 2008 Lecture7
No ratings yet
Fdi 2008 Lecture7
41 pages
Fdi 2008 Lecture3
No ratings yet
Fdi 2008 Lecture3
36 pages
DEISA Training October06 Technical 02 Communication
No ratings yet
DEISA Training October06 Technical 02 Communication
38 pages
Fdi 2008 Lecture6
100% (1)
Fdi 2008 Lecture6
39 pages
Compile-Time Stack Requirements Analysis With GCC
100% (2)
Compile-Time Stack Requirements Analysis With GCC
13 pages
HP Mpi
No ratings yet
HP Mpi
199 pages
Parallel Algorithms Underlying MPI Implementations
No ratings yet
Parallel Algorithms Underlying MPI Implementations
55 pages
VIA Evaluation
No ratings yet
VIA Evaluation
8 pages
System Design With SystemC
No ratings yet
System Design With SystemC
236 pages
Intel Vidf Via
No ratings yet
Intel Vidf Via
100 pages
Symmetric Multiprocessing
No ratings yet
Symmetric Multiprocessing
1 page
Project - ParallelComputing BSR v2
No ratings yet
Project - ParallelComputing BSR v2
40 pages
2.3. Implement Basic Connectivity
No ratings yet
2.3. Implement Basic Connectivity
4 pages
Chapter 10
No ratings yet
Chapter 10
29 pages
Faculty of Technology: Web Based Graduate Exit Examination System For Debretabor University Project Report
100% (1)
Faculty of Technology: Web Based Graduate Exit Examination System For Debretabor University Project Report
56 pages
EchoLife HG520s Home Gateway Quick Start
No ratings yet
EchoLife HG520s Home Gateway Quick Start
14 pages
Vsphere 7.0 Configuration - Maximums
No ratings yet
Vsphere 7.0 Configuration - Maximums
14 pages
L.I.T.E. Chapter 3 The Internet and The World Wide Web
No ratings yet
L.I.T.E. Chapter 3 The Internet and The World Wide Web
65 pages
Hytera DS-6211 MSO Hardware Description V04 - Eng
100% (1)
Hytera DS-6211 MSO Hardware Description V04 - Eng
27 pages
2.basic Networking Commands
No ratings yet
2.basic Networking Commands
5 pages
Metaswitch White Paper For NFV & SDN
No ratings yet
Metaswitch White Paper For NFV & SDN
19 pages
17computer Network 2015
No ratings yet
17computer Network 2015
17 pages
Compaq Proliant Ml350 g2
No ratings yet
Compaq Proliant Ml350 g2
47 pages
Mpi-Netsim: A Network Simulation Module For Mpi: 2009 15Th International Conference On Parallel and Distributed Systems
No ratings yet
Mpi-Netsim: A Network Simulation Module For Mpi: 2009 15Th International Conference On Parallel and Distributed Systems
8 pages
AvidThink Converge Network Digest Next Gen Infrastructure Acceleration SmartNICs Report 2020 Rev C
No ratings yet
AvidThink Converge Network Digest Next Gen Infrastructure Acceleration SmartNICs Report 2020 Rev C
18 pages
Computer Networks and MySQL - Notes
No ratings yet
Computer Networks and MySQL - Notes
16 pages
EC500 User Manual V2
No ratings yet
EC500 User Manual V2
24 pages
CCNA Routing and Switching
No ratings yet
CCNA Routing and Switching
8 pages
Network Monitoring Using Wireshark
No ratings yet
Network Monitoring Using Wireshark
16 pages
2018 Hysweep Interfacing PDF
No ratings yet
2018 Hysweep Interfacing PDF
93 pages
3a. Device Connectivity - MicroLogix
No ratings yet
3a. Device Connectivity - MicroLogix
6 pages
NKN Brochure
No ratings yet
NKN Brochure
6 pages
pcs7 Readme - en US PDF
No ratings yet
pcs7 Readme - en US PDF
88 pages
NC Monitor Instruction Manual - M700
No ratings yet
NC Monitor Instruction Manual - M700
43 pages
Communication Device: Dial-Up Modems
No ratings yet
Communication Device: Dial-Up Modems
9 pages
Configuring and Implementing Automatic External
No ratings yet
Configuring and Implementing Automatic External
9 pages
Ricoh IM C3000 IM C3500 IM C4500 IM C6000: Full Colour Multi Function Printer
No ratings yet
Ricoh IM C3000 IM C3500 IM C4500 IM C6000: Full Colour Multi Function Printer
4 pages
CN-UNIT-1 - Part-1
No ratings yet
CN-UNIT-1 - Part-1
101 pages
Qualys Virtual Scanner Appliance User Guide
No ratings yet
Qualys Virtual Scanner Appliance User Guide
19 pages
Redes Inalámbricas: Router Inalámbricos Serie N (Atheros) - 300 Mbps
No ratings yet
Redes Inalámbricas: Router Inalámbricos Serie N (Atheros) - 300 Mbps
10 pages

M Via

Uploaded by

M Via

Uploaded by

M-VIA and MVICH:

Status and Future Plans

• VIA: Virtual Interface Architecture

• M-VIA: Modular VIA for Linux

• MVICH: Implementation of MPICH ADI-2 for VIA

VI Provider Library • Kernel Agent is further abstracted:

• M-VIA 2 has better support for VIA NICs

• Introduces modularity at user level.

• Improved internal interfaces based on phase 1 feedback

M-VIPL M-VIPL Plugin

• Memory registration (pinning) is relatively expensive.

• Drawback of MPICH/ch_p4 and LAM: communication

• Loopback: VIA latency 3.9us; MVICH latency 9us

• Giganet: VIA BW 90 MB/s; MVICH BW 71 MB/s

• MVICH-2 (no estimated date)

• [email protected] (LBL developers)

You might also like