BG External Presentation January 2002
BG External Presentation January 2002
Project Update
January 2002
The Blue Gene Project
1E+16
Blue Gene / P
Blue Gene / L
1E+14
ASCI White
ASCI Red Blue Pacific
Peak Speed (flops)
BG/L
CU-11
180 TFLOP
ASCI-Q
30 TFLOP
Performance
QCDOC
CMOS7SF
20 TFLOP
PPC 630
ASCI-
White
10 TFLOP
PPC 604
QCDSP ASCI-Blue
0.6 TFLOP 3.3TFLOP
Rack
(128 boards, 8x8x16)
Board
(8 chips, 2x2x2)
Chip
(2 processors)
180/360 TF/s
16 TB
440 core
2.9/5.7 TF/s
EDRAM
266 GB
440 core
I/O
22.4/44.8 GF/s
2.8/5.6 GF/s 2.08 GB
4 MB
Blue Gene/L - The Networks
65536 nodes interconnected with three integrated networks
3 Dimensional Torus
Virtual cut-through hardware routing to maximize efficiency
2.8 Gb/s on all 12 node links (total of 4.2 GB/s per node)
Communication backbone
134 TB/s total torus interconnect bandwidth
Global Tree
One-to-all or all-all broadcast functionality
Arithmetic operations implemented in tree
~1.4 GB/s of bandwidth from any node to all other nodes
Latency of tree traversal less than 1usec
Ethernet
Incorporated into every node ASIC
Disk I/O
Host control, booting and diagnostics
Blue Gene/L System Software
1 Gb/s
Ethernet
Compute Nodei
Compute Nodei
ComputeApplication
Nodei
System
ComputeApplication
Nodes
Console
Application
MPI
C/C++, F95
Math
Application
MPI
C/C++, F95
Math
C/C++, F95
Kernel Services
MPI Math
C/C++,Kernel
F95 Services
MPI Math
Kernel Services
Kernel Services
64K Nodes Host
Blue Gene Science
1.00E+13
1.00E+12
time steps/month
1.00E+11
1.00E+10
1.00E+9
1.00E+8
1.00E+7
1.00E+6
1000 10000 100000
System Size (atoms)
1 rack Power3 ('01) 40*512 node BG/L partition (4Q04)
512 node BG/L partition (2H03) 1,000,000 GFLOP/second (2H06)
Data Volumes (assuming every
time-step written out)
1.000E+18
1.000E+17
bytes/month
1.000E+16
1.000E+15
1.000E+14
1.000E+13
1.000E+12
1E+3 1E+4 1E+5
System Size (atoms)
data volume/month (1 rack Power3)
data volume/month (512 node BL)
data volume/month (40*512 node BL)
data volume/month (1,000,000 GLOP/s)
External Scientific Interactions