0% found this document useful (0 votes)

13 views23 pages

HPC Nbody

Uploaded by

Rajul

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views23 pages

HPC Nbody

Uploaded by

Rajul

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

N-body Problems

Victor Eijkhout

Fall 2022
Summing forces

2
Particle interactions

for each particle i

for each particle j
let r̄ij be the vector between i and j;
then the force on i because of j is
mm
fij = −r̄ij |ri | j
ij
(where mi , mj are the masses or charges) and
fji = −fij .
Sum forces and move particle over ∆t

3
Complexity reduction

• Naive all-pairs algorithm: O (N 2 )

• Clever algorithms: O (N log N ), sometimes even O (N )
• Octtree algorithm: Barnes-Hut

4
Octtree

5
Dynamic octree creation
Procedure Quad_Tree_Build
Quad_Tree = {empty}
for j = 1 to N // loop over all N particles
Quad_Tree_Insert(j, root) // insert particle j in QuadTree
endfor
Traverse the Quad_Tree eliminating empty leaves

Procedure Quad_Tree_Insert(j, n) // Try to insert particle j at node n in

if n an internal node // n has 4 children
determine which child c of node n contains particle j
Quad_Tree_Insert(j, c)
else if n contains 1 particle // n is a leaf
add n’s 4 children to the Quad_Tree
move the particle already in n into the child containing it
let c be the child of n containing j
Quad_Tree_Insert(j, c)
else // n empty
store particle j in node n
end

6
Octree algorithm

• Consider cells on the top level

• if distance/diameter ratio small enough, take center of mass
• otherwise consider children cells

7
8
Masses calculation
// Compute the CM = Center of Mass and TM = Total Mass of all the particle
( TM, CM ) = Compute_Mass( root )

function ( TM, CM ) = Compute_Mass( n )

if n contains 1 particle
store (TM, CM) at n
return (TM, CM)
else // post order traversal
// process parent after all children
for all children c(j) of n
( TM(j), CM(j) ) = Compute_Mass( c(j) )
// total mass is the sum
TM = sum over children j of n: TM(j)
// center of mass is weighted sum
CM = sum over children j of n: TM(j)*CM(j) / TM
store ( TM, CM ) at n
return ( TM, CM )

9
Force evaluation
// for each particle, compute the force on it by tree traversal
for k = 1 to N
f(k) = TreeForce( k, root )
// compute force on particle k due to all particles inside root

function f = TreeForce( k, n )
// compute force on particle k due to all particles inside node n
f = 0
if n contains one particle // evaluate directly
f = force computed using formula on last slide
else
r = distance from particle k to CM of particles in n
D = size of n
if D/r < theta // ok to approximate by CM and TM
compute f
else // need to look inside node
for all children c of n
f = f + TreeForce ( k, c )

10
Complexity

• Each cell considers ‘rings’ of equi-distant cells

• but at doubling distance
• c log N cells to consider for each particle
• N log N overall

11
Computational aspects

• After position update, particles can move to next box: load

redistribution
• Naive octree algorithm is formulated for shared memory
• Distributed memory by using inspector-executor paradigm

12
Step 1: force by a particle

for level ℓ from one above the finest to the coarsest:

for each cell c on level ℓ
(ℓ) (ℓ+1)
let gc be the combination of the gi for all children i of c

13
14
Step 2: force on a particle

for level ℓ from one below the coarses to the finest:

for each cell c on level ℓ:
(ℓ)
let fc be the sum of
(ℓ−1)
1. the force fp on the parent p of c, and
(ℓ)
2. the sums gi for all i on level ℓ that
satisfy the cell opening criterium

15
16
• Center of mass calculation and force prolongation are local
• Force from neighbouring cells is a neighbour communication
• Neighbour communication can start before up/down tree
calculation is finished: latency hiding

17
All-pairs methods

• Traditional algorithm: distribute particles, for each particle gather

and update compute
• Problem: allgather has O (N )β cost
• does not go down with P, so does not scale weakly

18
1.5D calculation

√ √
• Better algorithm: use P × P processor grid,
√
• Divide particles in bins of N / P
• Processor (i , j ) computes interaction of boxes i and j:

19
20
21
22
√ √
• Better algorithm: use P × P processor grid,
√
• Divide particles in boxes of M = N / P
• Processor (i , j ) computes interaction of boxes i and j:
• this requires broadcast (for duplication) and reduction (for
summing) in processor rows and columns
√
• Bandwidth cost βN / P which is M: scalable.

Techniques of Value Analysis and Engineering by Lawrence D Miles
84% (38)
Techniques of Value Analysis and Engineering by Lawrence D Miles
383 pages
Game Physics Pearls
100% (6)
Game Physics Pearls
338 pages
Coumans Erwin Physics For Game
No ratings yet
Coumans Erwin Physics For Game
91 pages
N-Body Simulation in Cosmology: Thesis To Be Submitted in Partial Fulfillment of The Requirements For The Degree
No ratings yet
N-Body Simulation in Cosmology: Thesis To Be Submitted in Partial Fulfillment of The Requirements For The Degree
52 pages
Mpmcourse
No ratings yet
Mpmcourse
52 pages
Cooling Water Treatment Advanced Training Course Cooling Water Treatment ... (Pdfdrive)
100% (3)
Cooling Water Treatment Advanced Training Course Cooling Water Treatment ... (Pdfdrive)
266 pages
Particles and Point Like Objects
No ratings yet
Particles and Point Like Objects
92 pages
GV Ss21 03 Force Directed Algorithms o Print
No ratings yet
GV Ss21 03 Force Directed Algorithms o Print
23 pages
Slidesh
No ratings yet
Slidesh
39 pages
000 Getstartedrpi Digital
100% (2)
000 Getstartedrpi Digital
116 pages
Decomposing A Problem For Parallel Execution - Pablo Halpern - CppCon 2014
No ratings yet
Decomposing A Problem For Parallel Execution - Pablo Halpern - CppCon 2014
48 pages
Rigid Body Simulation Pixar
No ratings yet
Rigid Body Simulation Pixar
69 pages
Balanced Cantilever Bridge Design Considering Seismic Analysis Manual
50% (2)
Balanced Cantilever Bridge Design Considering Seismic Analysis Manual
31 pages
10.1201 b11324 Previewpdf PDF
No ratings yet
10.1201 b11324 Previewpdf PDF
180 pages
Slides (Autosaved)
No ratings yet
Slides (Autosaved)
12 pages
Material Point Method and It's Evolution Over Years
No ratings yet
Material Point Method and It's Evolution Over Years
29 pages
DFS3
No ratings yet
DFS3
23 pages
MAP551 Projet Nbody
No ratings yet
MAP551 Projet Nbody
4 pages
Adam P08
No ratings yet
Adam P08
36 pages
Cornerstone: Octree Construction Algorithms For Scalable Particle Simulations
No ratings yet
Cornerstone: Octree Construction Algorithms For Scalable Particle Simulations
10 pages
A Fast Algorithm For Particle Simulations Greengard Rokhlin
No ratings yet
A Fast Algorithm For Particle Simulations Greengard Rokhlin
13 pages
Graph Neural Networks in Particle Physics
No ratings yet
Graph Neural Networks in Particle Physics
18 pages
Molecular Dynamics
No ratings yet
Molecular Dynamics
22 pages
Trees
No ratings yet
Trees
67 pages
pc12 Nbody
No ratings yet
pc12 Nbody
63 pages
1 The SPH Equations: I I I I
No ratings yet
1 The SPH Equations: I I I I
14 pages
How Do Physics Engines Work
No ratings yet
How Do Physics Engines Work
48 pages
Numerical Methods in Finance. Part A. (2010-2011)
No ratings yet
Numerical Methods in Finance. Part A. (2010-2011)
23 pages
Gravitational Dynamics Heidelberg
No ratings yet
Gravitational Dynamics Heidelberg
3 pages
Concept Map
100% (1)
Concept Map
1 page
Graph Partitioning Algorithms: CME342 - Parallel Methods in Numerical Analysis
No ratings yet
Graph Partitioning Algorithms: CME342 - Parallel Methods in Numerical Analysis
47 pages
Physically Based Modeling: Rigid Body Simulation
No ratings yet
Physically Based Modeling: Rigid Body Simulation
69 pages
ANew Methodfor Computationof Long Ranged
No ratings yet
ANew Methodfor Computationof Long Ranged
12 pages
Corrosion and Corrosion Testing: Standard Terminology Relating To
No ratings yet
Corrosion and Corrosion Testing: Standard Terminology Relating To
5 pages
Project Group4
No ratings yet
Project Group4
10 pages
Graphics
No ratings yet
Graphics
8 pages
DEM Modeling: Lecture 11 Coarse Contact Detection: C. Wassgren, Purdue University 1
No ratings yet
DEM Modeling: Lecture 11 Coarse Contact Detection: C. Wassgren, Purdue University 1
26 pages
CS 294-73 Software Engineering For Scientific Computing Lecture 12: Particle Methods Homework 3
No ratings yet
CS 294-73 Software Engineering For Scientific Computing Lecture 12: Particle Methods Homework 3
32 pages
Real Time Physics Class Notes
No ratings yet
Real Time Physics Class Notes
57 pages
Information Bulletin For PHD Admission AUTUMN 2025
No ratings yet
Information Bulletin For PHD Admission AUTUMN 2025
26 pages
Matlab Code For Truss Problem, Generalised Program
80% (5)
Matlab Code For Truss Problem, Generalised Program
2 pages
Rigging Safety
No ratings yet
Rigging Safety
190 pages
Upload 01
No ratings yet
Upload 01
5 pages
Turnigy Accucell-8150, Cargador / Balanceador / Descargador, Manual English
100% (2)
Turnigy Accucell-8150, Cargador / Balanceador / Descargador, Manual English
23 pages
HV2 650V Datasheet
No ratings yet
HV2 650V Datasheet
6 pages
N Bodyreport
No ratings yet
N Bodyreport
9 pages
Unit5 RMD PDF
No ratings yet
Unit5 RMD PDF
27 pages
HPC Parallel
No ratings yet
HPC Parallel
122 pages
CHARMM-G, A GPU Based MD Simulation Code With PME and Reaction Force Field For Studying Large Membrane Regions
No ratings yet
CHARMM-G, A GPU Based MD Simulation Code With PME and Reaction Force Field For Studying Large Membrane Regions
32 pages
DEMO Farm Tools
No ratings yet
DEMO Farm Tools
8 pages
Eurotile Pricelist2015 3
No ratings yet
Eurotile Pricelist2015 3
147 pages
Particle Simulator
No ratings yet
Particle Simulator
26 pages
Graph Algorithm
No ratings yet
Graph Algorithm
10 pages
Project 6 2017 PDF
No ratings yet
Project 6 2017 PDF
4 pages
DEM Lecture0224
No ratings yet
DEM Lecture0224
53 pages
Football Teams Managmenet
No ratings yet
Football Teams Managmenet
32 pages
Multi-Core Strategies For Particle Me: John R. Williams, David Holmes and Peter Tilke
No ratings yet
Multi-Core Strategies For Particle Me: John R. Williams, David Holmes and Peter Tilke
7 pages
MCWP 4-6 (1996) - MAGTF Supply Ops
No ratings yet
MCWP 4-6 (1996) - MAGTF Supply Ops
68 pages
Strunk Oliver Stop My Constraints From Blowing Up
No ratings yet
Strunk Oliver Stop My Constraints From Blowing Up
51 pages
Untitled Document
No ratings yet
Untitled Document
2 pages
Graphs Spanning
No ratings yet
Graphs Spanning
51 pages
HPC Cmakeshort
No ratings yet
HPC Cmakeshort
11 pages
Florence Bertails - Linear Time Super-Helices
No ratings yet
Florence Bertails - Linear Time Super-Helices
10 pages
The Barnes-Hut Algorithm
No ratings yet
The Barnes-Hut Algorithm
8 pages
HPC Cmake
No ratings yet
HPC Cmake
76 pages
Physics: in Computer Graphics
No ratings yet
Physics: in Computer Graphics
26 pages
HPC Iterative
No ratings yet
HPC Iterative
106 pages
Using Python To Calculate Gravitational Force: Zachary Hafen
No ratings yet
Using Python To Calculate Gravitational Force: Zachary Hafen
3 pages
PixelJunk Shooter Fluid Sim and Rendering
No ratings yet
PixelJunk Shooter Fluid Sim and Rendering
60 pages
Flume User Guide
No ratings yet
Flume User Guide
48 pages
10624-Master Index For Pipe Supports
No ratings yet
10624-Master Index For Pipe Supports
36 pages
Brooding in Poultry
No ratings yet
Brooding in Poultry
32 pages
HPC Architecture
No ratings yet
HPC Architecture
86 pages
HPC Unix
No ratings yet
HPC Unix
46 pages
Introduction to computational plasma physics: 雷奕安 62755208，[email protected]
No ratings yet
Introduction to computational plasma physics: 雷奕安 62755208，[email protected]
45 pages
Biological Colision
No ratings yet
Biological Colision
4 pages
Internal Force Density
No ratings yet
Internal Force Density
4 pages
Multiple Choice. Choose Multiple Answers Question Bank
No ratings yet
Multiple Choice. Choose Multiple Answers Question Bank
15 pages
Topic 2: Molecular Dynamics of Lennard-Jones System (Continued)
No ratings yet
Topic 2: Molecular Dynamics of Lennard-Jones System (Continued)
3 pages
HPC Arithmetic
No ratings yet
HPC Arithmetic
62 pages
HPC Scaling
No ratings yet
HPC Scaling
56 pages
Batchlayout: A Batch-Parallel Force-Directed Graph Layout Algorithm in Shared Memory
No ratings yet
Batchlayout: A Batch-Parallel Force-Directed Graph Layout Algorithm in Shared Memory
10 pages
HPC Linear
No ratings yet
HPC Linear
52 pages
Section 3 - Group Assignment
No ratings yet
Section 3 - Group Assignment
7 pages
WBI11 01 Que 20190108
100% (1)
WBI11 01 Que 20190108
32 pages
Artificial Intelligence & Decision Support Systems
No ratings yet
Artificial Intelligence & Decision Support Systems
5 pages
HPC Debug
No ratings yet
HPC Debug
38 pages
HPC Programming
No ratings yet
HPC Programming
33 pages
TP343
No ratings yet
TP343
7 pages
Multimetru Auto, Model ADD51
No ratings yet
Multimetru Auto, Model ADD51
2 pages
Physically Based Modeling: Particle System Dynamics
No ratings yet
Physically Based Modeling: Particle System Dynamics
13 pages
Gap Filling Activities With Without Clues Tag Questions Pu Siam Ibn Bashar Al Saud
No ratings yet
Gap Filling Activities With Without Clues Tag Questions Pu Siam Ibn Bashar Al Saud
65 pages
Iev01582 Pi-2893
No ratings yet
Iev01582 Pi-2893
3 pages
Seaport
No ratings yet
Seaport
22 pages
TKC41005
No ratings yet
TKC41005
2 pages
HPC Graph
No ratings yet
HPC Graph
22 pages
Network Time Protocol (NTP) General Overview: David L. Mills University of Delaware
No ratings yet
Network Time Protocol (NTP) General Overview: David L. Mills University of Delaware
22 pages
Particle System Dynamcis Cs Cornell Edu
No ratings yet
Particle System Dynamcis Cs Cornell Edu
13 pages
Linear Models: Stability and Redundancy: 2.1 Singular Value Decomposition
No ratings yet
Linear Models: Stability and Redundancy: 2.1 Singular Value Decomposition
24 pages
Chapter 2 Memory Organization
No ratings yet
Chapter 2 Memory Organization
23 pages
HPC Intro
No ratings yet
HPC Intro
16 pages
HPC Performance
No ratings yet
HPC Performance
13 pages
09 MSDS Wax Dispersant
No ratings yet
09 MSDS Wax Dispersant
8 pages
HPC Pkgconfig
No ratings yet
HPC Pkgconfig
12 pages
HPC Git
No ratings yet
HPC Git
12 pages
Lec2 17
No ratings yet
Lec2 17
27 pages
Paperbangkok
No ratings yet
Paperbangkok
17 pages
Gambling, Random Walks and The Central Limit Theorem: 3.1 Random Variables and Laws of Large Num-Bers
No ratings yet
Gambling, Random Walks and The Central Limit Theorem: 3.1 Random Variables and Laws of Large Num-Bers
59 pages
Asset-V1 HKUx+HKU 08x+1T2030+type@asset+block@Introduction To FinTech Course Syllabus 05142018
No ratings yet
Asset-V1 HKUx+HKU 08x+1T2030+type@asset+block@Introduction To FinTech Course Syllabus 05142018
2 pages
Family Miles 23.9.2023
No ratings yet
Family Miles 23.9.2023
5 pages
Lec1 17
No ratings yet
Lec1 17
39 pages
Long-Range Dependency Effects in Network Timekeeping: David L. Mills University of Delaware
No ratings yet
Long-Range Dependency Effects in Network Timekeeping: David L. Mills University of Delaware
33 pages
IENG300 Assignment - 1 Solution
No ratings yet
IENG300 Assignment - 1 Solution
6 pages
Lec4 17
No ratings yet
Lec4 17
22 pages
4 Indus Dancing Girls Represent Mohini S
No ratings yet
4 Indus Dancing Girls Represent Mohini S
20 pages
Elective I (Math)
No ratings yet
Elective I (Math)
2 pages
DS-M5504HM-T Series Mobile DVR: Main Features
No ratings yet
DS-M5504HM-T Series Mobile DVR: Main Features
4 pages
Equity Structured Products Accumulator/ Decumulator
No ratings yet
Equity Structured Products Accumulator/ Decumulator
5 pages
0.1 Installation of R Packages
No ratings yet
0.1 Installation of R Packages
10 pages

HPC Nbody

Uploaded by

HPC Nbody

Uploaded by

N-body Problems

for each particle i

• Naive all-pairs algorithm: O (N 2 )

Procedure Quad_Tree_Insert(j, n) // Try to insert particle j at node n in

• Consider cells on the top level

function ( TM, CM ) = Compute_Mass( n )

• Each cell considers ‘rings’ of equi-distant cells

• After position update, particles can move to next box: load

for level ℓ from one above the finest to the coarsest:

for level ℓ from one below the coarses to the finest:

• Traditional algorithm: distribute particles, for each particle gather

You might also like