Citation
Salmon, John K. (1991) Parallel hierarchical Nbody methods. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/3J5VVR08. https://resolver.caltech.edu/CaltechTHESIS:04112011113813016
Abstract
Recent algorithmic advances utilizing hierarchical data structures have resulted in a dramatic reduction in the time required for computer simulation of Nbody systems with longrange interactions. Computations which required O(N^2) operations can now be done in O(N log N) or O(N). We review these tree methods and find that they may be distinguished based on a few simple features. The BarnesHut (BH) algorithm has received a great deal of attention, and is the subject of the remainder of the dissertation. We present a generalization of the BH tree and analyze the statistical properties of such trees in detail. We also consider the expected number of operations entailed by an execution of the BH algorithm. We find an optimal value for m, the maximum number of bodies in a terminal cell, and confirm that the number of operations is O(N log N), even if the distribution of bodies is not uniform. The mathematical basis of all hierarchical methods is the multipole approximation. We discuss multipole approximations, for the case of arbitrary, spherically symmetric, and Newtonian Green's functions. We describe methods for computing multipoles and evaluating multipole approximations in each of these cases, emphasizing the tradeoff between generality and algorithmic complexity. Nbody simulations in computational astrophysics can require 10^6 or even more bodies. Algorithmic advances are not sufficient, in and of themselves, to make computations of this size feasible. Parallel computation offers, a priori, the necessary computational power in terms of speed and memory. We show how the BH algorithm can be adapted to execute in parallel. We use orthogonal recursive bisection to partition space. The logical communication structure that emerges is that of a hypercube. A local version of the BH tree is constructed in each processor by iteratively exchanging data along each edge of the logical hypercube. We obtain speedups in excess of 380 on a 512 processor system for simulations of galaxy mergers with 180000 bodies. We analyze the performance of the parallel version of the algorithm and find that the overhead is due primarily to interprocessor synchronization delays and redundant computation. Communication is not a significant factor.
Item Type:  Thesis (Dissertation (Ph.D.)) 

Subject Keywords:  Physics 
Degree Grantor:  California Institute of Technology 
Division:  Physics, Mathematics and Astronomy 
Major Option:  Physics 
Thesis Availability:  Public (worldwide access) 
Research Advisor(s): 

Group:  Astronomy Department 
Thesis Committee: 

Defense Date:  6 December 1990 
Record Number:  CaltechTHESIS:04112011113813016 
Persistent URL:  https://resolver.caltech.edu/CaltechTHESIS:04112011113813016 
DOI:  10.7907/3J5VVR08 
Default Usage Policy:  No commercial reproduction, distribution, display or performance rights in this work are provided. 
ID Code:  6291 
Collection:  CaltechTHESIS 
Deposited By:  Tony Diaz 
Deposited On:  11 Apr 2011 22:30 
Last Modified:  10 Mar 2020 23:11 
Thesis Files

PDF
 Final Version
See Usage Policy. 10MB 
Repository Staff Only: item control page