Numerical Algorithms for Modern Parallel Computer ArchitecturesMartin Schultz Parallel computers have started to completely revolutionize scientific computation. Articles in this volume represent applied mathematics, computer science, and application aspects of parallel scientific computing. Major advances are discussed dealing with multiprocessor architectures, parallel algorithm development and analysis, parallel systems and programming languages. The optimization of the application of massively parallel architectures to real world problems will provide the impetus for the development of entirely new approaches to these technical situations. |
Contents
A Look at the Evolution of Mathematical Software for Dense | 29 |
A Review of Automatic Load Balancing | 51 |
Adaptive Methods for Hyperbolic Problems | 77 |
An Overview | 109 |
Problem Decomposition and Communication Tradeoffs | 145 |
Domain Decomposition Preconditioners for Elliptic Problems | 163 |
Block Algorithms for Parallel Machines 197 | 196 |
A Block QR Factorization Scheme for Loosely Coupled | 217 |
Other editions - View all
Numerical Algorithms for Modern Parallel Computer Architectures Martin Schultz No preview available - 2012 |
Numerical Algorithms for Modern Parallel Computer Architectures Martin Schultz No preview available - 1988 |
Common terms and phrases
application assigned backsubstitution bandwidth binary tree bitonic sort block column Boolean cube Butterfly cache Cholesky factor complexity computation graph Computer Science concurrent configuration cyclic reduction data dependencies defined diagonal dimensional domain decomposition dynamic efficient embedding ensemble architectures equations execution factorisation Figure floating point forward elimination Gaussian elimination given granularity Gray code hypercube IEEE implementation iteration Lennart Johnsson LINPACK load balancing machine mapping matrix matrix multiplication mesh message passing method Multimax multiple multiprocessor neural network nodes number of processors orthogonal Palign=1 parallel algorithms parallel computation partitioning performance phase pivot row point template preconditioned preconditioners private memories problem Proc recursions reduce scheduling Schur shared bulk memory shared memory simulated annealing solution solve solvers sparse speedup step storage subdomains subgraph switch systolic arrays tasks techniques Toeplitz matrix triangular system tridiagonal systems update upper triangular vector VLSI wavefront window