next up previous
Next: Additional benchmark information: Up: A Tool for Cluster Previous: In Case of Problems

Subsections

NASA Benchmark Suite (details)

The NAS suite will probably produce the most problems for end users. The main script run_suite is designed to ``wrap'' around and hide the various issues with the different MPI's and compilers. While run_suite does an adequate job, it certainly can not predict the potential software environments on a cluster. The NAS suite can be run from the command line. The following options are required. You also need to list the machine names in npb/cluster/machines file (one per line). This file is used by MPI to start your programs. In the npb directory, run

# run_suite -h

Usage: ./run_suite <OPTIONS>

Options:

run_suite,v 0.3 2/12/2002

Usage: ./run_suite <OPTIONS>

Options:

-v verbose output from make stage (default=make.log)

-c <compiler> compiler (gnu/pgi/intel)

-n <processors> number of processors

-t <test> test size (A,B,C,S)

-m <mpi> mpi version(lam,mpich,mpipro,dummy)

-o only build programs

-h show this help

To run on a single CPU use: '-c gnu -n 1 -t S -m dummy'

If you have problems producing the binary files, consult the make.log file for a complete listing for the make process. Currently the test suite has been tested on Paralogic clusters using GNU, Portland Group, and Intel compilers. It has been tested and works with LAM-MPI, MPICH, and MPI/PRO.

Description of the NAS tests.

BT
is a simulated CFD application that uses an implicit algorithm to solve 3dimensional (3D) compressible NavierStokes equations. The finite differences solution to the problem is based on an Alternating Direction Implicit (ADI) approximate factorization that decouples the x, y, and z dimensions. The resulting systems are BlockTridiagonal of 5x5 blocks and are solved sequentially along each dimension.
SP
is a simulated CFD application that has a similar structure to BT. The finite differences solution to the problem is based on a BeamWarming approximate factorization that decouples the x, y, and z dimensions. The resulting system has scalar Pentadiagonal bands of linear equations that are solved sequentially along each dimension.
LU
is a simulated CFD application that uses symmetric successive overrelaxation (SSOR) method to solve a seven block diagonal system resulting from finite difference discretization of the NavierStokes equations in 3D by splitting to into block Lower and Upper triangular systems.
FT
contains the computational kernel of a 3D fast Fourier Transform (FFT)based spectral method. FT performs three one dimensional (1D) FFT's, one for each dimension.
MG
uses a Vcycle MultiGrid method to compute the solution of the 3D scalar Poisson equation. The algorithm works continuously on a set of grids that are made between coarse and fine. It tests both short and long distance data movement.
CG
uses a Conjugate Gradient method to compute an approximation to the smallest eigenvalue of a large, sparse, unstructured matrix. This kernel tests unstructured grid computations and communications by using a matrix with randomly generated locations of entries.
EP
is an Embarrassingly Parallel benchmark. It generates pairs of Gaussian random deviates according to a specific scheme. The goal is to establish the reference point for peak performance of a given platform.


next up previous
Next: Additional benchmark information: Up: A Tool for Cluster Previous: In Case of Problems
Douglas Eadline 2003-03-17