Elsevier Science Home
Computer Physics Communications Program Library
Full text online from Science Direct
Programs in Physics & Physical Chemistry
CPC Home

[Licence| Download | New Version Template] aehu_v1_1.tar.gz(1044 Kbytes)
Manuscript Title: A Parallel Solver for Huge Dense Linear Systems
Authors: J.M. Badia, J.L. Movilla, J.I. Climente, M. Castillo, M. Marqués, R. Mayo, E.S. Quintana-Ortí, J. Planelles
Program title: Huge Dense System Solver (HDSS)
Catalogue identifier: AEHU_v1_1
Distribution format: tar.gz
Journal reference: Comput. Phys. Commun. 182(2011)2441
Programming language: Fortran90, C.
Computer: Parallel architectures: multiprocessors, computer clusters.
Operating system: Linux/Unix.
Has the code been vectorised or parallelized?: Yes, includes MPI directives.
RAM: Tested for up to 190GB
Keywords: LU decomposition, Out-of-core, Parallel computing.
Classification: 6.5.

External routines: MPI (http://www.mpi-forum.org/), BLAS (http://www.netlib.org/blas/), PLAPACK (http://www.cs.utexas.edu/~plapack/), POOCLAPACK (ftp://ftp.cs.utexas.edu/pub/rvdg/PLAPACK/pooclapack.ps)
(Code for PLAPACK and POOCLAPACK is included in the distribution).

Does the new version supersede the previous version?: Yes

Nature of problem:
Huge scale dense systems of linear equations, Ax = B, beyond standard LAPACK capabilities.

Solution method:
The linear systems are solved by means of parallelized routines based on the LU factorization, using efficient secondary storage algorithms when the available main memory is insufficient.

Reasons for new version:
In many applications we need to guarantee a high accuracy in the solution of very large linear systems and we can do it by using double-precision arithmetic.

Summary of revisions:
Version 1.1
  • Can be used to solve linear systems using double-precision arithmetic.
  • New version of the initialization routine. The user can choose the kind of arithmetic and the values of several parameters of the environment.

Running time:
About 5 hours to solve a system with more than 200,000 equations and more than 10,000 right-hand side vectors using double-precision arithmetic on an eight-node commodity cluster with a total of 64 Intel cores.