skip to main content
10.1145/509593.509621acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
Article
Free access

Parallel threshold-based ILU factorization

Published: 15 November 1997 Publication History

Abstract

The sparse linear systems arising in finite element applications are commonly solved using iterative methods. In particular, as the size of these problems increases, the increased computational and memory requirements of these problems render in-core direct solution methods unusable, leaving iterative methods as the only viable alternative for solving these problems in core.The major computational kernels of an iterative method are (i) computation of preconditioner, (ii) multiplication of a sparse matrix with a vector, and (iii) application of the preconditioner. Threshold-based incomplete LU factorization have been found to be quite effective in preconditioning iterative system solvers [14]. However, because these factorizations allow the fill elements to be created dynamically, their parallel formulations had not been well understood, and they have been considered to be unsuitable for distributed-memory parallel computers [13]. Furthermore, solution of the resulting sparse triangular system (which is required for the application of the preconditioner) is generally more difficult to parallelize than the multiplication of a sparse matrix with a vector.In this paper we show that highly parallel graph partitioning algorithms in conjunction with parallel algorithms for computing maximal independent sets can be used to develop scalable parallel formulations of incomplete factorization algorithms. We present a highly parallel formulation of the ILUT factorization algorithm [14] for distributed memory parallel computers. This algorithm uses our parallel multilevel k-way graph partitioning algorithm [6,8] in conjunction with a parallel maximal independent subset algorithm to parallelize both the factorization as well as the solution of the resulting triangular factors. We also present a modified ILUT factorization algorithm (ILUT*) that requires less time and is more scalable than ILUT. Our experiments on Cray T3D show that our parallel ILUT* algorithm achieve a high degree of concurrency, and when used as a preconditioner, it is comparable in quality to the unmodified ILUT algorithm. Furthermore, our experiments using the GMRES iterative solver show that the amount of time spent in computing the factorization using the ILUT* algorithm is usually much less than the amount of time required to solve the systems.

References

[1]
Owe Axelsson. Iterative Solution Methods. Cambridge University Press, New York, NY, 1994.
[2]
E. F. D'Azevedo, F. A. Forsyth, and W. P. Tang. Towards a cost effective ilu preconditioner with high level fill. BIT, 1992.
[3]
K. A. Gallivan, A. Sameh, and Z. Slatev. A parallel hybrid sparse linear system solver. Computing Systems in Engineering, 1:183-195, June 1990.
[4]
Anshul Gupta, George Karypis, and Vipin Kumar. Highly scalable parallel algorithms for sparse matrix factorization. Technical Report 94-63, Department of Computer Science, University of Minnesota, Minneapolis, MN, 1994. To appear in IEEE Transactions on Parallel and Distributed Computing. Available on WWW at URL http://www.cs.umn.edu/~karypis.
[5]
Mark T. Jones and Paul E. Plassmann. Scalable iterative solution of sparse linear systems. Parallel Computing, 20:753-773, 1994.
[6]
G. Karypis and V. Kumar. Parallel multilevel k-way partitioning scheme for irregular graphs. Technical Report TR 96-036, Department of Computer Science, University of Minnesota, 1996. Also available on WWW at URL http://www.cs.umn.edu/~karypis. A short version appears in Supercomputing 96.
[7]
G. Karypis and V. Kumar. Parallel threshold-based ilu factorization. Technical Report TR 96-061, Department of Computer Science, University of Minnesota, 1996. Also available on WWW at URL http://www.cs.umn.edu/~karypis.
[8]
George Karypis and Vipin Kumar. A coarse-grain parallel multilevel k-way partitioning algorithm. In Proceedings of the eighth SIAM conference on Parallel Processing for Scientific Computing, 1997.
[9]
R. N. Klepfer, C. R. Jhonson, and R. S. MaxLeod. The effects of inhomogeneities and anisotropies on electrocardiographic fields: A three-dimensional finite element study. In IEEE Engineering in Medice and Biology Society, 17th Annual International Conference, 1995.
[10]
Michael Luby. A simple parallel algorithm for the maximal independent set problem. SIAM Journal on Computing, 15(4):1036-1053, 1986.
[11]
S. Ma and Y. Saad. Distributed ILU(0) and SOR preconditioners for unstructured sparse linear systems. Technical Report 94-027, Army High Performance Computing Research Center, University of Minnesota, Minneapolis, MN, 1994.
[12]
J. A. Meijerink and H. A. van der Vorst. An iterative solution method for linear systems fo which the coefficient matrix is a symmetric m-matrix. Math. Comp., 31(137):148-162, 1977.
[13]
Y. Saad. ILUM: A parallel multielimination ilu preconditioner for general sparse matrices. Technical Report CS-92-241, Department of Computer Science, University of Minnesota, 1992.
[14]
Y. Saad. ILUT: a dual threshold incomplete ILU factorization. Numerical Linear Algebra with Applications, 1:387-402, 1994.
[15]
Y. Saad and M. H. Schultz. GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems. SIAM Journal on Scientific and Statistical Computing, 7:856-869, 1986.
[16]
Yousef Saad. Iterative Methods for Sparse Linear Systems. PWS Publishing, Boston, MA, 1996.
[17]
J. W. Watts-III. A conjugate gradient truncated direct method for the iterative solution of the reservoir pressure equation. Society of Petroleum Engineer Journal, 21:345-353, 1981.
[18]
D. P. Young, R. G. Melvin, F. T. Johnson, J. E. Bussoletti, L. B. Wigton, and S. S. Samant. Application of sparse matrix solvers as effective preconditioners. SIAM Journal on Scientific and Statistical Computing, 10:1186-1199, 1989.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SC '97: Proceedings of the 1997 ACM/IEEE conference on Supercomputing
November 1997
921 pages
ISBN:0897919858
DOI:10.1145/509593
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 November 1997

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

SC '97
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)71
  • Downloads (Last 6 weeks)11
Reflects downloads up to 03 Nov 2024

Other Metrics

Citations

Cited By

View all

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media