Sparse Cholesky Factorization on a Multiprocessor

Zmijewski, Earl

Sparse Cholesky Factorization on a Multiprocessor

dc.contributor.author	Zmijewski, Earl	en_US
dc.date.accessioned	2007-04-23T17:21:20Z
dc.date.available	2007-04-23T17:21:20Z
dc.date.issued	1987-08	en_US
dc.description.abstract	Systems of linear equations of the form $Ax = b,$ where $A$ is a large sparse symmetric positive definite matrix, arise frequently in science and engineering. The sequential computation of the solution vector $x$ is well understood and many algorithms for this problem employ the following steps. First, try to reorder the rows and columns of $A$ so that its Cholesky factor $L$ is sparse. Next, determine the structure of $L$ by symbolically factoring $A$ and allocate storage for $L$. Finally, numerically factor $A$ and then compute $x$ by solving the triangular systems $Ly=b$ and $L^{T}x=y$. In this thesis, we present parallel algorithms for the different steps of this computation. We design our algorithms for message-passing multiprocessors. The algorithms limit communication overhead and can solve problems that are too large to reside in the memory of any single processor. We provide numerical results based upon an implementation on an Intel hypercube. We begin by presenting a parallel column-oriented sparse numeric Cholesky factorization algorithm. Then, viewing $A$ as a graph, we develop a parallel graph partitioning algorithm that we use to order the columns of $A$ and partition them among the processors. In addition to producing a sparse $L$, the resulting ordering and partitioning allows for parallelism and reduces communication overhead during the remaining phases of the computation. The parallel graph partitioning algorithm is based on the sequential Kernighan-Lin algorithm for finding small edge separators. Since the computation of a particular column of $L$ may depend on columns stored on several processors, the processors cannot operate independently. The elimination forest of $A$ captures these dependencies and allows for efficient numeric factorization. We provide a parallel algorithm for computing the forest and prove its correctness. We also develop a parallel row-oriented symbolic factorization algorithm that uses the elimination forest. Finally, we describe fast parallel forward and backward triangular solve algorithms. These algorithms solve for the components of $x$ requiring information from other processors by using a variant of Li and Coleman's dense triangular solve algorithms.	en_US
dc.format.extent	6596284 bytes
dc.format.extent	1358166 bytes
dc.format.mimetype	application/pdf
dc.format.mimetype	application/postscript
dc.identifier.citation	http://techreports.library.cornell.edu:8081/Dienst/UI/1.0/Display/cul.cs/TR87-856	en_US
dc.identifier.uri	https://hdl.handle.net/1813/6696
dc.language.iso	en_US	en_US
dc.publisher	Cornell University	en_US
dc.subject	computer science	en_US
dc.subject	technical report	en_US
dc.title	Sparse Cholesky Factorization on a Multiprocessor	en_US
dc.type	technical report	en_US

Files

Original bundle

Now showing 1 - 2 of 2

Name:: 87-856.pdf
Size:: 6.29 MB
Format:: Adobe Portable Document Format

Download

Name:: 87-856.ps
Size:: 1.3 MB
Format:: Postscript Files

Download

Collections

Computer Science Technical Reports