Fast and efficient linear programming and linear least-squares computations


Journal Article

We present a new parallel algorithm for computing a least-squares solution to a sparse overdetermined system of linear equations Ax = b such that the m × n matrix A is sparse and the graph, G = (V, E), of the matrix H= I AT A O has an s(m + n)-separator family, i.e. either |V| < n0 for a fixed constant n0 or, by deleting a separator subset S of vertices of size ≤s(m + n), G can be partitioned into two disconnected subgraphs having vertex sets V1, V2 of size ≤ 2 3 (m + n), and each of the two resulting subgraphs induced by the vertex sets S U ́ Vi, i = 1, 2, can be recursively s(|S U ́ Vi|)-separated in a similar way. Our algorithm uses O(log (m + n) log2 s (m + n)) steps and ≤s3(m + n) processors; it relies on our recent parallel algorithm for solving sparse linear systems and has several immediate applications of interest, in particular to mathematical programming, to sparse nonsymmetric systems of linear equations and to the path algebra computations. We most closely examine the impact on the linear programming problem (LPP) which requires maximizing cTy subject to ATy ≤ b, y ≥ 0, where A is an m × n matrix. Hereafter it is assumed that m ≥ n. The recent algorithm by Karmarkar gives the best-known upper estimate [O (m3.5 L) arithmetic operations, where L is the input size] for the cost of the solution of this problem in the worst case. We prove an asymptotic improvement of that result in the case where the graph of the associated matrix H has an s (m + n)-separator family; then our algorithm can be implemented using O (mL log m log2 s (m + n)) parallel arithmetic steps, s3 (m + n) processors and a total of O (mLs3 (m + n) log m log2 s (m + n)) arithmetic operations. In many cases of practical importance this is a considerable improvement on the known estimates: for example, s (m + n) = √8 (m + n) if G is planar [as occurs in many operations research applications; for instance, in the problem of computing the maximum multicommodity flow with a bounded number of commodities in a network having an s (m + n)-separator family], so that the processor bound is only 8 √8 (m + n)1.5 and the total number of arithmetic steps is O (m2.5 L). Similarly, Karmakar's algorithm and the known algorithms for the solution of overdetermined linear systems are accelerated in the case of dense input matrices via our recent parallel algorithms for the inversion of dense k × k matrices using O (log2 k) steps and k3 processors. Combined with a modification of Karmarkar's algorithm, this implies solution of the LPP using O (Lm log2 m) steps and m2.5 processors. The stated results promise some important practical applications. Theoretically, the above processor bounds can be reduced for dense matrix inversion to o (k2.5) and for the LPP to o (m2.165) in the dense case and to o (s2.5 (m + n)) in the sparse case (preserving the same number of parallel steps); this also decreases the known sequential time bound for the LPP by a factor of m0.335, i.e. to O (Lm3.165). © 1986.

Full Text

Duke Authors

Cited Authors

  • Pan, V; Reif, J

Published Date

  • January 1, 1986

Published In

Volume / Issue

  • 12 / 12 PART A

Start / End Page

  • 1217 - 1227

International Standard Serial Number (ISSN)

  • 0898-1221

Digital Object Identifier (DOI)

  • 10.1016/0898-1221(86)90006-4

Citation Source

  • Scopus