COALA

Menu:

Publications


Refereed journals (published and in revision)

J. Demmel, L. Grigori, M. Gu, and H. Xiang
Communication avoiding rank revealing QR factorization with column pivoting, LAWN 276, pdf, in minor revision, SIAM J. Matrix Anal. & Appl. 2013.

A. Khabou, J. Demmel, L. Grigori, and M. Gu
LU factorization with panel rank revealing pivoting and its communication avoiding version, SIAM J. Matrix Anal. & Appl., Vol. 34, No. 3, pages 1401-1429, 2013, preliminary version published as LAWN 263, pdf.

J. Demmel, L. Grigori, M. F. Hoemmen, and J. Langou,
Communication-optimal parallel and sequential QR and LU factorizations,
SIAM Journal on Scientific Computing, Vol. 34, No 1, 2012, [pdf] (also available on arXiv:0808.2664v1), short version of UCB-EECS-2008-89 and LAWN 204, available since 2008.

L. Grigori, J. Demmel, and H. Xiang, CALU: a communication optimal LU factorization algorithm, SIAM Journal on Matrix Analysis, Vol. 32, pp. 1317-1350, 2011, preliminary version published as LAWN 226.

G. Ballard, J. Demmel, O. Holtz, O. Schwartz Minimizing Communication in Numerical Linear Algebra, SIAM Journal on Matrix Analysis, 2011, UCB-EECS-2009-62 pdf.

S. Tomov, J. Dongarra, M. Baboulin, Towards dense linear algebra for hybrid GPU accelerated manycore systems, Parallel Computing, Vol. 36, No 5&6, pp. 232-240 (2010)

J. Demmel, L. Grigori, M. F. Hoemmen, and J. Langou, Communication-optimal parallel and sequential QR and LU factorizations, SIAM journal on Scientific Computing, In press, 2011, short version of UCB-EECS-2008-89 and LAWN 204 (available on arXiv:0808.2664v1)

> Top of the page   > Home

Conference proceedings

G. Ballard, J. Demmel, L. Grigori, M. Jacquelin, H. D. Nguyen and E. Solomonik
Reconstructing Householder Vectors from Tall-Skinny QR , Proceedings of IEEE International Parallel & Distributed Processing Symposium, IPDPS 2014, pdf.

G. Ballard, A. Buluc, J. Demmel, L. Grigori, B. Lipshitz, O. Schwartz and S. Toledo,
Communication Optimal Parallel Multiplication of Sparse Random Matrices, Proceedings of ACM Symposium on Parallelism in Algorithms and Architectures SPAA 2013 Conference, pdf .

L. Grigori, P.-Y. David, J. Demmel, and S. Peyronnet, Brief announcement: Lower bounds on communication for sparse Cholesky factorization of a model problem ,
(3 pages) ACM SPAA 2010.

S. Donfack, L. Grigori, and A. Kumar Gupta, Adapting communication-avoiding LU and QR factorizations to multicore architectures,
Proceedings of IEEE International Parallel & Distributed Processing Symposium IPDPS, April 2010.

M. Mohiyuddin, M. Hoemmen, J. Demmel, and K. Yelick, Minimizing Communication in Sparse Matrix Solvers,
Proceedings of SC09, November 2009. pdf

G. Ballard, J. Demmel, O. Holtz, and O. Schwartz Communication-Optimal Parallel and Sequential Cholesky Decomposition,
Proceedings of Symposium on Parallelism in Algorithms and Architectures (SPAA 2009), August 2009. pdf

L. Grigori, J. Demmel, and H. Xiang, Communication avoiding Gaussian elimination,
Proceedings of the IEEE/ACM SuperComputing SC08 Conference, November 2008. Also INRIA TR 6523 pdf

> Top of the page   > Home

Technical reports

L. Grigori, S. Moufawad,
Communication avoiding ILU0 preconditioner , INRIA TR 8266 .

J. Demmel, L. Grigori, M. F. Hoemmen, and J. Langou, Communication-optimal parallel and sequential QR and LU factorizations : Theory and Practice,
UCB-EECS-2008-89 and LAWN 204 (tech report not submitted elsewhere)

> Top of the page   > Home