Performance Modeling and Analysis of Cache Blocking in Sparse Matrix Vector Multiply

Demmel, James W.; Computer Science Division; Yelick, Katherine A.; Vuduc, Richard W.; Nishtala, Rajesh

PDF

Description

We consider the problem of building high-performance implementations of sparse matrix-vector multiply (SpM x V), or y = y + A * x, which is an important and ubiquitous computational kernel. Prior work indicates that cache blocking of SpM x V is extremely important for some matrix and machine combinations, with speedups as high as 3x. In this paper we present a new, more compact data structure for cache blocking for SpM x V and look at the general question of when and why performance improves. Cache blocking appears to be most effective when simultaneously 1) the vector x does not fit in cache 2) the vector y fits in cache 3) the non zeros are distributed throughout the matrix and 4) the non zero density is sufficiently high. In particular we find that cache blocking does not help with band matrices no matter how large x and y are since the matrix structure already lends itself to the optimal access pattern.

Prior work on performance modeling assumed that the matrices were small enough so that x and y fit in the cache. However when this is not the case, the optimal block sizes picked by these models may have poor performance motivating us to update these performance models. In contrast, the optimum block sizes predicted by the new performance models generally match the measured optimum block sizes and therefore the models can be used as a basis for a heuristic to pick the block size.

We conclude with architectural suggestions that would make processor and memory systems more amenable to SpM x V.

Details

Title

Performance Modeling and Analysis of Cache Blocking in Sparse Matrix Vector Multiply

Creator

Demmel, James W., Author
Computer Science Division, Publisher
Yelick, Katherine A., Author
Vuduc, Richard W., Author
Nishtala, Rajesh, Author

Published

1905-06-26

Full Collection Name

Electrical Engineering & Computer Sciences Technical Reports

Other Identifiers

CSD-04-1335

Type

Text

Format

technical reports

Extent

71 p

Archive

The Engineering Library

Usage Statement

Researchers may make free and open use of the UC Berkeley Library’s digitized public domain materials. However, some materials in our online collections may be protected by U.S. copyright law (Title 17, U.S.C.). Use or reproduction of materials protected by copyright beyond that allowed by fair use (Title 17, U.S.C. § 107) requires permission from the copyright owners. The use or reproduction of some materials may also be restricted by terms of University of California gift or purchase agreements, privacy and publicity rights, or trademark law. Responsibility for determining rights status and permissibility of any use or reproduction rests exclusively with the researcher. To learn more or make inquiries, please see our permissions policies (https://www.lib.berkeley.edu/about/permissions-policies).

Collection

EECS Technical Reports

Files

Statistics

Download Full History

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket