Gamma @ ASPLOS 2021

Table of Contents

Paper link:


  • SpGeMM is bottlenecked by memory traffic and data movement.
    • Accelerators must minimize data movement.
  • SpGeMM has rich algorithmic diversity.
    • Wide range of dataflows with different tradeoffs.
  • Gustavson's algorithm is more efficient than inner/outer product.
    • more irregular reuse across data structures, requires specialized storage organization.

Gustavson's algorithm

The difference between inner product/outer product and Gustavson's algorithm is the order of loops.


Author: expye(Zihao Ye)


Date: 2022-07-31 Sun 00:00

Last modified: 2022-12-27 Tue 07:18

Licensed under CC BY-NC 4.0