Mohanty and a multi-institution team demonstrated a fast, explicit approach to constructing lossless vertex expansion networks.
Abstract: On multicore architectures, the ratio of peak memory bandwidth to peak floating-point performance (byte:flop ratio) is decreasing as core counts increase, further limiting the performance of ...
Abstract: Sparse matrix-vector multiplication (SpMV) is a fundamental operation in machine learning, scientific computing, and graph algorithms. In this paper, we investigate the space, time, and ...