Sparse Matrix Operations

MiniMax M2.5 Uses 10B Active Parameters per Token, Aiming for Cheaper Always-On Agents

MiniMax M2.5 hits about 80% on Sweetbench and runs near 100 tokens per second, helping teams deploy faster models on tighter budgets.

IEEE

ScanNow: A Scan Window-Based Sparse Matrix Multiplication Accelerator Design

Sparse matrix-matrix multiplication (SpMM) is a crucial kernel in various applications, including sparse deep neural networks [1]–[6], graph analytics [7], triangle counting [8], and linear algebra ...

IEEE

Hardware-Efficient Optical Matrix Processor via Low-Rank Approximation

Abstract: We propose a hardware-efficient optical matrix processor based on low-rank approximation, utilizing narrowband filters of microring resonators (MRRs) and broadband Mach-Zehnder ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

MiniMax M2.5 Uses 10B Active Parameters per Token, Aiming for Cheaper Always-On Agents

ScanNow: A Scan Window-Based Sparse Matrix Multiplication Accelerator Design

Hardware-Efficient Optical Matrix Processor via Low-Rank Approximation

Trending now