DeepSeek unveils V3.2 AI models matching GPT-5 and Google Gemini 3.0 Pro performance at fraction of the cost, introducing breakthrough sparse attention and reasoning-with-tools capabilities in ...
Abstract: Distributed computations, such as distributed matrix multiplication, can be vulnerable to significant security issues, notably Byzantine attacks. These attacks may target either worker nodes ...
This project focuses on lossless compression techniques optimizing space, time, and energy for multiplications between binary (or ternary) matrix formats and real-valued vectors.
Abstract: Structured sparsity has been proposed as an efficient way to prune the complexity of Machine Learning (ML) applications and to simplify the handling of sparse data in hardware. Accelerating ...