turboquant-py implements the TurboQuant and QJL vector quantization algorithms from Google Research (ICLR 2026 / AISTATS 2026). It compresses high-dimensional floating-point vectors to 1-4 bits per ...
Abstract: Efficient implementation of Depthwise Separable Convolution (DSC) inference on FPGAs remains a significant challenge. Conventional accelerators either suffer from suboptimal resource ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results