This repo aims at providing a collection of efficient Triton-based implementations for state-of-the-art linear attention models. All implementations are written purely in PyTorch and Triton, making ...
Students had to compile a portfolio and discuss it in an interview to demonstrate their understanding and ability to apply the knowledge.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results