Ahan Gupta, Yueming Yuan, Devansh Jain, Yuhao Ge, David Aponte, Yanqi Zhou, Charith Mendis. SPLAT: A framework for optimised GPU code-generation for SParse reguLar ATtention. OOPSLA 2025.
Ahan Gupta, Yueming Yuan, Yanqi Zhou, Charith Mendis. FLuRKA: Fast and accurate unified Low-Rank & Kernel Attention.