2026 arXiv TriAxialKV: Toward Extreme Low-Precision KV-Cache Quantization for Agentic Inference Tasks Hanzhang Shen, Haoran Wu, Yiren Zhao, and Robert Mullins* arXiv preprint arXiv:2605.17170, 2026 arXiv PDF 2025 arXiv ConCuR: Conciseness Makes State-of-the-Art Kernel Generation Lingcheng Kong, Jiateng Wei, Hanzhang Shen, and Huan Wang* arXiv preprint arXiv:2510.07356, 2025 arXiv PDF Website