ATiM ISCA 2025

ATiM Autotuning Tensor Programs for Processing-in-DRAM

Posted by Treaseven on July 8, 2025

Motivation

UPMEM现在软件栈只提供有限高级抽象的低级编程模型,要求大量开发和调优支持 DPU间和DPU内有大量与性能相关的巨大参数搜索空间 UPMEM由于未优化的分支导致其低利用率

Reference

ATiM: Autotuning Tensor Programs for Processing-in-DRAM