Conference paper
An architecture for near-data processing systems
Erik Vermij, Christoph Hagleitner, et al.
CF 2016
This paper presents a tensor-based algorithm that leverages a hardware accelerator for inferencing decision-tree-based machine learning models. The algorithm has been integrated in a public software library and is demonstrated on an IBM z16 server, using the Telum processor with the Integrated Accelerator for AI. We describe the architecture and implementation of the algorithm and present experimental results that demonstrate its superior runtime performance compared with popular CPU-based machine learning inference implementations.
Erik Vermij, Christoph Hagleitner, et al.
CF 2016
Olivier Maher, N. Harnack, et al.
DRC 2023
Sidney Tsai, Pritish Narayanan, et al.
ISCAS 2023
Max Bloomfield, Amogh Wasti, et al.
ITherm 2025