Architecture and performance of Devito, a system for automated stencil computation

TitleArchitecture and performance of Devito, a system for automated stencil computation
Publication TypeSubmitted
Year of Publication2018
AuthorsFabio Luporini, Michael Lange, Mathias Louboutin, Navjot Kukreja, JAN HUCKELHEIM, Charles Yount, Philipp A. Witte, PAUL H. J. KELLY, Gerard J. Gorman, Felix J. Herrmann
Keywordscompiler, finite difference method, performance optimization, private, stencil, structured grid, symbolic processing

Stencil computations are a key part of many high-performance computing applications, such as image processing, convolutional neural networks, and finite-difference solvers for partial differential equations. Devito is a framework capable of generating highly-optimized code given symbolic equations expressed in Python, specialized in, but not limited to, affine (stencil) codes. The lowering process – from mathematical equations down to C++ code – is performed by the Devito compiler through a series of intermediate representations. Several performance optimizations are introduced, including advanced common sub-expressions elimination, tiling and parallelization. Some of these are obtained through well-established stencil optimizers, integrated in the back-end of the Devito compiler. The architecture of the Devito compiler, as well as the performance optimizations that are applied when generating code, are presented. The effectiveness of such performance optimizations is demonstrated using operators drawn from seismic imaging applications.


Submitted to SIAM Journal on Scientific Computing on July 9, 2018.

Citation Keyluporini2018aap