Gheorghe-Teodor Bercea  Gheorghe-Teodor Bercea photo       

contact information

Compiler Development and Optimization
Thomas J. Watson Research Center, Yorktown Heights, NY USA

links



2016

A structure-exploiting numbering algorithm for finite elements on extruded meshes, and its performance evaluation in Firedrake
Bercea, Gheorghe-Teodor and McRae, Andrew TT and Ham, David A and Mitchell, Lawrence and Rathgeber, Florian and Nardi, Luigi and Luporini, Fabio and Kelly, Paul HJ
arXiv preprint arXiv:1604.05937, 2016
Abstract

A numbering algorithm for finite element on extruded meshes which avoids the unstructured mesh penalty
Bercea, Gheorghe-Teodor and McRae, Andrew TT and Ham, David A and Mitchell, Lawrence and Rathgeber, Florian and Nardi, Luigi and Luporini, Fabio and Kelly, Paul HJ
preprint, arxiv. org/abs/1604.05937 [cs. MS], 2016
Abstract

Firedrake: automating the finite element method by composing abstractions
Rathgeber, Florian and Ham, David A and Mitchell, Lawrence and Lange, Michael and Luporini, Fabio and McRae, Andrew TT and Bercea, Gheorghe-Teodor and Markall, Graham R and Kelly, Paul HJ
ACM Transactions on Mathematical Software (TOMS) 43(3), 24, ACM, 2016
Abstract


2015

Integrating GPU support for OpenMP offloading directives into Clang
Bertolli, Carlo and Antao, Samuel F and Bercea, Gheorghe-Teodor and Jacob, Arpith C and Eichenberger, Alexandre E and Chen, Tong and Sura, Zehra and Sung, Hyojin and Rokos, Georgios and Appelhans, David and others
Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC, pp. 5, 2015
Abstract

Cross-loop optimization of arithmetic intensity for finite element local assembly
Luporini, Fabio and Varbanescu, Ana Lucia and Rathgeber, Florian and Bercea, Gheorghe-Teodor and Ramanujam, J and Ham, David A and Kelly, Paul HJ
ACM Transactions on Architecture and Code Optimization (TACO) 11(4), 57, ACM, 2015
Abstract

Performance analysis of openmp on a gpu using a coral proxy application
Bercea, Gheorghe-Teodor and Bertolli, Carlo and Antao, Samuel F and Jacob, Arpith C and Eichenberger, Alexandre E and Chen, Tong and Sura, Zehra and Sung, Hyojin and Rokos, Georgios and Appelhans, David and others
Proceedings of the 6th International Workshop on Performance Modeling, Benchmarking, and Simulation of High Performance Computing Systems, pp. 2, 2015
Abstract


2014

Generalizing run-time tiling with the loop chain abstraction
Strout, Michelle Mills and Luporini, Fabio and Krieger, Christopher D and Bertolli, Carlo and Bercea, Gheorghe-Teodor and Olschanowsky, Catherine and Ramanujam, J and Kelly, Paul HJ
Parallel and Distributed Processing Symposium, 2014 IEEE 28th International, pp. 1136--1145
Abstract

COFFEE: an Optimizing Compiler for Finite Element Local Assembly
Luporini, Fabio and Varbanescu, Ana Lucia and Rathgeber, Florian and Bercea, Gheorghe-Teodor and Ramanujam, J and Ham, David A and Kelly, Paul HJ
arXiv preprint arXiv:1407.0904, 2014
Abstract