Arpith C. Jacob  Arpith C. Jacob photo       

contact information

Research Staff Member
Thomas J. Watson Research Center, Yorktown Heights, NY USA

links



2017

Efficient Fork-Join on GPUs through Warp Specialization
Arpith Jacob, Alexandre Eichenberger, Hyojin Sung, Samuel Antao, Gheorghe-Teodor Bercea, Carlo Bertolli, Alexey Bataev, Tian Jin, Tong Chen, Zehra Sura, Georgios Rokos, Kevin O'Brien
IEEE International Conference on High Performance Computing, Data, and Analytics, 2017


2016

Offloading Support for OpenMP in Clang and LLVM
Antao, Samuel F and Bataev, Alexey and Jacob, Arpith C and Bercea, Gheorghe-Teodor and Eichenberger, Alexandre E and Rokos, Georgios and Martineau, Matt and Jin, Tian and Ozen, Guray and Sura, Zehra and others
Proceedings of the Third Workshop on LLVM Compiler Infrastructure in HPC, pp. 1--11, 2016
Abstract

Early Experiences Porting Three Applications to OpenMP 4.5
Karlin, Ian and Scogland, Tom and Jacob, Arpith C and Antao, Samuel F and Bercea, Gheorghe-Teodor and Bertolli, Carlo and de Supinski, Bronis R and Draeger, Erik W and Eichenberger, Alexandre E and Glosli, Jim and others
International Workshop on OpenMP, pp. 281--292, 2016
Abstract

From Describing to Prescribing Parallelism: Translating the SPEC ACCEL OpenACC Suite to OpenMP Target Directives
Juckeland, Guido and Hernandez, Oscar and Jacob, Arpith C and Neilson, Daniel and Larrea, Ver{\'o}nica G Vergara and Wienke, Sandra and Bobyr, Alexander and Brantley, William C and Chandrasekaran, Sunita and Colgrove, Mathew and others
International Conference on High Performance Computing, pp. 470--488, 2016
Abstract

Performance analysis and optimization of Clang's OpenMP 4.5 GPU support
Martineau, Matt and McIntosh-Smith, Simon and Bertolli, Carlo and Jacob, Arpith C and Antao, Samuel F and Eichenberger, Alexandre and Bercea, Gheorghe-Teodor and Chen, Tong and Jin, Tian and O'Brien, Kevin and others
Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS), International Workshop on, pp. 54--64, 2016
Abstract

Scheduling for Clustered Vector Processors Near Memory
Arpith C. Jacob, Zehra Sura, Tong Chen, Carlo Bertolli, Samuel Antao, Olivier Sallenave, Kevin O'Brien, Ravi Nair, Jose R. Brunheroto, Philip Jacob, Bryan S. Rosenburg, Yoonho Park, Alexandre E. Eichenberger, Changhoan Kim
Technical Report, 2016
Abstract

Compiling for the Active Memory Cube
Arpith C. Jacob, Zehra Sura, Tong Chen, Carlo Bertolli, Samuel Antao, Olivier Sallenave, Kevin O Brien, Hans Jacobson, Ravi Nair, Jose R. Brunheroto, Philip Jacob, Bryan S. Rosenburg, Yoonho Park, Alexandre E. Eichenberger, Changhoan Kim
Technical Report, 2016
Abstract


2015

Towards Performance Portable GPU Programming with RAJA
Jacob, Arpith C and Antao, Samuel F and Sung, Hyojin and Eichenberger, Alexandre E and Bertolli, Carlo and Bercea, Gheorghe-Teodor and Chen, Tong and Sura, Zehra and Rokos, Georgios and O’Brien, Kevin
2015 - hpcport.alcf.anl.gov
Abstract

Exploiting fine-and coarse-grained parallelism using a directive based approach
Jacob, Arpith C and Nair, Ravi and Eichenberger, Alexandre E and Antao, Samuel F and Bertolli, Carlo and Chen, Tong and Sura, Zehra and O’Brien, Kevin and Wong, Michael
International Workshop on OpenMP, pp. 30--41, 2015
Abstract

Progressive codesign of an architecture and compiler using a proxy application
Jacob, Arpith and Nair, Ravi and Chen, Tong and Sura, Zehra and Kim, Changhoan and Bertolli, Carlo and Antao, Samuel and OBrien, Kevin
Computer Architecture and High Performance Computing (SBAC-PAD), 2015 27th International Symposium on, pp. 57--64
Abstract

Integrating GPU support for OpenMP offloading directives into Clang
Bertolli, Carlo and Antao, Samuel F and Bercea, Gheorghe-Teodor and Jacob, Arpith C and Eichenberger, Alexandre E and Chen, Tong and Sura, Zehra and Sung, Hyojin and Rokos, Georgios and Appelhans, David and others
Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC, pp. 5, 2015
Abstract

Performance analysis of openmp on a gpu using a coral proxy application
Bercea, Gheorghe-Teodor and Bertolli, Carlo and Antao, Samuel F and Jacob, Arpith C and Eichenberger, Alexandre E and Chen, Tong and Sura, Zehra and Sung, Hyojin and Rokos, Georgios and Appelhans, David and others
Proceedings of the 6th International Workshop on Performance Modeling, Benchmarking, and Simulation of High Performance Computing Systems, pp. 2, 2015
Abstract

Data access optimization in a processing-in-memory system
Zehra Sura, Arpith Jacob, Tong Chen, Bryan Rosenburg, Olivier Sallenave, Carlo Bertolli, Samuel Antao, Jose Brunheroto, Yoonho Park, Kevin O'Brien, Ravi Nair
CF '15: Proceedings of the 12th ACM International Conference on Computing Frontiers, 2015

Active Memory Cube: A processing-in-memory architecture for exascale systems
Nair, R. ; Antao, S.F. ; Bertolli, C. ; Bose, P. ; Brunheroto, J.R. ; Chen, T. ; Cher, C. ; Costa, C.H.A. ; Doi, J. ; Evangelinos, C. ; Fleischer, B.M. ; Fox, T.W. ; Gallo, D.S. ; Grinberg, L. ; Gunnels, J.A. ; Jacob, A.C. ; Jacob, P. ; Jacobson, H.M. ; K
IBM Journal of Research and Development, 2015


2014

Coordinating GPU threads for OpenMP 4.0 in LLVM
Carlo Bertolli, Samuel F. Antao, Alexandre E. Eichenberger, Kevin O'Brien, Zehra Sura, Arpith C. Jacob, Tong Chen, Olivier Sallenave
LLVM-HPC '14: Proceedings of the 2014 LLVM Compiler Infrastructure in HPC


2010

Design space exploration of throughput-optimized arrays from recurrence abstractions
Jacob, Arpith C and Buhler, Jeremy D and Chamberlain, Roger D
Proceedings of the 18th annual ACM/SIGDA international symposium on Field programmable gate arrays, pp. 286--286, 2010
Abstract

Design of throughput-optimized arrays from recurrence abstractions
A C Jacob, J D Buhler, R D Chamberlain
Application-specific Systems Architectures and Processors (ASAP), 2010 21st IEEE International Conference on, pp. 133--140

Rapid rna folding: Analysis and acceleration of the zuker recurrence
A C Jacob, J D Buhler, R D Chamberlain
Field-Programmable Custom Computing Machines (FCCM), 2010 18th IEEE Annual International Symposium on, pp. 87--94


2009

Optimal runtime reconfiguration strategies for systolic arrays
A C Jacob, J D Buhler, R D Chamberlain
Field Programmable Logic and Applications, 2009, pp. 162--167


2008

Mercury BLASTP: Accelerating protein sequence alignment
A Jacob, J Lancaster, J Buhler, B Harris, R D Chamberlain
ACM Transactions on Reconfigurable Technology and Systems (TRETS) 1(2), 9, ACM, 2008

Accelerating Nussinov RNA secondary structure prediction with systolic arrays on FPGAs
A Jacob, J Buhler, R D Chamberlain
Application-Specific Systems, Architectures and Processors, 2008, pp. 191--196

Hardware technologies for high-performance data-intensive computing
M Gokhale, J Cohen, A Yoo, W M Miller, A Jacob, C Ulmer, R Pearce
Computer 41(4), 60--68, IEEE, 2008


2007

Language classification using n-grams accelerated by FPGA-based Bloom filters
A Jacob, M Gokhale
Proceedings of the 1st international workshop on High-performance reconfigurable computing technology and applications: held in conjunction with SC07, pp. 31--37, 2007

A banded Smith-Waterman FPGA accelerator for Mercury BLASTP
B Harris, A C Jacob, J M Lancaster, J Buhler, R D Chamberlain
Field Programmable Logic and Applications, 2007, pp. 765--769

Preliminary results in accelerating profile HMM search on FPGAs
A C Jacob, J M Lancaster, J D Buhler, R D Chamberlain
Parallel and Distributed Processing Symposium, 2007, pp. 1--8

FPGA-accelerated seed generation in Mercury BLASTP
A Jacob, J Lancaster, J Buhler, R D Chamberlain
Field-Programmable Custom Computing Machines, 2007, pp. 95--106

Mercury BLASTN: Faster DNA sequence comparison using a streaming hardware architecture
J D Buhler, J M Lancaster, A C Jacob, R D Chamberlain, others
Proc. of Reconfigurable Systems Summer Institute, 2007

Biosequence similarity search on the Mercury system
P Krishnamurthy, J Buhler, R Chamberlain, M Franklin, K Gyang, A Jacob, J Lancaster
The Journal of VLSI Signal Processing 49(1), 101--121, Springer, 2007


2006

Scalable softcore vector processor for biosequence applications
A C Jacob, B Harris, J Buhler, R Chamberlain, Y H Cho
Field-Programmable Custom Computing Machines, 2006, pp. 295--296