Supercomputing       

links

Supercomputing Publications



2016

X10 and APGAS at Petascale
Olivier Tardieu, Benjamin Herta, David Cunningham, David Grove, Prabhanjan Kambadur, Vijay Saraswat, Avraham Shinnar, Mikio Takeuchi, Mandana Vaziri, Wei Zhang
j-TOPC 2(4), 25:1--25:32, 2016
Abstract


2015

High performance computing enabling exhaustive analysis of higher order single nucleotide polymorphism interaction in Genome Wide Association Studies
Benjamin Goudey, Mani Abedini, John L Hopper, Michael Inouye, Enes Makalic, Daniel F Schmidt, John Wagner, Zeyu Zhou, Justin Zobel, Matthias Reumann
Health Information Science and Systems 3(1), 1, BioMed Central, 2015

System-wide power management control via clock distribution network
Paul W. Coteus, Alan Gara, Thomas M. Gooding, Rudolf A. Haring, Gerard V. Kopcsay, Thomas A. Liebsch, Don D. Reed
US patent 9037892


2014

Efficient Task Placement and Routing in Dragonfly Networks
B. Prisacari, G. Rodriguez, P. Heidelberger, D. Chen, C. Minkenberg, T. Hoefler
Proceedings of the 23rd ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC'14), ACM, 2014

Performance Implications of Remote-only Load Balancing Under Adversarial Traffic in Dragonflies
Bogdan Prisacari, German Rodriguez, Marina Garcia, Enrique Vallejo, Ramon Beivide, Cyriel Minkenberg
Proceedings of the 8th International Workshop on Interconnection Network Architecture: On-Chip, Multi-Chip, pp. 5:1--5:4, ACM, 2014

GLB: Lifeline-based Global Load Balancing Library in X10
Wei Zhang, Olivier Tardieu, David Grove, Benjamin Herta, Tomio Kamada, Vijay Saraswat, Mikio Takeuchi
Proceedings of the First Workshop on Parallel Programming for Analytics Applications, pp. 31--40, ACM, 2014
Abstract

Kernel Methods Match Deep Neural Networks on TIMIT
Po-Sen Huang, Haim Avron, Tara Sainath, Vikas Sindhwani, Bhuvana Ramabhadran
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
Best Student Paper Award

Resilient x10: Efficient failure-aware programming
David Cunningham, David Grove, Benjamin Herta, Arun Iyengar, Kiyokuni Kawachiya, Hiroki Murata, Vijay Saraswat, Mikio Takeuchi, Olivier Tardieu
Proceedings of the 19th ACM SIGPLAN symposium on Principles and practice of parallel programming, pp. 67--80, ACM, 2014

Randomizing task placement and route selection do not randomize traffic (enough)
Bogdan Prisacari, German Rodriguez, Ana Jokanovic, Cyriel Minkenberg
Design Automation for Embedded Systems, 1-12, Springer US, 2014

Testing and operating a multiprocessor chip with processor redundancy
Ralph E. Bellofatto, Steven M. Douskey, Rudolf A. Haring, Moyra K. McManus, Martin Ohmacht, Dietmar Schmunkamp, Krishnan Sugavanam, Bryan J. Weatherford
US patent 8868975

Low Power, Massively Parallel, Energy Efficient Supercomputers
IBM BlueGene team
The Green Computing Book: Tackling Energy Efficiency at Large Scale, Chapman & Hall/CRC press (Taylor & Francis), 2014


2013

Fast Pattern-specific Routing for Fat Tree Networks
Bogdan Prisacari, German Rodriguez, Cyriel Minkenberg, Torsten Hoefler
ACM Trans. Archit. Code Optim. 10(4), 36:1--36:25, ACM, 2013

Randomizing task placement does not randomize traffic (enough)
Ana Jokanovic, Bogdan Prisacari, German Rodriguez, Cyriel Minkenberg
Proceedings of the 2013 Interconnection Network Architecture: On-Chip, Multi-Chip, pp. 9--12, ACM

Generalized Hierarchical All-to-All Exchange Patterns
Bogdan Prisacari, German Rodriguez, Cyriel Minkenberg
Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on, pp. 537--547

Bandwidth-optimal All-to-all Exchanges in Fat Tree Networks
Bogdan Prisacari, German Rodriguez, Cyriel Minkenberg, Torsten Hoefler
Proceedings of the 27th International ACM Conference on International Conference on Supercomputing, pp. 139--148, ACM, 2013

Application-level power and performance characterization and optimization on IBM Blue Gene/Q systems
Ramon Bertran, Yutaka Sugawara, Hans Jacobson, Alper Buyuktosunoglu, Pradip Bose
IBM Journal of Research and Development 57(1/2), 4--1, IBM, 2013

Reproducibility in a multiprocessor system
R A Bellofatto, D Chen, P W Coteus, N A Eisley, A Gara, T M Gooding, R A Haring, P Heidelberger, G V Kopcsay, T A Liebsch, M Ohmacht, D D Reed, R M Senger, B Steinmacher-Burow, Y Sugawara
US Patent 8595554

Blue Gene/Q: by co-design
IBM Blue Gene Team
Computer Science - Research and Development 28(2-3), 127-135, Springer-Verlag, 2013

Global synchronization of parallel processors using clock pulse width modulation
D Chen, M R Ellavsky, R L Franke, A Gara, T M Gooding, R A Haring, M J Jeanson, G V Kopcsay, T A Liebsch, D Littrell, M Ohmacht, D D Reed, B E Schenck, R A Swetz
US Patent 8412974

Design of the IBM Blue Gene/Q Compute chip
IBM Blue Gene team
IBM Journal of Research and Development 57(1/2), 1:1-1:13, 2013

The Blue Gene Project
IBM Blue Gene Team
IBM Journal of Research and Development 57(1/2), 0:1-6, 2013

Design for low power and power management in IBM Blue Gene/Q
K. Sugavanam, C.Y. Cher, J.A. Gunnels, R.A. Haring, P. Heidelberger, H.M. Jacobson, M.K. McManus, D.P. Paulsen, D.L. Satterfield, Y. Sugawara, R. Walkup
IBM Journal of Research and Development 57(1/2), 3:1-11, IBM, 2013

Modeling, validation, and co-design of IBM Blue Gene/Q: Tools and examples
I B M Blue Gene Team
IBM Journal of Research and Development 57(1/2), 6:1--6:12, 2013

Blue Gene/Q: Sequoia and Mira
P. Vranas (chapter editor) and co-authors from IBM/ANL/LLNL
Contemporary High Performance Computing: From Petascale toward Exascale, Chapman & Hall/CRC press (Taylor & Francis), 2013

AI-Ckpt: Leveraging Memory Access Patterns for Adaptive Asynchronous Incremental Checkpointing
Bogdan Nicolae, Franck Cappello
HPDC '13: 22th International ACM Symposium on High-Performance Parallel and Distributed Computing, pp. 155-166, 2013

BlobCR: Virtual disk based checkpoint-restart for HPC applications on IaaS clouds
Bogdan Nicolae, Franck Cappello
J. Parallel Distrib. Comput. 73(5), 698-711, Academic Press, Inc., 2013

Faster Subset Selection for Matrices and Applications
Haim Avron, Christos Boutsidis
SIAM Journal on Matrix Analysis and Applications 34(4), 2013
Also available on arxiv: http://arxiv.org/abs/1201.0127

Solving Hermitian Positive Definite Systems Using Indefinite Incomplete Factorizations
H Avron, A Gupta, S Toledo
Journal of Computational and Applied Mathematics 243, 126-138, Elsevier B.V., 2013
Preliminary version appeared as IBM Research Report (W1107-050)

Towards Scalable Checkpoint Restart: A Collective Inline Memory Contents Deduplication Proposal
Bogdan Nicolae
IPDPS '13: The 27th IEEE International Parallel and Distributed Processing Symposium, pp. 19-28, 2013


2012

Performance implications of deadlock avoidance techniques in torus networks
Bogdan Prisacari, German Rodriguez, Cyriel Minkenberg, Ramon Beivide Palacio
High Performance Switching and Routing (HPSR), 2012 IEEE 13th International Conference on, pp. 115--121

Method and apparatus to debug an integrated circuit chip via synchronous clock stop and scan
Ralph E. Bellofatto, Matthew R. Ellavsky, Alan G. Gara, Mark E. Giampapa, Thomas M. Gooding, Rudolf A. Haring, Lance G. Hehenberger, Martin Ohmacht
US 8140925

The IBM Blue Gene/Q Compute Chip
R.A. Haring, M. Ohmacht, T.W. Fox, M.K. Gschwind, D.L. Satterfield, K. Sugavanam, P.W. Coteus, P. Heidelberger, M.A. Blumrich, R.W. Wisniewski, A. Gara, G.L.-T. Chiu, P.A. Boyle, N.H. Chist, Changhoan Kim
Micro, IEEE 32(2), 48 -60, IEEE, 2012

Scalable Reed-Solomon-based Reliable Local Storage for HPC Applications on IaaS Clouds
Leonardo Bautista Gomez, Bogdan Nicolae, Naoya Maruyama, Franck Cappello, Satoshi Matsuoka
Euro-Par '12: 18th International Euro-Par Conference on Parallel Processing, pp. 313-324, 2012

Alleviating Scalability Issues of Checkpointing Protocols
Rolf Riesen, Kurt Ferreira, Dilma Da Silva, Pierre Lemarinier, Dorian Arnold, Patrick G. Bridges
SC'12: Proceedings of the 2012 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, IEEE Computer Society

Detection and Correction of Silent Data Corruption for Large-Scale High-Performance Computing
David Fiala, Frank Mueller, Christian Engelmann, Rolf Riesen, Kurt Ferreira, Ron Brightwell
SC'12: Proceedings of the 2012 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, IEEE Computer Society

The Viability of Using Compression to Decrease Message Log Sizes
Kurt Ferreira, Rolf Riesen, Dorian Arnold, Dewan Ibtesham, Ron Brightwell
Euro-Par 2012 Workshops, Springer, Heidelberg
5th Workshop on Resiliency in High Performance Computing (Resilience) in Clusters, Clouds, and Grids in conjunction with the 18th International European Conference on Parallel and Distributed Computing (Euro-Par 2012), Rhodes Island, G

Does partial replication pay off?
Jon Stearley, Kurt Ferreira, David Robinson, Dorian Arnold, Patrick Bridges, Jim Laros, Kevin Pedretti, Rolf Riesen
Fault Tolerance at Extreme Scale (FTXS) workshop in association with the 42nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), 2012

Simulating application resilience at exascale
Rolf Riesen, Kurt Ferreira, Maria Ruiz Varela, Michela Taufer, Arun Rodrigues
Euro-Par 2011 Workshops, Part II, pp. 221--230, Springer, Heidelberg, 2012
4th Workshop on Resiliency in High Performance Computing (Resilience) in Clusters, Clouds, and Grids in conjunction with the 17th International European Conference on Parallel and Distributed Computing (Euro-Par 2011), Bordeaux, France, August

Topology Configuration in Hybrid EPS/OCS Interconnects
Konstantinos Christodoulopoulos, Donal O'Mahony, Marco Ruffini, and Kostas Katrinis
Proceedings of the International European Conference on Parallel and Distributed Computing (Euro-Par 2012 Distinguished Paper Award)

A high-productivity task-based programming model for clusters
Enric Tejedor, Montse Farreras, David Grove, Rosa M. Badia, Gheorghe Almasi, Jesus Labarta
Concurrency and Computation: Practice and Experience 24(18), 2421--2448, John Wiley & Sons, Ltd, 2012

SatX10: a scalable plug & play parallel SAT framework
Bard Bloom, David Grove, Benjamin Herta, Ashish Sabharwal, Horst Samulowitz, Vijay Saraswat
Proceedings of the 15th international conference on Theory and Applications of Satisfiability Testing, pp. 463--468, Springer-Verlag, 2012

Managing data-movement for effective shared-memory parallelization of out-of-core sparse solvers
Haim Avron, Anshul Gupta
Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC'12), pp. 102:1--102:11, IEEE Computer Society Press, 2012

Data-driven Fault Tolerance for Work Stealing Computations
W. Ma, S. Krishnamoorthy
International Conference on Supercomputing (ICS), 2012

Collective Operations in a File System Based Execution Model
P Shinde, E V Hensbergen
US Patent 20,120,060,018


2011

A performance model for X10 applications: what's going on under the hood?
D. Grove, O. Tardieu, D. Cunningham, B. Herta, I. Peshansky, V. Saraswat
Proceedings of the 2011 ACM SIGPLAN X10 Workshop, pp. 1:1--1:8

Power throttling of collections of computing elements
Ralph E. Bellofatto, Paul W. Coteus, Paul G. Crumley, Alan G. Gara, Mark E. Giampapa, Thomas M. Gooding, Rudolf A. Haring, Mark G. Megerian, Martin Ohmacht, Don D. Reed, Richard A. Swetz, Todd Takken
US patent 8,001,401

The Blue Gene/Q Compute Chip
R Haring, others
Proceedings of the 23th IEEE International Symposium on High Performance Chips (HotChips), 2011

BlobCR: Efficient Checkpoint-Restart for HPC Applications on IaaS Clouds using Virtual Disk Image Snapshots
Bogdan Nicolae, Franck Cappello
SC '11: 24th International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 34:1-34:12, 2011

Efficient support for MPI-I/O atomicity based on versioning
Viet-Trung Tran, Bogdan Nicolae, Gabriel Antoniu, Luc Bouge
CCGRID '11: 11th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, pp. 514-523, 2011

Encyclopedia of Parallel Computing

Springer, 2011

Job Scheduling
Arthur B. Maccabe Rolf Riesen
Encyclopedia of Parallel Computing, pp. 997-1002, 2011

Operating System Strategies
Arthur B. Maccabe Rolf Riesen
Encyclopedia of Parallel Computing, pp. 1391-1401, 2011

Single System Image
Arthur B. Maccabe Rolf Riesen
Encyclopedia of Parallel Computing, pp. 1820-1827, 2011

Evaluating the Viability of Process Replication Reliability for Exascale Systems
Kurt Ferreira, Rolf Riesen, Patrick G. Bridges, Dorian Arnold, Steraley, James H. Laros III, Ron Oldfield, Kevin Pedretti, Ron Brightwell
Proceedings of the 2011 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, IEEE Computer Society
Abstract

Libhashckpt: Hash-based Incremental Checkpointing Using GPU\'s
Dorian Arnold Kurt Ferreira
Recent Advances in the Message Passing Interface: 18th European MPI Users\' Group Meeting, EuroMPI 2011, Santorini, Greece, September 2011. Proceedings, Springer Verlag

Cache injection for parallel applications
Arthur B. Maccabe Edgar A. Le\'on
Proceedings of the 20th international symposium on High performance distributed computing, pp. 15--26, ACM, 2011
Abstract

A framework for architecture-level power, area, and thermal simulation and its application to network-on-chip design exploration
M. Hsieh, K. Thompson, W. Song, A.F. Rodrigues, R. Riesen
SIGMETRICS Perform. Eval. Rev.38, 63--68, ACM, 2011
Abstract


Randomized algorithms for estimating the trace of an implicit symmetric positive semi-definite matrix
Haim Avron, Sivan Toledo
J. ACM 58(8), 1-34, ACM, 2011
Abstract

FOX: A Fault-oblivious Extreme-scale Execution Environment
Ron Minnich, Curtis L. Janssen, Sriram Krishnamoorthy, Maya Gokhale, P. Sadayappan, Jonathan Appavoo, Eric Van Hensbergen, Jim Mckie, Charles Forsyth
Proceedings of ASCR Exascale Research Kickoff, Department of Energy Office of Science, 2011

Fault Oblivious eXascale Whitepaper
Ron Minnich, Curtis L. Janssen, Sriram Krishnamoorthy, Andres Marquez, Maya Gokhale, P. Sadayappan, Jonathan Appavoo, Eric Van Hensbergen, Jim Mckie
International Workshop on Runtime and Operating Systems for Supercomputers, ACM/SIGARCH, 2011

Basic Resource Aggregation System Infrastructure Layer
Eric Van Hensbergen, Pravin Shinde, Noah Evans
International Workshop on Runtime and Operating Systems for Supercomputers, ACM/SIGARCH, 2011

Poster: FOX: a fault-oblivious extreme scale execution environment
R G Minnich, C L Janssen, S Krishnamoorthy, A Marquez, W Ma, M Gokhale, P Sadayappan, E V Hensbergen, J Appavoo, J Mckie
Proceedings of the 2011 companion on High Performance Computing Networking, Storage and Analysis Companion, pp. 91--92

Lifeline-based global load balancing
Vijay A Saraswat, Prabhanjan Kambadur, Sreedhar Kodali, David Grove, Sriram Krishnamoorthy
Proceedings of the 16th ACM symposium on Principles and Practice of Parallel Programming (PPoPP), pp. 201--212, ACM, 2011
Abstract

X10 as a parallel language for scientific computation: practice and experience
Josh Milthorpe, V Ganesh, Alistair P Rendell, David Grove
IEEE International Parallel and Distributed Processing Symposium, pp. 1080--1088, IEEE, 2011
Abstract

Communication Optimizations for Distributed-Memory X10 Programs
Rajkishore Barik, Jisheng Zhao, David Grove, Igor Peshansky, Zoran Budimlic, Vivek Sarkar
Proceedings of the 25th IEEE International Parallel and Distributed Processing Symposium, IEEE, 2011

Evaluating the Viability of Process Replication Reliability for Exascale Systems
Ron Brightwell Kurt Ferreira
Proceedings of the 2011 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, IEEE Computer Society
Abstract

HARE: Final Report
Eric Van Hensbergen, Ron Minnich, Jim McKie, Charles Forsyth
RC25241, IBM Corp, 2011


2010

The Asynchronous Partitioned Global Address Space Model
V. Saraswat, G. Almasi, G. Bikshandi, C. Cascaval, D. Cunningham, D. Grove, S. Kodali, I. Peshansky, O. Tardieu
AMP'10: Proceedings of The First Workshop on Advances in Message Passing, 2010

Ultrascalable petaflop parallel supercomputer
Matthias A Blumrich, Dong Chen, George Chiu, Thomas M Cipolla, Paul W Coteus, Alan G Gara, Mark E Giampapa, Shawn Hall, Rudolf A Haring, Philip Heidelberger, others
Patent US7761687

Gathering Entropy at Large Scale with HAVEGE and BlobSeer
Alin Suciu, Bogdan Nicolae, Gabriel Antoniu, Zsolt Istvan, Istvan Szakats
Automat. Comput. Appl. Math.19, 3-11, MEDIAMIRA Science Publisher, 2010

Transparent Redundant Computing with MPI
Rolf Riesen Ron Brightwell Kurt Ferreira
Recent Advances in the Message Passing Interface: 17th European MPI Users\' Group Meeting, EuroMPI 2010, Stuttgart, Germany, September 2010. Proceedings, pp. 208--218, Springer Verlag
Abstract

A Framework for Architecture-Level Power, Area and Thermal Simulation and its Application to Network-on-chip Design Exploration
R Riesen M Hsieh
1st International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems (PMBS 10) held as part of SC10, 2010
Abstract

See Applications Run and Throughput Jump: The Case for Redundant Computing in {HPC}
Rolf Riesen, Kurt Ferreira, Jon Stearley
1st International Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS 2010) in conjunction with The 40th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2010)
Abstract

The Structural Simulation Toolkit
B. Jacob A.F. Rodrigues
1st International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems (PMBS 10) held as part of SC10, 2010
Abstract

See Applications Run and Throughput Jump: The Case for Redundant Computing in HPC
Jon Stearley Rolf Riesen Kurt Ferreira
1st International Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS 2010) in conjunction with The 40th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2010)
Abstract

Combinatorial Preconditioners
S Toledo, H Avron
Uwe Naumann, Olaf Schenk, eds.: Combinatorial Scientific Computing, Computational Science series, Chapman & Hall / CRC Press, 2010

Blendenpik: Supercharging LAPACK's Least-Squares Solver
Haim Avron, Petar Maymounkov, Sivan Toledo
SIAM Journal on Scientific Computing 32(3), 1217-1236, SIAM, 2010

IOMMU: Strategies for Mitigating the IOTLB Bottleneck
N Amit, M Ben-Yehuda, B A Yassour
WIOSCA '10: The Sixth Annual Workshop on the Interaction between Operating Systems and Computer Architecture, 2010

Isostack---highly efficient network processing on dedicated cores
L Shalev, E BOROVIK, J SATRAN, M Ben-Yehuda
USENIX ATC' 10: The 2010 USENIX Annual Technical Conference, pp. 5--5

Grid broker selection strategies using aggregated resource information
I Rodero, F Guim, J Corbalan, L Fong, S M Sadjadi
Future Generation Computer Systems 26(1), 72--86, Elsevier, 2010

Reducing task creation and termination overhead in explicitly parallel programs
J Zhao, J Shirako, V K Nandivada, V Sarkar
Proceedings of the 19th international conference on Parallel architectures and compilation techniques, pp. 169--180, 2010


Block Storage Listener for Detecting File-Level Intrusions
M Allalouf, M Ben-Yehuda, J Satran, I Segall
Mass Storage Systems and Technologies (MSST), 2010 IEEE 26th Symposium on, pp. 1--12

The Turtles Project: Design and Implementation of Nested Virtualization
M Ben-Yehuda, M D Day, Z Dubitzky, M Factor, N Har'El, A Gordon, A Liguori, O Wasserman, B A Yassour
IBM Research Report H-0282, 2010

On the DMA Mapping Problem in Direct Device Assignment
B A Yassour, M Ben-Yehuda, O Wasserman
Proceedings of the 3rd Annual Haifa Experimental Systems Conference, pp. 1--12, 2010

Plugging the hypervisor abstraction leaks caused by virtual networking
Alex Landau, David Hadas, Muli Ben-Yehuda
Proceedings of the 3rd Annual Haifa Experimental Systems Conference, pp. 16, ACM, 2010
Abstract

Power-efficient, reliable microprocessor architectures: modeling and design methods
Pradip Bose, Alper Buyuktosunoglu, Chen-Yong Cher, John A. Darringer, Meeta S. Gupta, Hendrik Hamann, Hans Jacobson, Prabhakar N. Kudva, Eren Kursun, Niti Madan, Indira Nair, Jude A. Rivers, Jeonghee Shin, Alan J. Weger, Victor Zyuban
Proceedings of the 20th symposium on Great lakes symposium on VLSI (GLVLSI), 2010

A Wire-Speed Power (TM) Processor: 2.3GHz 45nm SOI with 16 Cores and 64 Threads
C. Johnson, D. H. Allen, J. Brown, S. Vanderwiel, R. Hoover, H. Achilles, C-Y. Cher, G. A. May, H. Franke, J. Xenedis, C. Basso
2010 IEEE International Solid-State Circuits Conference (ISSCC)

Performance and power evaluation of an in-line accelerator
Alejandro Rico, Jeff H Derby, Robert K Montoye, Timothy H Heil, Chen-Yong Cher, Pradip Bose
Proceedings of the 7th ACM international conference on Computing frontiers, pp. 81--82, 2010

Combined Iterative and Model-driven Optimization in an Automatic Parallelization Framework
Louis-Noel Pouchet, Uday Bondhugula, Cedric Bastoul, Albert Cohen, J Ramanujam, P Sadayappan
Supercomputing (SC) 2010

Molecular Dynamics with Multiple Time Scales: How to Avoid Pitfalls
J A Morrone, R Zhou, BJ Berne
Journal of Chemical Theory and Computation, 1465, ACS Publications, 2010

Inside Cover: Size Dependence of Nanoscale Confinement on Chiral Transformation (Chem. Eur. J. 22/2010)
Z Wang, C Wang, P Xiu, W Qi, Y Tu, Y Shen, R Zhou, R Zhang, H Fang
Chemistry-A European Journal 16(22), 6398, John Wiley \& Sons, 2010

Design Exploration of Hybrid cache architecture with disparate memory technologies
X. Wu, J. Li, L. Zhang, E. Speight, R. Rajamony and Y. Xie
ACM Transactions on Architecture and Code Optimization (TACO), ACM, 2010

Enigma: Architectural Support and Operating System Support for Reducing the Impact of Address Translation
Lixin Zhang, Evan Speight, Ram Rajamony, Jiang Lin
International Conference on Supercomputing (ICS), ACM/SIGARCH, 2010

Providing a cloud network infrastructure on a supercomputer
Jonathan Appavoo, Amos Waterland, Dilma Da Silva, Volkmar Uhlig, Bryan Rosenburg, Eric Van Hensbergen, Jan Stoess, Robert Wisniewski, Udo Steinberg
1st Workshop on Scientific Cloud Computing, pp. 385--394, ACM, 2010

VirtFS: A virtualization aware File System pass-through
V Jujjuri, E Van Hensbegren, A Ligouri, B Pulavarty
Ottawa Linux Symposium, 2010

Poster: XCPU3
Pravin Shinde, Eric Van Hensbergen
Eurosys, 2010

Poster: PUSH, a Dataflow Shell
N Evans, E Van Hensbergen
Eurosys, 2010

Statistically regulating program behavior via mainstream computing
M W Stephenson, R Rangan, E Yashchin, E Van Hensbergen
Proceedings of the 8th annual IEEE/ACM international symposium on Code generation and optimization, pp. 238--247, 2010

MULTI-RESOLUTION MODELING OF BIOLOGICAL MACROMOLECULES
Flores S, Bernauer J, Huang X, Zhou R, Shin S.
Pacific Symposium BioComputing, pp. 200-204, 2010

A Model for Fusion and Code Motion in an Automatic Parallelizing Compiler
Uday Bondhugula, Oktay Gunluk, Sanjeeb Dash, Lakshminarayanan Renganarayana
International Conference on Parallel Architectures and Compilation Techniques (PACT), 2010

Believe it or Not! Multicore CPUs can Match GPUs for FLOP-intensive Applications!
Rajesh Bordawekar, Uday Bondhugula, Ravi Rao
Research Report RC24982, IBM TJ Watson Research Center, Yorktown Heights, New York, 2010


Compiling for Reduced Bit-Width Queue Processors
Arquimedes Canedo, Ben A Abderazek, Masahiro Sowa
Journal of Signal Processing Systems 59(1), 45--55, Springer, 2010

Natural Instruction Level Parallelism-aware Compiler for High-Performance QueueCore Processor Architecture
Ben A Abderazek, Masashi Masuda, Arquimedes Canedo, Kenichi Kuroda
Journal of Supercomputing, Springer, 2010

Managing Faults for Distributed Workflows over Grids
O Ezenwoye, M B Blake, G Dasgupta, S M Sadjadi, S Kalayci, L L Fong
IEEE Internet Computing 14(2), 84--88, IEEE, 2010


Automatic Parallelization of Simulink Applications
Arquimedes Canedo, Takeo Yoshizawa, Hideaki Komatsu
International Symposium on Code Generation and Optimization 2010

RC2 - A Living Lab for Cloud Computing
Glenn Ammons, Vasanth Bala, Stefan Berger, Dilma M. Da Silva, Jim Doran, Frank Franco, Alexei Karve, Herb Lee, James A. Lindeman, Ajay Mohindra, Bob Oesterlin, Giovanni Pacifici, Dimitrios Pendarakis, Darrell Reimer, Kyung Dong Ryu, Mariusz Sabath, Xiaol
IBM Research Technical Report RC24947, 2010

Power and Thermal Characterization of POWER6 System. In the International Conference on Parallel Architectures and Compilation Techniques (PACT), Vienna, Austria, Sep. 2010.
Victor Jimenez, Francisco J. Cazorla, Roberto Gioiosa, Eren Kursun, Canturk Isci, Chen-Yong Cher, Alper Buyuktosunoglu, Pradip Bose, and Mateo Valero
In the International Conference on Parallel Architectures and Compilation Techniques (PACT), 2010

Automatic Creation of Tile Size Selection Models
Tomofumi Yuki, Lakshminarayanan Renganarayana, Sanjay Rajopadhye, Charles Anderson, Alexandre Eichenberger and Kevin O'Brien
International Symposium on Code Generation and Optimization (CGO), 2010

Skewed Pipelining for Parallel Simulink Simulations
Arquimedes Canedo, Takeo Yoshizawa, Hideaki Komatsu
Design, Automation and Test in Europe 2010

Observations on Tuning a Java Enterprise Application for Performance and Scalability
Erik Altman, Matthew Arnold, Rajesh Bordawekar, Robert Delmonico, Nick Mitchell, Peter F. Sweeney
IBM Journal of Research and Development 54(5), 2, IBM, 2010

A Unified Execution Model for Cloud Computing
Eric van Hensbergen, Noah Evans, Phillip Stanley-Marbell
Large Scale Distributed Systems and Middleware, (LADIS 2009), Co-located with the 22nd ACM Symposium on Operating Systems Principles (SOSP 2009), pp. 12--17, ACM, 2010

Inferring Arbitrary Distributions for Data and Computation
Soham S. Chakraborty, V K Nandivada
SPLASH Onward!, ACM, 2010


2009

PFunc: modern task parallelism for modern high performance computing
Prabhanjan Kambadur, Anshul Gupta, Amol Ghoting, Haim Avron, Andrew Lumsdaine
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (SC'09), pp. 43:1--43:11, ACM, 2009
Abstract

Using perturbed QR factorizations to solve linear least-squares problems
H Avron, E Ng, S Toledo
SIAM Journal on Matrix Analysis and Applications 31(2), 674--693, SIAM, 2009

Combinatorial preconditioners for scalar elliptic finite-element problems
Haim Avron, Doron Chen, Gil Shklarski, Sivan Toledo
SIAM Journal on Matrix Analysis and Applications 31(2), 694--720, Society for Industrial and Applied Mathematics, 2009

Broker Selection Strategies in Interoperable Grid Systems
I Rodero, F Guim, J Corbalan, L Fong, S M Sadjadi
2009 International Conference on Parallel Processing, pp. 180--187

Scalability Analysis of Job Scheduling Using Virtual Nodes
N Bobroff, R Coppinger, L Fong, S Seelam, J Xu
Job Scheduling Strategies for Parallel Processing (JSSPP), pp. 190--206, 2009


Task decomposition for adaptive data staging in workflows for distributed environments
O Ezenwoye, B Viswanathan, S M Sadjadi, L Fong, G Dasgupta, S Kalayci
Proceedings of the 21st International Conference on Software Engineering and Knowledge Engineering, pp. 16--19, 2009

' target='_blank' class='ibm-external-link ibm-inlinelink ibm-icon-after'>An experimental system for grid meta-broker evaluation

Y Liu, N Bobroff, L Fong, S Seelam, D Villegas, S M Sadjadi, I Rodero
Proceedings of the 1st ACM workshop on Large-Scale system and application performance, pp. 11--18, 2009

Sparse matrix factorization on massively parallel computers
A Gupta, S Koric, T George
SC 2010, pp. 1, ACM, 2009

Power-Performance evaluation of read-write aware hybrid caches
X. Wu, J. Li, E. Speight, L. Zhang and Y. Xie
Design, Automation & Test in Europe (DATE), 2009

Hybrid Cache Architecture with Disparate Memory Technologies
Xiaoxia Wu, Jian Li, Lixin Zhang, Evan Speight, Ram Rajamony, and Yuan Xie
36th International Symposium on Computer Architecture (ISCA), IEEE/ACM, 2009

Efficient, portable implementation of asynchronous multi-place programs
Ganesh Bikshandi, Jose G Castanos, Sreedhar B Kodali, V Krishna Nandivada, Igor Peshansky, Vijay A Saraswat, Sayantan Sur, Pradeep Varma, Tong Wen
ACM Symposium on Principles and Practice of Parallel Programming (PPoPP), pp. 271--282, ACM, 2009
Abstract

Chunking parallel loops in the presence of synchronization
J Shirako, J M Zhao, V K Nandivada, V N Sarkar
Proceedings of the 23rd international conference on Supercomputing, pp. 181--192, 2009

Brief announcement: PUSH, a DISC shell
N P Evans, E Van Hensbergen
Proceedings of the 28th ACM symposium on Principles of distributed computing, pp. 306--307, 2009

Experiences with Hybrid Clusters
D Jamsek, E Van Hensbergen
Workshop on Parallel Programming on Accelerator Clusters (PPAC). held in conjunction with IEEE Cluster, 2009

Service Oriented File Systems
E Van Hensbergen, N Evans, P Stanley-Marbell
RC24788 , IBM, 2009

Efficient Algorithms for Global Snapshots in Large Distributed Systems
R Garg, V K Garg, Y Sabharwal
IEEE Transactions on Parallel and Distributed Systems 21(5), 620-630, Published by the IEEE Computer Society, 2009

A Cluster Overlap Measure for Comparison of Activations in fMRI Studies
G Cecchi, R Garg, A Rao
Medical Image Computing and Computer-Assisted Intervention--MICCAI 2009, pp. 1018--1025, Springer

HPCC RandomAccess benchmark for next generation supercomputers
V Aggarwal, Y Sabharwal, R Garg, P Heidelberger
2009 - computer.org, IEEE

Prediction and interpretation of distributed neural activity with sparse models
Melissa K Carroll, Guillermo A Cecchi, Irina Rish, Rahul Garg, A Ravishankar Rao
NeuroImage 44(1), 112--122, Elsevier, 2009

Gradient Descent with Sparsification: An iterative algorithm for sparse recovery with restricted isometry property
R Garg, R Khandekar
Proceedings of the 26th Annual International Conference on Machine Learning, pp. 337--344, 2009

Proc. 11th Intl. Conf. on Coordination Models and Languages
J Field, V T Vasconcelos, eds.
Proc. 11th Intl. Conf. on Coordination Models and Languages, Springer-Verlag, 2009

Compiler Support for Code Size Reduction using a Queue-based Processor
Arquimedes Canedo, Ben Abderazek, Masahiro Sowa
Lecture Notes in Computer Science, pp. 269--285, Springer Berlin, 2009

Design and implementation of a queue compiler
Arquimedes Canedo, Ben A Abderazek, Masahiro Sowa
Microprocess. Microsyst. 33(2), 129--138, Elsevier Science Publishers B. V., 2009

Efficient Compilation for Queue Size Constrained Queue Processors
Arquimedes Canedo, Ben A Abderazek, Masahiro Sowa
Parallel Comput. 35(4), 213--225, Elsevier Science Publishers B. V., 2009

Software and Hardware Design Issues for Low Complexity High Performance Processor Architecture
Masashi Masuda, Abderazek Ben Abdallah, Arquimedes Canedo
ICPP Workshops, pp. 558-565, 2009

Water-mediated signal multiplication with Y-shaped carbon nanotubes
Y Tu, P Xiu, R Wan, J Hu, R Zhou, H Fang
Proceedings of the National Academy of Sciences 106(43), 18120, National Acad Sciences, 2009

Recognition Mechanism of siRNA by Viral p19 Suppressor of RNA Silencing: A Molecular Dynamics Study
Z Xia, Z Zhu, J Zhu, R Zhou
Biophysical journal 96(5), 1761--1769, Elsevier, 2009



Dewetting and Hydrophobic Interaction in Physical and Biological Systems
B J Berne, J D Weeks, R Zhou
Annual review of physical chemistry60, 85--103, Annual Reviews, 2009

Urea’s action on hydrophobic interactions
R Zangi, R Zhou, BJ Berne
J. Am. Chem. Soc 131(4), 1535--1541, 2009

System Resilience at Extreme Scale -- White Paper
T El-Ghazawi, A Fox, B F Godfrey, M D Cray, A Hoisie, J Plank, J Simons, E N M Elnozahy, A IBM
Department of Defense -- Defense Advanced Research Project Agency, U.S. Department of Defense -- Defense Advanced Research Project Agency, 2009


Post-copy live migration of virtual machines
Michael R Hines, Umesh Deshpande, Kartik Gopalan
ACM SIGOPS operating systems review 43(3), 14--26, ACM, 2009

Post-copy based live virtual machine migration using adaptive pre-paging and dynamic self-ballooning
M R Hines, K Gopalan
Proceedings of the 2009 ACM SIGPLAN/SIGOPS international conference on Virtual execution environments, pp. 51--60

EXECUTING COMPUTER-INTENSIVE DATABASE USER-DEFINED PROGRAMS ON AN ATTACHED HIGH-PERFORMANCE PARALLEL COMPUTER
R NATARAJAN, M KOCHTE
WO Patent WO/2009/038,911, 2009 - wipo.int
WO Patent WO/2009/038,911

The Parallel Machine Learning (PML) Framework and the Transform Regression Algorithm
S Asur, A Ghoting, R Natarajan, E Pednault
2009 - domino.watson.ibm.com

Advanced millimeter-wave technologies: antennas, packaging and circuits
D. Liu, U. Pfeiffer
2009 - books.google.com, Wiley

Data Layout Transformation for Enhancing Data Locality on NUCA Chip Multiprocessors
Q Lu, C Alias, U Bondhugula, T Henretty, S Krishnamoorthy, J Ramanujam, A Rountev, P Sadayappan, Y Chen, H Lin, others
Proceedings of the 18th International Conference on Parallel Architectures and Compilation Techniques, 2009

Hybrid Iterative and Model-Driven Optimization in the Polyhedral Model
L N Pouchet, U Bondhugula, C Bastoul, A Cohen, R Ramanujam, P Sadayappan
INRIA Research Report 6269, INRIA Saclay, France, 2009

Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors
M Baskaran, N Vydyanathan, Uday Bondhugula, J Ramanujam, A Rountev, P Sadayappan
ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp. 219--228, 2009

Compact Multi-Dimensional Kernel Extraction for Register Tiling
L Renganarayana, Uday Bondhugula, Salem Dersavi, Alexandre E. Eichenberger, Kevin O'Brien
Proceedings of the 22nd International Conference on High Performance Networking and Computing (SC), pp. 1--12, 2009

Temperature Variation Characterization and Thermal Management of Multicore Architectures
Eren Kursun, Chen Yong Cher
IEEE Computer Society: IEEE Micro - Top Picks Special Issue on Most Significant and Relevant Papers in Computer Architecture, Volume 29, Issue 1, pp. 116 - 126, January 2009 29(1), 116--126, IEEE

On modeling some essential dynamics of the subprime mortgage crisis
L. An, D. Subramanian, A. King
27th International Conference of the System Dynamics Society, 2009

Requirements for systemic risk management in the financial sector: invited talk
D.N. Dillenberger, A.J. King, F.N. Parr
Proceedings of the 2nd Workshop on High Performance Computational Finance, pp. 3, 2009

Blue Eyes: Scalable and reliable system management for cloud computing
Sukhyun Song, Kyung Dong Ryu, Dilma Da Silva
Fifth International Workshop on System Management Techniques, Processes, and Services (SMTPS), co-located with IPDPS, pp. 1-8, IEEE Computer Society, 2009

Kittyhawk: Enabling cooperation and competition in a global, shared computational system
Jonathan Appavoo, Volkmar Uhlig, Amos Waterland, Bryan Rosenburg, Dilma Silva da, Jose Moreira
IBM Journal of Research and Development 53(4), 2009

Portably solving file races with hardness amplification
Dan Tsafrir, Tomer Hertz, David Wagner, Dilma Da Silva
Trans. Storage 4(3), 1--30, ACM, 2009

Out-of-band detection of boot-sequence termination events
N Parush, D Pelleg, M Ben-Yehuda, P Ta-Shma
Proceedings of the 6th international conference on Autonomic computing, pp. 71--72, 2009


Small Retinal Blood Vessel Tracking Using an Adaptive Filter
S H Chang, D S Shim, L Gong, X Hu
Journal of Imaging Science and Technology53, 020507, 2009

A Parallel Point Matching Algorithm for Landmark Based Image Registration Using Multicore Platform
L Yang, L Gong, H Zhang, J L Nosher, D J Foran
Proceedings of the 15th International Euro-Par Conference on Parallel Processing, pp. 947, 2009

Climbing the Plateau
C A Halverson, J Carver
PLATEAU 09 , 2009

METHOD FOR MAPPING PRIVACY POLICIES TO CLASSIFICATION LABELS
C A Brodie, R H Guski, C N Karat, J Karat, P K Malkin



RTTS: Towards Enterprise-level Real-Time Speech Transcription and Translation Services
J M Huerta, C Wu, A Sakrajda, S Caskey, E E Jan, A Faisman, S Ben-David, W Liu, A Lee, O Stewart, others
Tenth Annual Conference of the International Speech Communication Association, 2009
Abstract

Designing crowdsourcing community for the enterprise
O Stewart, J M Huerta, M Sader
Proceedings of the ACM SIGKDD Workshop on Human Computation, pp. 50--53, 2009
Abstract

Handbook of software architecture
G Booch
Website and Blog.[Internet]< http://www. handbookofsoftwarearchitecture. com/index. jsp, 2009

A Multicore Based Parallel Image Registration Method
Lin Yang, Leiguang Gong, Hong Zhang, John Nosher, David Foran
31st IEEE Annual International IEEE EMBS Conference, 2009

Accelerating 3D nonrigid registration using the Cell Broadband Engine processor
J. Rohrer and L. Gong
IBM Journal of Research and Development 53(5), IBM, 2009

Exploring an evolutionary medical analytic wallet
Aaron K Baughman, Mweene Monze, Christian Eggenberger, Peter Malkin, Neil Katz, Chris Dawson, Barry Graham
GECCO '09: Proceedings of the 11th Annual Conference Companion on Genetic and Evolutionary Computation Conference, pp. 1987--1992, ACM, 2009
Abstract

Mobile Applications for the Next Billions: A Social Computing Perspective
C Danis, M Bailey, J Christensen, J Ellis, T Erickson, R Farrell, W A Kellogg
ics.uci.edu, 2009


True value: assessing and optimizing the cost of computing at the data center level
J Karidis, JE Moreira, Jaime H Moreno
6th ACM Conference on Computing Frontiers, 2009

Free energy simulations reveal a double mutant avian H5N1 virus hemagglutinin with altered receptor binding specificity
P Das, J Li, A K Royyuru, R Zhou
Journal of Computational Chemistry 30(11), 1654--1663, Wiley Online Library, 2009

Software hardening: a research agenda
T Wrigstad, P Eugster, J Field, N Nystrom, J Vitek
Proceedings for the 1st workshop on Script to Program Evolution, pp. 58--70, 2009

Reactors: A data-oriented synchronous/asynchronous programming model for distributed applications
J Field, M C Marinescu, C Stefansen
Coordination Models and Languages, pp. 76--95, Elsevier Science Publishers Ltd., 2009


2008

Overview of the IBM Blue Gene/P project
I B M Blue Gene Team
IBM Journal of Research and Development 52(1/2), 199--220, IBM CORP 1 NEW ORCHARD ROAD, ARMONK, NY 10504 USA, 2008

Parallel unsymmetric-pattern multifrontal sparse LU with column preordering
Haim Avron, Gil Shklarski, Sivan Toledo
ACM Trans. Math. Softw.34, 8:1--8:31, ACM, 2008
Abstract

Static detection of place locality and elimination of runtime checks
S Agarwal, R K Barik, V Nandivada, R Shyamasundar, P Varma
Asian Symposium on Programming Languages and Systems, 53--74, Springer, 2008

METHOD OF FORMING POLYMER FEATURES BY DIRECTED SELF-ASSEMBLY OF BLOCK COPOLYMERS
J. Cheng, W.D. Hinsberg, H.C. Kim, C.T. Rettner, D.P. Sanders
US Patent App. 12/061,693


2007

The Blue Gene/L supercomputer: a hardware and software story
Jose E Moreira, Valentina Salapura, George Almasi, Charles Archer, Ralph Bellofatto, Peter Bergner, Randy Bickford, Mathias Blumrich, Jos\'e R Brunheroto, Arthur A Bright, others
International Journal of Parallel Programming 35(3), 181--206, Kluwer Academic Publishers-Plenum Publishers, 2007


2006

A holistic approach to system reliability in Blue Gene
M Blumrich, D Chen, GL-T Chiu, T Cipolla, P Coteus, P Crumley, A Gara, ME Giampapa, S Hall, RA Haring, others
Innovative Architecture for Future Generation High Performance Processors and Systems, 2006. IWIA'06. International Workshop on, pp. 3--12


2005

Overview of the Blue Gene System Architecture
A Gara, MA Blumrich, D Chen, GLT Chiu, P Coteus, ME Giampapa, RA Haring, P Heidelberger, D Hoenicke, GV Kopcsay, others
IBM Journal of Research and Development 49(2/3), 195--212, IBM, 2005

Blue Gene/L compute chip: Memory and Ethernet subsystem
Martin Ohmacht, Reinaldo A Bergamaschi, Subhrajit Bhattacharya, Alan Gara, ME Giampapa, Balaji Gopalsamy, Ruud A Haring, Dirk Hoenicke, David J Krolak, James A Marcella, others
IBM Journal of Research and Development 49(2.3), 255--264, IBM, 2005

Creating the BlueGene/L supercomputer from low-power SoC ASICs
Arthur A Bright, Matthew R Ellavsky, Alan Gara, Ruud A Haring, Gerard V Kopcsay, Robert F Lembach, James A Marcella, Martin Ohmacht, Valentina Salapura
Solid-State Circuits Conference, 2005. Digest of Technical Papers. ISSCC. 2005 IEEE International, pp. 188--189, IEEE

Blue Gene/L advanced diagnostics environment
ME Giampapa, Ralph Bellofatto, Matthias A Blumrich, Dong Chen, Marc Boris Dombrowa, Alan Gara, Ruud A Haring, Philip Heidelberger, Dirk Hoenicke, Gerard V Kopcsay, others
IBM Journal of Research and Development 49(2.3), 319--331, IBM, 2005

Blue Gene/L compute chip: Synthesis, timing, and physical design
AA Bright, RA Haring, MB Dombrowa, M. Ohmacht, D. Hoenicke, S. Singh, JA Marcella, RF Lembach, SM Douskey, MR Ellavsky, C.G. Zoellin, A. Gara
IBM Journal of Research and Development 49(2/3), 277--287, IBM, 2005

Verification strategy for the Blue Gene/L chip
ME Wazlowski, NR Adiga, DK Beece, R. Bellofatto, MA Blumrich, D. Chen, MB Dombrowa, A. Gara, ME Giampapa, RA Haring, P. Heidelberger, D. Hoenicke, B.J. Nathanson, M. Ohmacht, R. Sharrar, S. Singh, B.D. Steinmacher-Burow, R.B. Tremaine,
IBM Journal of Research and Development 49(2/ 3), 303--318, IBM, 2005

Power and performance optimization at the system level
Valentina Salapura, Randy Bickford, Matthias Blumrich, Arthur A Bright, Dong Chen, Paul Coteus, Alan Gara, Mark Giampapa, Michael Gschwind, Manish Gupta, others
Proceedings of the 2nd conference on Computing frontiers, pp. 125--132, ACM, 2005

Blue Gene/L compute chip: Control, test, and bring-up infrastructure
RA Haring, R. Bellofatto, AA Bright, PG Crumley, MB Dombrowa, SM Douskey, MR Ellavsky, B. Gopalsamy, D. Hoenicke, TA Liebsch, J. A. Marcella, and M. Ohmacht
IBM Journal of Research and Development 49(2/3), 289--301, IBM, 2005


2004

The eDRAM based L3-Cache of the BlueGene/L Supercomputer Processor Node
M Ohmacht, D Hoenicke, R Haring, A Gara
Proceedings 16th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD04), pp. 18-22, Published by the IEEE Computer Society, 2004


2002

An overview of the BlueGene/L supercomputer
N.R. Adiga, G. Almasi, G.S. Almasi, Y. Aridor, R. Barik, D. Beece, R. Bellofatto, G. Bhanot, R. Bickford, M. Blumrich, others
Supercomputing, ACM/IEEE 2002 Conference, pp. 60--60, IEEE Computer Society
Abstract

Cellular supercomputing with system-on-a-chip
G Almasi, GS Almasi, D Beece, R Bellofatto, G Bhanot, R Bickford, M Blumrich, AA Bright, J Brunheroto, C Cascaval, others
Solid-State Circuits Conference, 2002. Digest of Technical Papers. ISSCC. 2002 IEEE International, pp. 196--197

Blue Gene/L, a system-on-a-chip
G. Almasi, G.S. Almasi, D. Beece, R. Bellofatto, G. Bhanot, R. Bickford, M. Blumrich, A.A. Bright, J. Brunheroto, C. Cascaval, others
Cluster Computing, 2002. Proceedings. 2002 IEEE International Conference on, pp. 349--350, IEEE Computer Society


2001

Blue Gene: a vision for protein science using a petaflop supercomputer
Frances Allen, G Almasi, Wanda Andreoni, D Beece, Bruce J. Berne, A Bright, Jose Brunheroto, Calin Cascaval, J Castanos, Paul Coteus
IBM Systems Journal 40(2), 310-327, IBM, 2001


1994

Compiler Transformations for High-Performance Computing
David F. Bacon, Susan L. Graham, Oliver J. Sharp
Computing Surveys 26(4), 345--420, ACM, 1994
Abstract