Entering the Petaflop Era: The Architecture and Performance of Roadrunner (2009)
Kevin J. Barker, Kei Davis, Adolfy Hoisie, Darren J. Kerbyson, Mike Lang, Scott Pakin, ...
precision) hybrid-architecture supercomputer developed by LANL and IBM. It contains 12,240 IBM PowerXCell 8i processors and 12,240 AMD Opteron cores in 3,060 compute nodes. Roadrunner is the first...
Parallel Architectures and Performance Team (2008)
Energy under contract W-7405-ENG-36. By acceptance of this article, the publisher recognizes that the U.S. Government retains a nonexclusive, royalty-free license to publish or reproduce the...
Fabrizio Petrini, Wu-chun Feng, Adolfy Hoisie, Salvador Coll, Eitan Frachtenberg
The interconnection network and its associated software libraries are critical components for high-performance cluster computers and supercomputers, Web-server farms, and network-attached storage....
VERIFYING LARGE-SCALE SYSTEM PERFORMANCE DURING INSTALLATION USING MODELING (2008)
Darren J. Kerbyson, Adolfy Hoisie, Harvey J. Wasserman
Abstract In this paper we describe an important use of predictive application performance modeling- the validation of measured performance during a new large-scale system installation. Using a...
Fabrizio Petrini, Adolfy Hoisie, Wu-chun Feng, Richard Graham
We present an initial performance evaluation of the Quadrics interconnection network (QsNET). We describe the main hardware and software features of QsNET of relevance to the system designer and to...
Scalable Parallel Electronic Structure Calculations on the IBM SP2 (2007)
Stefan Goedecker, Adolfy Hoisie
We have developed an highly efficient and scalable electronic structure code for parallel computers using message passing. The algorithm takes advantage of the natural parallelism in quantum...
Darren J. Kerbyson, Adolfy Hoisie, Scott Pakin, Fabrizio Petrini, Harvey J. Wasserman
Energy under contract W-7405-ENG-36. By acceptance of this article, the publisher recognizes that the U.S. Government retains a nonexclusive, royalty-free license to publish or reproduce the...
Static Allocation of Multirail Networks (2007)
Salvador Coll, Eitan Frachtenberg, Fabrizio Petrini, Adolfy Hoisie, Leonid Gurvits
Using multiple independent networks (also known as rails) is an emerging technique to overcome bandwidth limitations and enhance fault-tolerance of current high-performance clusters. This report...
Fabrizio Petrini, Eitan Frachtenberg, Adolfy Hoisie, Salvador Coll
Abstract. In this paper we present an in-depth description of the Quadrics interconnection network (QsNET) and an experimental performance evaluation on a 64-node AlphaServer cluster. We explore...
PREDICTIVE PERFORMANCE AND SCALABILITY (2007)
J. Kerbyson, Henry J. Alme, Adolfy Hoisie
Energy under contract W-7405-ENG-36. By acceptance of this article, the publisher recognizes that the U.S. Government retains a nonexclusive, royaltyfree license to publish or reproduce the published...
Fabrizio Petrini, Wu-chun Feng, Adolfy Hoisie, Salvador Coll, Eitan Frachtenberg
The interconnection network and its associated software libraries are critical components for high-performance cluster computers and supercomputers, Web-server farms, and network-attached storage....
Fabrizio Petrini, Adolfy Hoisie, Wu-chun Feng, Richard Graham
We present an initial performance evaluation of the Quadrics interconnection network (QsNET). We describe the main hardware and software features of QsNET of relevance to the system designer and to...
Hoisie, Adolfy, Miller, Barton P., Mohr, Bernd
Given the exponential increase in the complexity of modern parallel systems, parallel applications often fail to exploit the full power of the underlying hardware. At scale, it is not uncommon for...
Hoisie, Adolfy, Miller, Barton P., Mohr, Bernd
From 20th to 24th August 2007, the Dagstuhl Seminar 07341 ``Code Instrumentation and Modeling for Parallel Performance Analysis'' was held in the International Conference and Research Center (IBFI),...
Workshop on Performance Analysis and Distributed Computing (2004)
Getov, Vladimir, Gerndt, Michael, Hoisie, Adolfy, Malony, Allen, Miller, Barton
A performance evaluation on an Alpha EV7 processing node, Int (2004)
Darren J. Kerbyson, Darren J. Kerbyson, Adolfy Hoisie, Adolfy Hoisie, Scott Pakin, Scott Pakin, ...
In this paper we detail the performance of a new Alpha-Server node containing 16 Alpha EV7 CPUs. The EV7 processor is based on the EV68 processor core that is used in terascale systems at Los Alamos...
A Performance and Scalability Analysis of the BlueGene/L Architecture (2004)
Kei Davis, Adolfy Hoisie, Greg Johnson, Darren J. Kerbyson, Mike Lang, Scott Pakin, ...
Based on a set of measurements done on the 512-node 500MHz prototype and early results on a 2048 node 700MHz BlueGene/L machine at IBM Watson, we present a performance and scalability analysis of the...
Salvador Coll, José Duato, Francisco J. Mora, Fabrizio Petrini, Adolfy Hoisie
Abstract The efficent implementation of collective communication is a key factor to provide good performance and scalability of communication patterns that involve global data movement and global...
Salvador Coll, José Duato, Francisco J. Mora, Fabrizio Petrini, Adolfy Hoisie
Abstract The efficient implementation of collective communication is a key factor to provide good performance and scalability of communication patterns that involve global data movement and global...
Salvador Coll, Fabrizio Petrini, Eitan Frachtenberg, Adolfy Hoisie
A common trend in the design of large-scale clusters is to use a high-performance data network to integrate the processing nodes in a single parallel computer. In these systems the performance of the...
Performance Evaluation of the Quadrics Interconnection Network (2001)
Fabrizio Petrini, Adolfy Hoisie, Wu-chun Feng, Richard Graham
We present an initial performance evaluation of the Quadrics interconnection network (QsNET). We describe the main hardware and software features of QsNET of relevance to the system designer and to...
The Quadrics Network (QsNet): High-Performance Clustering Technology (2001)
Fabrizio Petrini, Wu-chun Feng, Adolfy Hoisie, Salvador Coll, Eitan Frachtenberg
The Quadrics interconnection network (QsNet) contributes two novel innovations to the field of highperformance interconnects: (1) integration of the virtualaddress spaces of individual nodes into a...
The Quadrics Network (QsNet): High-Performance Clustering Technology (2001)
Fabrizio Petrini, Wu-chun Feng, Adolfy Hoisie, Salvador Coll, Eitan Frachtenberg
The Quadrics interconnection network (QsNet) contributes two novel innovations to the field of highperformance interconnects: (1) integration of the virtualaddress spaces of individual nodes into a...
Using multirail networks in high-performance clusters (2001)
Salvador Coll, Eitan Frachtenberg, Fabrizio Petrini, Adolfy Hoisie, Leonid Gurvits
Using multiple independent networks (also known as rails) is an emerging technique to overcome bandwidth limitations and enhance fault tolerance of current high-performance parallel computers. In...
Using multirail networks in high-performance clusters (2001)
Salvador Coll, Eitan Frachtenberg, Fabrizio Petrini, Adolfy Hoisie, Leonid Gurvits
Using multiple independent networks (also known as rails) is an emerging technique to overcome bandwidth limitations and enhance fault tolerance of current highperformance clusters. We present an...
Predictive performance and scalability modeling of a large-scale application (2001)
J. Kerbyson, Hank J. Alme, Adolfy Hoisie, Fabrizio Petrini, Harvey J. Wasserman, Michael Gittings
distribution is unlimited.
The Quadrics Network (QsNet): High-Performance Clustering Technology (2001)
Fabrizio Petrini, Wu-chun Feng, Adolfy Hoisie, Salvador Coll, Eitan Frachtenberg
The Quadrics interconnection network (QsNet) contributes two novel innovations to the field of highperformance interconnects: (1) integration of the virtualaddress spaces of individual nodes into a...
Performance Evaluation of the Quadrics Interconnection Network (2001)
Fabrizio Petrini, Salvador Coll, Eitan Frachtenberg, Adolfy Hoisie
In this paper we present an in-depth description of the Quadrics interconnection network (QsNET) and an experimental performance evaluation on a 64-node Alphaserver cluster. We expose the performance...
Using multirail networks in high-performance clusters (2001)
Salvador Coll, Eitan Frachtenberg, Fabrizio Petrini, Adolfy Hoisie, Leonid Gurvits
Using multiple independent networks (also known as rails) is an emerging technique to overcome bandwidth limitations and enhance fault-tolerance of current high-performance clusters. We present and...
Using multirail networks in high-performance clusters (2001)
Salvador Coll, Eitan Frachtenberg, Fabrizio Petrini, Adolfy Hoisie, Leonid Gurvits
Using multiple independent networks (also known as rails) is an emerging technique to overcome bandwidth limitations and enhance fault tolerance of current high-performance parallel computers. In...
Performance Evaluation of the Quadrics Interconnection Network (2001)
Fabrizio Petrini, Salvador Coll, Eitan Frachtenberg, Adolfy Hoisie
In this paper we present an in-depth description of the Quadrics interconnection network (QsNET) and an experimental performance evaluation on a 64-node AlphaServer cluster. We explore several...
Harvey Wasserman, Adolfy Hoisie, Adolfy Hoisie, Olaf Lubeck, Olaf Lubeck
The authors develop a model for the parallel performance of algorithms that consist of concurrent, two-dimensional wavefronts implemented in a message-passing environment. The model, based on a LogGP...
POEMS: End-to-end performance design of large parallel adaptive computational systems (1998)
Ewa Deelman, Aditya Dube, Adolfy Hoisie, Yong Luo, Richard L. Oliver, David Sundaram-stukel, ...
The POEMS project is creating an environment for endto-end performance modeling of complex parallel and distributed systems, spanning the domains of application software, runtime and operating system...
The Quadrics Network: High-Performance Clustering Technology (1998)
Fabrizio Petrini, Wu-chun Feng, Adolfy Hoisie, Salvador Coll, Eitan Frachtenberg
The interconnection network and its associated software libraries are critical components for high-performance cluster computers and supercomputers, Web-server farms, and network-attached storage....
Scalable Parallel Electronic Structure Calculations on the IBM SP2 (1996)
Goedecker, Stefan, Hoisie, Adolfy
We have developed a highly efficient and scalable electronic structure code for parallel computers using message passing. The algorithm takes advantages of the natural parallelism in quantum...
Scalable Parallel Electronic Structure Calculations on the IBM SP2 (1996)
Goedecker, Stefan, Hoisie, Adolfy
We have developed a highly efficient and scalable electronic structure code for parallel computers using message passing. The algorithm takes advantages of the natural parallelism in quantum...
Block Factorizations on a Cluster of RS/6000s (1992)
This paper discusses optimizing computational linear algebra algorithms on a ring cluster of IBM RS/6000s. We offer the results of a block Cholesky factorization and the underlying BLAS to...