Eitan Frachtenberg

An Idealistic Neuro-PPM Branch Predictor (2008)

Ram Srinivasan, Eitan Frachtenberg, Scott Pakin, Jeanine Cook

Historically, Markovian predictors have been very successful in predicting branch outcomes. In this work we propose a hybrid scheme that employs two Prediction by Partial Matching (PPM) Markovian...

Workshop on Scheduling and Resource Management for Cluster Computing (in conjunction with the ICPP01). LA-UR 01-3025 Abstract Gang Scheduling with Lightweight User-Level Communication (2008)

Eitan Frachtenberg, Fabrizio Petrini, Salvador Coll, Wu Chun Feng

In this paper we explore the performance of gang scheduling on a cluster using the Quadrics interconnection network. On such a cluster, the scheduler can take advantage of the unique capabilities of...

Neuro-PPM Branch Prediction (2008)

Ram Srinivasan, Eitan Frachtenberg, Olaf Lubeck, Scott Pakin, Jeanine Cook

Historically, Markovian predictors have been very successful in predicting branch outcomes. In this work we propose a hybrid scheme that employs two Prediction by Partial Matching (PPM) Markovian...

On the Feasibility of Incremental Checkpointing for Scientific Computing (2008)

José Carlos, Sancho Fabrizio, Petrini Greg, Johnson Juan Fernández, Eitan Frachtenberg

In the near future large-scale parallel computers will feature hundreds of thousands of processing nodes. In such systems, fault tolerance is critical as failures will occur very often. Checkpointing...

Scalable Resource Management in High-Performance Computers (2008)

Eitan Frachtenberg, Fabrizio Petrini, Juan Fern, Salvador Coll

Clusters and other loosely-coupled systems are becoming ubiquitous and larger

Advance Access published on May 25, 2006 doi:10.1093/comjnl/bxl020 An Abstract Interface for System Software on Large-Scale Clusters (2008)

Juan Fernández, Eitan Frachtenberg, Fabrizio Petrini, José-carlos Sancho

Scalable management of distributed resources is one of the major challenges when building largescale clusters for high-performance computing. This task includes transparent fault tolerance, efficient...

Seeking Optimal Search Strategy and Result Representation in BoW (2008)

Maayan Zhitomirsky-geffet, Eitan Frachtenberg, Yair Wiseman, Dror G. Feitelson

One of the biggest concerns of modern information retrieval systems is reducing the user effort required for manual traversal and filtering of long matching document lists. In this paper we propose...

Abstract Pitfalls in Parallel Job Scheduling Evaluation (2008)

Eitan Frachtenberg

There are many choices to make when evaluating the performance of a complex system. In the context of parallel job scheduling, one must decide what workload to use and what measurements to take....

Static Allocation of Multirail Networks (2007)

Salvador Coll, Eitan Frachtenberg, Fabrizio Petrini, Adolfy Hoisie, Leonid Gurvits

Using multiple independent networks (also known as rails) is an emerging technique to overcome bandwidth limitations and enhance fault-tolerance of current high-performance clusters. This report...

Scalable Resource Management in High Performance Computers (2007)

Eitan Frachtenberg, Fabrizio Petrini, Juan Fern, Salvador Coll

Clusters of workstations have emerged as an important platform for building cost-effective, scalable and highly-available computers. Although many hardware solutions are available today, the largest...

2 (2007)

Eitan Frachtenberg, Dror G. Feitelson, Fabrizio Petrini, Juan Fernandez

Fine-grained parallel applications require all their processes to run simultaneously on distinct processors to achieve good efficiency. This is typically accomplished by space slicing, wherein nodes...

© 2003 Kluwer Academic Publishers. Manufactured in The Netherlands. Performance Evaluation of the Quadrics Interconnection Network (2007)

Fabrizio Petrini, Eitan Frachtenberg, Adolfy Hoisie, Salvador Coll

Abstract. In this paper we present an in-depth description of the Quadrics interconnection network (QsNET) and an experimental performance evaluation on a 64-node AlphaServer cluster. We explore...

1 (2007)

Eitan Frachtenberg, Dror G. Feitelson, Juan Fernandez, Fabrizio Petrini

Jobs that run on parallel systems that use gang scheduling for multiprogramming may interact with each other in various ways. These interactions are affected by system parameters such as the level of...

2 (2007)

Eitan Frachtenberg, Dror G. Feitelson, Fabrizio Petrini, Juan Fern

Fine-grained parallel applications require all their processes to run simultaneously on distinct processors to make good progress. This is typically achieved by space slicing with variable...

An Abstract Interface for System Software on Large-Scale Clusters (2006)

Fernández, Juan, Frachtenberg, Eitan, Petrini, Fabrizio, Sancho, José-Carlos

Scalable management of distributed resources is one of the major challenges when building large-scale clusters for high-performance computing. This task includes transparent fault tolerance,...

An Abstract Interface for System Software on Large-Scale Clusters (2006)

Fernández, Juan, Frachtenberg, Eitan, Petrini, Fabrizio, Sancho, José-Carlos

Scalable management of distributed resources is one of the major challenges when building large-scale clusters for high-performance computing. This task includes transparent fault tolerance,...

NIC-based Reduction Algorithms for Large-scale Clusters (2005)

Fabrizio Petrini, Adam Moody, Juan Fernandez, Eitan Frachtenberg, Dhabaleswar K. Panda

Abstract — Efficient algorithms for reduction operations across a group of processes are crucial for good performance in many large-scale, parallel scientific applications. While previous...

● High Inter Process Communication Requires synchronization needs co- (2005)

Eitan Frachtenberg, Dror G. Feitelson, Senior Member, Fabrizio Petrini, Student Member

scheduling ● Two common Approaches: ➢ batch scheduling ➔ wherein nodes are dedicated for the duration of the run ➢ gang scheduling ➔ wherein time slicing is coordinated across processors...

Adaptive parallel job scheduling with flexible coscheduling (2005)

Eitan Frachtenberg, Dror G. Feitelson, Fabrizio Petrini, Juan Fern

Many scientific and high-performance computing applications consist of multiple processes running on different processors that communicate frequently. Because of their synchronization needs, these...

Adaptive parallel job scheduling with flexible coscheduling (2005)

Eitan Frachtenberg, Dror G. Feitelson, Fabrizio Petrini, Juan Fernández

Many scientific and high-performance computing applications consist of multiple processes running on different processors that communicate frequently. Because of their synchronization needs, these...

Adaptive parallel job scheduling with flexible coscheduling (2005)

Eitan Frachtenberg, Dror G. Feitelson, Fabrizio Petrini, Juan Fern

Many scientific and high-performance computing applications consist of multiple processes running on different processors that communicate frequently. These applications can suffer severe performance...

Designing Parallel Operating Systems via Parallel Programming (2004)

Eitan Frachtenberg, Kei Davis, Fabrizio Petrini, Juan Fern, José Carlos Sancho

Abstract. Ever-increasing demand for computing capability is driving the construction of ever-larger computer clusters, soon to be reaching tens of thousands of processors. Many functionalities of...

Architectural Support for System Software on Large-Scale Clusters (2004)

Juan Fernández, Eitan Frachtenberg, Fabrizio Petrini, Kei Davis, Jose Carlos Sancho

Scalable management of distributed resources is one of the major challenges in deployment of large-scale clusters. Man-agement includes transparent fault tolerance, efficient allocation of resources,...

Flexible CoScheduling: Mitigating Load Imbalance and Improving Utilization of Heterogeneous Resources (2003)

Eitan Frachtenberg, Dror G. Feitelson, Fabrizio Petrini, Juan Fern

Fine-grained parallel applications require all their processes to run simultaneously on distinct processors to achieve good efficiency. This is typically achieved by space slicing with variable...

Flexible CoScheduling: Mitigating Load Imbalance and Improving Utilization of Heterogeneous Resources (2003)

Eitan Frachtenberg, Dror G. Feitelson, Fabrizio Petrini, Juan Fern

Fine-grained parallel applications require all their processes to run simultaneously on distinct processors to achieve good efficiency. This is typically accomplished by space slicing, wherein nodes...

Parallel job scheduling under dynamic workloads (2003)

Eitan Frachtenberg, Dror G. Feitelson, Juan Fern, Fabrizio Petrini

Jobs that run on parallel systems that use gang scheduling for multiprogramming may interact with each other in various ways. These interactions are affected by system parameters such as the level of...

Scalable Collective Communication on the ASCI Q Machine (2003)

Fabrizio Petrini Juan, Juan Fernandez, Eitan Frachtenberg, Salvador Coll

Scientific codes spend a considerable part of their run time executing collective communication operations. Such operations can also be critical for efficient resource management in large-scale...

Scalable collective communication on the ASCI Q machine (2003)

Fabrizio Petrini, Juan Fernandez, Eitan Frachtenberg, Salvador Coll

Scientific codes spend a considerable part of their run time executing collective communication operations. Such operations can also be critical for efficient resource management in large-scale...

STORM: Lightning-Fast Resource Management (2002)

Eitan Frachtenberg, Fabrizio Petrini, Juan Fernandez, Scott Pakin, Salvador Coll

Although workstation clusters are a common platform for high-performance computing (HPC), they remain more difficult to manage than sequential systems or even symmetric multiprocessors. Furthermore,...

Performance Evaluation of I/O Traffic and Placement of I/O Nodes on a High Performance Network (2002)

Salvador Coll, Fabrizio Petrini, Eitan Frachtenberg, Adolfy Hoisie

A common trend in the design of large-scale clusters is to use a high-performance data network to integrate the processing nodes in a single parallel computer. In these systems the performance of the...

Scalable Resource Management in High-Performance Computers (2002)

Eitan Frachtenberg, Fabrizio Petrini, Juan Fern, Salvador Coll

Clusters and other loosely-coupled systems are becoming ubiquitous and larger

The Quadrics Network (QsNet): High-Performance Clustering Technology (2001)

Fabrizio Petrini, Wu-chun Feng, Adolfy Hoisie, Salvador Coll, Eitan Frachtenberg

The Quadrics interconnection network (QsNet) contributes two novel innovations to the field of highperformance interconnects: (1) integration of the virtualaddress spaces of individual nodes into a...

The Quadrics Network (QsNet): High-Performance Clustering Technology (2001)

Fabrizio Petrini, Wu-chun Feng, Adolfy Hoisie, Salvador Coll, Eitan Frachtenberg

The Quadrics interconnection network (QsNet) contributes two novel innovations to the field of highperformance interconnects: (1) integration of the virtualaddress spaces of individual nodes into a...

Using multirail networks in high-performance clusters (2001)

Salvador Coll, Eitan Frachtenberg, Fabrizio Petrini, Adolfy Hoisie, Leonid Gurvits

Using multiple independent networks (also known as rails) is an emerging technique to overcome bandwidth limitations and enhance fault tolerance of current high-performance parallel computers. In...

Using multirail networks in high-performance clusters (2001)

Salvador Coll, Eitan Frachtenberg, Fabrizio Petrini, Adolfy Hoisie, Leonid Gurvits

Using multiple independent networks (also known as rails) is an emerging technique to overcome bandwidth limitations and enhance fault tolerance of current highperformance clusters. We present an...

Gang scheduling with lightweight user-level communication (2001)

Eitan Frachtenberg, Fabrizio Petrini, Salvador Coll, Wu-chun Feng

In this paper, we explore the performance of gang scheduling on a cluster using the Quadrics interconnection network. In such a cluster, the scheduler can take advantage of this network’s unique...

The Quadrics Network (QsNet): High-Performance Clustering Technology (2001)

Fabrizio Petrini, Wu-chun Feng, Adolfy Hoisie, Salvador Coll, Eitan Frachtenberg

The Quadrics interconnection network (QsNet) contributes two novel innovations to the field of highperformance interconnects: (1) integration of the virtualaddress spaces of individual nodes into a...

Performance Evaluation of the Quadrics Interconnection Network (2001)

Fabrizio Petrini, Salvador Coll, Eitan Frachtenberg, Adolfy Hoisie

In this paper we present an in-depth description of the Quadrics interconnection network (QsNET) and an experimental performance evaluation on a 64-node Alphaserver cluster. We expose the performance...

Gang scheduling with lightweight user-level communication (2001)

Eitan Frachtenberg, Fabrizio Petrini, Salvador Coll, Wu-chun Feng

In this paper, we explore the performance of gang scheduling on a cluster using the Quadrics interconnection network. In such a cluster, the scheduler can take advantage of this network's unique...

Using multirail networks in high-performance clusters (2001)

Salvador Coll, Eitan Frachtenberg, Fabrizio Petrini, Adolfy Hoisie, Leonid Gurvits

Using multiple independent networks (also known as rails) is an emerging technique to overcome bandwidth limitations and enhance fault-tolerance of current high-performance clusters. We present and...

Using multirail networks in high-performance clusters (2001)

Salvador Coll, Eitan Frachtenberg, Fabrizio Petrini, Adolfy Hoisie, Leonid Gurvits

Using multiple independent networks (also known as rails) is an emerging technique to overcome bandwidth limitations and enhance fault tolerance of current high-performance parallel computers. In...

Performance Evaluation of the Quadrics Interconnection Network (2001)

Fabrizio Petrini, Salvador Coll, Eitan Frachtenberg, Adolfy Hoisie

In this paper we present an in-depth description of the Quadrics interconnection network (QsNET) and an experimental performance evaluation on a 64-node AlphaServer cluster. We explore several...

The Quadrics Network: High-Performance Clustering Technology (1998)

Fabrizio Petrini, Wu-chun Feng, Adolfy Hoisie, Salvador Coll, Eitan Frachtenberg

The interconnection network and its associated software libraries are critical components for high-performance cluster computers and supercomputers, Web-server farms, and network-attached storage....