Effects of Packet Pacing for MPI Programs in a Grid Environment (2008)
Ryousei Takano, Motohiko Matsuda, Tomohiro Kudoh, Yuetsu Kodama, Fumihiro Okazaki, Yutaka Ishikawa, ...
Abstract — Improving the performance of TCP communication is the key to the successful deployment of MPI programs in a Grid environment in which multiple clusters are connected through high...
OBP Lib: An Object-Oriented Parallel Library and its Preliminary Performance (2007)
Motohiko Matsuda, Mitsuhisa Sato, Yutaka Ishikawa
OBP Lib (Object-Oriented Building Blocks in Parallel Programming) is a collection of parallel object classes including parallel array and matrix/vector classes in C+ + , which is useful for combining...
Communication Performance of Gigabit LAN Workstation Cluster RWC/WSC (2007)
Kazuto Kubota, Yoshio Tanaka, Motohiko Matsuda, Mitsuhisa Sato, Hiroshi Tezuka, Atsushi Hori, ...
This paper reports on the communication performance of the RWC/WSC. It is then compared with massively parallel machines: the CM5 and the Paragon. For comparison, specific collective communications...
Design and evaluation of precise software pacing mechanisms for fast long-distance networks (2005)
Ryousei Takano, Tomohiro Kudoh, Yuetsu Kodama, Motohiko Matsuda, Hiroshi Tezuka, Yutaka Ishikawa
Abstract — In this paper, we propose precise software pacing mechanisms at end nodes of communication paths. First, we propose an Inter Packet Gap (IPG) control mechanism, which inserts gap packets...
Motohiko Matsuda, Tomohiro Kudoh, Hiroshi Tazuka, Yutaka Ishikawa
Many implementations of an MPI communication library are realized on top of the socket interface which is based on connection-oriented stream communication. This paper addresses a mismatch between...
Evaluating of MPI Implementations on Grid-connected Clusters using an Emulated (2003)
Motohiko Matsuda, Tomohiro Kudoh
The MPICH-SCore high performance communication library for cluster computing is integrated into the MPICH-G2 library in order to adapt PC clusters to a Grid environment. The integrated library is...
Parallelization of Sparse Cholesky Factorization on an SMP Cluster (1999)
Shigehisa Satoh Kazuhiro, Kazuhiro Kusano, Yoshio Tanaka, Motohiko Matsuda, Mitsuhisa Sato
. In this paper, we present parallel implementations of the sparse Cholesky factorization kernel in the SPLASH-2 programs to evaluate performance of a Pentium Pro based SMP cluster. Solaris threads...
Network Interface Active Messages for Low Overhead Communication on SMP PC Clusters (1999)
Motohiko Matsuda, Yoshio Tanaka, Kazuto Kubota, Mitsuhisa Sato
. NICAM is a communication layer for SMP PC clusters connected via Myrinet, designed to reduce overhead and latency by directly utilizing a micro-processor equipped on the network interface. It...
COMPaS: A Pentium Pro PC-based SMP Cluster and its Experience (1998)
Yoshio Tanaka, Motohiko Matsuda, Makoto Ando, Kazuto Kubota, Mitsuhisa Sato
. We have built an eight node SMP cluster called COMPaS (Cluster Of Multi-Processor Systems), each node of which is a quadprocessor Pentium Pro PC. We have designed and implemented a remote memory...
Makoto Ando, Yoshio Tanaka, Kazuto Kubota, Motohiko Matsuda, Yutaka Akiyama, Mitsuhisa Sato
In this paper, we measure and compare the performance of shared- and distributed-memory multiprocessors using a parallel tree search problem to characterize these types of multiprocessors. We take...
OBP Lib: An Object-Oriented Parallel Library and its Preliminary Performance (1998)
Motohiko Matsuda, Mitsuhisa Sato, Yutaka Ishikawa
OBP Lib (Object-Oriented Building Blocks in Parallel Programming) is a collection of parallel object classes including parallel array and matrix/vector classes in C+ + , which is useful for combining...
Efficient Implementation of Portable C*-like Data-Parallel Library in C++ (1997)
Motohiko Matsuda, Mitsuhisa Sato, Yutaka Ishikawa
The C* language is a data-parallel extension of the C language which incorporates parallel data types. Since the C+ + language provides operator overloading, a C+ + library can implement the C*...
Parallel Array Class Implementation Using C++ STL Adaptors (1997)
Motohiko Matsuda, Mitsuhisa Sato, Yutaka Ishikawa
. STL Adaptors can combine operations and are used in elimination of temporaries in a C+ + array class; this technique is known as Expression Templates or Template Closures. Since the technique is...
Parallel Array Class Implementation Using C++ STL Adaptors (Extended Abstract) (1997)
Motohiko Matsuda, Mitsuhisa Sato, Yutaka Ishikawa
this paper, we present an implementation of a parallel array class with ghost-cells employing the Adaptor technique. Ghost-cells, also called as guard-cells or an envelope, are overlapping boundary...
Design and Implementation of Metalevel Architecture in C++ - MPC++ Approach (1996)
Yutaka Ishikawa, Atsushi Hori, Mitsuhisa Sato, Motohiko Matsuda, Jörg Nolte, Hiroshi Tezuka, ...
We developed a compile-time metalevel architecture in C++, called the MPC ++ metalevel architecture, to not only extend and modify language semantics, but also extend syntax. This architecture...
RWC Massively Parallel Software Environment and An Overview of MPC++ (1995)
Yutaka Ishikawa, Atsushi Hori, Mitsuhisa Sato, Motohiko Matsuda, Jörg Nolte, Hiroshi Tezuka, ...
this paper, we focus on MPC ++. First of all, the MPC ++ approach to a parallel programming system is described in section 2. Sections 3 and 4 show an overview of the control parallel and the MPC ++...