Yunhong Gu

Sector and Sphere: Towards Simplified Storage and Processing of Large Scale Distributed Data (2008)

Gu, Yunhong, Grossman, Robert L

Cloud computing has demonstrated that processing very large datasets over commodity clusters can be done simply given the right programming model and infrastructure. In this paper, we describe the...

Data Mining Using High Performance Data Clouds: Experimental Studies Using Sector and Sphere (2008)

Grossman, Robert L, Gu, Yunhong

We describe the design and implementation of a high performance cloud that we have used to archive, analyze and mine large distributed data sets. By a cloud, we mean an infrastructure that provides...

Compute and Storage Clouds Using Wide Area High Performance Networks (2008)

Grossman, Robert L., Gu, Yunhong, Sabala, Michael, Zhang, Wanzhi

We describe a cloud based infrastructure that we have developed that is optimized for wide area, high performance networks and designed to support data mining applications. The infrastructure...

Experimental Studies of Data Transport and Data (2004)

Robert L. Grossman, Yunhong Gu, David Hanley, Xinwei Hong, Parthasarathy Krishnaswamy

Although the amount of earth science data is growing rapidly, as is the availability of high performance networks, our ability to access large remote earth science data sets is still very limited....

Future Generation Computer Systems 19 (2003) 897--908 (2004)

Joe Mambretti, Jeremy Weinberger, Jim Chen, Elizabeth Bacon, Fei Yeh, David Lillethun, ...

Traditionally, the design and implementation of network based applications, especially large-scale, high performance applications, have had to be compromised across multiple dimensions interfaces,...

Photonic Data Services: Integrating Data, Network and Path Services to Support Next Generation Data Mining Applications (2004)

Robert L. Grossman, Yunhong Gu, Dave Hanley, Xinwei Hong, Dave Lillethun, Jorge Levera, ...

We describe an architecture for next generation, distributed data mining systems which integrates data services to facilitate remote data analysis and distributed data mining, network protocol...

Journal of Grid Computing, to appear. 1 (2004)

Yunhong Gu, Robert Grossman

This paper describes SABUL, an application-level data transfer protocol for data-intensive applications over high bandwidth-delay product networks. SABUL is designed for reliability, high...

Open DMIX: High Performance Web Services for (2004)

Robert Grossman, Chetan Gupta, Parthasarathy Krishnaswamy, L. Grossman, Yunhong Gu, David Hanley, ...

In this note, we introduce Open DMIX, an open source collection of web services for the mining, integration, and exploration of remote and distributed data. We also describe some preliminary...

Experimental Studies Using Application Layer Protocols Based Upon UDP to Support Applications Requiring Requiring Very High Volume Data Flows (2004)

Robert L. Grossman, Yunhong Gu, Xinwei Hong, Antony Antony, Johan Blom, Freek Dijkstra, ...

We describe a UDP based application level transport protocol, named UDT (UDP based Data Transfer) that is designed for high performance networking and computing. It is fast, fair, and friendly. UDT...

Data Webs for Earth Science Data (2003)

Rajiv Balach, Yunhong Gu, Robert Grossman, Xinwei Hong, Jorge Levera, Marco Mazzucco

We describe high performance data webs for earth science data which are designed for interactively analyzing small to moderate size remote data sets, as well as mining distributed data sets....

A Case for the Global Access to Large Distributed (2003)

Robert L. Grossman, Yunhong Gu, Dave Hanley, Xinwei Hong, Jorge Levera, Marco Mazzucco, ...

We argue that data webs employing specialized path services, network protocols, and data protocols can be an e#ective platform to analyze and access millions of distributed Gigabyte (and larger) size...

Experimental Studies Using (2003)

At Igrid, Robert L. Grossman, Yunhong Gu, Don Hamelburg, Dave Hanley, Xinwei Hong, ...

We describe an architecture for remote and distributed data intensive applications which integrates optical path services, network protocol services for high performance data transport, and data...