Yunhong Gu

The Open Cloud Testbed: A Wide Area Testbed for Cloud Computing Utilizing High Performance Network Services (2009)

Grossman, Robert, Gu, Yunhong, Sabala, Michal, Bennet, Collin, Seidman, Jonathan, Mambratti, Joe

Recently, a number of cloud platforms and services have been developed for data intensive computing, including Hadoop, Sector, CloudStore (formerly KFS), HBase, and Thrift. In order to benchmark the...

Manuscripts, submitted to IEEE COMMUNICATIONS LETTERS 1 SABUL: A High Performance Data Transfer Protocol (2008)

Yunhong Gu, Student Member, Xinwei Hong, Marco Mazzucco, Robert Grossman

Abstract-- The paper describes SABUL, an application level data transfer protocol for data intensive applications over high bandwidth-delay product networks. SABUL is designed for reliability, high...

End-to-End Congestion Control for High (2008)

Yunhong Gu, Student Member, Robert L. Grossman

Abstract—One of the headache problems in high performance computing area is the lack of a transport protocol to transfer bulk data fast over computational grids. TCP, the de facto transport...

Sector and Sphere: Towards Simplified Storage and Processing of Large Scale Distributed Data (2008)

Gu, Yunhong, Grossman, Robert L

Cloud computing has demonstrated that processing very large datasets over commodity clusters can be done simply given the right programming model and infrastructure. In this paper, we describe the...

Data Mining Using High Performance Data Clouds: Experimental Studies Using Sector and Sphere (2008)

Grossman, Robert L, Gu, Yunhong

We describe the design and implementation of a high performance cloud that we have used to archive, analyze and mine large distributed data sets. By a cloud, we mean an infrastructure that provides...

Distributed Discovery in E-Science: Lessons from the Angle Project ∗ (2008)

Robert L. Grossman, Michael Sabala, Yunhong Gu, Anushka An, Matt H, Rajmonda Sulo, ...

We describe the design of a system called Angle that detects emergent and anomalous behavior in distributed IP packet data. Currently, Angle sensors are collecting IP packet data at four locations,...

Abstract UDT: UDP-based Data Transfer for High-Speed Wide Area Networks (2008)

Yunhong Gu, Robert L. Grossman

In this paper, we summarize our work on the UDT high performance data transport protocol in the past four years. UDT was designed to effectively utilize the rapidly emerging high-speed wide area...

Compute and Storage Clouds Using Wide Area High Performance Networks (2008)

Grossman, Robert L., Gu, Yunhong, Sabala, Michael, Zhang, Wanzhi

We describe a cloud based infrastructure that we have developed that is optimized for wide area, high performance networks and designed to support data mining applications. The infrastructure...

Experimental Studies Using Photonic Data Services (2008)

At Igrid, Robert L. Grossman, Yunhong Gu, Don Hamelburg, Dave Hanley, Xinwei Hong, ...

We describe an architecture for remote and distributed data intensive applications which integrates path services for optical paths, network protocol services for high performance data transport, and...

Abstract for Second International Workshop on Protocols for Fast Long-Distance Networks UDT: An Application Level Transport Protocol for Grid Computing (2008)

Yunhong Gu, Robert L. Grossman

As network bandwidth and delays increase, TCP becomes inefficient [1, 5, 8, 9]. These problems are due to slow loss-recovery, a RTT bias inherent in its AIMD congestion-control algorithm, and the...

Manuscript submitted to Computer Communication Review Using UDP for Reliable Data Transfer over High Bandwidth-Delay Product Networks (2008)

Yunhong Gu, Robert Grossman

As the network bandwidth and delay increase, TCP becomes inefficient. Data intensive applications over high-speed networks such as the computational grids need new transport protocol to support them....

FastPara: a High-level Declarative Data-Parallel Programming Framework on Clusters ABSTRACT (2008)

Yong Mao, Yunhong Gu, Jia Chen, Robert L. Grossman

This paper presents FastPara, a C++ programming framework and associated runtime support for writing and running data-parallel applications in computer cluster environments. With FastPara, the user...

Open DMIX- Data Integration and Exploration Services for Data Grids, Data Web and Knowledge Grid Applications (2008)

Robert L. Grossman, Yunhong Gu, Dave Hanley, Xinwei Hong, Gokulnath Rao

The analysis and mining of remote and distributed data is critical for many applications being deployed on grid-based and web-based computing platforms. Broadly speaking, the

Abstract UDT: UDP-based Data Transfer for High-Speed Wide Area Networks (2008)

Yunhong Gu, Robert L. Grossman

In this paper, we summarize our work on the UDT high performance data transport protocol over the past four years. UDT was designed to effectively utilize the rapidly emerging high-speed wide area...

A Peer-to-Peer Infrastructure for Distributing Large Scientific Data Sets over Wide Area High-Performance Networks: Experimental Studies Using Wide Area Layer 2 Services (2008)

Yunhong Gu, Robert L. Grossman

www.ncdm.uic.edu This paper presents Sector, a distributed environment that was created specifically to address the challenges inherent in accessing, exploring, analyzing and transporting extremely...

Journal of Grid Computing, to appear. 1 (2008)

Sabul Transport Protocol, Yunhong Gu, Robert Grossman

This paper describes SABUL, an application-level data transfer protocol for data-intensive applications over high bandwidth-delay product networks. SABUL is designed for reliability, high...

Integrating Path, Network and Data Services to Support Next Generation Data Mining Applications, Data Mining: Next Generation (2007)

Robert L. Grossman, Yunhong Gu, Dave Hanley, Xinwei Hong, Jorge Levera, Marco Mazzucco, ...

We describe an architecture for next generation, distributed data mining systems which integrates data services to facilitate remote data analysis and distributed data mining, network protocol...

Experimental studies of data transport and data access of earth-science data over networks with high bandwidth delay products (2007)

Robert L. Grossman, Yunhong Gu, David Hanley, Xinwei Hong, Parthasarathy Krishnaswamy

Computer Networks, 2004. Although the amount of earth science data is growing rapidly, as is the availability of high performance networks, our ability to access large remote earth science data sets...

A case for global access to large distributed data sets using data webs employing photonic data services, in preparation (2007)

Dave Lillethun, Robert L. Grossman, Yunhong Gu, Dave Hanley, Xinwei Hong, Jorge Levera, ...

We argue that data webs employing specialized path services, network protocols, and data protocols can be an effective platform to analyze and access millions of distributed Gigabyte (and larger)...

Data mining middleware for wide-area high-performance networks,” Future Generation (2006)

Robert L. Grossman, Yunhong Gu, David Hanley, Michal Sabala, Joe Mambretti, Alex Szalay, ...

In this paper, we describe two distributed, data intensive applications that were demonstrated at iGrid 2005 (iGrid Demonstration US109 and iGrid Demonstration US121). One involves transporting...

UDT : a high performance data transport protocol / (2005)

Gu, Yunhong.

Thesis (Ph.D. in Computer Science)--University of Illinois at Chicago, 2005.

Experiences in Design and Implementation of a High Performance Transport (2004)

Yunhong Gu, Xinwei Hong, Robert L. Grossman

This paper describes our experiences in the development of the UDP-based Data Transport (UDT) protocol, an application level transport protocol used in distributed data intensive applications. The...

Open DMIX: High Performance Web Services for Distributed Data Mining (2004)

Robert Grossman, Chetan Gupta, Parthasarathy Krishnaswamy, L. Grossman, Yunhong Gu, Yunhong Gu, ...

In this note, we introduce Open DMIX, an open source collection of web services for the mining, integration, and exploration of remote and distributed data. We also describe some preliminary...

Experimental Studies Using Application Layer Protocols Based Upon UDP to Support Applications Requiring Requiring Very High Volume Data Flows (2004)

Robert L. Grossman, Yunhong Gu And Xinwei, Yunhong Gu, Xinwei Hong, Antony Antony, Johan Blom, ...

We describe a UDP based application level transport protocol, named UDT (UDP based Data Transfer) that is designed for high performance networking and computing. It is fast, fair, and friendly. UDT...

at Chicago “High Performance Data Streaming in Service Architecture (2004)

Geoffrey Fox, Harshawardhan Gadgil, Shrideep Pallickara, Marlon Pierce, Robert L. Grossman, Yunhong Gu, ...

Applications dealing with large data sets obtained via simulation or actual real-time sensor networks are increasing in abundance. The data obtained from real-time sources may contain certain...

The Photonic TeraStream: Enabling Next Generation Applications Through Intelligent Optical Networking at iGRID2002 (2003)

Joe Mambretti, Jeremy Weinberger, Jim Chen, Elizabeth Bacon, Fei Yeh, David Lillethun, ...

Traditionally, the design and implementation of network based applications, especially large-scale, high performance applications, have had to be compromised across multiple dimensions interfaces,...

Rate Based Congestion Control over High Bandwidth/Delay Links, 2002, submitted for publication (2003)

Yunhong Gu, Xinwei Hong, Marco Mazzucco, Robert Grossman

As the network bandwidth and delay increase, traditional dominant transfer protocols on the Internet like TCP have shown inefficiency and instability to many applications involved in huge amounts of...

SABUL: A high performance data transfer protocol (2003)

Yunhong Gu, Xinwei Hong, Marco Mazzucco, Robert Grossman

The paper describes a general purpose high performance data transfer protocol for data intensive applications over high bandwidth networks. The protocol, named SABUL, has demonstrated the efficiency...

Experimental Studies Using Photonic Data Services at IGrid 2002 (2002)

At Igrid, Robert L. Grossman, Yunhong Gu, Don Hamelburg, Dave Hanley, Xinwei Hong, ...

We describe an architecture for remote and distributed data intensive applications which integrates optical path services, network protocol services for high performance data transport, and data...

Data Webs for Earth Science Data (2002)

Asvin Ananthanarayan Rajiv, Rajiv Balach, Yunhong Gu, Robert Grossman, Xinwei Hong, Jorge Levera, ...

We describe high performance data webs for earth science data which are designed for interactively analyzing small to moderate size remote data sets, as well as mining distributed data sets....