Jim Gray

Diagnosing norovirus-associated infectious intestinal disease using viral load (2009)

Phillips, Gemma, Lopman, Ben, Tam, Clarence C, Iturriza-Gomara, Miren, Brown, David, Gray, Jim

Abstract Background Reverse transcription-polymerase chain reaction (RT-PCR) is the main method for laboratory diagnosis of norovirus-associated infectious intestinal disease (IID). However, up to...

Life Under your Feet: A Wireless Soil Ecology Sensor Network (2009)

Răzvan Musăloiu-e, Andreas Terzis, Katalin Szlavecz, Alex Szalay, Joshua Cogan, Jim Gray

Wireless sensor networks can revolutionize soil ecology by providing measurements at temporal and spatial granularities previously impossible. This paper presents an experimental soil monitoring...

The effects of LIPUS on soft-tissue healing: a review of literature (2009)

Khanna, Anil, Nelmes, Richard T. C., Gougoulias, Nikolaos, Maffulli, Nicola, Gray, Jim

Introduction Ultrasound is widely used for imaging purposes and as an adjunct to physiotherapy. Low-intensity pulsed ultrasound (LIPUS), having removed the thermal component found at higher...

Rotaviruses and rotavirus vaccines (2009)

Desselberger, Ulrich, Manktelow, Emily, Li, Wilson, Cheung, Winsome, Iturriza-Gómara, Miren, Gray, Jim

Background Rotaviruses (RVs) are an important cause of acute gastroenteritis in infants and young children worldwide, resulting in more than 600 000 deaths per annum, mainly in developing countries....

Data Management in the Worldwide Sensor Web (2008)

Magdalena Balazinska, Amol Deshpande, Michael J. Franklin, Phillip B. Gibbons, Jim Gray, Suman Nath, ...

Harvesting the benefits of a sensor-rich world presents many data management challenges. Recent advances in research and industry aim to address these challenges.

Reader Aids- (2008)

Jim Gray

Special math needed for explanations: None

Finding Galaxy Clusters: When SQL meets the grid (2008)

Er S. Szalay, Jim Gray, Aniruddha R. Thakar, William J. O’mullane, James Annis

We illustrate the benefits of combining database systems and grid technologies for data-intensive applications. Using a cluster of SQL servers, we reimplemented an existing grid application to find...

Categories and Subject Descriptors: H. 2.4 [Database Management]: Systems-distributed systems; (2008)

Irving L. Traiger, Jim Gray, Cesare A. Galtieri, Bruce G. Lindsay

The concepts of transaction and of data consistency are defined for a distributed system. The cases of partitioned data, where fragments of a file are stored at multiple nodes, and replicated data,...

Hash Join Algorithms in a Multiuser Environment (2008)

Hansjorg Zeller, Jim Gray

Summary. As main memory becomes a cheaper resource, hash joins are an alternative to the traditional methods of perfonning equi-joins: nested loop and merge joins. This paper introduces a modified,...

VIEWPOINT The World-Wide Telescope (2008)

C. H. Bennett, Er Szalay, Jim Gray

may enhance our ability to understand as well as control quantum systems. Bob: I thought all the fuss about quantum computing was about engineering—but that sounds like something you’d read in...

Abstract A Memory Model for Scientific Algorithms on Graphics Processors (2008)

Naga K. Govindaraju, Scott Larsen, Jim Gray, Dinesh Manocha

We present a memory model to analyze and improve the performance of scientific algorithms on graphics processing units (GPUs). Our memory model is based on texturing hardware, which uses a 2D...

• The revolution in Computational Science • The Virtual Observatory Concept = = World-Wide Telescope 2 Computational Science (2008)

Jim Gray, Alex Szalay, Ani Thakar, Roy Williams, George Djorgovski, Julian Bunn Caltech, ...

• In the beginning science was empirical. • Then theoretical branches evolved. • Now, we have computational branches. – Was primarily simulation – Growth areas: data analysis &...

The sixth data release of the Sloan Digital Sky Survey (2008)

Adelman-McCarthy, Jennifer K., Agüeros, Marcel A., Allam, Sahar S., Prieto, Carlos Allende, Anderson, Kurt S. J., Anderson, Scott F., ...

This paper describes the Sixth Data Release of the Sloan Digital Sky Survey. With this data release, the imaging of the northern Galactic cap is now complete. The survey contains images and...

Abstract A Performance Study of Sequential I/O on Windows NT ™ 4 (2008)

Erik Riedel, Catharine Van Ingen, Jim Gray

Large-scale database, data mining, and multimedia applications require large, sequential transfers and have bandwidth as a key requirement. This paper investigates the performance of reading and...

From FITS to SQL- Loading and Publishing the SDSS Data (2008)

Aniruddha R. Thakar, Er S. Szalay, Jim Gray

Abstract. For large astronomical databases like the SDSS Science Archive,data loading is potentially the most time-consuming and labor-intensive part of archive operations,and it is also the most...

ImgCutout, an Engine of Instantaneous Astronomical Discovery (2008)

Er S. Szalay, Jim Gray

Abstract. ImgCutout is a Web application that enables professional astronomers and the general public to interactively visualize and explore large, complex astronomical data sets. The application...

Abstract A Performance Study of Sequential I/O on Windows NT ™ 4 (2008)

Erik Riedel, Catharine Van Ingen, Jim Gray

Large-scale database, data mining, and multimedia applications require large, sequential transfers and have bandwidth as a key requirement. This paper investigates the performance of reading and...

Tandem TR 86.3 (2008)

Fastsort An External, Alex Tsukerman, Jim Gray, Michael Stewart, Susan Uren, Bonnie Vaughan

FastSort is an external sort that uses parallel processing, large main memories and parallel disc accesses to obtain high performance. FastSort can sort a file as quickly as it can read the input and...

Lecture Notes in Computer Science Edited by G. Goos and J. Hartmanis 60 M. J. Flynn, J. N. Gray, A. K. Jones, K. Lagally H. Opderbeck, G. J. Popek, B. Randell J. H. Saltzer, H. R. Wehle (2008)

Operating Systems An, M. J. Flynn, J. N. Gray, A. K. Jones, K. Lagally, H. Opderbeck, ...

This paper plagiarizes the work of the large and anonymous army of people working in the field. Because of the state of the field, there are few references to the literature (much of the...

Tandem Computers (2008)

Jim Gray Digital, Charles Levine, Jim Gray, Walt Kohler, Charles Levine, Jim Gray, ...

TPC benchmarks provide the de facto industry standard for measuring transaction processing performance. This paper argues that TPC-A and TPC-B have served their purpose and are effectively obsolete....

Tandem TR 89.1 (2008)

Hash Join Algorithms, Hansjörg Zeller, Jim Gray

This paper introduces a modified, adaptive hash join method that is designed to work with dynamic changes in the amount of available memory. The general idea of the algorithm is to regulate resource...

The sixth data release of the Sloan Digital Sky Survey (2008)

Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Prieto, Carlos Alende, Anderson, Kurt S. J., Anderson, Scott F., ...

This paper describes the Sixth Data Release of the Sloan Digital Sky Survey. With this data release, the imaging of the northern Galactic cap is now complete. The survey contains images and...

The sixth data release of the Sloan Digital Sky Survey (2008)

Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Prieto, Carlos Alende, Anderson, Kurt S. J., Anderson, Scott F., ...

This paper describes the Sixth Data Release of the Sloan Digital Sky Survey. With this data release, the imaging of the northern Galactic cap is now complete. The survey contains images and...

The sixth data release of the Sloan Digital Sky Survey (2008)

Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Prieto, Carlos Alende, Anderson, Kurt S. J., Anderson, Scott F., ...

This paper describes the Sixth Data Release of the Sloan Digital Sky Survey. With this data release, the imaging of the northern Galactic cap is now complete. The survey contains images and...

The sixth data release of the Sloan Digital Sky Survey (2008)

Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Prieto, Carlos Alende, Anderson, Kurt S. J., Anderson, Scott F., ...

This paper describes the Sixth Data Release of the Sloan Digital Sky Survey. With this data release, the imaging of the northern Galactic cap is now complete. The survey contains images and...

Draft Sequential IO Paper 1 07/04/99 A Performance Study of Sequential I/O on Windows NT ™ 4.0 (2007)

Erik Riedel (cmu, Erik Riedel, Catharine Van Ingen, Catharine Van Ingen, Jim Gray, Jim Gray

This paper investigates the most efficient way to read and write large sequential files using the Windows NT ™ 4.0 File System. The study explores the performance of Intel Pentium Pro ™ based...

1 Parallel Database Systems: The Future of High Performance Database Processing (2007)

David J. Dewitt, Jim Gray

Abstract: Parallel database machine architectures have evolved from the use of exotic hardware to a software parallel dataflow architecture based on conventional shared-nothing hardware. These new...

Digital Immortality 2 (2007)

Gordon Bell, Gordon Bell, Jim Gray, Jim Gray

1: This work has been submitted for publication to the Communications of the ACM. Copyright may be transferred without further notice and the publisher may then post the accepted version. A version...

Nsort: a Parallel Sorting Program for NUMA and SMP Machines (2007)

Version November Chris, Chris Nyberg, Charles Koester, Ordinal Technology Corp, Ordinal Technology Corp, Jim Gray

This paper describes Nsort's background, presents its performance sorting a terabyte of data, and compares its performance on an industry-standard benchmark. Nsort performance is presented for...

The Microsoft TerraServer™ (2007)

Tom Barclay, Robert Eberl, Jim Gray, Jim Gray, John Nordlinger, John Nordlinger, ...

The Microsoft TerraServer stores aerial and satellite images of the earth in a SQL Server Database served to the public via the Internet. It is the world's largest atlas, combining five...

Data Cube: ARelational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals (2007)

Jim Gray, Adam Bosworth, Andrew Layman, Hamid Pirahesh, Generalizing Group-by

: Data analysis applications typically aggregate data across many dimensions looking for unusual patterns. The SQL aggregate functions and the GROUP BY operator produce zero-dimensional or...

To mark (2007)

Roger Needham, Roger Needham, Martín Abadi, Ross Anderson, Jean Bacon, Andrew Birrell, ...

comprising in this compilation are copyright of the respective authors. All rights are reserved. This publication may not be copied, reproduced, published or distributed in whole or in part in any...

Performance of the 1-1 Data Pump (2007)

Tobias Mayr, Jim Gray

Abstract: This document describes the implementation and performance of a 1-1 data pump, i.e., a program transferring data between disks on one node or on two different nodes connected by a network....

A Transactional Approach to Redundant Disk Array Implementation (2007)

Martin Francis, Jim Gray, Daniel P. Siewiorek, Charles Richard Courtright

are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of sponsoring companies or the government of the United States of America.

1 High Performance Computing: Crays, Clusters, and Centers. What Next? (2007)

Gordon Bell, Gordon Bell, Jim Gray, Jim Gray, Gray Microsoft. Com

(1) clusters of “Cray-style ” vector supercomputers; (2) clusters of scalar uni- and multi-processors. Clusters are in transition from (a) massively parallel computers and clusters running...

1 High Performance Computing: Crays, Clusters, and Centers. What Next? (2007)

Gordon Bell, Gordon Bell, Jim Gray, Jim Gray, Gray Microsoft. Com

Abstract: After 50 years of building high performance scientific computers, two major architectures exist: (1) clusters of “Cray-style ” vector supercomputers; (2) clusters of scalar uni- and...

The Sloan Digital Sky Survey Quasar Catalog IV. Fifth Data Release (2007)

Schneider, Donald P., Hall, Patrick B., Richards, Gordon T., Strauss, Michael A., Berk, Daniel E. Vanden, Anderson, Scott F., ...

We present the fourth edition of the Sloan Digital Sky Survey (SDSS) Quasar Catalog. The catalog contains 77,429 objects; this is an increase of over 30,000 entries since the previous edition. The...

Life Under Your Feet: An End-to-End Soil Ecology Sensor Network, Database, Web Server, and Analysis Service (2007)

Szlavecz, Katalin, Terzis, Andreas, Ozer, Stuart, Musaloiu-E, Razvan, Cogan, Joshua, Small, Sam, ...

Wireless sensor networks can revolutionize soil ecology by providing measurements at temporal and spatial granularities previously impossible. This paper presents a soil monitoring system we...

The Zones Algorithm for Finding Points-Near-a-Point or Cross-Matching Spatial Datasets (2007)

Gray, Jim, Nieto-Santisteban, Maria A., Szalay, Alexander S.

Zones index an N-dimensional Euclidian or metric space to efficiently support points-near-a-point queries either within a dataset or between two datasets. The approach uses relational algebra and the...

Cross-Matching Multiple Spatial Observations and Dealing with Missing Data (2007)

Gray, Jim, Szalay, Alex, Budavari, Tamas, Lupton, Robert, Nieto-Santisteban, Maria, Thakar, Ani

Cross-match spatially clusters and organizes several astronomical point-source measurements from one or more surveys. Ideally, each object would be found in each survey. Unfortunately, the...

SkyServer Traffic Report - The First Five Years (2007)

Singh, Vik, Gray, Jim, Thakar, Ani, Szalay, Alexander S., Raddick, Jordan, Boroski, Bill, ...

The SkyServer is an Internet portal to the Sloan Digital Sky Survey Catalog Archive Server. From 2001 to 2006, there were a million visitors in 3 million sessions generating 170 million Web hits, 16...

Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals (2007)

Gray, Jim, Chaudhuri, Surajit, Bosworth, Adam, Layman, Andrew, Reichart, Don, Venkatrao, Murali, ...

Data analysis applications typically aggregate data across many dimensions looking for anomalies or unusual patterns. The SQL aggregate functions and the GROUP BY operator produce zero-dimensional or...

Data Management: Past, Present, and Future (2007)

Gray, Jim

Soon most information will be available at your fingertips, anytime, anywhere. Rapid advances in storage, communications, and processing allow us move all information into Cyberspace. Software to...

A Critique of ANSI SQL Isolation Levels (2007)

Berenson, Hal, Bernstein, Phil, Gray, Jim, Melton, Jim, O'Neil, Elizabeth, O'Neil, Patrick

ANSI SQL-92 defines Isolation Levels in terms of phenomena: Dirty Reads, Non-Repeatable Reads, and Phantoms. This paper shows that these phenomena and the ANSI SQL definitions fail to characterize...

Queues Are Databases (2007)

Gray, Jim

Message-oriented-middleware (MOM) has become an small industry. MOM offers queued transaction processing as an advance over pure client-server transaction processing. This note makes four points:...

Supporting Finite Element Analysis with a Relational Database Backend, Part I: There is Life beyond Files (2007)

Heber, Gerd, Gray, Jim

In this paper, we show how to use a Relational Database Management System in support of Finite Element Analysis. We believe it is a new way of thinking about data management in well-understood...

Supporting Finite Element Analysis with a Relational Database Backend, Part II: Database Design and Access (2007)

Heber, Gerd, Gray, Jim

This is Part II of a three article series on using databases for Finite Element Analysis (FEA). It discusses (1) db design, (2) data loading, (3) typical use cases during grid building, (4) typical...

Thousands of DebitCredit Transactions-Per-Second: Easy and Inexpensive (2007)

Gray, Jim, Levine, Charles

A $2k computer can execute about 8k transactions per second. This is 80x more than one of the largest US bank's 1970's traffic - it approximates the total US 1970's financial transaction volume. Very...

A Measure of Transaction Processing 20 Years Later (2007)

Gray, Jim

This provides a retrospective of the paper "A Measure of Transaction Processing" published in 1985. It shows that transaction processing peak performance and price-peformance have improved about...

Using Table Valued Functions in SQL Server 2005 To Implement a Spatial Data Library (2007)

Gray, Jim, Szalay, Alex, Fekete, Gyorgy

This article explains how to add spatial search functions (point-near-point and point in polygon) to Microsoft SQL Server 2005 using C# and table-valued functions. It is possible to use this library...

Indexing the Sphere with the Hierarchical Triangular Mesh (2007)

Szalay, Alexander S., Gray, Jim, Fekete, George, Kunszt, Peter Z., Kukol, Peter, Thakar, Ani

We describe a method to subdivide the surface of a sphere into spherical triangles of similar, but not identical, shapes and sizes. The Hierarchical Triangular Mesh (HTM) is a quad-tree that is...

Petascale Computational Systems (2007)

Bell, Gordon, Gray, Jim, Szalay, Alex

Computational science is changing to be data intensive. Super-Computers must be balanced systems; not just CPU farms but also petascale IO and networking arrays. Anyone building CyberInfrastructure...

Empirical Measurements of Disk Failure Rates and Error Rates (2007)

Gray, Jim, Van Ingen, Catharine

The SATA advertised bit error rate of one error in 10 terabytes is frightening. We moved 2 PB through low-cost hardware and saw five disk read error events, several controller failures, and many...

Large-Scale Query and XMatch, Entering the Parallel Zone (2007)

Nieto-Santisteban, Maria A., Thakar, Aniruddha R., Szalay, Alexander S., Gray, Jim

Current and future astronomical surveys are producing catalogs with millions and billions of objects. On-line access to such big datasets for data mining and cross-correlation is usually as highly...

The Sloan Digital Sky Survey Quasar Catalog. IV. Fifth Data Release (2007)

Schneider, Donald P., Hall, Patrick B., Richards, Gordon T., Strauss, Michael A., Vanden Berk, Daniel E., Anderson, Scott F., ...

We present the fourth edition of the Sloan Digital Sky Survey (SDSS) Quasar Catalog. The catalog contains 77,429 objects; this is an increase of over 30,000 entries since the previous edition. The...

The Fifth Data Release of the Sloan Digital Sky Survey (2007)

Adelman-McCarthy, Jennifer K., Agüeros, Marcel A., Allam, Sahar S., Anderson, Kurt S. J., Anderson, Scott F., Annis, James, ...

This paper describes the Fifth Data Release (DR5) of the Sloan Digital Sky Survey (SDSS). DR5 includes all survey quality data taken through 2005 June and represents the completion of the SDSS-I...

The fifth data release of the Sloan Digital Sky Survey (2007)

Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Anderson, Kurt S. J., Anderson, Scott F., Annis, James, ...

This paper describes the Fifth Data Release (DR5) of the Sloan Digital Sky Survey (SDSS). DR5 includes all survey quality data taken through 2005 June and represents the completion of the SDSS-I...

The Fifth Data Release of the Sloan Digital Sky Survey (2007)

Adelman-McCarthy, Jennifer K., Agueeros, Marcel A., Allam, Sahar S., Anderson, Kurt S. J., Anderson, Scott F., Annis, James, ...

This paper describes the Fifth Data Release (DR5) of the Sloan Digital Sky Survey (SDSS). DR5 includes all survey quality data taken through 2005 June and represents the completion of the SDSS-I...

The Fifth Data Release of the Sloan Digital Sky Survey (2007)

Adelman-McCarthy, Jennifer K., Agueeros, Marcel A., Allam, Sahar S., Anderson, Kurt S. J., Anderson, Scott F., Annis, James, ...

This paper describes the Fifth Data Release (DR5) of the Sloan Digital Sky Survey (SDSS). DR5 includes all survey quality data taken through 2005 June and represents the completion of the SDSS-I...

Data Cube: A Relational Aggregation Operator Generalizing Group By, Cross-Tab, and Sub-Totals. (2007)

Magdalena Balazinska, Jim Gray, Data Mining, Knowledge Discovery

Class projects are going very well! Project presentations: 15 minutes – On Wednesday in two weeks – There are 14 teams, so we will need to schedule extra time – We will give you the grading...

The fifth data release of the Sloan Digital Sky Survey (2007)

Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Anderson, Kurt S. J., Anderson, Scott F., Annis, James, ...

This paper describes the Fifth Data Release (DR5) of the Sloan Digital Sky Survey (SDSS). DR5 includes all survey quality data taken through 2005 June and represents the completion of the SDSS-I...

Designing a Multi-petabyte Database for LSST (2006)

Becla, Jacek, Hanushevsky, Andrew, Nikolaev, Sergei, Abdulla, Ghaleb, Szalay, Alex, Nieto-Santisteban, Maria, ...

The 3.2 giga-pixel LSST camera will produce approximately half a petabyte of archive images every month. These data need to be reduced in under a minute to produce real-time transient alerts, and...

The SDSS Quasar Survey: Quasar Luminosity Function from Data Release Three (2006)

Richards, Gordon T., Strauss, Michael A., Fan, Xiaohui, Hall, Patrick B., Jester, Sebastian, Schneider, Donald P., ...

We determine the number counts and z=0-5 luminosity function for a well-defined, homogeneous sample of quasars from the Sloan Digital Sky Survey (SDSS). We conservatively define the most uniform...

The Sloan Digital Sky Survey quasar survey: Quasar luminosity function from data release 3 (2006)

Richards, Gordon T., Strauss, Michael A., Fan, Xiaohui, Hall, Patrick B., Jester, Sebastian, Schneider, Donald P., ...

We determine the number counts and z=0-5 luminosity function for a well-defined, homogeneous sample of quasars from the Sloan Digital Sky Survey (SDSS). We conservatively define the most uniform...

The Fourth Data Release of the Sloan Digital Sky Survey (2006)

Adelman-McCarthy, Jennifer K., Agüeros, Marcel A., Allam, Sahar S., Anderson, Kurt S. J., Anderson, Scott F., Annis, James, ...

This paper describes the Fourth Data Release of the Sloan Digital Sky Survey (SDSS), including all survey-quality data taken through 2004 June. The data release includes five-band photometric data...

The fourth data release of the Sloan Digital Sky Survey (2006)

Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Anderson, Kurt S. J., Anderson, Scott F., Annis, James, ...

This paper describes the Fourth Data Release of the Sloan Digital Sky Survey (SDSS), including all survey-quality data taken through 2004 June. The data release includes five-band photometric data...

The fourth data release of the Sloan Digital Sky Survey (2006)

Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Anderson, Kurt S. J., Anderson, Scott F., Annis, James, ...

This paper describes the Fourth Data Release of the Sloan Digital Sky Survey (SDSS), including all survey-quality data taken through 2004 June. The data release includes five-band photometric data...

GPUTeraSort: High Performance Graphics Coprocessor Sorting for Large Database Management (2006)

Naga K. Govindaraju, Jim Gray, Ritesh Kumar, Dinesh Manocha

We present a new algorithm, GPUTeraSort, to sort billionrecord wide-key databases using a graphics processing unit (GPU) Our algorithm uses the data and task parallelism on the GPU to perform...

SkyServer Traffic Report – The First Five Years (2006)

Vik Singh, Jim Gray

Abstract The SkyServer is an Internet portal to the Sloan

Petascale Computational Systems: (2006)

Balanced Cyberinfrastructure In, Gordon Bell, Jim Gray, Alex Szalay

Computational science is changing to be data intensive. Super-Computers must be balanced system, not just CPU farms but also petascale IO and networking arrays. Anyone building CyberInfrastructure...

Petascale computational systems: balanced cyberinfrastructure in a data-centric world. Letter to NSF Cyberinfrastructure Directorate. http://research.microsoft.com/~gray/papers/Petascale%20computational%20systems (2006)

Gordon Bell, Jim Gray, Alex Szalay

Abstract: Computational science is changing to be data intensive. NSF should support balanced systems, not just CPU farms but also petascale IO and networking. NSF should allocate resources to...

The fourth data release of the Sloan Digital Sky Survey (2006)

Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Anderson, Kurt S. J., Anderson, Scott F., Annis, James, ...

This paper describes the Fourth Data Release of the Sloan Digital Sky Survey (SDSS), including all survey-quality data taken through 2004 June. The data release includes five-band photometric data...

Batch is back: CasJobs, serving multi-TB data on the Web (2005)

OMullane, William, Li, Nolan, Nieto-Santisteban, Maria, Szalay, Alex, Thakar, Ani, Gray, Jim

The Sloan Digital Sky Survey (SDSS) science database describes over 140 million objects and is over 1.5 TB in size. The SDSS Catalog Archive Server (CAS) provides several levels of query interface to...

When Database Systems Meet the Grid (2005)

Nieto-Santisteban, Maria A., Szalay, Alexander S., Thakar, Aniruddha R., O'Mullane, William J., Gray, Jim, Annis, James

We illustrate the benefits of combining database systems and Grid technologies for data-intensive applications. Using a cluster of SQL servers, we reimplemented an existing Grid application that...

TerraServer SAN-Cluster Architecture and Operations Experience (2005)

Barclay, Tom, Gray, Jim

Microsoft TerraServer displays aerial, satellite, and to-pographic images of the earth in a SQL database available via the Internet. It is one of the most popular online at-lases, presenting...

Sequential File Programming Patterns and Performance with .NET (2005)

Kukol, Peter, Gray, Jim

Programming patterns for sequential file access in the .NET Framework are described and the performance is measured. The default behavior provides excellent performance on a single disk - 50 MBps...

Scientific Data Management in the Coming Decade (2005)

Gray, Jim, Liu, David T., Nieto-Santisteban, Maria, Szalay, Alexander S., DeWitt, David, Heber, Gerd

This is a thought piece on data-intensive science requirements for databases and science centers. It argues that peta-scale datasets will be housed by science centers that provide substantial storage...

Performance Considerations for Gigabyte per Second Transcontinental Disk-to-Disk File Transfers (2005)

Kukol, Peter, Gray, Jim

Moving data from CERN to Pasadena at a gigabyte per second using the next generation Internet requires good networking and good disk IO. Ten Gbps Ethernet and OC192 links are in place, so now it is...

Where the Rubber Meets the Sky: Bridging the Gap between Databases and Science (2005)

Gray, Jim, Szalay, Alexander S.

Scientists in all domains face a data avalanche - both from better instruments and from improved simulations. We believe that computer science tools and computer scientists are in a position to help...

The Third Data Release of the Sloan Digital Sky Survey (2005)

Abazajian, Kevork, Adelman-McCarthy, Jennifer K., Agüeros, Marcel A., Allam, Sahar S., Anderson, Kurt S. J., Anderson, Scott F., ...

This paper describes the Third Data Release of the Sloan Digital Sky Survey (SDSS). This release, containing data taken up through 2003 June, includes imaging data in five bands over 5282 deg2,...

The third data release of the Sloan Digital Sky Survey (2005)

Abazajian, Kevork N., Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Anderson, Kurt S. J., Anderson, Scott F., ...

This paper describes the Third Data Release of the Sloan Digital Sky Survey (SDSS). This release, containing data taken up through 2003 June, includes imaging data in five bands over 5282 deg²,...

The third data release of the Sloan Digital Sky Survey (2005)

Abazajian, Kevork N., Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Anderson, Kurt S. J., Anderson, Scott F., ...

This paper describes the Third Data Release of the Sloan Digital Sky Survey (SDSS). This release, containing data taken up through 2003 June, includes imaging data in five bands over 5282 deg²,...

When Database Systems Meet The Grid (2005)

Jim Gray, Er S. Szalay, James Annis, Aniruddha R. Thakar, William J. O’mullane

We illustrate the benefits of combining database systems and Grid technologies for data-intensive applications. Using a cluster of SQL servers, we reimplemented an existing Grid application that...

Alternative Software Stacks for OGSA-based Grids (2005)

Marty Humphrey, Glenn Wasson, Yuliyan Kiryakov, Sang-min Park, David Del Vecchio, Jim Gray

has been a major step forward for Grid Computing, but its de facto reliance on the Web Services Resource Framework (WSRF) and WS-Notification have left some in the community questioning if the Grid...

A "Measure of Transaction Processing" 20 Years Later (2005)

Jim Gray, Jim Gray

This article will appear in the IEEE Data Engineering Bulletin ######## Figure 1 (by Charles Levine from [2]): Price/performance trend lines for TPC-A and TPC-C. The 15-year trend lines track...

Figure 1: A $10M Tandem 208 tps system (1, 2) and (2005)

Ibm Tps System, Jim Gray, Charles Levine, Microsoft Sql Server

A $2k computer can execute about 8k transactions per second. This is 80x more than one of the largest US bank's 1970's traffic -- it approximates the total US 1970's financial...

Part I: There is Life beyond Files (2005)

Gerd Heber Cornell, Gerd Heber, Gerd Heber, Gerd Heber, Jim Gray, Jim Gray, ...

In this paper, we show how to use a Relational Database Management System in support of Finite Element Analysis. We believe it is a new way of thinking about data management in well-understood...

Batch is back: CasJobs, . . . (2005)

William O'Mullane, Nolan Li, Nolan Li, Maria Nieto-Santisteban, María Nieto-santisteban, Alex Szalay, ...

The Sloan Digital Sky Survey (SDSS) science database describes over 230 million objects and is over 1.6 TB in size. The SDSS Catalog Archive Server (CAS) provides several levels of query interface to...

Alternative Software Stacks for OGSA-based Grids (2005)

Marty Humphrey, Glenn Wasson, Yuliyan Kiryakov, Sang-min Park, David Del Vecchio, Jim Gray

(OGSA) has been a major step forward for Grid Computing, but its de facto reliance on the Web Services Resource Framework (WSRF) and WS-Notification have left some in the community questioning if the...

Alternative Software Stacks for OGSA-based Grids (2005)

Marty Humphrey, Glenn Wasson, Yuliyan Kiryakov, Sang-min Park, David Del Vecchio, Jim Gray

has been a major step forward for Grid Computing, but its de facto reliance on the Web Services Resource Framework (WSRF) and WS-Notification have left some in the community questioning if the Grid...

The Lowell Database Research Self-Assessment (2005)

Abiteboul, Serge, Agrawal, Rakesh, Bernstein, Philip A., Carey, Michael J., Ceri, Stefano, Croft, W. Bruce, ...

Database needs are changing, driven by the Internet and increasing amounts of scientific and sensor data. In this article, the authors propose research into several important new directions for...

The third data release of the Sloan Digital Sky Survey (2005)

Abazajian, Kevork N., Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Anderson, Kurt S. J., Anderson, Scott F., ...

This paper describes the Third Data Release of the Sloan Digital Sky Survey (SDSS). This release, containing data taken up through 2003 June, includes imaging data in five bands over 5282 deg²,...

Consensus on Transaction Commit (2004)

Gray, Jim, Lamport, Leslie

The distributed transaction commit problem requires reaching agreement on whether a transaction is committed or aborted. The classic Two-Phase Commit protocol blocks if the coordinator fails....

The Revolution In Database System Architecture (2004)

Gray, Jim

Database system architectures are undergoing revolutionary changes. Algorithms and data are being unified by integrating programming languages with the database system. This gives an extensible...

There Goes the Neighborhood: Relational Algebra for Spatial Data Search (2004)

Gray, Jim, Szalay, Alexander S., Thakar, Aniruddha R., Fekete, Gyorgy, O'Mullane, William, Nieto-Santisteban, Maria A., ...

We explored ways of doing spatial search within a relational database: (1) hierarchical triangular mesh (a tessellation of the sphere), (2) a zoned bucketing system, and (3) representing areas as...

A Quick Look at SATA Disk Performance (2004)

Barclay, Tom, Chong, Wyman, Gray, Jim

We have been investigating the use of low-cost, commodity components for multi-terabyte SQL Server databases. Dubbed storage bricks, these servers are white box PCs containing the largest ATA drives,...

Extending the SDSS Batch Query System to the National Virtual Observatory Grid (2004)

Nieto-Santisteban, Maria A., O'Mullane, William, Gray, Jim, Li, Nolan, Budavari, Tamas, Szalay, Alexander S., ...

The Sloan Digital Sky Survey science database is approaching 2TB. While the vast majority of queries normally execute in seconds or minutes, this interactive execution time can be disproportionately...

The World Wide Telescope: An Archetype for Online Science (2004)

Gray, Jim, Szalay, Alexander S.

Most scientific data will never be directly examined by scientists; rather it will be put into online databases where it will be analyzed and summarized by computer programs. Scientists increasingly...

Distributed Computing Economics (2004)

Gray, Jim

Computing economics are changing. Today there is rough price parity between (1) one database access, (2) ten bytes of network traffic, (3) 100,000 instructions, (4) 10 bytes of disk storage, and (5)...

The Sloan Digital Sky Survey Science Archive: Migrating a Multi-Terabyte Astronomical Archive from Object to Relational DBMS (2004)

Thakar, Aniruddha R., Szalay, Alexander S., Kunszt, Peter Z., Gray, Jim

The Sloan Digital Sky Survey Science Archive is the first in a series of multi-Terabyte digital archives in Astronomy and other data-intensive sciences. To facilitate data mining in the SDSS archive,...

Cosmological Parameters from Eigenmode Analysis of Sloan Digital Sky Survey Galaxy Redshifts (2004)

Pope, Adrian C., Matsubara, Takahiko, Szalay, Alexander S., Blanton, Michael R., Eisenstein, Daniel J., Gray, Jim, ...

We present estimates of cosmological parameters from the application of the Karhunen-Loeve transform to the analysis of the 3D power spectrum of density fluctuations using Sloan Digital Sky Survey...

The second data release of the Sloan Digital Sky Survey (2004)

Abazajian, Kevork, Adelman-McCarthy, Jennifer K., Agüeros, Marcel A., Allam, Sahar S., Anderson, Kurt, Anderson, Scott F., ...

The Sloan Digital Sky Survey (SDSS) has validated and made publicly available its Second Data Release. This data release consists of 3324 deg2 of five-band (ugriz) imaging data with photometry for...

The second data release of the Sloan Digital Sky Survey (2004)

Abazajian, Kevork N., Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Anderson, Kurt S. J., Anderson, Scott F., ...

The Sloan Digital Sky Survey (SDSS) has validated and made publicly available its Second Data Release. This data release consists of 3324 deg(2) of five-band (ugriz) imaging data with photometry for...

Peter Kukol Jim Gray December 2004 (2004)

Peter Kukol, Jim Gray

Programming patterns for sequential file access in the .NET Framework are described and the performance is measured. The default behavior provides excellent performance on a single disk -- 50 MBps...

When Database Sytems Meet the Grid (2004)

Alexander S. Szalay, Aniruddha R. Thakar, William J. O’mullane, William J. O'Mullane, Jim Gray, James Annis, ...

We illustrate the benefits of combining database systems and Grid technologies for data-intensive applications. Using a cluster of SQL servers, we reimplemented an existing Grid application that...

Relational Algebra for Spatial Data Search (2004)

Jim Gray Microsoft, Jim Gray, Alexander S. Szalay, Gyorgy Fekete, Aniruddha R. Thakar, ...

We explored ways of doing spatial search within a relational database: (1) hierarchical triangular mesh (a tessellation of the sphere), (2) a zoned bucketing system, and (3) representing areas as...

Where the Rubber Meets the Sky: Bridging the Gap between Databases and Science (2004)

Jim Gray Alex, Alex Szalay, Jim Gray, Alex Szalay

Scientists in all domains face a data avalanche -- both from better instruments and from improved simulations. We believe that computer science tools and computer scientists are in a position to help...

Tom Barclay (2004)

Jim Gray Wyman, Tom Barclay, Jim Gray, Wyman Chong

Microsoft TerraServer stores aerial, satellite, and topographic images of the earth in a SQL database available via the Internet since June 1998. It is a popular online atlas, combining twenty-two...

TerraServer SAN-Cluster . . . (2004)

Tom Barclay, Jim Gray

Microsoft TerraServer displays aerial, satellite, and topographic images of the earth in a SQL database available via the Internet. It is one of the most popular online atlases, presenting seventeen...

Extending the SDSS Batch Query System to the National Virtual Observatory Grid (2004)

Alexander S. Szalay, William O'Mullane, Jim Gray, Jim Gray, ...

The Sloan Digital Sky Survey science database is approaching 2TB. While the vast majority of queries normally execute in seconds or minutes, this interactive execution time can be disproportionately...

A Minute with Nsort on a 32P NEC Windows Itanium2 Server (2004)

Chris Nyberg Ordinal, Itanium Server, Chris Nyberg, Ordinal Technology Corp, Jim Gray, Microsoft Corporation, ...

In March 2004, the Nsort# program was able to sort 34 GB of data (340,000,000 100-byte records) in 58 seconds on a 32 processor Itanium 2 NEC Express5800/1320Xd running Microsoft Windows Server 2003...

Consensus on transaction commit (2004)

Jim Gray, Leslie Lamport

The distributed transaction commit problem requires reaching agreement on whether a transaction is committed or aborted. The classic Two-Phase Commit protocol blocks if the coordinator fails....

Abstract A Minute with Nsort on a 32P NEC Windows (2004)

Itanium Server, Chris Nyberg, Ordinal Technology Corp, Jim Gray, Microsoft Corporation, Charles Koester, ...

seconds on a 32 processor Itanium ® 2 NEC ® Express5800/1320Xd running Microsoft ® Windows® Server 2003 Datacenter Edition. This set new records for the MinuteSort benchmark. The data was read...

Consensus on transaction commit (2004)

Jim Gray, Leslie Lamport

The distributed transaction commit problem requires reaching agreement on whether a transaction is committed or aborted. The classic Two-Phase Commit protocol blocks if the coordinator fails....

Consensus on transaction commit (2004)

Jim Gray, Leslie Lamport

The distributed transaction commit problem requires reaching agreement on whether a transaction is committed or aborted. The classic Two-Phase Commit protocol blocks if the coordinator fails....

The second data release of the Sloan Digital Sky Survey (2004)

Abazajian, Kevork N., Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Anderson, Kurt S. J., Anderson, Scott F., ...

The Sloan Digital Sky Survey (SDSS) has validated and made publicly available its Second Data Release. This data release consists of 3324 deg(2) of five-band (ugriz) imaging data with photometry for...

The Lowell Database Research Self Assessment (2003)

Abiteboul, Serge, Agrawal, Rakesh, Bernstein, Phil, Carey, Mike, Ceri, Stefano, Croft, Bruce, ...

A group of senior database researchers gathers every few years to assess the state of database research and to point out problem areas that deserve additional focus. This report summarizes the...

The Sloan Digital Sky Survey Quasar Catalog II. First Data Release (2003)

Schneider, Donald P., Fan, Xiaohui, Hall, Patrick B., Jester, Sebastian, Richards, Gordon T., Stoughton, Chris, ...

We present the second edition of the Sloan Digital Sky Survey (SDSS) Quasar Catalog. The catalog consists of 16713 objects in the SDSS First Data Release (DR1) that have luminosities larger than...

The first data release of the Sloan Digital Sky Survey (2003)

Abazajian, Kevork N., Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Anderson, Scott F., Annis, James, ...

The Sloan Digital Sky Survey (SDSS) has validated and made publicly available its First Data Release. This consists of 2099 deg2 of five-band (u, g, r, i, z) imaging data, 186,240 spectra of...

A Quick Look at Serial ATA (SATA) Disk Performance (2003)

Tom Barclay, Tom Barclay, Wyman Chong, Wyman Chong, Jim Gray, Jim Gray, ...

We have been investigating the use of low-cost, commodity components for multi-terabyte SQL Server databases [SQL]. Dubbed storage bricks, these servers are white box PCs containing the largest ATA...

Serge Abiteboul, Rakesh Agrawal, Phil Bernstein, Mike Carey, Stefano Ceri, Bruce Croft, David DeWitt, Mike Franklin, (2003)

Serge Abiteboul, Rakesh Agrawal, Phil Bernstein, Mike Carey, Stefano Ceri, Bruce Croft, ...

This report summarizes the discussion and conclusions of the sixth ad-hoc meeting held May 4-6, 2003 in Lowell, Mass. It observes that information management continues to be a critical component of...

Distributed computing economics (2003)

Jim Gray, Jim Gray

This paper makes br oad statements about the economics of computing. The numbers are fluid -- (costs change every day.) They are approximate to within factor of 3. For this specific fact: SETI@Home...

Consensus on Transaction Commit (2003)

Jim Gray And, Jim Gray, Leslie Lamport

The distributed transaction commit problem requires reaching agreement on whether a transaction is committed or aborted. The classic Two-Phase Commit protocol blocks if the coordinator fails....

The first data release of the Sloan Digital Sky Survey (2003)

Abazajian, Kevork N., Adelman-McCarthy, Jennifer K., Agueros, Marcel A., Allam, Sahar S., Anderson, Scott F., Annis, James, ...

The Sloan Digital Sky Survey (SDSS) has validated and made publicly available its First Data Release. This consists of 2099 deg2 of five-band (u, g, r, i, z) imaging data, 186,240 spectra of...

TerraService.NET: An Introduction to Web Services (2002)

Barclay, Tom, Gray, Jim, Strand, Eric, Ekblad, Steve, Richter, Jeffrey

This article explores the design and construction of a geo-spatial Internet web service application from the host web site perspective and from the perspective of an application using the web...

TeraScale SneakerNet: Using Inexpensive Disks for Backup, Archiving, and Data Exchange (2002)

Gray, Jim, Chong, Wyman, Barclay, Tom, Szalay, Alex, VandenBerg, Jan

Large datasets are most economically trnsmitted via parcel post given the current economics of wide-area networking. This article describes how the Sloan Digital Sky Survey ships terabyte scale...

Online Scientific Data Curation, Publication, and Archiving (2002)

Gray, Jim, Szalay, Alexander S., Thakar, Ani R., Stoughton, Christopher, VandenBerg, Jan

Science projects are data publishers. The scale and complexity of current and future science data changes the nature of the publication process. Publication is becoming a major project component. At...

Petabyte Scale Data Mining: Dream or Reality? (2002)

Szalay, Alexander S., Gray, Jim, VandenBerg, Jan

Science is becoming very data intensive1. Today's astronomy datasets with tens of millions of galaxies already present substantial challenges for data mining. In less than 10 years the catalogs are...

Web Services for the Virtual Observatory (2002)

Szalay, Alexander S., Budavari, Tamas, Malika, Tanu, Gray, Jim, Thakara, Ani

Web Services form a new, emerging paradigm to handle distributed access to resources over the Internet. There are platform independent standards (SOAP, WSDL), which make the developers? task...

Spatial Clustering of Galaxies in Large Datasets (2002)

Szalay, Alexander S., Budavari, Tamas, Connolly, Andrew, Gray, Jim, Matsubara, Takahiko, Pope, Adrian, ...

Datasets with tens of millions of galaxies present new challenges for the analysis of spatial clustering. We have built a framework that integrates a database of object catalogs, tools for creating...

The SDSS SkyServer: Public Access to the Sloan Digital Sky Server Data (2002)

Szalay, Alexander S., Gray, Jim, Thakar, Ani R., Kunszt, Peter Z., Malik, Tanu, Raddick, Jordan, ...

The SkyServer provides Internet access to the public Sloan Digi-tal Sky Survey (SDSS) data for both astronomers and for science education. This paper describes the SkyServer goals and archi-tecture....

Data Mining the SDSS SkyServer Database (2002)

Gray, Jim, Szalay, Alex S., Thakar, Ani R., Kunszt, Peter Z., Stoughton, Christopher, Slutz, Don, ...

An earlier paper (Szalay et. al. "Designing and Mining MultiTerabyte Astronomy Archives: The Sloan Digital Sky Survey," ACM SIGMOD 2000) described the Sloan Digital Sky Survey's (SDSS) data...

Sloan Digital Sky Survey: Early data release (2002)

Stoughton, Chirs, Lupton, Rpbert H., Bernardi, Mariangela, Blanton, Michael R., Burles, Scott, Castander, Francsico J., ...

The Sloan Digital Sky Survey (SDSS) is an imaging and spectroscopic survey that will eventually cover approximately one-quarter of the celestial sphere and collect spectra of 10 6 galaxies, 100,000...

Sloan Digital Sky Survey: early data release (2002)

Stoughton, Chris, Lupton, Robert H., Bernardi, Mariangela, Blanton, Michael R., Burles, Scott M., Castander, Francisco J., ...

The Sloan Digital Sky Survey (SDSS) is an imaging and spectroscopic survey that will eventually cover approximately one-quarter of the celestial sphere and collect spectra of ≈ 106 galaxies,...

The World-Wide Telescope, an Archetype for Online Science (2002)

Alex Szalay, Jim Gray

Most scientific data will never be directly examined by scientists; rather it will be put into online databases where it will be analyzed and summarized by computer programs. Scientists increasingly...

Petabyte Scale Data Mining: Dream or Reality? (2002)

Alexander S. Szalay, Jan Vandenberg, Jim Gray, Jan V, Enberg A

Science is becoming very data intensive 1 . Today's astronomy datasets with tens of millions of galaxies already present substantial challenges for data mining. In less than 10 years the...

Data Mining the SDSS SkyServer Database (2002)

Jim Gray, Don Slutz, Alex S. Szalay, Ani R. Thakar, Jan VandenBerg, Jan V, ...

An earlier paper described the Sloan Digital Sky Survey's (SDSS) data management needs [Szalay1] by defining twenty database queries and twelve data visualization tasks that a good data...

Web Services for the Virtual Observatory (2002)

Alexander S. Szalay, Er S. Szalay, Tams Budavári, Tanu Malik, Ani Thakar, ...

Web Services form a new, emerging paradigm to handle distributed access to resources over the Internet. There are platform independent standards (SOAP, WSDL), which make the developers' task...

Jim Gray, Microsoft Research (2002)

Alexander Szalay Johns, Jim Gray, Er S. Szalay, Er S. Szalay, Ani R. Thakar, Ani R. Thakar, ...

Science projects are data publishers. The scale and complexity of current and future science data changes the nature of the publication process. Publication is becoming a major project component. At...

Sloan Digital Sky Survey: early data release (2002)

Stoughton, Chris, Lupton, Robert H., Bernardi, Mariangela, Blanton, Michael R., Burles, Scott M., Castander, Francisco J., ...

The Sloan Digital Sky Survey (SDSS) is an imaging and spectroscopic survey that will eventually cover approximately one-quarter of the celestial sphere and collect spectra of ≈ 106 galaxies,...

The SDSS SkyServer, Public Access to the Sloan Digital Sky Server Data (2001)

Szalay, Alexander, Gray, Jim, Thakar, Ani, Kunszt, Peter Z., Malik, Tanu, Raddick, Jordan, ...

The SkyServer provides Internet access to the public Sloan Digital Sky Survey (SDSS) data for both astronomers and for science education. This paper describes the SkyServer goals and architecture. It...

Functionality, Availability, Agility, Manageability, Scalability -- The new priorities of application design (2001)

Jim Gray Microsoft, Jim Gray

Introduction Traditionally, enterprise systems have worried a great deal about scalability, availability, and manageability. There have been heated debates and competition in the scalability arena,...

The SDSS SkyServer - Public Access . . . (2001)

Er S. Szalay, Er S. Szalay, Jim Gray, Jim Gray, Ani R. Thakar, ...

Sky Survey (SDSS) data for both astronomers and for science education. This paper describes the SkyServer goals and architecture. It also describes our experience operating the SkyServer on the...

Microsoft TerraServer: A Spatial Data Warehouse (2000)

Tom Barclay, Jim Gray, Don Slutz, Tom Barclay, Jim Gray, Don Slutz

Microsoft ® TerraServer stores aerial, satellite, and topographic images of the earth in a SQL database available via the Internet. It is the world’s largest online atlas, combining five terabytes...

Nsort: A parallel sorting program for NUMA and SMP machines (2000)

Chris Nyberg, Charles Koester, Ordinal Technology Corp, Ordinal Technology Corp, Jim Gray, Microsoft Corporation

is a high-performance sort program for SGI IRIX, Sun Solaris and HP-UX servers. Nsort allows its users to realize the full processing potential of their multi-processor, multi-disk Unix

Fcast Multicast File Distribution (2000)

Jim Gemmell, Eve Schooler, Jim Gray

Reliable data multicast is problematic. ACK/NACK schemes do not scale to large audiences, and simple data replication wastes network bandwidth. Fcast, "file multicasting", combines...

Rules of Thumb in Data Engineering (2000)

Jim Gray Prashant, Jim Gray, Prashant Shenoy

This paper reexamines the rules of thumb for the design of data storage systems. Briefly, it looks at storage, processing, and networking costs, ratios, and trends with a particular focus on...

Designing and Mining Multi-Terabyte Astronomy Archives: The Sloan Digital Sky Survey (2000)

Peter Kunszt, Alexander S. Szalay, Alexander S. Szalay, Peter Z. Kunszt, Ani Thakar, Ani Thakar, ...

The next-generation astronomy digital archives will cover most of the sky at fine resolution in many wavelengths, from X-rays, through ultraviolet, optical, and infrared. The archives will be stored...

Windows 2000 Disk IO Performance (2000)

Leonard Chung, Jim Gray, Bruce Worthington, Robert Horst

This paper is an empirical study of the random and sequential I/O performance of Windows 2000 using the NT File System. It continues the work done by Riedel, et. al. in their 1997 paper exploring...

Designing and Mining Multi-Terabyte Astronomy Archives: The Sloan Digital Sky Survey (2000)

Alexander Szalay Szalay, Jim Gray, Alexander S, Peter Kunszt, Peter Kunszt, Ani Thakar, ...

The next-generation astronomy digital archives will cover most of the universe at fine resolution in many wavelengths, from X-rays to ultraviolet, optical, and infrared. The archives will be stored...

Tom Barclay (2000)

Jim Gray Don, Jim Gray, Don Slutz, Tom Barclay, Tom Barclay

Microsoft TerraServer stores aerial, satellite, and topographic images of the earth in a SQL database available via the Internet. It is the world's largest online atlas, combining eight...

December 1999 (2000)

Jim Gray, Prashant Shenoy

This paper reexamines the rules of thumb for the design of data storage systems. Briefly, it looks at storage, processing, and networking costs, ratios, and trends with a particular focus on...

Gordon Bell and Jim Gray 1 October 2000 (2000)

Gordon Bell, Jim Gray, Gordon Bell, Jim Gray

this article appears at http://research.microsoft.com/pubs/ Microsoft Research {GBell, Gray} @ Microsoft.com Digital immortality, like ordinary immortality, is a continuum from enduring fame at one...

Computer Technology Forecast for Virtual Observatories Extended Abstract of talk at Astronomy Virtual Observatories of the Future at (2000)

Jim Gray, Jim Gray

I was asked, as a computer scientist, to give a sense of what computer technologies the VOF can design for over the next decade. In designing the VOF we need to think in terms of how much storage,...

The Sloan Digital Sky Survey and its Archive (1999)

Szalay, Alexander S., Kunszt, Peter, Thakar, Anirudha, Gray, Jim, Slutz, Don

The next-generation astronomy archives will cover most of the universe at fine resolution in many wavelengths. One of the first of these projects, the Sloan Digital Sky Survey (SDSS) will create a...

Scalability Terminology: Farms, Clones, Partitions, Packs, RACS and RAPS (1999)

Devlin, Bill, Gray, Jim, Laing, Bill, Spix, George

Defines a vocabulary for scaleable systems: Geoplexes, Farms, Clones, RACS, RAPS, clones, partitions, and packs and dicusses the design tradeoffs of using clones, partitons, and packs.

What Next? A Dozen Information-Technology Research Goals (1999)

Gray, Jim

Charles Babbage's vision of computing has largely been realized. We are on the verge of realizing Vannevar Bush's Memex. But, we are some distance from passing the Turing Test. These three visions...

Designing and Mining Multi-Terabyte Astronomy Archives: The Sloan Digital Sky Survey (1999)

Szalay, Alexander S., Kunszt, Peter, Thakar, Ani, Gray, Jim

The next-generation astronomy digital archives will cover most of the universe at fine resolution in many wave-lengths, from X-rays to ultraviolet, optical, and infrared. The archives will be stored...

What Next? (1999)

Jim Gray, Jim Gray

: Charles Babbage's vision of computing has largely been realized. We are on the verge of realizing Vannevar Bush's Memex. But, we are some distance from passing the Turing Test. These...

Fcast Scalable Multicast File Distribution: Caching And Parameter Optimizations (1999)

Jim Gemmell, Eve Schooler, Jim Gray

Reliable data multicast is problematic. ACK/NACK schemes do not scale to large audiences, and simple data replication wastes network bandwidth. Fcast, "file multicasting", combines...

Microsoft TerraServer: A Spatial Data Warehouse (1999)

Tom Barclay, Jim Gray, Don Slutz

Microsoft TerraServer stores aerial, satellite, and topographic images of the earth in a SQL database available via the Internet. It is the world's largest online atlas, combining five terabytes...

What Next? A Dozen Information-Technology Research Goals (1999)

Jim Gray, Jim Gray

: Charles Babbage's vision of computing has largely been realized. We are on the verge of realizing Vannevar Bush's Memex. But, we are some distance from passing the Turing Test. These...

Fcast Scalable Multicast File Distribution: Caching And Parameter Optimizations (1999)

Jim Gemmell, Eve Schooler, Jim Gray

Reliable data multicast is problematic. ACK/NACK schemes do not scale to large audiences, and simple data replication wastes network bandwidth. Fcast, "file multicasting", combines...

Rules of Thumb in Data Engineering (1999)

Jim Gray, P. Shenoy, Jim Gray, Prashant Shenoy, Prashant Shenoy

This paper reexamines the rules of thumb for the design of data storage systems. Briefly, it looks at storage, processing, and networking costs, ratios, and trends with a particular focus on...

The Asilomar Report on Database Research (1998)

Bernstein, Phil, Brodie, Michael, Ceri, Stefano, DeWitt, David, Franklin, Mike, Garcia-Molina, Hector, ...

The database research community is rightly proud of success in basic research, and its remarkable record of technology transfer. Now the field needs to radically broaden its research focus to attack...

Microsoft TerraServer (1998)

Barclay, Tom, Eberl, Robert, Gray, Jim, Nordlinger, John, Raghavendran, Guru, Slutz, Don, ...

The Microsoft TerraServer stores aerial and satellite images of the earth in a SQL Server Database served to the public via the Internet. It is the world's largest atlas, combining five terabytes of...

Locally Served Network Computers (1998)

Gray, Jim

NCs are the natural evolution of PCs, ubiquitous computers everywhere. The current vision of NCs requires two improbable developments: (1) inexpensive high-bandwidth WAN links to the Internet, and...

The Revolution Yet to Happen (1998)

Bell, C. Gordon, Gray, Jim

All information about physical objects including humans, buildings, processes, and organizations will be online. This trend is both desirable and inevitable. Cyberspace will provide the basis for...

Performance / Price Sort (1998)

Gray, Jim, Coates, Joshua, Nyberg, Chris

NTsort is an external sort on WindowsNT 5.0. It has minimal functionality but excellent price performance. In particular, running on mail-order hardware it can sort 1.5 GB for a penny. For...

The Five-Minute Rule Ten Years Later, and Other Computer Storage Rules of Thumb (1998)

Gray, Jim, Graefe, Goetz

Simple economic and performance arguments suggest appropriate lifetimes for main memory pages and suggest optimal page sizes. The fundamental tradeoffs are the prices and bandwidths of RAMs and...

A Performance Study of Sequential I/O on Windows NT 4 (1998)

Erik Riedel, Catharine Van Ingen, Jim Gray, Erik Riedel, Catharine Van Ingen, Jim Gray

Large-scale database, data mining, and multimedia applications require large, sequential transfers and have bandwidth as a key requirement. This paper investigates the performance of reading and...

Icos Corporation (1998)

Jim Gray, Jim Gray

Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or...

Performance / Price Sort and PennySort (1998)

Jim Gray, Joshua Coates, Chris Nyberg

: NTsort is an external sort on WindowsNT 5.0. It has minimal functionality but excellent price performance. In particular, running on mail-order hardware it can sort 1.5 GB for a penny. NT5.0 is not...

A Performance Study of Sequential I/O on Windows NT™ 4 (1998)

Erik Riedel, Catharine Van Ingen, Jim Gray

Large-scale database, data mining, and multimedia applications require large, sequential transfers and have bandwidth as a key requirement. This paper investigates the performance of reading and...

1 NTCluster DataPump, Rivers (1998)

Ntcluster Datapump Rivers, Joshua Coates, Joe Barrera, Ro Forin, Jim Gray

We report on the design, implementation, and performance of three distributed systems running on a cluster of WindowsNT nodes: DataPump, RiverSystem and NTClusterSort. The DataPump is a simple data...

The Revolution Yet to Happen (1997)

Gordon Bell, Gordon Bell, Jim Gray, Jim Gray

By 2047, almost all information will be in cyberspace (1984) -- including all knowledge and creative works. All information about physical objects including humans, buildings, processes, and...

Nsort: a Parallel Sorting Program for NUMA and SMP Machines - Version 2.1 (1997)

Chris Nyberg, Charles Koester, Ordinal Technology Corp, Ordinal Technology Corp, Jim Gray, Microsoft Corporation

This paper describes Nsort's background, presents its performance sorting a terabyte of data, and compares its performance on an industry-standard benchmark. Nsort performance is presented for...

A Transactional Approach to Redundant Disk Array Implementation (1997)

Martin Francis, Jim Gray, Daniel P. Siewiorek, Charles Richard Courtright

are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of sponsoring companies or the government of the United States of America.

A Transactional Approach to Redundant Disk Array Implementation (1997)

Martin Francis, Jim Gray, Daniel P. Siewiorek, Charles Richard Courtright

are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of sponsoring companies or the government of the United States of America.

The dangers of replication and a solution (1996)

Jim Gray, Pat Helland

Abstract: Update anywhere-anytime-anyway transactional replication has unstable behavior as the workload scales up: a ten-fold increase in nodes and traflc gives a thousand fold increase in deadlocks...

Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals (1996)

Jim Gray, Adam Bosworth, Andrew Layman, Don Reichart, Hamid Pirahesh

Abstract. Data analysis applications typically aggregate data across many dimensions looking for anomalies or unusual patterns. The SQL aggregate functions and the GROUP BY operator produce...

Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals (1996)

Jim Gray, Adam Bosworth, Andrew Layman, Don Reichart, Hamid Pirahesh

Abstract. Data analysis applications typically aggregate data across many dimensions looking for anomalies or unusual patterns. The SQL aggregate functions and the GROUP BY operator produce...

The dangers of replication and a solution (1996)

Jim Gray, Pat Helland

Abstract: Update anywhere-anytime-anyway transactional replication has unstable behavior as the workload scales up: a ten-fold increase in nodes and traflc gives a thousand fold increase in deadlocks...

The dangers of replication and a solution (1996)

Jim Gray, Pat Helland, Dennis Shasha

Abstract: Update anywhere-anytime-anyway transactional replication has unstable behavior as the workload scales up: a ten-fold increase in nodes and traffic gives a thousand fold increase in...

The dangers of replication and a solution (1996)

Jim Gray, Pat Helland, Dennis Shasha

Abstract: Update anywhere-anytime-anyway transactional replication has unstable behavior as the workload scales up: a ten-fold increase in nodes and traffic gives a thousand fold increase in...

Data Management: Past, Present, and Future (1996)

Jim Gray, Jim Gray

: Soon most information will be available at your fingertips, anytime, anywhere. Rapid advances in storage, communications, and processing allow us move all information into Cyberspace. Software to...

The Dangers of Replication and a Solution (1996)

Jim Gray Pat, Dennis Shasha (nyu, Jim Gray, Jim Gray, Pat Helland, Pat Helland, ...

ing with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Publications...

The Dangers of Replication and a Solution (1996)

Jim Gray, Pat Helland, Pat O'Neil, Dennis Shasha

: Update anywhere-anytime-anyway transactional replication has unstable behavior as the workload scales up: a ten-fold increase in nodes and traffic gives a thousand fold increase in deadlocks or...

Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals (1996)

Jim Gray, Adam Bosworth, Andrew Layman, Don Reichart, Hamid Pirahesh

Abstract. Data analysis applications typically aggregate data across many dimensions looking for anomalies or unusual patterns. The SQL aggregate functions and the GROUP BY operator produce...

A critique of ANSI SQL isolation levels (1995)

Jim Gray, U. C. Berkeley

Reads, and Phantoms. This paper shows that these phenomena and the ANSI SQL definitions fail to properly characterize several popular isolation levels, including the standard Ioeking implementations...

Advantages of COMA (1995)

John G. Robinson, David C. Baxter, Jim Gray

This reports summarizes the development of these machines and presents case studies that illustrate the benefits of implementations based on a Cache-Only Memory Architecture (COMA). The compelling...

Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals (1995)

Jim Gray, Adam Bosworth, Andrew Layman, Hamid Pirahesh, Jim Gray Microsoft, Adam Bosworth Microsoft, ...

: Data analysis applications typically aggregate data across many dimensions looking for unusual patterns. The SQL aggregate functions and the GROUP BY operator produce zero-dimensional or...

Queues Are Databases (1995)

Jim Gray, Jim Gray

: Message-oriented-middleware (MOM) has become an small industry. MOM offers queued transaction processing as an advance over pure client-server transaction processing. This note makes four points:...

A Critique of ANSI SQL Isolation Levels (1995)

Hal Berenson, Phil Bernstein, Jim Gray, Jim Melton, Elizabeth O’Neil, Patrick O'Neil, ...

ANSI SQL-92 [MS, ANSI] defines Isolation Levels in terms of phenomena: Dirty Reads, Non-Repeatable Reads, and Phantoms. This paper shows that these phenomena and the ANSI SQL definitions fail to...

A critique of ANSI SQL isolation levels (1995)

Jim Gray, U. C. Berkeley

Reads, and Phantoms. This paper shows that these phenomena and the ANSI SQL definitions fail to properly characterize several popular isolation levels, including the standard Ioeking implementations...

Alphasort: a RISC machine sort (1994)

Chris Nyberg, Tom Barclay, Ztrka Cvetanovic, Jim Gray, Dave Lomet

Abstract A new sort algorithm, called AlphaSort, demonstrates that commodity processors and disks can handle commercial batch workloads. Using Alpha AXP processors, commodi ~ memory, and arrays of...

Quickly Generating Billion-Record Synthetic Databases (1994)

Jim Gray, Prakash Sundaresan, Susanne Englert, Peter J. Weinberger

: Evaluating database system performance often requires generating synthetic databases -- ones having certain statistical properties but filled with dummy information. When evaluating different...

Desktop Batch Processing (1994)

Jim Gray And, Jim Gray, Chris Nyberg

Today, online transaction processing applications can downsize from mainframes to microprocessors. Commodity database systems, operating systems, and hardware came of age in 1993., -- they surpassed...

Jim Gray, Prakash Sundaresan (1994)

Digital San Francisco, Jim Gray, Prakash Sundaresan, Susanne Englert, Peter J. Weinberger

Evaluating database system performance often requires generating synthetic databases -- ones having certain statistical properties but filled with dummy information. When evaluating different...

QVLDB AlphaSort: A Cache-Sensitive Parallel External Sort (1994)

Chris Nyberg, Tom Barclay, Zarka Cvetanovic, Jim Gray, Dave Lomet

Abstract. A new sort algorithm, called AlphaSort, demonstrates that commodity processors and disks can handle commercial batch workloads. Using commod-ity processors, memory, and arrays of SCSI...

Database and Transaction Processing Performance Handbook. http://www. benchmarkresources.com/handbook (1993)

Jim Gray

Digital Equipment Corp This handbook is a compendium of the popular performance and price/performance metrics for database systems and transaction processing systems. Each benchmark tries to answer...

Steve Kiss (1993)

Charles Levine, Jim Gray, Steve Kiss, Walt Kohler, Charles Levine, Jim Gray, ...

Digital Equipment Corp.4 TPC benchmarks provide the de facto industry standard for measuring transaction processing performance. This paper argues that TPC-A and TPC-B have served their purpose and...

Parallel database systems: the future of high performance database systems (1992)

David J. Dewitt, Jim Gray

Abstract: Parallel database machine architectures have evolved from the use of exotic hardware to a software parallel dataflow architecture based on conventional shared-nothing hardware. These new...

Parallel database systems: the future of high performance database systems (1992)

David J. Dewitt, Jim Gray

Abstract: Parallel database machine architectures have evolved from the use of exotic hardware to a software parallel dataflow architecture based on conventional shared-nothing hardware. These new...

Parallel Database Systems: The Future of Database Processing or a Passing Fad? (1991)

David J. DeWitt, Jim Gray

Parallel database machine architectures have evolved from the use of exotic hardware to a software parallel dataflow architecture based on conventional shared-nothing hardware. These new designs...

SQL Access and IBM DRDA (1991)

Scott Newman, Jim Gray

Syntax Notation 1 (ASN.1) to define the messages that are used for communication between client and server. These definitions are independent of the transfer syntax (encoding) that is actually used...

A Census of Tandem System Availability (1990)

Jim Gray, Jim Gray, Jim Gray

Abstract: Tandem computer systems are designed to be single-fault tolerant. This paper takes a census of customer system outages reported to Tandem. The census shows a clear improvement in the...

A Census of Tandem System Availability (1990)

Jim Gray, Jim Gray, Jim Gray

Abstract: Tandem computer systems are designed to be single-fault tolerant. This paper takes a census of customer system outages reported to Tandem. The census shows a clear improvement in the...

1'TANDEM Fault Tolerance in Tandem Computer Systems (1990)

Joel Bartlett, Wendy Bartlett, Richard Carr, Dave Garcia, Jim Gray, Robert Horst, ...

Tandem produces high-availability, general-purpose computers that provide fault tolerance through failfast hardware modules and fault-tolerant software2. This chapter presents a historical...

Parity Striping of Disc Arrays: Low-cost Reliable Storage with Acceptable Throughput (1990)

Jim Gray, Bob Horst, Mark Walker

An analysis of mirrored discs and of RAIDS shows that mirrors have considerably better throughput, measured as requests/second on random requests of arbitrary size (up to IMB). Mirrors have...

A Benchmark of NonStop SQL Release 2 Demonstrating Near-Linear Speedup and Scaleup on Large Databases Susanne Englert Jim Gray Terrye Kocher Praful Shah (1989)

Tandem Part No, Susanne Englert, Jim Gray, Terrye Kocher, Praful Shah, Susanne Englert, ...

NonStop SQL is an implementation of ANSI/ISO SQL on Tandem Computer systems. In its second release, NonStop SQL transparently and automatically implements parallelism within an SQL statement. This...

A benchmark of NonStop SQL release 2 demonstrating near-linear speedup and scaleup on large databases (1989)

Susanne Englert, Susanne Englert, Susanne Englert, Jim Gray, Jim Gray, Jim Gray, ...

its second release, NonStop SQL transparently and automatically implements parallelism within an SQL statement. This parallelism allows query execution speed to increase almost linearly as processors...

A Comparison Of The Byzantine Agreement Problem And The Transaction Commit Problem (1988)

Jim Gray

Transaction commit and Byzantine agreement solve the problem of multiple processes reaching agreement in the presence of process and message failures. This paper summarizes the computation and fault...

Tandem TR 88.5 (1988)

Disk Shadowing Dina, Dina Bitton, Jim Gray

Disk shadowing is a technique for maintaining a set of two or more identical disk images on separate disk devices. Its primary purpose is to enhance reliability and availability of secondary storage...

The Cost Of Messages (1988)

Jim Gray March, Jim Gray, A Cost Model

Distributed systems can be modeled as processes communicating via messages. This model abstracts the three degrees of distribution: shared memory, local network, and wide area network. Although these...

An Execution Model 2 (1988)

Jim Gray

Abstract: Distributed systems can be modeled as processes communicating via messages. This model abstracts the three degrees of distribution: shared memory, local network, and wide area network....

Disk Shadowing (1988)

Dina Bitton, Jim Gray

cupertino California Disk shadowing is a technique for maintaining a set of two or more identical disk images on separate disk devices. Its primary purpose is to enhance reliability and availability...

The Case Against Transparent Access to Geographically Distributed Data Jim Gray Tandem Computers Inc. 19333 Vallco Parkway Cupertino Ca (1987)

Tandem Part Number, Jim Gray, Cupertino Ca, Jim Gray

Distributed database software offers transparent access to data -- no matter where in the network the data is located, an authorized program can access the data as though it is local. This article...

A comparison of the byzantine agreement problem and the transaction commit problem (1987)

Jim Gray, Jim Gray

Abstract: Transaction commit and Byzantine agreement solve the problem of multiple processes reaching agreement in the presence of process and message failures. This paper summarizes the computation...

Tandem TR 85.5 (1986)

Distributed Computer Systems, Jim Gray, Mark Anderton

Distributed computer applications built from off-the-shelf hardware and software are increasingly common. This paper examines four such distributed systems with contrasting degrees of decentralized...

Fault Tolerance In Tandem Computer Systems (1986)

Joel Bartlett, Jim Gray, Bob Horst

Tandem builds single-fault-tolerant computer systems. At the hardware level, the system is designed as a loosely coupled multi-processor with fail-fast modules connected via dual paths. It is...

Tandem TR 85.7 WHY DO COMPUTERS STOP AND WHAT CAN BE DONE ABOUT IT? (1985)

Jim Gray

An analysis of the failure statistics of a commercially available fault-tolerant system shows that administration and software are the major contributors to failure. Various approaches to software...

TABLE OF CONTENTS (1985)

Jim Gray

An analysis of the failure statistics of a commercially available fault-tolerant system shows that administration and software are the major contributors to failure. Various approaches to software...

An Approach To Decentralized Computer Systems (1985)

Jim Gray

The technology for distributed computing is available. However, decentralized systems still pose design and management problems. Decentralized systems will always require more careful design,...

Tandem TR 85.1 (1985)

One Thousand Transactions, Jim Gray, Bob Good, Dieter Gawlick, Pete Homan, Harald Sammer, ...

Several companies intend to provide general-purpose transaction processing systems capable of one thousand transactions per second. This paper surveys the need for such systems and contrasts the...

Tandemcomputers (1985)

Census Of Tandem, Jim Gray, Jim Gray, Jim Gray

Tandem computer systems are designed to be single-fault tolerant. This paper takes a census of customer system outages reported to Tandem. The census shows a clear improvement in the reliability of...

Why Do Computers Stop And What Can Be Done About It? (1985)

Jim Gray

An analysis of the failure statistics of a commercially available fault-tolerant system shows that administration and software are the major contributors to failure. Various approaches to software...

Tandem TR 85.4 AN APPROACH TO DECENTRALIZED COMPUTER SYSTEMS (1985)

Jim Gray, Jim Gray, Jim Gray, Jim Gray

The technology for distributed computing is available. However, decentralized systems still pose design and management problems. Decentralized systems will always require more careful design,...

One Thousand Transactions per Second (1985)

Jim Gray, Jim Gray, Jim Gray, Bob Good, Bob Good, Bob Good, ...

Several companies intend to provide general-purpose processing systems capable of one thousand transactions This paper surveys the need for such systems and approaches being taken by three different...

UPDATE IN PLACE: A poison apple?......................................................................................7 (1981)

Jim Gray

ABSTRACT: A transaction is a transformation of state which has the properties of atomicity (all or nothing), durability (effects survive failures) and consistency (a correct transformation). The...

Tandem TR 81.1 (1981)

An Approach To, Jim Gray

Soon every desk will have a computer on it. Software to do mundane things such as payroll, mail, and text processing exists and as a by-product produces vast quantities of on-line in formation. Many...

The Transaction Concept: Virtues and Limitations (1981)

Jim Gray

ABSTRACT: A transaction is a transformation of state which has the properties of atomicity (all or nothing), durability (effects survive failures) and consistency (a correct transformation). The...

The Recovery Manager of the System R Database Manager (1981)

Jim Gray, Paul Mcjones, Mike Blasgen, Bruce Lindsay, Raymond Lorie, Tom Price

The recovery subsystem of an experimental data management system is described and evaluated. The transactmn concept allows application programs to commit, abort, or partially undo their effects. The...

A transaction model (1980)

Jim Gray, Jim Gray, Jim Gray

Th~s report has been subm~tted for ~ubllcatlon outs~de of IBM and w~ll probably be copyrighted ~f accepted for publ~cat~on It has ken Issued as a Research Report for early dlssem~nat~on of ~ts...

The world-wide telescope (0000)

Gray , Jim

The article focuses on a new online way to see the global structure of the universe, the World-Wide Telescope, which promises to be not only a wonderful virtual telescope but an archetype for the...

The world-wide telescope

Gray , Jim

The article focuses on a new online way to see the global structure of the universe, the World-Wide Telescope, which promises to be not only a wonderful virtual telescope but an archetype for the...

Amino Acid Substitution within the VP7 Protein of G2 Rotavirus Strains Associated with Failure To Serotype

Gómara, Miren Iturriza, Cubitt, David, Desselberger, Ulrich, Gray, Jim

Rotavirus strains collected in the United Kingdom during the 1995-1996 season and genotyped as G2 by reverse transcription-PCR failed to serotype in enzyme-linked immunosorbent assays using three...

Reassortment In Vivo: Driving Force for Diversity of Human Rotavirus Strains Isolated in the United Kingdom between 1995 and 1999

Iturriza-Gómara, Miren, Isherwood, Beverley, Desselberger, Ulrich, Gray, Jim

The G and P genotypes of 3,601 rotavirus strains collected in the United Kingdom between 1995 and 1999 were determined (M. Iturriza-Gómara et al., J. Clin. Microbiol. 38:4394–4401, 2000). In 95.4%...

Molecular Characterization of VP6 Genes of Human Rotavirus Isolates: Correlation of Genogroups with Subgroups and Evidence of Independent Segregation

Iturriza Gómara, Miren, Wong, Cecilia, Blome, Sandra, Desselberger, Ulrich, Gray, Jim

A reverse transcription-PCR (RT-PCR) was established to amplify a 379-bp cDNA fragment (nucleotides 747 to 1126, coding for amino acids 241 to 367) of the VP6 gene of group A rotaviruses associated...

Evidence for Genetic Linkage between the Gene Segments Encoding NSP4 and VP6 Proteins in Common and Reassortant Human Rotavirus Strains

Iturriza-Gòmara, Miren, Anderton, Emma, Kang, Gagandeep, Gallimore, Chris, Phillips, Wendy, Desselberger, Ulrich, ...

NSP4-encoding genes of 78 human rotavirus strains of common or reassortant genotypes were characterized by reverse transcription-PCR followed by sequencing and phylogenetic analysis. It was found...

Characterization of G10P[11] Rotaviruses Causing Acute Gastroenteritis in Neonates and Infants in Vellore, India

Gómara, Miren Iturriza, Kang, Gagandeep, Mammen, Ajit, Jana, Atanu Kumar, Abraham, Mary, Desselberger, Ulrich, ...

Rotavirus G10P[11] strains, which are commonly found in cattle, have frequently been associated with asymptomatic neonatal infections in India. We report the finding of G10P[11] strains associated...

Amino Acid Substitution within the VP7 Protein of G2 Rotavirus Strains Associated with Failure To Serotype

Gómara, Miren Iturriza, Cubitt, David, Desselberger, Ulrich, Gray, Jim

Rotavirus strains collected in the United Kingdom during the 1995-1996 season and genotyped as G2 by reverse transcription-PCR failed to serotype in enzyme-linked immunosorbent assays using three...

Reassortment In Vivo: Driving Force for Diversity of Human Rotavirus Strains Isolated in the United Kingdom between 1995 and 1999

Iturriza-Gómara, Miren, Isherwood, Beverley, Desselberger, Ulrich, Gray, Jim

The G and P genotypes of 3,601 rotavirus strains collected in the United Kingdom between 1995 and 1999 were determined (M. Iturriza-Gómara et al., J. Clin. Microbiol. 38:4394–4401, 2000). In 95.4%...

Molecular Characterization of VP6 Genes of Human Rotavirus Isolates: Correlation of Genogroups with Subgroups and Evidence of Independent Segregation

Iturriza Gómara, Miren, Wong, Cecilia, Blome, Sandra, Desselberger, Ulrich, Gray, Jim

A reverse transcription-PCR (RT-PCR) was established to amplify a 379-bp cDNA fragment (nucleotides 747 to 1126, coding for amino acids 241 to 367) of the VP6 gene of group A rotaviruses associated...

Evidence for Genetic Linkage between the Gene Segments Encoding NSP4 and VP6 Proteins in Common and Reassortant Human Rotavirus Strains

Iturriza-Gòmara, Miren, Anderton, Emma, Kang, Gagandeep, Gallimore, Chris, Phillips, Wendy, Desselberger, Ulrich, ...

NSP4-encoding genes of 78 human rotavirus strains of common or reassortant genotypes were characterized by reverse transcription-PCR followed by sequencing and phylogenetic analysis. It was found...

Characterization of G10P[11] Rotaviruses Causing Acute Gastroenteritis in Neonates and Infants in Vellore, India

Gómara, Miren Iturriza, Kang, Gagandeep, Mammen, Ajit, Jana, Atanu Kumar, Abraham, Mary, Desselberger, Ulrich, ...

Rotavirus G10P[11] strains, which are commonly found in cattle, have frequently been associated with asymptomatic neonatal infections in India. We report the finding of G10P[11] strains associated...

AlphaSort: A Cache-Sensitive Parallel External Sort

Chris Nyberg Tom, Tom Barclay, Zarka Cvetanovic, Jim Gray, Dave Lomet

A new sort algorithm, called AlphaSort, demonstrates that commodity processors and disks can handle commercial batch workloads. Using commodity processors, memory, and arrays of SCSI disks, AlphaSort...