Efficient Path Query Processing on Encoded XML (2008)
As XML has become a popular format for information exchange, the efficient processing of broadcast XML data on a constrained device (for example, a cell phone or a PDA) becomes a critical task. In...
Efficiently Supporting Structure Queries on Phylogenetic Trees (2008)
Susan B. Davidson, Junhyong Kim, Yifeng Zheng
With phylogenetics becoming increasingly important in biomedical research, the number of phylogenetic studies is increasing rapidly and huge mount of phylogenetic data has been generated and stored...
Susan B. Davidson, Anthony S. Kosky
WOL is a Horn-clause language for specifying transformations involving complex types and recursive data-structures. Its declarative syntax makes programs easy to modify in response to schema...
ABSTRACT XKvalidator: A Constraint Validator For XML (2008)
Yi Chen, Susan B. Davidson, Yifeng Zheng
The role of XML in data exchange is evolving from one of merely conveying the structure of data to one that also conveys its semantics. In particular, several proposals for key and foreign key...
Yifeng Zheng, Stephen Fisher, Shirley Cohen, Sheng Guo, Junhyong Kim, Susan B. Davidson
Evolutionary and systems biology increasingly rely on the construction of large phylogenetic trees which represent the relationships between species of interest. As the number and size of such trees...
Provenance and scientific workflows: challenges and opportunities (2008)
Susan B. Davidson, Juliana Freire
Provenance in the context of workflows, both for the data they derive and for their specification, is an essential component to allow for result reproducibility, sharing, and knowledge re-use in the...
March Vol No, Serge Abiteboul, Sophie Cluet, Tova Milo, Pini Mogilevsky, Jerome Simeon, ...
A broad spectrum of data is available on the Web in distinct heterogeneous sources, stored under different formats. As the number of systems that utilize this data grows, the importance of data...
Addressing the Provenance Challenge using ZOOM (2007)
Cohen-Boulakia, Sarah, Biton, Olivier, Cohen, Shirley, Davidson, Susan B
ZOOM*UserViews presents a model of provenance for scientific workflows that is simple, generic, and yet sufficiently expressive to answer questions of data and step provenance that have been...
BioGuideSRS: Querying Multiple Sources with a user-centric perspective (2007)
Cohen-Boulakia, Sarah, Biton, Olivier, Davidson, Susan B, Froidevaux, Christine
Summary: Biologists are frequently faced with the problem of integrating information from multiple heterogeneous sources with their own experimental data. Given the large number of public sources, it...
Zoom*UserViews: Querying relevant provenance in workflow systems (demo (2007)
Olivier Biton, Susan B. Davidson
In this demonstration, we present the ZOOM*UserView system, and focus on the module which generates a “user view” based on what tasks the user perceives to be relevant in the workflow...
Zheng, Yifeng, Fisher, Stephen, Cohen, Shirley, Guo, Sheng, Kim, Junhyong, Davidson, Susan B
Evolutionary and systems biology increasingly rely on the construction of large phylogenetic trees which represent the relationships between species of interest. As the number and size of such trees...
PATAXÃ: A Framework to Allow Updates Through XML Views (2006)
Braganholo, Vanessa P., Davidson, Susan B., Heuser, Carlos A.
XML has become an important medium for data exchange, and is frequently used as an interface to (i.e., a view of) a relational database. Although a lot of work has been done on querying relational...
Path-based systems to guide scientists in the maze of biological data sources (2006)
Cohen-Boulakia, Sarah, Davidson, Susan B., Froidevaux, Christine, Lacroix, Zoe, Vidal, Maria-Esther
Fueled by novel technologies capable of producing massive amounts of data for a single experiment, scientists are faced with an explosion of information which must be rapidly analyzed and combined...
Towards a Model of Provenance and User Views in Scientific Workflows (2006)
Cohen, Shirley, Cohen-Boulakia, Sarah, Davidson, Susan B
Scientific experiments are becoming increasingly large and complex, with a commensurate increase in the amount and complexity of data generated. Data, both intermediate and final results, is derived...
A Model for User-Oriented Data Provenance in Pipelined Scientific Workflows (2006)
Bowers, Shawn, McPhillips, Timothy, Ludascher, Bertram, Cohen, Shirley, Davidson, Susan B.
Integrated provenance support promises to be a chief advantage of scientific workflow systems over script-based alternatives. While it is often recognized that information gathered during scientific...
Designing and Evaluating an XPath Dialect for Linguistic Queries (2006)
Bird, Steven, Chen, Yi, Davidson, Susan B., Lee, Haejoong, Zheng, Yifeng
Linguistic research and natural language processing employ large repositories of ordered trees. XML, a standard ordered tree model, and XPath, its associated language, are natural choices for...
An Efficient XPath Query Processor for XML Streams (2006)
Chen, Yi, Davidson, Susan B, Zheng, Yifeng
Streaming XPath evaluation algorithms must record a potentially exponential number of pattern matches when both predicates and descendant axes are present in queries, and the XML data is recursive....
A model for user-oriented data provenance in pipelined scientific workflows (2006)
Shawn Bowers, Timothy Mcphillips, Bertram Ludäscher, Shirley Cohen, Susan B. Davidson
Abstract. Integrated provenance support promises to be a chief advantage of scientific workflow systems over script-based alternatives. While it is often recognized that information gathered during...
A model for user-oriented data provenance in pipelined scientific workflows (2006)
Shawn Bowers, Timothy Mcphillips, Bertram Ludäscher, Shirley Cohen, Susan B. Davidson
Abstract. Integrated provenance support promises to be a chief advantage of scientific workflow systems over script-based alternatives. While it is often recognized that information gathered during...
A User-centric Framework for Accessing Biological Sources and Tools (2005)
Cohen-Boulakia, Sarah, Davidson, Susan B., Froidevaux, Christine
Biologists face two problems in interpreting their experiments: the integration of their data with information from multiple heterogeneous sources and data analysis with bioinformatics tools. It is...
ViteX : a Streaming XPath Processing System (2005)
Chen, Yi, Davidson, Susan B., Zheng, Yifeng
We present ViteX, an XPath processing system on XML streams with polynomial time complexity. ViteX uses a polynomial-space data structure to encode an exponential number of pattern matches (in the...
Extending XPath to Support Linguistic Queries (2005)
Bird, Steven, Chen, Yi, Davidson, Susan B., Lee, Haejoong, Zheng, Yifeng
Linguistic research and language technology development employ large repositories of ordered trees. XML, a standard ordered tree model, and XPath, its associated language, are natural choices for...
A User-centric Framework for Accessing Biological Sources and Tools (2005)
Susan B. Davidson, Sarah Cohen-boulakia, Sarah Cohen-boulakia, Susan Davidson, Christine Froidevaux, Christine Froidevaux
Abstract. Biologists face two problems in interpreting their experiments: the integration of their data with information from multiple heterogeneous sources and data analysis with bioinformatics...
ViteX: A Streaming XPath Processing System (2005)
Yi Chen, Susan B. Davidson, Yifeng Zheng
We present ViteX, an XPath processing system on XML streams with polynomial time complexity. ViteX uses a polynomial-space data structure to encode an exponential number of pattern matches (in the...
Extending XPath to support linguistic queries (2005)
Steven Bird, Yi Chen, Susan B. Davidson, Haejoong Lee, Yifeng Zheng
Linguistic research and language technology development employ large repositories of ordered trees. XML, a standard ordered tree model, and XPath, its associated language, are natural choices for...
EXPedite: A System for Encoded XML Processing (2004)
Chen, Yi, Mihaila, George A, Davidson, Susan B, Padmanabhan, Sriram
As XML becomes an increasingly popular format for information exchange, the efficient processing of broadcast XML data on a constrained device (for example, a cell phone or a PDA) becomes a critical...
BLAS : An Efficient XPath Processing System (2004)
Chen, Yi, Davidson, Susan B, Zheng, Yifeng
We present BLAS, a Bi-LAbeling based System, for efficiently processing complex XPath queries over XML data. BLAS uses P-labeling to process queries involving consecutive child axes, and D-labeling...
Propagating XML Constraints to Relations (2003)
Davidson, Susan B., Fan, Wenfei, Hara, Carmem, Qin, Jing
We present a technique for refining the design of relational storage for XML data based on XML key propagation. Three algorithms are presented: one checks whether a given functional dependency is...
Using XQuery to build updatable XML views over relational databases (2003)
Braganholo, Vanessa P, Davidson, Susan B., Heuser, Carlos A
XML has become an important medium for data exchange, and is frequently used as an interface to - i.e. a view of - a relational database. Although much attention has been paid to the problem of...
Reasoning about the updatability of XML views over relational databases (2003)
Braganholo, Vanessa P, Davidson, Susan B., Heuser, Carlos A
XML has become an important medium for data exchange, and is also used as an interface to - i.e. a view of - a relational database. While previous work has considered XML views for the purpose of...
UXQuery: building updatable XML views over relational databases (2003)
Vanessa P. Braganholo, Susan B. Davidson, Carlos A. Heuser
XML has become an important medium for data exchange, and is frequently used as an interface to - i.e. a view of - a relational database. Although much attention has been paid to the problem of...
Reasoning about the updatability of XML views over relational databases (2003)
Vanessa P. Braganholo, Susan B. Davidson, Carlos A. Heuser
databases
XKvalidator: A Constraint Validator For XML (2002)
Chen, Yi, Davidson, Susan B., Zheng, Yifeng
The role of XML in data exchange is evolving from one of merely conveying the structure of data to one that also conveys its semantics. In particular, several proposals for key and foreign key...
Validating Constraints in XML (2002)
Chen, Yi, Davidson, Susan B., Zheng, Yifeng
The role of XML in data exchange is evolving from one of merely conveying the structure of data to one that also conveys its semantics. In particular, several proposals for key and foreign key...
Constraint Preserving XML Storage in Relations (2002)
Yi Chen Susan, Yi Chen, Susan B. Davidson, Yifeng Zheng
As XML becomes a standard for data representation on the internet, there is a growing interest in storing XML using relational database technology.
Reasoning about Keys for XML (2001)
Buneman, Peter, Davidson, Susan B., Fan, Wenfei, Hara, Carmem, Tan, Wang-Chiew
We study absolute and relative keys for XML, and investigate their associated decision problems. We argue that these keys are important to many forms of hierarchically structured data including XML...
K2/Kleisli and GUS: Experiments in Integrated Access to Genomic Data Sources (2001)
Davidson, Susan B, Crabtree, Jonathan, Brunk, Brian P., Schug, Jonathan, Tannen, Val, Overton, Chris, ...
The integration of heterogeneous data sources and software systems is a major issue in the biomed ical community and several approaches have been explored: linking databases, "on-the- fly"...
Indexing Keys in Hierarchical Data (2001)
Chen, Yi, Davidson, Susan B., Zheng, Yifeng
Building on a notion of keys for XML, we propose a novel indexing scheme for hierarchical data that is based not only on the structure but also the content of the data. The index can be used to check...
The Kleisli Approach to Data Transformation and Integration (2001)
Susan B. Davidson, Limsoon Wong
Kleisli is a data transformation and integration system that can be used for any application where the data is typed, but has proven especially useful for bioinformatics applications. It extends the...
K2/Kleisli and GUS: Experiments in Integrated Access to Genomic Data Sources (2001)
Susan B. Davidson, Jonathan Crabtree, Brian Brunk, Jonathan Schug, Val Tannen, Chris Overton, ...
The integration of heterogeneous data sources and software systems is a major issue in the biomedical community and several approaches have been explored: linking databases, \on-the- y "...
View Maintenance for Hierarchical Semistructured Data (2000)
Liefke, Hartmut, Davidson, Susan B.
Over the last few years, efficient access to heterogenous data sources has become tremendously important. One common technique for increasing efficiency is to maintain locally sorted views in data...
Reasoning about Keys for XML (2000)
Buneman, Peter, Davidson, Susan B., Fan, Wenfei, Hara, Carmem, Tan, Wang-Chiew
We study two classes of XML keys introduced in [6], and investigate their associated (finite) implication problems. In contrast to other proposals of keys for XML, these two classes of keys can be...
View Maintenance for Hierarchical Semistructured Data (2000)
Hartmut Liefke, Susan B. Davidson
Over the last few years, efficient access to heterogeneous data sources has become tremendously important. One common technique for increasing efficiency is to maintain locally stored views in data...
Inference Rules for Nested Functional Dependencies (1999)
Hara, Carmem S, Davidson, Susan B.
Functional dependencies add semantics to a database schema, and are useful for studying various problems, such as database design, query optimization and how dependencies are carried into a view. In...
Updates and nondeterminism in object-oriented databases (1999)
Hartmut Liefke, Susan B. Davidson
While the topic of object-oriented databases is by now fairly mature-- a standard for data definition and query languages has been developed [10]-- the question of updates has largely been ignored....
Inference Rules for Nested Functional Dependencies (1999)
Carmem S. Hara, Susan B. Davidson
Functional dependencies add semantics to a database schema# and are useful for studying vari# ous problems# such as database design# query optimization and how dependencies are carried into a view....
Updating Complex Value Databeses (1998)
Liefke, H., Davidson, Susan B.
Query languages and their optimizations have been a very important issue in the database community. Languages for updating databases, however, have not been studied to the same extent, although they...
An Execution Model for CPL+ (1998)
Hartmut Liefke, Susan B. Davidson
Query languages and their optimizations have been a very important issue in the database community. Languages for updating databases, however, have not been studied to the same extent, although they...
Querying an Object-Oriented Database Using CPL (1997)
Davidson, Susan B., Hara, Carmem, Popa, Lucian
The Collection Programming Language is based on a complex value model of data and has successfully been used for querying transforming and integrating data from a wide variety of structured data...
WOL: A Language for Database Transformations and Constraints (1997)
Davidson, Susan B, Kosky, Anthony S
The need to transform data between heterogeneous databases arises from a number of critical tasks in data management. These tasks are complicated by schema evolution in the underlying databases, and...
Adding Structure to Unstructured Data (1997)
Buneman, Peter, Davidson, Susan B, Fernandez, Mary, Suciu, Dan
We develop a new schema for unstructured data. Traditional schemas resemble the type systems of programming languages. For unstructured data, however, the underlying type may be much less constrained...
BioKleisli: A Digital Library for Biomedical Researchers (1997)
S.B. Davidson, Susan B. Davidson, V. Tannen, L. Wong, C. Overton, Val Tannen, ...
. Data of interest to biomedical researchers associated with the Human Genome Project (HGP) is stored all over the world in a number of different electronic data formats and accessible through a...
Adding Structure to Unstructured Data (1996)
Buneman, Peter, Davidson, Susan B, Fernandez, Mary, Suciu, Dan
We develop a new schema for unstructured data. Traditional schemas resemble the type systems of programming languages. For unstructured data, however, the underlying type may be much less constrained...
Effecting Database Transformations Using Morphase (1996)
Davidson, Susan B, Kosky, Anthony S
Database transformations are a frequent problem for data managers supporting scientific databases, particularly those connected with the Human Genome Project. The databases involved frequently...
Effecting Database Transformations Using Morphase (1996)
Davidson, Susan B, Kosky, Anthony
Database transformations are a frequent problem for data managers supporting scientific databases, particularly those connected with the Human Genome Project. The databases involved frequently...
Effecting Database Transformations Using Morphase (1996)
Susan B. Davidson, Anthony S. Kosky
Database transformations are a frequent problem for data managers supporting scientific databases, particularly those connected with the Human Genome Project. The databases involved frequently...
A Data Transformation System for Biological Data Sources (1995)
Buneman, Peter, Davidson, Susan B, Hart, Kyle, Overton, Chris, Wong, L.
Scientific data of importance to biologists in the Human Genome Project resides not only in conventional databases, but in structured files maintained in a number of different formats (e.g. ASN.1 and...
Semantics of Database Transformations (1995)
Kosky, Anthony, Davidson, Susan B, Buneman, Peter
Database transformations arise in many different settings including database integrations, evolution of database systems, and implementing user views and data-entry tools. This paper surveys...
Programming Constructs for Unstructured Data (1995)
Buneman, Peter, Davidson, Susan B, Suciu, Dan
We investigate languages for querying and transforming unstructured data, by which we mean languages than can be used without knowledge of the structure (schema) of the database. Such data can be...
A Data Transformation System for Biological Data Sources (1995)
Buneman, Peter, Davidson, Susan B, Hart, Kyle, Overton, Chris
Scientific data of importance to biologists in the Human Genome Project resides not only in conventional databases, but in structured files maintained in a number of different formats (e.g. ASN.1 and...
Facilitating Transformations in a Human Genome Project Database (1994)
Davidson, Susan B., Kosky, Anthony, Eckman, B.
Human Genome Project databases present a confluence of interesting database challenges: rapid schema and data evolution, complex data entry and constraint management, and the need to integrate...
Facilitating Transformations in a Human Genome Project Database (1993)
Davidson, Susan B, Kosky, Anthony S, Eckman, B.
Human Genome Project databases present a confluence of interesting database challenges: rapid schema and data evolution, complex data entry and constraint management, and the need to integrate...
Proving Properties of Real-Time Distributed Systems: A Comparison of Three Approaches (1992)
Brémond-Grégoire, Patrice, Davidson, Susan B, Lee, Insup
Three formal methods for specifying properties of real-time systems are reviewed and used in a common example. Two of them offer a graphical representation and the third is an algebraic language. The...
Theoretical Aspects of Schema Merging (1992)
Buneman, Peter, Davidson, Susan B., Kosky, Anthony
A general technique for merging database schemas is developed that has a number of advantages over existing techniques, the most important of which is that schemas are placed in a partial order that...
Theoretical Aspects Of Schema Merging (1991)
Buneman, Peter, Davidson, Susan B, Kosky, Anthony S
A general technique for merging database schemas is developed that has a number of advantages over existing techniques, the most important of which is that schemas are placed in a partial order that...
Timed Atomic Commitment (1991)
Davidson, Susan B, Lee, Insup, Wolfe, Victor
In a large class of hard-real-time control applications, components execute concurrently on distributed nodes and must coordinate, under timing constraints, to perform the control task. As such, they...
Lee, Insup, Davidson, Susan B, Gerber, Richard
The timed behavior of distributed real-time systems can be specified using a formalism called Communicating Shared Resources, or CSR. The underlying computation model of CSR is resource-based in...
RTC: Language Support For Real-Time Concurrency (1991)
Wolfe, Victor, Davidson, Susan B, Lee, Insup
This paper presents language constructs for the expression of timing and concurrency requirements in distributed real-time programs. Our programming paradigm combines an object-based paradigm for the...
RTC: Language Support for Real-Time Concurrency (1991)
Wolfe, Victor, Davidson, Susan B, Lee, Insup
This paper presents language constructs for the expression of timing and concurrency requirements in distributed real-time programs. The programming paradigm combines an object-based paradigm for the...
A Performance Analysis of Timed Synchronous Communication Primitives (1990)
Two algorithms for timed synchronous communication between a single sender and single receiver have recently appeared in the literature. Each weakens the definition of "correct" timed synchronous...
Timed Atomic Commitment (1989)
Davidson, Susan B, Lee, Insup, Wolfe, Victor
In a large class of hard-real-time control applications, components execute concurrently on distributed nodes and must coordinate, under timing constraints, to perform the control task. As such, they...
Replicated Data and Partition Failures (1989)
In a distributed database system, data is often replicated to improve performance and availability. By storing copies of shared data on processors where it is frequently accessed, the need for...
Partial Computation in Real-Time Database Systems: A Research Plan (1988)
State-of-the-art database management systems are inappropriate for real-time applications due to their lack of speed and predictability of response. To combat these problems, the scheduler needs to...
Formally Integrating Real-Time Specification: A Research Proposal (1988)
Lee, Insup, Davidson, Susan B, Gerber, Richard
To date, research in reasoning about timing properties of real-time programs has considered specification and implementation as separate issues. Specification uses formal methods; it abstracts out...
Modeling Reliable Distributed Real-time Programs (1988)
Wolfe, Victor, Davidson, Susan B, Lee, Insup
A model for distributed hard real-time programs should incorporate real-time characteristics and be capable of analyzing time-related reliability issues. We introduce a model called the Real-Tie...
Partial Computation in Real-Time Database Systems (1988)
Davidson, Susan B, Watters, Aaron
A critical component of real-time systems in the database, which is used to store external input such as environmental readings from sensors, as well as system information. Typically these databases...
Motivating Time as a First Class Entity (1987)
Lee, Insup, Davidson, Susan B, Fay-Wolfe, Victor
In hard real-time applications, programs must not only be functionally correct but must also meet timing constraints. Unfortunately, little work has been done to allow a high-level incorporation of...
Adding Time to Synchronous Process Communications (1987)
In distributed real-time systems, communicating processes cannot be delayed for arbitrary amounts of time while waiting for messages. Thus, communication primitives used for real-time programming...
Consistency in a Partitioned Network: A Survey (1984)
Davidson, Susan B., Garcia-Molina, Hector, Skeen, Dale
Recently, several strategies for transaction processing in partitioned distributed database systems with replicated data have been proposed. We survey these strategies in light of the goal of...
Consistency in a Partitioned Network: A Survey (1984)
Davidson, Susan B., Garcia-Molina, Hector, Skeen, Dale
Recently, several strategies for transaction processing in partitioned distributed database systems with replicated data have been proposed. We survey these strategies in light of the goal of...
Consistency in a Partitioned Network: A Survey (1984)
Davidson, Susan B, Garcia-Molina, Hector, Skeen, Dale
Recently, several strategies for transaction processing in partitioned distributed database systems with replicated data have been proposed. We survey these strategies in light of the competing goals...
Optimism and consistency in partitioned distributed database systems (1984)
A protocol for transaction processing during partition failures is presented which guarantees mutual consistency between copies of data-items after repair is completed. The protocol is “optimistic...
An optimistic protocol for partitioned distributed database systems /--Susan B. Davidson. (1982)
Thesis (Ph. D.)--Princeton University, 1982.
An optimistic protocol for partitioned distributed database systems / (1982)
Thesis (Ph. D.)--Princeton University, 1982.
An optimistic protocol for partitioned distributed database systems /--Susan B. Davidson. (1982)
Thesis (Ph. D.)--Princeton University, 1982.