Baden Hughes

An Intelligent Search Infrastructure for Language Resources on the Web ARC Special Research Initiative (E-Research) SR0567353 (2008)

Chief Investigators, Timothy Baldwin, Steven Bird, Baden Hughes

Language occupies a central role on the web: most content is expressed in a given language, and most access takes place via natural language input and interfaces. Today, investigation of human...

Baden Hughes, Towards a Web Search Service for Minority Language Communities, Open Road 2006 Towards a Web Search Service for Minority Language Communities (2008)

Baden Hughes

Locating resources of interest on the web in the general case is at best a low precision activity owing to the large number of pages on the web (for example, Google covers more than 8 billion web...

Towards Interoperable Secondary Annotations in the E-Social Science Domain (2008)

Baden Hughes, Desmond Schmidt, Andrew E. Smith

Abstract. The sharing of data for secondary analysis has been very limited, especially in the social sciences. The reasons usually cited for this limited sharing are (1) strong privacy requirements...

Making Connections: First Year Transition for Computer Science and (2008)

Software Engineering Students, Alistair Moffat, Baden Hughes, Harald Søndergaard, Paul Gruba

During the last decade, an increasing emphasis has been placed on the need for carefully planned transition programs to help first-year students integrate into university. In this paper we critically...

A Situated Learning Perspective on Learning Object Design (2008)

Roderick Farmer And, Roderick A. Farmer, Baden Hughes, The Case Framework

Introduction Situated Learning Perspective The CASE Framework 3 Introduction Learning object design inclusive of instructional design and learning theories is seen as desirable (Daniel and Mohan,...

A Classification-Based Framework for (2008)

Roderick Farmer And, Roderick A. Farmer, Baden Hughes

Introduction The CASE Framework Learning Object Classification Evaluation Conclusion and Future Work 3 Introduction Closed sets of properties are a requirement for automatic classification and...

Feature-based Encoding and Querying Language Resources with Character Semantics (2006)

Baden Hughes, Dafydd Gibbon

In this paper we discuss the explicit representation of character features pertaining to written language resources, which we argue are critically necessary in the long term of archiving language...

Building Computational Grids with Apple's Xgrid Middleware (2006)

Baden Hughes

Apple's release of the Xgrid framework for distributed computing introduces a new technology solution for loosely coupled distributed computation. In this paper systematically describe, compare...

Reconsidering language identification for written language resources (2006)

Baden Hughes, Timothy Baldwin, Steven Bird, Jeremy Nicholson, Andrew Mackinlay

The task of identifying the language in which a given document (ranging from a sentence to thousands of pages) is written has been relatively well studied over several decades. Automated approaches...

A distributed architecture for interactive parse annotation (2005)

Baden Hughes

In this paper we describe a modular system architecture for distributed parse annotation using interactive correction. This involves interactively adding constraints to an existing parse until the...

A distributed architecture for interactive parse annotation (2005)

Baden Hughes

In this paper we describe a modular system architecture for distributed parse annotation using interactive correction. This involves interactively adding constraints to an existing parse until the...

Towards a general model for linguistic paradigms (2004)

David Penton, Catherine Bow, Steven Bird, Baden Hughes

Linguistic forms are inherently multi-dimensional. They exhibit a variety of phonological, orthographic, morphosyntactic, semantic and pragmatic properties. Accordingly, linguistic analysis involves...

Securing Interpretability: The Case of Ega Language Documentation (2004)

Dafydd Gibbon Catherine, Catherine Bow, Steven Bird, Baden Hughes

The prime consideration in designing sustainable language resources is to ensure that they remain interpretable for coming generations of users. In this paper we adopt a new perspective on resource...

Grid-based Indexing of a Newswire Corpus (2004)

Baden Hughes Srikumar, Baden Hughes, Srikumar Venugopal, Rajkumar Buyya

In this paper we report experience in the use of computational grids in the domain of natural language processing, particularly in the area of information extraction, to create query indices for...

Experiments with Data-Intensive NLP on a Computational Grid (2004)

Baden Hughes, Steven Bird

Large databases of annotated text and speech are widely used for developing and testing language technologies. However, the size of these corpora and associated language models are outpacing the...

Ega Interlinear XML samples (2003)

Gibbon, Dafydd, Bird, Steven, Bow, Catherine, Hughes, Baden

Ega Interlinear XML samples including python script to convert from table format

GOLD POS categories and their relevance to Ega markup (2003)

Gibbon, Dafydd, Bow, Catherine, Hughes, Baden

Table showing correspondencies between GOLD POS categories and Ega morphosyntax