Hamish Cunningham

Position Paper (2009)

Kalina Bontcheva, Christopher Brewster, Fabio Ciravegna, Hamish Cunningham, Louise Guthrie, Robert Gaizauskas, ...

AKT is a major research project applying a variety of technologies to knowledge management. Knowledge is a dynamic, ubiquitous resource, which is to be found equally in an expert's head, under...

ABSTRACT Large-scale, Parallel Automatic Patent Annotation (2009)

Milan Agatonovic, Niraj Aswani, Kalina Bontcheva, Hamish Cunningham, Yaoyong Li, Ian Roberts, ...

When researching new product ideas or filing new patents, inventors need to retrieve all relevant pre-existing know-how and/or to exploit and enforce patents in their technological domain. However,...

Creating Tools for Morphological Analysis of Sumerian (2009)

Valentin Tablan, Wim Peters, Diana Maynard, Hamish Cunningham

Sumerian is a long-extinct language documented throughout the ancient Middle East, arguably the first language for which we have written evidence, and is a language isolate (i.e. no related languages...

Christine Topfer, Office for Educational Review (2008)

Early Years, Allison Reid, Claire Lennon, Lady Gowrie, Campbell Street, Child Centre, ...

The examples in this book focus on the Thinking and Communicating Essentials only.

Cost Sensitive Evaluation Measures for F-term Patent Classification (2008)

Yaoyong Li, Kalina Bontcheva, Hamish Cunningham

Some classification problems, such as the NTCIR-06 F-term patent classification task, have general or specific relations between the target class labels. Consequently, in such cases, it is desirable...

CLOnE: Controlled Language for Ontology Editing (2008)

Adam Funk, Valentin Tablan, Kalina Bontcheva, Hamish Cunningham, Brian Davis, Siegfried H

Abstract. This paper presents a controlled language for ontology editing and a software implementation, based partly on standard NLP tools, for processing that language and manipulating an ontology....

Event-coreference across Multiple Multi-lingual Sources in the MUMIS Project ∗ (2008)

Jan Kuper, Horacio Saggion, Hamish Cunningham, Thierry Declerck, Marco Puts, Franciska De Jong, ...

This paper describes work done in the context of the MUMIS project, especially on multi-document information extraction. The MUMIS project aims at searching and indexing video material on the domain...

GATE- a General Architecture for Text Engineering (2008)

Kalina Bontcheva, Hamish Cunningham

GATE 1 is an architecture, development environment, and framework for building systems that process human language (Cunningham et al., 2002; Maynard et al., 2002b). It has been in development at the...

Content Augmentation for Mixed-Mode News Broadcasts (2008)

Mike Dowman, Valentin Tablan, Hamish Cunningham, Borislav Popov, Ontotext Lab, Sirma Ai Ead

Rich News, a system that augments news broadcasts with textual content, is described. The system identifies individual stories in news broadcasts, and annotates them with related content from the...

Content Augmentation for Mixed-Mode News Broadcasts (2008)

Mike Dowman, Valentin Tablan, Hamish Cunningham, Borislav Popov

Rich News, a system that augments news broadcasts with textual content, is described. The system identifies individual stories in news broadcasts, and annotates them with related content from the...

CLOnE: Controlled Language for Ontology Editing (2008)

Adam Funk, Valentin Tablan, Kalina Bontcheva, Hamish Cunningham, Siegfried H

Abstract. This paper presents a controlled language for ontology editing and a software implementation, based partly on standard NLP tools, for processing that language and manipulating an ontology....

Cost Sensitive Evaluation Measures for F-term Patent Classification (2008)

Yaoyong Li, Kalina Bontcheva, Hamish Cunningham

Some classification problems, such as the NTCIR-06 F-term patent classification task, have general or specific relations between the target class labels. Consequently, in such cases, it is desirable...

SEMANTIC ANALYSIS FOR TOMORROW’S AUDIO-VISUAL DIGITAL ARCHIVES (2008)

Cristian Ursu, Valentin Tablan, Hamish Cunningham

natural language processing PrestoSpace 1 is a European-funded research project that aims at addressing the problem of decaying audio-visual archives throughout Europe by means of digitisation for...

Indexing and Querying Linguistic Metadata and Document Content (2008)

Niraj Aswani, Valentin Tablan, Kalina Bontcheva, Hamish Cunningham

The need for efficient corpus indexing and querying arises frequently both in machine learning-based and human-engineered natural language processing systems. This paper presents the ANNIC system,...

Experiments of Opinion Analysis On Two Corpora MPQA and NTCIR-6 (2008)

Yaoyong Li, Kalina Bontcheva, Hamish Cunningham

This paper describes the algorithms and linguistic features used in our participating system for the opinion analysis pilot task at NTCIR-6. It presents and discusses the results of our system on the...

Yorick Wilks (2007)

Hamish Cunningham, Robert J. Gaizauskas

Much progress has been made in the provision of reusable data resources for Natural Language Engineering, such as grammars, lexicons, thesauruses. Although a number of projects have addressed the...

Abstract (2007)

Hamish Cunningham, Kevin Humphreys

We classify and review current approaches to software infrastructure for research, de-velopment and delivery of NLP systems. The task is motivated by a discussion of current trends in the field of...

and Wilks Implementing a Sense Tagger (2007)

Hamish Cunningham, Mark Stevenson, Yorick Wilks

We describe two systems: GATE (General Ar-chitecture for Text Engineering), an architec-ture to aid in the production and delivery of language engineering systems which signifi-cantly reduces...

Yorick Wilks (2007)

Hamish Cunningham, Robert J. Gaizauskas

Much progress has been made in the provision of reusable data resources for Natural Language Engineering, such as grammars, lexicons, thesauruses. Al-though a number of projects have ad-dressed the...

Visual Execution and Data Visualisation in Natural Language Processing (2007)

Peter Rodgers, Robert Gaizauskas, Kevin Humphreys, Hamish Cunningham

We describe GGI, a visual system that allows the user to execute an automatically generated data flow graph containing code modules that perform natural language processing tasks. These code modules...

Acknowledgements (2007)

Hamish Cunningham

This thesis defines the boundaries of Software Architecture for Language Engineering (SALE), an area formed by the intersection of human language computation and software engineering. SALE covers all...

Experiments with Geographic Knowledge for Information Extraction (2007)

Dimitar Manov, Atanas Kiryakov, Borislav Popov, Ontotext Lab, Diana Maynard, Kalina Bontcheva, ...

Here we present work on using spatial knowledge in conjunction with information extraction (IE). Considerable volume of location data was imported in a knowledge base (KB) with entities of general...

MUSE: a MUlti-Source Entity recognition system (2007)

Diana Maynard, Valentin Tablan, Kalina Bontcheva, Hamish Cunningham, Yorick Wilks

Abstract. This paper describes a robust and easily adaptable system for named entity recognition from a variety of different text types. Most information extraction systems need to be customised...

British Telecommunications plc. (2007)

Adam Funk, Brian Davis, Valentin Tablan, Kalina Bontcheva, Hamish Cunningham, Ipswich Ip Re, ...

Deliverable D2.2.2 (WP2.2) This deliverable outlines an approach to automatic generation of metadata from natural language. The emphasis is on easy to use applications and controlled language...

SVM Based Learning System for F-term Patent Classification (2007)

Yaoyong Li, Kalina Bontcheva, Hamish Cunningham

This paper describes our SVM-based system and the techniques we used to adapt the approach for the specifics of the F-term patent classification subtask at NTCIR-6 Patent Retrieval Task. Our system...

Mining Information for Instance Unification (2006)

Aswani, Niraj, Bontcheva, Kalina, Cunningham, Hamish

Instance unification determines whether two instances in an ontology refer to the same object in the real world. More specifically, this paper addresses the instance unification problem for person...

Mining information for instance unification (2006)

Niraj Aswani, Kalina Bontcheva, Hamish Cunningham

Abstract. Instance unification determines whether two instances in an ontology refer to the same object in the real world. More specifically, this paper addresses the instance unification problem for...

Automatic Extraction of Hierarchical Relations from Text (2006)

Ting Wang, Yaoyong Li, Kalina Bontcheva, Hamish Cunningham, Ji Wang

Abstract. Automatic extraction of semantic relationships between entity instances in an ontology is useful for attaching richer semantic metadata to documents. In this paper we propose an SVM based...

K.: User-friendly ontology authoring using a controlled language (2006)

Valentin Tablan, Tamara Polajnar, Hamish Cunningham, Kalina Bontcheva

In recent years, following the rapid development in the Semantic Web and Knowledge Management research, ontologies have become more in demand in Natural Language Processing. An increasing number of...

Mining information for instance unification (2006)

Niraj Aswani, Kalina Bontcheva, Hamish Cunningham

Abstract. Instance unification determines whether two instances in an ontology refer to the same object in the real world. More specifically, this paper addresses the instance unification problem for...

Indexing and Querying Linguistic Metadata and Document Content (2005)

Aswani, Niraj, Tablan, Valentin, Bontcheva, Kalina, Cunningham, Hamish

The need for efficient corpus indexing and querying arises frequently both in machine learning-based and human-engineered natural language processing systems. This paper presents the ANNIC system,...

SVM Based Learning System For Information Extraction (2005)

Li, Yaoyong, Bontcheva, Kalina, Cunningham, Hamish

We present an SVM-based learning system for information extraction (IE). One distinctive feature of our system is the use of a variant of the SVM, the SVM with uneven margins, which is particularly...

SVM Based Learning System For Information Extraction (2005)

Li, Yaoyong, Bontcheva, Kalina, Cunningham, Hamish

We present an SVM-based learning system for information extraction (IE). One distinctive feature of our system is the use of a variant of the SVM, the SVM with uneven margins, which is particularly...

Using Uneven Margins SVM and Perceptron for Information Extraction (2005)

Yaoyong Li, Kalina Bontcheva, Hamish Cunningham

The classification problem derived from information extraction (IE) has an imbalanced training set. This is particularly true when learning from smaller datasets which often have a few positive...

Web-Assisted Annotation, Semantic Indexing and Search of Television and Radio News (2005)

Mike Dowman, Valentin Tablan, Hamish Cunningham, Borislav Popov

The Rich News system, that can automatically annotate radio and television news with the aid of resources retrieved from the World Wide Web, is described. Automatic speech recognition gives a...

Proceedings of the 9th Conference on Computational Natural Language Learning (CoNLL), (2005)

Pages Ann Arbor, Yaoyong Li, Kalina Bontcheva, Hamish Cunningham

The classification problem derived from information extraction (IE) has an imbalanced training set. This is particularly true when learning from smaller datasets which often have a few positive...

Web-assisted annotation, semantic indexing and search of television and radio news (2005)

Mike Dowman, Valentin Tablan, Hamish Cunningham, Borislav Popov

The Rich News system, that can automatically annotate radio and television news with the aid of resources retrieved from the World Wide Web, is described. Automatic speech recognition gives a...

SVM Based Learning System For Information Extraction (2005)

Yaoyong Li, Kalina Bontcheva, Hamish Cunningham

Abstract. This paper presents an SVM-based learning system for information extraction (IE). One distinctive feature of our system is the use of a variant of the SVM, the SVM with uneven margins,...

Perceptron Learning for Chinese Word Segmentation (2005)

Yaoyong Li, Chuanjiang Miao, Kalina Bontcheva, Hamish Cunningham

We explored a simple, fast and effective learning algorithm, the uneven margins Perceptron, for Chinese word segmentation. We adopted the character-based classification framework and transformed the...

B.: Semantically enhanced television news through web and video integration (2005)

Mike Dowman, Valentin Tablan, Cristian Ursu, Hamish Cunningham, Borislav Popov

Abstract. The Rich News system for semantically annotating television news broadcasts and augmenting them with additional web content is described. Online news sources were mined for material...

British Telecommunications plc. (2005)

Yaoyong Li, Kalina Bontcheva, Niraj Aswani, Wim Peters, Hamish Cunningham, Ipswich Ip Re, ...

Deliverable D2.5.2 (WP2.5) The first part of this deliverable presents the Pascal Challenge on evaluating machine learning methods for Information Extraction, the systems that we entered into the...

B.: Semantically enhanced television news through web and video integration (2005)

Mike Dowman, Valentin Tablan, Cristian Ursu, Hamish Cunningham, Borislav Popov

Abstract. The Rich News system for semantically annotating television news broadcasts and augmenting them with additional web content is described. On-line news sources were mined for material...

An SVM based learning algorithm for Information Extraction (2004)

Li, Yaoyong, Bontcheva, Kalina, Cunningham, Hamish

We present an SVM-based learning algorithm for information extraction. We conducted experiments for different settings of the algorithms and different combinations of NLP features. We also show that...

Introduction to the Special Issue on Software Architecture for Language Engineering (2004)

Cunningham, Hamish, Scott, Donia

Every building, and every computer program, has an architecture: structural and organisational principles that underpin its design and construction. The garden shed once built by one of the authors...

DOI: 10.1017/S1351324904003481 Printed in the United Kingdom Software Architecture for Language Engineering (2004)

Hamish Cunningham, Donia Scott

1 Software Architecture Every building, and every computer program, has an architecture: structural and organisational principles that underpin its design and construction. The garden shed once built...

Automatic Language-Independent Induction of Gazetteer Lists (2004)

Diana Maynard, Kalina Bontcheva, Hamish Cunningham

Adaptation of existing Information Extraction (IE) systems to new languages and domains is the focus of much current research, but progress is often hindered by the lack of available resources to...

Cross document ontology based information extraction for multimedia retrieval (2003)

Dennis Reidsma, Jan Kuper, Thierry Declerck, Horacio Saggion, Hamish Cunningham

Abstract. This paper describes the MUMIS project, which applies ontology based Information Extraction to improve the results of Information Retrieval in multimedia archives. It makes use of a domain...

The Cricket Indoor Location System http://nms.lcs.mit.edu/projects/cricket (2003)

Horacio Saggion, Kalina Bontcheva, Hamish Cunningham

We present a robust summarisation system developed within the GATE architecture that makes use of robust components for semantic tagging and coreference resolution provided by GATE. Our system...

The semantic web: A new opportunity and challenge for human language technology (2003)

Kalina Bontcheva, Hamish Cunningham

Abstract. This position paper motivates the need for Semantic Web enabled Human Language Technology (HLT) tools and discusses the major outstanding challenges in this area. It introduces the idea of...

P.: Intelligent multimedia indexing and retrieval through multi-source information extraction and merging (2003)

Jan Kuper, Horacio Saggion, Hamish Cunningham, Thierry Declerck, Franciska De Jong, Dennis Reidsma, ...

This paper reports work on automated meta-data creation for multimedia content. The approach results in the generation of a conceptual index of the content which may then be searched via semantic...

Cross document ontology based information extraction for multimedia retrieval (2003)

Dennis Reidsma, Jan Kuper, Thierry Declerck, Horacio Saggion, Hamish Cunningham

Abstract. This paper describes the MUMIS project, which applies ontology based Information Extraction to improve the results of Information Retrieval in multimedia archives. It makes use of a domain...

Towards a semantic extraction of Named Entities (2003)

Diana Maynard, Kalina Bontcheva, Hamish Cunningham

In this paper, we discuss the new challenges posed by the progression from information extraction to content extraction, as demonstrated by the ACE program. We explore whether traditional IE...

The Cricket Indoor Location System http://nms.lcs.mit.edu/projects/cricket (2003)

Horacio Saggion, Kalina Bontcheva, Hamish Cunningham

We present a robust summarisation system developed within the GATE architecture that makes use of robust components for semantic tagging and coreference resolution provided by GATE. Our system...

GATE: A Unicode-based infrastructure supporting multilingual information extraction (2003)

Kalina Bontcheva, Diana Maynard, Valentin Tablan, Hamish Cunningham

NLP infrastructures with comprehensive multilingual support can substantially decrease the overhead of developing Information Extraction (IE) systems in new languages by offering support for...

The Cricket Indoor Location System http://nms.lcs.mit.edu/projects/cricket (2003)

Horacio Saggion, Kalina Bontcheva, Hamish Cunningham

We present a robust summarisation system developed within the GATE architecture that makes use of robust components for semantic tagging and coreference resolution provided by GATE. Our system...

OLLIE: On-Line Learning for Information Extraction (2003)

Valentin Tablan, Kalina Bontcheva, Diana Maynard, Hamish Cunningham

This paper reports work aimed at developing an open, distributed learning environment, OLLIE, where researchers can experiment with different Machine Learning (ML) methods for Information Extraction....

Software Architectures for Language Engineering: a Critical Review (2003)

Hamish Cunningham, Kalina Bontcheva

This paper concerns infrastructural work in the fields of Language Engineering, Natural Language Processing and Computational Linguistics. We begin by defining the area...

P.: Intelligent multimedia indexing and retrieval through multi-source information extraction and merging (2003)

Jan Kuper, Horacio Saggion, Hamish Cunningham, Thierry Declerck, Franciska De Jong, Dennis Reidsma, ...

This paper reports work on automated meta-data creation for multimedia content. The approach results in the generation of a conceptual index of the content which may then be searched via semantic...

How feasible is the reuse of grammars for Named Entity Recognition (2002)

Katerina Pastra, Diana Maynard, Oana Hamza, Hamish Cunningham, Yorick Wilks

In this paper, we investigate whether reusing existing grammars for NE recognition instead of creating them from scratch is a viable solution to time constraints in developing grammars. We discuss...

How feasible is the reuse of grammars for Named Entity Recognition (2002)

Katerina Pastra, Diana Maynard, Oana Hamza, Hamish Cunningham, Yorick Wilks

In this paper, we investigate whether reusing existing grammars for NE recognition instead of creating them from scratch is a viable solution to time constraints in developing grammars. We discuss...

Adapting a robust multi-genre ne system for automatic content extraction (2002)

Diana Maynard, Hamish Cunningham, Kalina Bontcheva, Ontotext Lab

marinsirma. bg Abstract. Many current IE systems tend to be designed with particular applications and domains in mind. With the increasing need for robust NLP tools which can handle a variety of IE...

Architectural elements of language engineering robustness (2002)

Diana Maynard, Valentin Tablan, Hamish Cunningham, Cristian Ursu, Horacio Saggion, Kalina Bontcheva, ...

We discuss robustness in LE systems from the perspective of engineering, and the predictability of both outputs and construction process that this entails. We present an architectural system that...

Extracting Information for Automatic Indexing of Multimedia Material (2002)

Horacio Saggion, Hamish Cunningham, Diana Maynard, Kalina Bontcheva, Oana Hamza, Christian Ursu, ...

This paper discusses our work on information extraction (IE) from multi-lingual, multi-media, multi-genre Language Resources, in a domain where there are many different event types. This work is...

A Light-weight Approach to Coreference Resolution for Named Entities in Text (2002)

Marin Dimitrov, Kalina Bontcheva, Hamish Cunningham, Diana Maynard, Ontotext Lab, Sirma Ai

This paper presents a lightweight approach to pronoun resolution in the case when the antecedent is named entity. It falls under the category of the so-called "knowledge poor "...

Using a text engineering framework to build an extendable and portable IE-based summarisation system (2002)

Diana Maynard, Kalina Bontcheva, Horacio Saggion, Hamish Cunningham, Oana Hamza

In this paper we describe how information extraction technology has been used to build a summarisation system in the domain of occupational health and safety. The core of the application is based on...

A unicode-based environment for creation and use of language resources (2002)

Valentin Tablan, Cristian Ursu, Kalina Bontcheva, Hamish Cunningham, Diana Maynard, Oana Hamza, ...

GATE is a Unicode-aware architecture, development environment and framework for building systems that process human language. It is often thought that the character sets problem has been solved by...

Using GATE as an environment for teaching NLP (2002)

Kalina Bontcheva, Hamish Cunningham, Valentin Tablan, Diana Maynard, Oana Hamza

In this paper we argue that the GATE architecture and visual development environment can be used as an effective tool for teaching language engineering and computational linguistics. Since GATE comes...

Gate: An architecture for development of robust hlt applications (2002)

Hamish Cunningham, Diana Maynard, Kalina Bontcheva, Valentin Tablan

In this paper we present GATE, a framework and graphical development environment which enables users to de-velop and deploy language engineering components and resources in a robust fashion. The GATE...

Adapting a robust multi-genre ne system for automatic content extraction (2002)

Diana Maynard, Hamish Cunningham, Kalina Bontcheva, Ontotext Lab

marinsirma. bg Abstract. Many current information extraction systems tend to be designed with particular applications and domains in mind. With the increasing need for robust language engineering...

Developing reusable and robust language processing components for information systems using GATE (2002)

Kaling Bontcheva, Hamish Cunningham, Diana Maynard, Valentin Tablan, Horacio Saggion

In this paper we present GATE, an architecture and a graphical development environment which enables users to develop and deploy HLT applications in a robust fashion. GATE also provides reusable,...

Gate: An architecture for development of robust hlt applications (2002)

Hamish Cunningham, Diana Maynard, Kalina Bontcheva, Valentin Tablan

In this paper we present GATE, a framework and graphical development environment which enables users to de-velop and deploy language engineering components and resources in a robust fashion. The GATE...

Using human language technology for automatic annotation and indexing of digital library content (2002)

Kalina Bontcheva, Diana Maynard, Hamish Cunningham, Horacio Saggion

Abstract. In this paper we show how we used robust human language technology, such as our domain-independent and customisable named entity recogniser, for automatic content annotation and indexing in...

Using GATE as an environment for teaching NLP (2002)

Kalina Bontcheva, Hamish Cunningham, Valentin Tablan, Diana Maynard, Oana Hamza

In this paper we argue that the GATE architecture and visual development environment can be used as an effective tool for teaching language engineering and computational linguistics. Since GATE comes...

Using a text engineering framework to build an extendable and portable IE-based summarisation system (2002)

Diana Maynard, Kalina Bontcheva, Horacio Saggion, Hamish Cunningham, Oana Hamza

Under consideration for other conferences (specify)? In this paper we show how tools provided by a text engineering framework (GATE) have been used to build an IE-based summarisation system in the...

Gate: An architecture for development of robust hlt applications (2002)

Hamish Cunningham, Diana Maynard, Kalina Bontcheva, Valentin Tablan

In this paper we present GATE, a framework and graphical development environment which enables users to develop and deploy language engineering components and resources in a robust fashion. The GATE...

Using GATE as an environment for teaching NLP (2002)

Kalina Bontcheva, Hamish Cunningham, Valentin Tablan, Diana Maynard, Oana Hamza

In this paper we argue that the GATE architecture and visual development environment can be used as an effective tool for teaching language engineering and computational linguistics. Since GATE comes...

EMILLE, A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation (2002)

Paul Baker, Andrew Hardie, Tony Mcenery, Hamish Cunningham, Rob Gaizauskas

The paper describes developments to date on the EMILLE Project (Enabling Minority Language Engineering) being carried out at the Universities of Lancaster and Sheffield. EMILLE was established to...

A light-weight approach to coreference resolution for named entities in text. Unpublished M.Sc (2002)

Marin Dimitrov, Kalina Bontcheva, Hamish Cunningham, Diana Maynard

This paper presents a lightweight approach to pronoun resolution in the case when the antecedent is a named entity. It falls under the category of the so-called ‘knowledge poor ’ approaches that...

Using HLT for Acquiring, Retrieving and Publishing Knowledge in AKT: Position Paper (2001)

Boncheva, Kalina, Brewster, Christopher, Ciravegna, Fabio, Cunningham, Hamish, Guthrie, Louise, Gaizauskas, Robert, ...

AKT is a major research project applying a variety of technologies to knowledge management. Knowledge is a dynamic, ubiquitous resource, which is to be found equally in an expert's head, under...

Using HLT for Acquiring, Retrieving and Publishing Knowledge in AKT: Position Paper (2001)

Bontcheva, Dr. Kalina, Brewster, Mr. Christopher, Ciravegna, Dr. Fabio, Cunningham, Dr. Hamish, Guthrie, Dr. Louise, Gaizauskas, Dr. Rob, ...

AKT is a major research project applying a variety of technologies to knowledge management. Knowledge is a dynamic, ubiquitous resource, which is to be found equally in an expert's head, under...

Using HLT for Acquiring, Retrieving and Publishing Knowledge (2001)

Kalina Bontcheva, Christopher Brewster, Fabio Ciravegna, Hamish Cunningham, Louise Guthrie, Robert Gaizauskas, ...

AKT is a major research project applying a variety of technologies to knowledge management. Knowledge is a dynamic, ubiquitous resource, which is to be found equally in an expert's head, under...

Named Entity Recognition from Diverse Text Types (2001)

Diana Maynard, Valentin Tablan, Cristian Ursu, Hamish Cunningham, Yorick Wilks

Current research in Information Extraction tends to be focused on application-specific systems tailored to a particular domain. The Muse system is a multi-purpose Named Entity recognition system...

The Automatic Generation of Formal Annotations in a Multimedia Indexing and Searching Environment (2001)

Thierry Declerck, Peter Wittenburg, Hamish Cunningham

We describe in this paper the MUMIS Project (Multimedia Indexing and Searching Environment) , which is concerned with the development and integration of base technologies, demonstrated within a...

Software Architecture for Language Engineering (2000)

Hamish Cunningham

This thesis defines the boundaries of Software Architecture for Language Engineering (SALE), an area formed by the intersection of human language computation and software engineering. SALE covers all...

Uniform Language Resource Access and Distribution (2000)

Hamish Cunningham, Wim Peters, Clare Mccauley, Kalina Bontcheva, Yorick Wilks

The reuseability and accessibility of lexical and linguistic resources often requires substantial programming overhead and detailed knowledge of the structure and nature of resource-specific...

Experience of using GATE for NLP R&D (2000)

Hamish Cunningham, Diana Maynard, Kalina Bontcheva, Valentin Tablan, Yorick Wilks

GATE, a General Architecture for Text Engineering, aims to provide a software infrastructure for researchers and developers working in NLP. GATE has now been widely available for four years. In this...

Software Infrastructure for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis (2000)

Hamish Cunningham, Kalina Bontcheva, Valentin Tablan, Yorick Wilks

This paper presents a taxonomy of previous work on infrastructures, architectures and development environments for representing and processing Language Resources (LRs), corpora, and annotations. This...

Software Infrastructure for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis. (2000)

Hamish Cunningham, Kalina Bontcheva, Valentin Tablan, Yorick Wilks

This paper presents a taxonomy of previous work on infrastructures, architectures and development environments for representing and processing Language Resources (LRs), corpora, and annotations. This...

Information Extraction - a User Guide (1999)

Hamish Cunningham, Hamish Cunningham

Contents 1 Introduction 1 2 Types of IE 3 3 Performance levels 4 4 Named Entity recognition 5 5 Coreference resolution 7 6 Template Element production 9 7 Template Relation production 10 8 Scenario...

LaSIE jumps the GATE (1999)

Yorick Wilks, Robert Gaizauskas, Kevin Humphreys, Hamish Cunningham

The benefits of the effective creation of Information Extraction (IE) in the last ten years, driven by the ARPA TIPSTER programme and the associated MUC evaluations, have been enormous, but it must...

Experience with a Language Engineering Architecture: Three Years of GATE (1999)

Hamish Cunningham, Robert Gaizauskas, Kevin Humphreys, Yorick Wilks

GATE, the General Architecture for Text Engineering, aims to provide a software infrastructure for researchers and developers working in the area of natural language processing. A version of GATE has...

Implementing a Sense Tagger in General Architecture for Text Engineering (1998)

Hamish Cunningham, Mark Stevenson, Yorick Wilks

We describe two systems: GATE (General Architecture for Text Engineering), an architecture to aid in the production and delivery of language engineering systems which significantly reduces...

Implementing a Sense Tagger in a General Architecture for Text Engineering (1998)

Hamish Cunningham, Mark Stevenson, Yorick Wilks

We describe two systems: GATE (General Architecture for Text Engineering), an architecture to aid in the production and delivery of language engineering systems which significantly reduces...

Sense Tagging and Language Engineering (1998)

Mark Stevenson, Hamish Cunningham, Yorick Wilks

. Language engineering is a new, practical, branch of NLP and Computational Linguistics. We describe two language engineering systems: GATE (General Architecture for Text Engineering), an...

Software Infrastructure for Natural Language Processing (1997)

Cunningham, Hamish, Humphreys, Kevin, Gaizauskas, Robert, Wilks, Yorick

We classify and review current approaches to software infrastructure for research, development and delivery of NLP systems. The task is motivated by a discussion of current trends in the field of NLP...

Information Extraction - A User Guide (1997)

Cunningham, Hamish

This technical memo describes Information Extraction from the point-of-view of a potential user of the technology. No knowledge of language processing is assumed. Information Extraction is a process...

GATE: A TIPSTER-based general architecture for text engineering (1997)

Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas

Building on the work of the TIPSTER architecture group, the University of Sheffield Natural Language Processing group have developed GATE, a General Architecture for Text Engineering. GATE implements...

Software Infrastructure for Natural Language Processing (1997)

Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas, Yorick Wilks

We classify and review current approaches to software infrastructure for research, development and delivery of NLP systems. The task is motivated by a discussion of current trends in the field of NLP...

Information Extraction - a User Guide (1997)

Hamish Cunningham, Hamish Cunningham

Contents 1 Introduction 1 2 Types of IE 2 3 Performance levels 3 4 Named Entity recognition 5 5 Coreference resolution 7 6 Template Element production 8 7 Scenario Template extraction 10 8 An...

Software Infrastructure for Natural Language Processing (1997)

Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas

We classify and review current approaches to software infrastructure for research, development and delivery of NLP systems. The task is motivated by a discussion of current trends in the field of NLP...

New Methods, Current Trends and Software Infrastructure for NLP (1996)

Cunningham, Hamish, Wilks, Yorick, Gaizauskas, Robert J.

The increasing use of `new methods' in NLP, which the NeMLaP conference series exemplifies, occurs in the context of a wider shift in the nature and concerns of the discipline. This paper begins with...

A General Architecture for Language Engineering (GATE) - a new approach to Language Engineering R&D (1996)

Cunningham, Hamish, Gaizauskas, Robert J., Wilks, Yorick

This report argues for the provision of a common software infrastructure for NLP systems. Current trends in Language Engineering research are reviewed as motivation for this infrastructure, and...

TIPSTER-Compatible Projects at Sheffield (1996)

Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas, Yorick Wilks

more appropriately described by the term Language Engineering than the well-established labels of Nat-ural Language Processing or Computational Linguis-tics. This reflects an increased focus on...

A General Architecture for Text Engineering (1996)

Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas

For a variety of reasons NLP has recently spawned a related engineering discipline called language en-gineering (LE), whose orientation is towards the ap-plication of NLP techniques to solving...

GATE: an environment to support research and development in natural language engineering (1996)

Robert Gaizauskas, Hamish Cunningham, Yorick Wilks, Peter Rodgers, Kevin Humphreys

We describe a software environment to support research and development in natural language (NL) engineering. This environment-- GATE (General Architecture for Text Engineering)-- aims to advance...

New Methods, Current Trends and Software Infrastructure for NLP (1996)

Hamish Cunningham, Yorick Wilks, Robert J. Gaizauskas

. The increasing use of `new methods' in NLP, which this conference series exemplifies, occurs in the context of a wider shift in the nature and concerns of the discipline. This paper begins...

Software Infrastructure for Language Engineering (1996)

Hamish Cunningham, Yorick Wilks, Robert J. Gaizauskas

This paper describes a freely-available system called GATE [Cunningham, Gaizauskas, Wilks 1995] -- a General Architecture for Text Engineering -- which integrates much of this work. GATE is an

CREOLE Developer's Manual (1996)

Hamish Cunningham, Kevin Humphreys, Rob Gaizauskas, Martin Stower

this document or its parent collection using the API defined by the Tipster people. The full specification of this API is given in [6] which is available from Sheffield's ftp site as

GATE: An Environment to Support Research and Development in Natural Language Engineering (1996)

Robert Gaizauskas, Hamish Cunningham, Yorick Wilks, Peter Rodgers, Kevin Humphreys

We describe a software environment to support research and development in natural language (NL) engineering. This environment -- GATE (General Architecture for Text Engineering) -- aims to advance...

GATE - a General Architecture for Text Engineering (1996)

Hamish Cunningham, Yorick Wilks, Robert J. Gaizauskas

Much progress has been made in the provision of reusable data resources for Natural Language Engineering, such as grammars, lexicons, thesauruses. Although a number of projects have addressed the...

Tipster-Compatible Projects At Sheffield (1996)

Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas, Yorick Wilks

this paper at the April 1996 TIPSTER workshop, and for extensive comments during the preparation of this paper. References

GATE: An Environment to Support Research and Development in Natural Language Engineering (1996)

Robert Gaizauskas, Hamish Cunningham, Yorick Wilks, Peter Rodgers, Kevin Humphreys

We describe a software environment to support research and development in natural language (NL) engineering. This environment -- GATE (General Architecture for Text Engineering) -- aims to advance...

A General Architecture for Text Engineering (GATE) - a new approach to Language Engineering R&D (1995)

Hamish Cunningham, Hamish Cunningham, Robert J. Gaizauskas, Robert J. Gaizauskas

this report is that the jury is still out on performance vs. competence. Thus, as well as a host of competing linguistic and lexicographic theories, LE is home to a thoroughgoing paradigm conflict....