Kalina Bontcheva, Christopher Brewster, Fabio Ciravegna, Hamish Cunningham, Louise Guthrie, Robert Gaizauskas, ...
AKT is a major research project applying a variety of technologies to knowledge management. Knowledge is a dynamic, ubiquitous resource, which is to be found equally in an expert's head, under...
ABSTRACT Large-scale, Parallel Automatic Patent Annotation (2009)
Milan Agatonovic, Niraj Aswani, Kalina Bontcheva, Hamish Cunningham, Yaoyong Li, Ian Roberts, ...
When researching new product ideas or filing new patents, inventors need to retrieve all relevant pre-existing know-how and/or to exploit and enforce patents in their technological domain. However,...
Creating Tools for Morphological Analysis of Sumerian (2009)
Valentin Tablan, Wim Peters, Diana Maynard, Hamish Cunningham
Sumerian is a long-extinct language documented throughout the ancient Middle East, arguably the first language for which we have written evidence, and is a language isolate (i.e. no related languages...
Christine Topfer, Office for Educational Review (2008)
Early Years, Allison Reid, Claire Lennon, Lady Gowrie, Campbell Street, Child Centre, ...
The examples in this book focus on the Thinking and Communicating Essentials only.
Cost Sensitive Evaluation Measures for F-term Patent Classification (2008)
Yaoyong Li, Kalina Bontcheva, Hamish Cunningham
Some classification problems, such as the NTCIR-06 F-term patent classification task, have general or specific relations between the target class labels. Consequently, in such cases, it is desirable...
CLOnE: Controlled Language for Ontology Editing (2008)
Adam Funk, Valentin Tablan, Kalina Bontcheva, Hamish Cunningham, Brian Davis, Siegfried H
Abstract. This paper presents a controlled language for ontology editing and a software implementation, based partly on standard NLP tools, for processing that language and manipulating an ontology....
Event-coreference across Multiple Multi-lingual Sources in the MUMIS Project ∗ (2008)
Jan Kuper, Horacio Saggion, Hamish Cunningham, Thierry Declerck, Marco Puts, Franciska De Jong, ...
This paper describes work done in the context of the MUMIS project, especially on multi-document information extraction. The MUMIS project aims at searching and indexing video material on the domain...
GATE- a General Architecture for Text Engineering (2008)
Kalina Bontcheva, Hamish Cunningham
GATE 1 is an architecture, development environment, and framework for building systems that process human language (Cunningham et al., 2002; Maynard et al., 2002b). It has been in development at the...
Content Augmentation for Mixed-Mode News Broadcasts (2008)
Mike Dowman, Valentin Tablan, Hamish Cunningham, Borislav Popov, Ontotext Lab, Sirma Ai Ead
Rich News, a system that augments news broadcasts with textual content, is described. The system identifies individual stories in news broadcasts, and annotates them with related content from the...
Content Augmentation for Mixed-Mode News Broadcasts (2008)
Mike Dowman, Valentin Tablan, Hamish Cunningham, Borislav Popov
Rich News, a system that augments news broadcasts with textual content, is described. The system identifies individual stories in news broadcasts, and annotates them with related content from the...
CLOnE: Controlled Language for Ontology Editing (2008)
Adam Funk, Valentin Tablan, Kalina Bontcheva, Hamish Cunningham, Siegfried H
Abstract. This paper presents a controlled language for ontology editing and a software implementation, based partly on standard NLP tools, for processing that language and manipulating an ontology....
Cost Sensitive Evaluation Measures for F-term Patent Classification (2008)
Yaoyong Li, Kalina Bontcheva, Hamish Cunningham
Some classification problems, such as the NTCIR-06 F-term patent classification task, have general or specific relations between the target class labels. Consequently, in such cases, it is desirable...
SEMANTIC ANALYSIS FOR TOMORROW’S AUDIO-VISUAL DIGITAL ARCHIVES (2008)
Cristian Ursu, Valentin Tablan, Hamish Cunningham
natural language processing PrestoSpace 1 is a European-funded research project that aims at addressing the problem of decaying audio-visual archives throughout Europe by means of digitisation for...
Indexing and Querying Linguistic Metadata and Document Content (2008)
Niraj Aswani, Valentin Tablan, Kalina Bontcheva, Hamish Cunningham
The need for efficient corpus indexing and querying arises frequently both in machine learning-based and human-engineered natural language processing systems. This paper presents the ANNIC system,...
Experiments of Opinion Analysis On Two Corpora MPQA and NTCIR-6 (2008)
Yaoyong Li, Kalina Bontcheva, Hamish Cunningham
This paper describes the algorithms and linguistic features used in our participating system for the opinion analysis pilot task at NTCIR-6. It presents and discusses the results of our system on the...
Hamish Cunningham, Robert J. Gaizauskas
Much progress has been made in the provision of reusable data resources for Natural Language Engineering, such as grammars, lexicons, thesauruses. Although a number of projects have addressed the...
JAPE: a Java Annotation Patterns Engine 1 Contents (2007)
Hamish Cunningham, Hamish Cunningham, Diana Maynard, Diana Maynard, Valentin Tablan, Valentin Tablan
Hamish Cunningham, Kevin Humphreys
We classify and review current approaches to software infrastructure for research, de-velopment and delivery of NLP systems. The task is motivated by a discussion of current trends in the field of...
Multilingual adaptations of ANNIE, a reusable information extraction (2007)
Diana Maynard, Hamish Cunningham
tool
and Wilks Implementing a Sense Tagger (2007)
Hamish Cunningham, Mark Stevenson, Yorick Wilks
We describe two systems: GATE (General Ar-chitecture for Text Engineering), an architec-ture to aid in the production and delivery of language engineering systems which signifi-cantly reduces...
Hamish Cunningham, Robert J. Gaizauskas
Much progress has been made in the provision of reusable data resources for Natural Language Engineering, such as grammars, lexicons, thesauruses. Al-though a number of projects have ad-dressed the...
Visual Execution and Data Visualisation in Natural Language Processing (2007)
Peter Rodgers, Robert Gaizauskas, Kevin Humphreys, Hamish Cunningham
We describe GGI, a visual system that allows the user to execute an automatically generated data flow graph containing code modules that perform natural language processing tasks. These code modules...
This thesis defines the boundaries of Software Architecture for Language Engineering (SALE), an area formed by the intersection of human language computation and software engineering. SALE covers all...
Experiments with Geographic Knowledge for Information Extraction (2007)
Dimitar Manov, Atanas Kiryakov, Borislav Popov, Ontotext Lab, Diana Maynard, Kalina Bontcheva, ...
Here we present work on using spatial knowledge in conjunction with information extraction (IE). Considerable volume of location data was imported in a knowledge base (KB) with entities of general...
MUSE: a MUlti-Source Entity recognition system (2007)
Diana Maynard, Valentin Tablan, Kalina Bontcheva, Hamish Cunningham, Yorick Wilks
Abstract. This paper describes a robust and easily adaptable system for named entity recognition from a variety of different text types. Most information extraction systems need to be customised...
Cristian Ursu, Valentin Tablan, Kalina Bontcheva, Oana Hamza, Hamish Cunningham
2.1 How to add supprt for a new document format........ 4 2.2 Detecting the right reader.................... 6
British Telecommunications plc. (2007)
Adam Funk, Brian Davis, Valentin Tablan, Kalina Bontcheva, Hamish Cunningham, Ipswich Ip Re, ...
Deliverable D2.2.2 (WP2.2) This deliverable outlines an approach to automatic generation of metadata from natural language. The emphasis is on easy to use applications and controlled language...
SVM Based Learning System for F-term Patent Classification (2007)
Yaoyong Li, Kalina Bontcheva, Hamish Cunningham
This paper describes our SVM-based system and the techniques we used to adapt the approach for the specifics of the F-term patent classification subtask at NTCIR-6 Patent Retrieval Task. Our system...
Mining Information for Instance Unification (2006)
Aswani, Niraj, Bontcheva, Kalina, Cunningham, Hamish
Instance unification determines whether two instances in an ontology refer to the same object in the real world. More specifically, this paper addresses the instance unification problem for person...
Mining information for instance unification (2006)
Niraj Aswani, Kalina Bontcheva, Hamish Cunningham
Abstract. Instance unification determines whether two instances in an ontology refer to the same object in the real world. More specifically, this paper addresses the instance unification problem for...
Automatic Extraction of Hierarchical Relations from Text (2006)
Ting Wang, Yaoyong Li, Kalina Bontcheva, Hamish Cunningham, Ji Wang
Abstract. Automatic extraction of semantic relationships between entity instances in an ontology is useful for attaching richer semantic metadata to documents. In this paper we propose an SVM based...
K.: User-friendly ontology authoring using a controlled language (2006)
Valentin Tablan, Tamara Polajnar, Hamish Cunningham, Kalina Bontcheva
In recent years, following the rapid development in the Semantic Web and Knowledge Management research, ontologies have become more in demand in Natural Language Processing. An increasing number of...
Mining information for instance unification (2006)
Niraj Aswani, Kalina Bontcheva, Hamish Cunningham
Abstract. Instance unification determines whether two instances in an ontology refer to the same object in the real world. More specifically, this paper addresses the instance unification problem for...
Indexing and Querying Linguistic Metadata and Document Content (2005)
Aswani, Niraj, Tablan, Valentin, Bontcheva, Kalina, Cunningham, Hamish
The need for efficient corpus indexing and querying arises frequently both in machine learning-based and human-engineered natural language processing systems. This paper presents the ANNIC system,...
SVM Based Learning System For Information Extraction (2005)
Li, Yaoyong, Bontcheva, Kalina, Cunningham, Hamish
We present an SVM-based learning system for information extraction (IE). One distinctive feature of our system is the use of a variant of the SVM, the SVM with uneven margins, which is particularly...
SVM Based Learning System For Information Extraction (2005)
Li, Yaoyong, Bontcheva, Kalina, Cunningham, Hamish
We present an SVM-based learning system for information extraction (IE). One distinctive feature of our system is the use of a variant of the SVM, the SVM with uneven margins, which is particularly...
Using Uneven Margins SVM and Perceptron for Information Extraction (2005)
Yaoyong Li, Kalina Bontcheva, Hamish Cunningham
The classification problem derived from information extraction (IE) has an imbalanced training set. This is particularly true when learning from smaller datasets which often have a few positive...
Web-Assisted Annotation, Semantic Indexing and Search of Television and Radio News (2005)
Mike Dowman, Valentin Tablan, Hamish Cunningham, Borislav Popov
The Rich News system, that can automatically annotate radio and television news with the aid of resources retrieved from the World Wide Web, is described. Automatic speech recognition gives a...
Proceedings of the 9th Conference on Computational Natural Language Learning (CoNLL), (2005)
Pages Ann Arbor, Yaoyong Li, Kalina Bontcheva, Hamish Cunningham
The classification problem derived from information extraction (IE) has an imbalanced training set. This is particularly true when learning from smaller datasets which often have a few positive...
Web-assisted annotation, semantic indexing and search of television and radio news (2005)
Mike Dowman, Valentin Tablan, Hamish Cunningham, Borislav Popov
The Rich News system, that can automatically annotate radio and television news with the aid of resources retrieved from the World Wide Web, is described. Automatic speech recognition gives a...
SVM Based Learning System For Information Extraction (2005)
Yaoyong Li, Kalina Bontcheva, Hamish Cunningham
Abstract. This paper presents an SVM-based learning system for information extraction (IE). One distinctive feature of our system is the use of a variant of the SVM, the SVM with uneven margins,...
Perceptron Learning for Chinese Word Segmentation (2005)
Yaoyong Li, Chuanjiang Miao, Kalina Bontcheva, Hamish Cunningham
We explored a simple, fast and effective learning algorithm, the uneven margins Perceptron, for Chinese word segmentation. We adopted the character-based classification framework and transformed the...
B.: Semantically enhanced television news through web and video integration (2005)
Mike Dowman, Valentin Tablan, Cristian Ursu, Hamish Cunningham, Borislav Popov
Abstract. The Rich News system for semantically annotating television news broadcasts and augmenting them with additional web content is described. Online news sources were mined for material...
British Telecommunications plc. (2005)
Yaoyong Li, Kalina Bontcheva, Niraj Aswani, Wim Peters, Hamish Cunningham, Ipswich Ip Re, ...
Deliverable D2.5.2 (WP2.5) The first part of this deliverable presents the Pascal Challenge on evaluating machine learning methods for Information Extraction, the systems that we entered into the...
B.: Semantically enhanced television news through web and video integration (2005)
Mike Dowman, Valentin Tablan, Cristian Ursu, Hamish Cunningham, Borislav Popov
Abstract. The Rich News system for semantically annotating television news broadcasts and augmenting them with additional web content is described. On-line news sources were mined for material...
An SVM based learning algorithm for Information Extraction (2004)
Li, Yaoyong, Bontcheva, Kalina, Cunningham, Hamish
We present an SVM-based learning algorithm for information extraction. We conducted experiments for different settings of the algorithms and different combinations of NLP features. We also show that...
Introduction to the Special Issue on Software Architecture for Language Engineering (2004)
Cunningham, Hamish, Scott, Donia
Every building, and every computer program, has an architecture: structural and organisational principles that underpin its design and construction. The garden shed once built by one of the authors...
Hamish Cunningham, Donia Scott
1 Software Architecture Every building, and every computer program, has an architecture: structural and organisational principles that underpin its design and construction. The garden shed once built...
Automatic Language-Independent Induction of Gazetteer Lists (2004)
Diana Maynard, Kalina Bontcheva, Hamish Cunningham
Adaptation of existing Information Extraction (IE) systems to new languages and domains is the focus of much current research, but progress is often hindered by the lack of available resources to...
2.3 Using Gaze............................ 5 (2004)
Valentin Tablan, Diana Maynard, Kalina Bontcheva, Hamish Cunningham
Large scale experiments for semantic labeling of noun phrases in raw text (2004)
Louise Guthrie, Roberto Basili, Fabio Zanzotto, Kalina Bontcheva, Hamish Cunningham, David Guthrie, ...
Large scale experiments for semantic labeling of noun phrases in raw text (2004)
Louise Guthrie, Roberto Basili, Fabio Zanzotto, Kalina Bontcheva, Hamish Cunningham, David Guthrie, ...
Cross document ontology based information extraction for multimedia retrieval (2003)
Dennis Reidsma, Jan Kuper, Thierry Declerck, Horacio Saggion, Hamish Cunningham
Abstract. This paper describes the MUMIS project, which applies ontology based Information Extraction to improve the results of Information Retrieval in multimedia archives. It makes use of a domain...
The Cricket Indoor Location System http://nms.lcs.mit.edu/projects/cricket (2003)
Horacio Saggion, Kalina Bontcheva, Hamish Cunningham
We present a robust summarisation system developed within the GATE architecture that makes use of robust components for semantic tagging and coreference resolution provided by GATE. Our system...
The semantic web: A new opportunity and challenge for human language technology (2003)
Kalina Bontcheva, Hamish Cunningham
Abstract. This position paper motivates the need for Semantic Web enabled Human Language Technology (HLT) tools and discusses the major outstanding challenges in this area. It introduces the idea of...
Workshop on Human Language Technology for the Semantic Web and Web Services (2003)
Hamish Cunningham, Ying Ding, Atanas Kiryakov, Programme Committee, Er Maedche, Robert Bosch Gmbh, ...
Proceedings of the
Jan Kuper, Horacio Saggion, Hamish Cunningham, Thierry Declerck, Franciska De Jong, Dennis Reidsma, ...
This paper reports work on automated meta-data creation for multimedia content. The approach results in the generation of a conceptual index of the content which may then be searched via semantic...
Cross document ontology based information extraction for multimedia retrieval (2003)
Dennis Reidsma, Jan Kuper, Thierry Declerck, Horacio Saggion, Hamish Cunningham
Abstract. This paper describes the MUMIS project, which applies ontology based Information Extraction to improve the results of Information Retrieval in multimedia archives. It makes use of a domain...
Towards a semantic extraction of Named Entities (2003)
Diana Maynard, Kalina Bontcheva, Hamish Cunningham
In this paper, we discuss the new challenges posed by the progression from information extraction to content extraction, as demonstrated by the ACE program. We explore whether traditional IE...
The Cricket Indoor Location System http://nms.lcs.mit.edu/projects/cricket (2003)
Horacio Saggion, Kalina Bontcheva, Hamish Cunningham
We present a robust summarisation system developed within the GATE architecture that makes use of robust components for semantic tagging and coreference resolution provided by GATE. Our system...
GATE: A Unicode-based infrastructure supporting multilingual information extraction (2003)
Kalina Bontcheva, Diana Maynard, Valentin Tablan, Hamish Cunningham
NLP infrastructures with comprehensive multilingual support can substantially decrease the overhead of developing Information Extraction (IE) systems in new languages by offering support for...
Louise Guthrie, Roberto Basili, Fabio Zanzotto, Kalina Bontcheva, Hamish Cunningham, Marco Cammisa, ...
1.1 Five Characteristics of the Work......................... 3
The Cricket Indoor Location System http://nms.lcs.mit.edu/projects/cricket (2003)
Horacio Saggion, Kalina Bontcheva, Hamish Cunningham
We present a robust summarisation system developed within the GATE architecture that makes use of robust components for semantic tagging and coreference resolution provided by GATE. Our system...
OLLIE: On-Line Learning for Information Extraction (2003)
Valentin Tablan, Kalina Bontcheva, Diana Maynard, Hamish Cunningham
This paper reports work aimed at developing an open, distributed learning environment, OLLIE, where researchers can experiment with different Machine Learning (ML) methods for Information Extraction....
Software Architectures for Language Engineering: a Critical Review (2003)
Hamish Cunningham, Kalina Bontcheva
This paper concerns infrastructural work in the fields of Language Engineering, Natural Language Processing and Computational Linguistics. We begin by defining the area...
Jan Kuper, Horacio Saggion, Hamish Cunningham, Thierry Declerck, Franciska De Jong, Dennis Reidsma, ...
This paper reports work on automated meta-data creation for multimedia content. The approach results in the generation of a conceptual index of the content which may then be searched via semantic...
GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications (2002)
How feasible is the reuse of grammars for Named Entity Recognition (2002)
Katerina Pastra, Diana Maynard, Oana Hamza, Hamish Cunningham, Yorick Wilks
In this paper, we investigate whether reusing existing grammars for NE recognition instead of creating them from scratch is a viable solution to time constraints in developing grammars. We discuss...
developing language processing components with GATE (2002)
Hamish Cunningham, Diana Maynard, Kalina Bontcheva, Valentin Tablan, Cristian Ursu, Marin Dimitrov, ...
Work on GATE has been partly supported by EPSRC grants GR/K25267 (Large-Scale
How feasible is the reuse of grammars for Named Entity Recognition (2002)
Katerina Pastra, Diana Maynard, Oana Hamza, Hamish Cunningham, Yorick Wilks
In this paper, we investigate whether reusing existing grammars for NE recognition instead of creating them from scratch is a viable solution to time constraints in developing grammars. We discuss...
Adapting a robust multi-genre ne system for automatic content extraction (2002)
Diana Maynard, Hamish Cunningham, Kalina Bontcheva, Ontotext Lab
marinsirma. bg Abstract. Many current IE systems tend to be designed with particular applications and domains in mind. With the increasing need for robust NLP tools which can handle a variety of IE...
Architectural elements of language engineering robustness (2002)
Diana Maynard, Valentin Tablan, Hamish Cunningham, Cristian Ursu, Horacio Saggion, Kalina Bontcheva, ...
We discuss robustness in LE systems from the perspective of engineering, and the predictability of both outputs and construction process that this entails. We present an architectural system that...
Extracting Information for Automatic Indexing of Multimedia Material (2002)
Horacio Saggion, Hamish Cunningham, Diana Maynard, Kalina Bontcheva, Oana Hamza, Christian Ursu, ...
This paper discusses our work on information extraction (IE) from multi-lingual, multi-media, multi-genre Language Resources, in a domain where there are many different event types. This work is...
A Light-weight Approach to Coreference Resolution for Named Entities in Text (2002)
Marin Dimitrov, Kalina Bontcheva, Hamish Cunningham, Diana Maynard, Ontotext Lab, Sirma Ai
This paper presents a lightweight approach to pronoun resolution in the case when the antecedent is named entity. It falls under the category of the so-called "knowledge poor "...
Diana Maynard, Kalina Bontcheva, Horacio Saggion, Hamish Cunningham, Oana Hamza
In this paper we describe how information extraction technology has been used to build a summarisation system in the domain of occupational health and safety. The core of the application is based on...
A unicode-based environment for creation and use of language resources (2002)
Valentin Tablan, Cristian Ursu, Kalina Bontcheva, Hamish Cunningham, Diana Maynard, Oana Hamza, ...
GATE is a Unicode-aware architecture, development environment and framework for building systems that process human language. It is often thought that the character sets problem has been solved by...
Using GATE as an environment for teaching NLP (2002)
Kalina Bontcheva, Hamish Cunningham, Valentin Tablan, Diana Maynard, Oana Hamza
In this paper we argue that the GATE architecture and visual development environment can be used as an effective tool for teaching language engineering and computational linguistics. Since GATE comes...
Gate: An architecture for development of robust hlt applications (2002)
Hamish Cunningham, Diana Maynard, Kalina Bontcheva, Valentin Tablan
In this paper we present GATE, a framework and graphical development environment which enables users to de-velop and deploy language engineering components and resources in a robust fashion. The GATE...
Adapting a robust multi-genre ne system for automatic content extraction (2002)
Diana Maynard, Hamish Cunningham, Kalina Bontcheva, Ontotext Lab
marinsirma. bg Abstract. Many current information extraction systems tend to be designed with particular applications and domains in mind. With the increasing need for robust language engineering...
Kaling Bontcheva, Hamish Cunningham, Diana Maynard, Valentin Tablan, Horacio Saggion
In this paper we present GATE, an architecture and a graphical development environment which enables users to develop and deploy HLT applications in a robust fashion. GATE also provides reusable,...
Gate: An architecture for development of robust hlt applications (2002)
Hamish Cunningham, Diana Maynard, Kalina Bontcheva, Valentin Tablan
In this paper we present GATE, a framework and graphical development environment which enables users to de-velop and deploy language engineering components and resources in a robust fashion. The GATE...
Kalina Bontcheva, Diana Maynard, Hamish Cunningham, Horacio Saggion
Abstract. In this paper we show how we used robust human language technology, such as our domain-independent and customisable named entity recogniser, for automatic content annotation and indexing in...
Using GATE as an environment for teaching NLP (2002)
Kalina Bontcheva, Hamish Cunningham, Valentin Tablan, Diana Maynard, Oana Hamza
In this paper we argue that the GATE architecture and visual development environment can be used as an effective tool for teaching language engineering and computational linguistics. Since GATE comes...
Diana Maynard, Kalina Bontcheva, Horacio Saggion, Hamish Cunningham, Oana Hamza
Under consideration for other conferences (specify)? In this paper we show how tools provided by a text engineering framework (GATE) have been used to build an IE-based summarisation system in the...
Gate: An architecture for development of robust hlt applications (2002)
Hamish Cunningham, Diana Maynard, Kalina Bontcheva, Valentin Tablan
In this paper we present GATE, a framework and graphical development environment which enables users to develop and deploy language engineering components and resources in a robust fashion. The GATE...
Using GATE as an environment for teaching NLP (2002)
Kalina Bontcheva, Hamish Cunningham, Valentin Tablan, Diana Maynard, Oana Hamza
In this paper we argue that the GATE architecture and visual development environment can be used as an effective tool for teaching language engineering and computational linguistics. Since GATE comes...
Paul Baker, Andrew Hardie, Tony Mcenery, Hamish Cunningham, Rob Gaizauskas
The paper describes developments to date on the EMILLE Project (Enabling Minority Language Engineering) being carried out at the Universities of Lancaster and Sheffield. EMILLE was established to...
Marin Dimitrov, Kalina Bontcheva, Hamish Cunningham, Diana Maynard
This paper presents a lightweight approach to pronoun resolution in the case when the antecedent is a named entity. It falls under the category of the so-called ‘knowledge poor ’ approaches that...
Using HLT for Acquiring, Retrieving and Publishing Knowledge in AKT: Position Paper (2001)
Boncheva, Kalina, Brewster, Christopher, Ciravegna, Fabio, Cunningham, Hamish, Guthrie, Louise, Gaizauskas, Robert, ...
AKT is a major research project applying a variety of technologies to knowledge management. Knowledge is a dynamic, ubiquitous resource, which is to be found equally in an expert's head, under...
Using HLT for Acquiring, Retrieving and Publishing Knowledge in AKT: Position Paper (2001)
Bontcheva, Dr. Kalina, Brewster, Mr. Christopher, Ciravegna, Dr. Fabio, Cunningham, Dr. Hamish, Guthrie, Dr. Louise, Gaizauskas, Dr. Rob, ...
AKT is a major research project applying a variety of technologies to knowledge management. Knowledge is a dynamic, ubiquitous resource, which is to be found equally in an expert's head, under...
Using HLT for Acquiring, Retrieving and Publishing Knowledge (2001)
Kalina Bontcheva, Christopher Brewster, Fabio Ciravegna, Hamish Cunningham, Louise Guthrie, Robert Gaizauskas, ...
AKT is a major research project applying a variety of technologies to knowledge management. Knowledge is a dynamic, ubiquitous resource, which is to be found equally in an expert's head, under...
Named Entity Recognition from Diverse Text Types (2001)
Diana Maynard, Valentin Tablan, Cristian Ursu, Hamish Cunningham, Yorick Wilks
Current research in Information Extraction tends to be focused on application-specific systems tailored to a particular domain. The Muse system is a multi-purpose Named Entity recognition system...
Thierry Declerck, Peter Wittenburg, Hamish Cunningham
We describe in this paper the MUMIS Project (Multimedia Indexing and Searching Environment) , which is concerned with the development and integration of base technologies, demonstrated within a...
Software Architecture for Language Engineering (2000)
This thesis defines the boundaries of Software Architecture for Language Engineering (SALE), an area formed by the intersection of human language computation and software engineering. SALE covers all...
Uniform Language Resource Access and Distribution (2000)
Hamish Cunningham, Wim Peters, Clare Mccauley, Kalina Bontcheva, Yorick Wilks
The reuseability and accessibility of lexical and linguistic resources often requires substantial programming overhead and detailed knowledge of the structure and nature of resource-specific...
Experience of using GATE for NLP R&D (2000)
Hamish Cunningham, Diana Maynard, Kalina Bontcheva, Valentin Tablan, Yorick Wilks
GATE, a General Architecture for Text Engineering, aims to provide a software infrastructure for researchers and developers working in NLP. GATE has now been widely available for four years. In this...
Hamish Cunningham, Kalina Bontcheva, Valentin Tablan, Yorick Wilks
This paper presents a taxonomy of previous work on infrastructures, architectures and development environments for representing and processing Language Resources (LRs), corpora, and annotations. This...
JAPE: a Java Annotation Patterns Engine (2000)
Hamish Cunningham, Hamish Cunningham, Diana Maynard, Diana Maynard, Valentin Tablan, Valentin Tablan
Contents
Hamish Cunningham, Kalina Bontcheva, Valentin Tablan, Yorick Wilks
This paper presents a taxonomy of previous work on infrastructures, architectures and development environments for representing and processing Language Resources (LRs), corpora, and annotations. This...
Information Extraction - a User Guide (1999)
Hamish Cunningham, Hamish Cunningham
Contents 1 Introduction 1 2 Types of IE 3 3 Performance levels 4 4 Named Entity recognition 5 5 Coreference resolution 7 6 Template Element production 9 7 Template Relation production 10 8 Scenario...
Yorick Wilks, Robert Gaizauskas, Kevin Humphreys, Hamish Cunningham
The benefits of the effective creation of Information Extraction (IE) in the last ten years, driven by the ARPA TIPSTER programme and the associated MUC evaluations, have been enormous, but it must...
Experience with a Language Engineering Architecture: Three Years of GATE (1999)
Hamish Cunningham, Robert Gaizauskas, Kevin Humphreys, Yorick Wilks
GATE, the General Architecture for Text Engineering, aims to provide a software infrastructure for researchers and developers working in the area of natural language processing. A version of GATE has...
Implementing a Sense Tagger in General Architecture for Text Engineering (1998)
Hamish Cunningham, Mark Stevenson, Yorick Wilks
We describe two systems: GATE (General Architecture for Text Engineering), an architecture to aid in the production and delivery of language engineering systems which significantly reduces...
Implementing a Sense Tagger in a General Architecture for Text Engineering (1998)
Hamish Cunningham, Mark Stevenson, Yorick Wilks
We describe two systems: GATE (General Architecture for Text Engineering), an architecture to aid in the production and delivery of language engineering systems which significantly reduces...
Sense Tagging and Language Engineering (1998)
Mark Stevenson, Hamish Cunningham, Yorick Wilks
. Language engineering is a new, practical, branch of NLP and Computational Linguistics. We describe two language engineering systems: GATE (General Architecture for Text Engineering), an...
Software Infrastructure for Natural Language Processing (1997)
Cunningham, Hamish, Humphreys, Kevin, Gaizauskas, Robert, Wilks, Yorick
We classify and review current approaches to software infrastructure for research, development and delivery of NLP systems. The task is motivated by a discussion of current trends in the field of NLP...
Information Extraction - A User Guide (1997)
This technical memo describes Information Extraction from the point-of-view of a potential user of the technology. No knowledge of language processing is assumed. Information Extraction is a process...
A Design for Multilingual Information Extraction (poster) (1997)
Azzam, S., Humphreys, K., Gaizauskas, R., Cunningham, Hamish, Wilks, Y.
GATE: A TIPSTER-based general architecture for text engineering (1997)
Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas
Building on the work of the TIPSTER architecture group, the University of Sheffield Natural Language Processing group have developed GATE, a General Architecture for Text Engineering. GATE implements...
Software Infrastructure for Natural Language Processing (1997)
Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas, Yorick Wilks
We classify and review current approaches to software infrastructure for research, development and delivery of NLP systems. The task is motivated by a discussion of current trends in the field of NLP...
Information Extraction - a User Guide (1997)
Hamish Cunningham, Hamish Cunningham
Contents 1 Introduction 1 2 Types of IE 2 3 Performance levels 3 4 Named Entity recognition 5 5 Coreference resolution 7 6 Template Element production 8 7 Scenario Template extraction 10 8 An...
Software Infrastructure for Natural Language Processing (1997)
Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas
We classify and review current approaches to software infrastructure for research, development and delivery of NLP systems. The task is motivated by a discussion of current trends in the field of NLP...
New Methods, Current Trends and Software Infrastructure for NLP (1996)
Cunningham, Hamish, Wilks, Yorick, Gaizauskas, Robert J.
The increasing use of `new methods' in NLP, which the NeMLaP conference series exemplifies, occurs in the context of a wider shift in the nature and concerns of the discipline. This paper begins with...
Cunningham, Hamish, Gaizauskas, Robert J., Wilks, Yorick
This report argues for the provision of a common software infrastructure for NLP systems. Current trends in Language Engineering research are reviewed as motivation for this infrastructure, and...
TIPSTER-Compatible Projects at Sheffield (1996)
Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas, Yorick Wilks
more appropriately described by the term Language Engineering than the well-established labels of Nat-ural Language Processing or Computational Linguis-tics. This reflects an increased focus on...
A General Architecture for Text Engineering (1996)
Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas
For a variety of reasons NLP has recently spawned a related engineering discipline called language en-gineering (LE), whose orientation is towards the ap-plication of NLP techniques to solving...
GATE: an environment to support research and development in natural language engineering (1996)
Robert Gaizauskas, Hamish Cunningham, Yorick Wilks, Peter Rodgers, Kevin Humphreys
We describe a software environment to support research and development in natural language (NL) engineering. This environment-- GATE (General Architecture for Text Engineering)-- aims to advance...
New Methods, Current Trends and Software Infrastructure for NLP (1996)
Hamish Cunningham, Yorick Wilks, Robert J. Gaizauskas
. The increasing use of `new methods' in NLP, which this conference series exemplifies, occurs in the context of a wider shift in the nature and concerns of the discipline. This paper begins...
Software Infrastructure for Language Engineering (1996)
Hamish Cunningham, Yorick Wilks, Robert J. Gaizauskas
This paper describes a freely-available system called GATE [Cunningham, Gaizauskas, Wilks 1995] -- a General Architecture for Text Engineering -- which integrates much of this work. GATE is an
CREOLE Developer's Manual (1996)
Hamish Cunningham, Kevin Humphreys, Rob Gaizauskas, Martin Stower
this document or its parent collection using the API defined by the Tipster people. The full specification of this API is given in [6] which is available from Sheffield's ftp site as
GATE: An Environment to Support Research and Development in Natural Language Engineering (1996)
Robert Gaizauskas, Hamish Cunningham, Yorick Wilks, Peter Rodgers, Kevin Humphreys
We describe a software environment to support research and development in natural language (NL) engineering. This environment -- GATE (General Architecture for Text Engineering) -- aims to advance...
GATE - a General Architecture for Text Engineering (1996)
Hamish Cunningham, Yorick Wilks, Robert J. Gaizauskas
Much progress has been made in the provision of reusable data resources for Natural Language Engineering, such as grammars, lexicons, thesauruses. Although a number of projects have addressed the...
Tipster-Compatible Projects At Sheffield (1996)
Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas, Yorick Wilks
this paper at the April 1996 TIPSTER workshop, and for extensive comments during the preparation of this paper. References
GATE: An Environment to Support Research and Development in Natural Language Engineering (1996)
Robert Gaizauskas, Hamish Cunningham, Yorick Wilks, Peter Rodgers, Kevin Humphreys
We describe a software environment to support research and development in natural language (NL) engineering. This environment -- GATE (General Architecture for Text Engineering) -- aims to advance...
Hamish Cunningham, Hamish Cunningham, Robert J. Gaizauskas, Robert J. Gaizauskas
this report is that the jury is still out on performance vs. competence. Thus, as well as a host of competing linguistic and lexicographic theories, LE is home to a thoroughgoing paradigm conflict....