Robert Gaizauskas

Position Paper (2009)

Kalina Bontcheva, Christopher Brewster, Fabio Ciravegna, Hamish Cunningham, Louise Guthrie, Robert Gaizauskas, ...

AKT is a major research project applying a variety of technologies to knowledge management. Knowledge is a dynamic, ubiquitous resource, which is to be found equally in an expert's head, under...

Acquiring Sense Tagged Examples using Relevance Feedback (2009)

Mark Stevenson, Yikun Guo, Robert Gaizauskas

Supervised approaches to Word Sense Disambiguation (WSD) have been shown to outperform other approaches but are hampered by reliance on labeled training examples (the data acquisition bottleneck)....

Disambiguation of biomedical text using diverse sources of information (2008)

Stevenson, Mark, Guo, Yikun, Gaizauskas, Robert, Martinez, David

Abstract Background Like text in other domains, biomedical documents contain a range of terms with more than one possible meaning. These ambiguities form a significant obstacle to the automatic...

Mining clinical relationships from patient narratives (2008)

Roberts, Angus, Gaizauskas, Robert, Hepple, Mark, Guo, Yikun

Abstract Background The Clinical E-Science Framework (CLEF) project has built a system to extract clinically significant information from the textual component of medical records in order to support...

2005b. A hybrid approach to align sentences and words in English-Hindi parallel corpora (2008)

Niraj Aswani, Robert Gaizauskas

In this paper we describe an alignment system that aligns English-Hindi texts at the sentence and word level in parallel corpora. We describe a simple sentence length approach to sentence alignment...

�Computational Linguistics Society of R.O.C. Information Extraction: Beyond Document Retrieval (2008)

Robert Gaizauskas, Yorick Wilks

In this paper we give a synoptic view of the growth text processing technology of information extraction (IE) whose function is to extract information about a pre-specified set of entities, relations...

Abstract (2008)

José Castaño, Robert Ingria, Roser Saurí, Robert Gaizauskas, Andrea Setzer, Graham Katz

In this paper we provide a description of TimeML, a rich specification language for event and temporal expressions in natural language text, developed in the context of the AQUAINT program on...

SUSSEX UNIVERSITY: DESCRIPTION OF THE SUSSEX SYSTEM USED FOR MUC- 5 (2008)

Introductio N, Robert Gaizauskas, Lynne Cahill, Roger Evan S

This paper describes the system used for the University of Sussex team's participation in the MUC-5 messag e understanding trials. What is described below is the result of 12 person-months of...

ABSTRACT Intelligent Access to Text: Integrating Information Extraction Technology into Text Browsers (2008)

Robert Gaizauskas, Patrick Herring, Michael Oakes, Michelline Beaulieu, Peter Willett, Helene Fowkes, ...

initial.surname£ In this paper we show how two standard outputs from information extraction (IE) systems – named entity annotations and scenario templates – can be used to enhance access to text...

Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives (2008)

Emma Barker, Ryuichiro Higashinaka, François Mairesse, Robert Gaizauskas, Marilyn Walker, Jonathan Foster

This paper describes a dialogue data collection experiment and resulting corpus for dialogues between a senior mobile journalist and a junior cub reporter back at the office. The purpose of the...

redundancy (2008)

Horacio Saggion, Robert Gaizauskas

summarization by cluster/profile relevance and

The University of Sheffield’s TREC 2005 Q&A Experiments (2008)

Robert Gaizauskas, Mark A. Greenwood, Henk Harkema, Horacio Saggion, Atheesh Sanka

Introduction Our entries in the TREC 2005 QA evaluation continue experiments carried out as part of TREC 2004. Hence here, as there, we report work on multiple approaches to both the main and...

Benjamin AdrianWorkshop Organization Programme Chairs (2008)

Benjamin Adrian, Günter Neumann, Alexander Troussov, Borislav Popov Preface, Benjamin Adrian, Guenter Neumann, ...

More and more information extraction (IE) systems use ontologies for extraction tasks. These systems use knowledge representation techniques for extracting information from unstructured or...

Mining clinical relationships from patient narratives (2008)

Bmc Bioinformatics, Angus Roberts, Robert Gaizauskas, Yikun Guo

© 2008 Roberts et al; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License

Automatically Extracting Enzyme Interaction and Protein Structure Information from Biological Science Journal Articles (2007)

Kevin Humphreys, George Demetriou, Robert Gaizauskas

With the explosive growth of scientific literature in the area of molecular biology, the need to automatically process and extract information from on-line text sources has become increasingly...

University of Shefield TREC-8 Q&A System (2007)

Kevin Humphreys, Robert Gaizauskas, Mark Hepple, Mark Sanderson

this paper concentrates on describing the modi#cations made to our existing information extraction system to allow it to participate in the Q & A task.

University of Sheffield TREC-8 Q & A System (2007)

Kevin Humphreys, Robert Gaizauskas, Mark Hepple, Mark Sanderson

this paper concentrates on describing the modifications made to our existing information extraction system to allow it to participate in the Q & A task.

A Comparison of Machine Learning Algorithms for Prepositional Phrase Attachment (2007)

Brian Mitchell, Robert Gaizauskas

This paper presents work which extends previous corpus-based work on training Machine Learning Algorithms to perform Prepositional Phrase attachment. Besides recreating others ’ experiments to see...

Investigations into the Grammar Underlying the Penn (2007)

Underlying Penn, R. Gaizauskas, Treebank Ii, Treebank Ii, Robert Gaizauskas

Given a bracketed corpus like the Penn Treebank II (Marcus et al., 1995) an obvious thing to want to do is to extract the grammar underlying the bracketing and investigate some of its properties. How...

TREC 2002 Q&A System (2007)

Mark A. Greenwood, Ian Roberts, Robert Gaizauskas

The system entered by the University of Sheffield in the question answering track of TREC 2002 represents a significant development over the Sheffield system entered into TREC-8 [9] and TREC9 [15],...

Visual Execution and Data Visualisation in Natural Language Processing (2007)

Peter Rodgers, Robert Gaizauskas, Kevin Humphreys, Hamish Cunningham

We describe GGI, a visual system that allows the user to execute an automatically generated data flow graph containing code modules that perform natural language processing tasks. These code modules...

University of Sheeld TREC-9 Q & A System (2007)

Sam Scott, Robert Gaizauskas

The system entered by the University of Sheeld in the question answering track of TREC9 represents a signicant development over the Sheeld system entered for TREC-8 [6] and, satisfyingly, achieved...

University of Sheffield, (2007)

Mark Stevenson, Robert Gaizauskas

Lists of names are an important knowledge source for many systems which carry out named entity recognition. It is shown that augmenting hand-crafted lists with those derived from corpora can improve...

a (2007)

Kevin Humphreys, Robert Gaizauskas, Mark Sanderson

a Department of Computer Science / b

Proceedings of the LREC Workshop "Information Extraction meets Corpus Linguistics" (2007)

Athens Greece Improving, Mark Stevenson, Robert Gaizauskas

Lists of names are an important knowledge source for many systems which carry out named entity recognition. It is shown that augmenting hand-crafted lists with those derived from corpora can improve...

Intelligent Access to Text: Integrating Information (2007)

Extraction Technology Into, Robert Gaizauskas, Patrick Herring, Michael Oakes, Michelline Beaulieu, Peter Willett, ...

In this paper we show how two standard outputs from information extraction (IE) systems -- named entity annotations and scenario templates -- can be used to enhance access to text collections via a...

Computational Linguistics Society of R.O.C. Information Extraction: Beyond Document Retrieval (2007)

Robert Gaizauskas, Yorick Wilks

In this paper we give a synoptic view of the growth text processing technology of information extraction (IE) whose function is to extract information about a pre-specified set of entities, relations...

SemEval2007 task 15: TempEval temporal relation identification (2007)

Marc Verhagen, Robert Gaizauskas, Frank Schilder, Graham Katz, James Pustejovsky

The TempEval task proposes a simple way to evaluate automatic extraction of temporal relations. It avoids the pitfalls of evaluating a graph of inter-related labels by defining three sub tasks that...

Proceedings of the Ninth International Workshop on Parsing Technologies (IWPT), pages 200--201, (2005)

Vancouver October Association, Robert Gaizauskas, Horacio Saggion, Mark A. Greenwood, Kevin Humphreys

We describe SUPPLE, a freely-available, open source natural language parsing system, implemented in Prolog, and designed for practical use in language engineering (LE) applications. SUPPLE can be run...

Using a named entity tagger to generalise surface matching text patterns for question answering (2003)

Mark A. Greenwood, Robert Gaizauskas

This paper explores one particular limitation common to question answering systems which operate by using induced surface matching text patterns – namely the problems concerned with question...

Evaluating Passage Retrieval Approaches for Question Answering (2003)

Ian Roberts, Robert Gaizauskas

Automatic open domain question answering (QA) has been the focus of much recent research, stimulated by the introduction of a QA track in TREC in 1999. Many QA systems have been developed and most...

CLEF—joining up healthcare with clinical and post-genomic research (2003)

Alan Rector, Jeremy Rogers, Adel Taweel, David Ingram, Dipak Kalra, Jo Milan, ...

www.clinical-escience.org CLEF aims to join up clinical care and biomedical research. It is developing methods for managing and using pseudonymised repositories of the long-term patient histories...

Using HLT for Acquiring, Retrieving and Publishing Knowledge (2001)

Kalina Bontcheva, Christopher Brewster, Fabio Ciravegna, Hamish Cunningham, Louise Guthrie, Robert Gaizauskas, ...

AKT is a major research project applying a variety of technologies to knowledge management. Knowledge is a dynamic, ubiquitous resource, which is to be found equally in an expert's head, under...

The meter corpus: A corpus for analysing journalistic text reuse (2001)

Robert Gaizauskas, Jonathan Foster, Yorick Wilks, John Arundel, Paul Clough, Scott Piao

As a part of the METER (MEasuring TExt Reuse) project we have built a new type of comparable corpus consisting of annotated examples of related newspaper texts. Texts in the corpus were manually...

Information Extraction from biological science journal articles: Enzyme interactions and protein structures (2001)

Kevin Humphreys, George Demetriou, Robert Gaizauskas

Information extraction technology, as defined and developed through the U.S. DARPA Message Understanding Conferences (MUCs) [ DARPA, 1998] , has proved

Two applications of information extraction to biological science journal articles: Enzyme interactions and protein structures (2000)

Kevin Humphreys, George Demetriou, Robert Gaizauskas

Information extraction technology, as de ned and developed through the U.S. DARPA Message Understanding Conferences (MUCs), has proved successful at extracting information primarily from newswire...

Experiments on sentence boundary detection (2000)

Mark Stevenson, Robert Gaizauskas

This paper explores the problem of identifying sen-tence boundaries in the transcriptions produced by automatic speech recognition systems. An experi-ment which determines the level of human...

Using corpus-derived named lists for named entity recognition (2000)

Mark Stevenson, Robert Gaizauskas

This paper describes experiments to establish the performance of a named entity recognition system which builds categorized lists of names from manu-ally annotated training data. Names in text are...

A combined IR/NLP approach to question answering against large text collections (2000)

Robert Gaizauskas, Kevin Humphreys

r.gaizauskas,k.humphreys¡ We describe an approach to finding literal answer strings to natural language questions in large text collections. The approach involves linking an IR system with an NLP...

Automatically augmenting terminological lexicons from untagged text (2000)

George Demetriou, Robert Gaizauskas

Lexical resources play a crucial role in language technology but lexical acquisition can often be a time-consuming, laborious and costly exercise. In this paper, we describe a method for the...

Using corpus-derived named lists for named entity recognition (2000)

Mark Stevenson, Robert Gaizauskas

This paper describes experiments to establish the performance of a named entity recognition system which builds categorized lists of names from manually annotated training data. Names in text are...

Term Recognition and Classification in Biological Science Journal Articles (2000)

Robert Gaizauskas, George Demetriou, Kevin Humphreys

In this paper we describe the application of automatic terminology recognition and classification techniques for two bioinformatics projects: extraction of information about enzymes and metabolic...

Two applications of information extraction to biological science journal articles: Enzyme interactions and protein structures (2000)

Kevin Humphreys, George Demetriou, Robert Gaizauskas

Information extraction technology, as defined and developed through the U.S. DARPA Message Understanding Conferences (MUCs), has proved successful at extracting information primarily from newswire...

A Combined IR/NLP Approach to Question Answering Against Large Text Collections (2000)

Robert Gaizauskas, Kevin Humphreys

We describe an approach to finding literal answer strings to natural language questions in large text collections. The approach involves linking an IR system with an NLP system that performs...

Experiments on Sentence Boundary Detection (2000)

Mark Stevenson, Robert Gaizauskas

This paper explores the problem of identifying sentence boundaries in the transcriptions produced by automatic speech recognition systems. An experiment which determines the level of human...

Using Corpus-derived Name Lists for Named Entity Recognition (2000)

Mark Stevenson, Robert Gaizauskas

This paper describes experiments to establish the performance of a named entity recognition system which naively builds categorized lists of names from manually annotated training data and then...

Experiments on Sentence Boundary Detection (2000)

Mark Stevenson, Robert Gaizauskas

This paper explores the problem of identifying sentence boundaries in the transcriptions produced by automatic speech recognition systems. An experiment which determines the level of human...

Using Corpus-derived Name Lists for Named Entity Recognition (2000)

Mark Stevenson, Robert Gaizauskas

This paper describes experiments to establish the performance of a named entity recognition system which builds categorized lists of names from manually annotated training data. Names in text are...

Two applications of information extraction to biological science journal articles: Enzyme interactions and protein structures (2000)

Kevin Humphreys, George Demetriou, Robert Gaizauskas

Information extraction technology, as de ned and developed through the U.S. DARPA Message Understanding Conferences (MUCs), has proved successful at extracting information primarily from newswire...

using coreference chains for text summarization (1999)

Saliha Azzam, Kevin Humphreys, Robert Gaizauskas

We describe the use of coreference chains for the production of text summaries, using a variety of criteria to select a 'best ' chain to represent the main topic of a text. The approach has...

Baseline IE-NE experiments using the SPRACH/LaSIE system (1999)

Steve Renals, Yoshihiko Gotoh, Robert Gaizauskas, Mark Stevenson

We have developed two conceptually different systems that are able to identify named entities from spoken audio. One (referred to as SPRACH-S) has a stochastic finite state machine structure for use...

Baseline IE-NE experiments using the SPRACH/LaSIE system (1999)

Steve Renals, Yoshihiko Gotoh, Robert Gaizauskas, Mark Stevenson

We have developed two conceptually different systems that are able to identify named entities from spoken audio. One (referred to as SPRACH-S) has a stochastic finite state machine structure for use...

Baseline IE-NE Experiments Using the Sprach/LaSIE System (1999)

Steve Renals, Yoshihiko Gotoh, Robert Gaizauskas, Mark Stevenson

We have developed two conceptually different systems that are able to identify named entities from spoken audio. One (referred to as SPRACH-S) has a stochastic finite state machine structure for use...

using coreference chains for text summarization (1999)

Saliha Azzam, Kevin Humphreys, Robert Gaizauskas

We describe the use of coreference chains for the production of text summaries, using a variety of criteria to select a `best ' chain to represent the main topic of a text. The approach has been...

LaSIE jumps the GATE (1999)

Yorick Wilks, Robert Gaizauskas, Kevin Humphreys, Hamish Cunningham

The benefits of the effective creation of Information Extraction (IE) in the last ten years, driven by the ARPA TIPSTER programme and the associated MUC evaluations, have been enormous, but it must...

Using Coreference Chains for Text Summarization (1999)

Saliha Azzam, Kevin Humphreys, Robert Gaizauskas

We describe the use of coreference chains for the production of text summaries, using a variety of criteria to select a `best' chain to represent the main topic of a text. The approach has been...

Baseline Ie-Ne Experiments Using The Sprach/lasie System (1999)

Steve Renals Yoshihiko, Yoshihiko Gotoh, Robert Gaizauskas, Mark Stevenson

We have developed two conceptually different systems that are able to identify named entities from spoken audio. One (referred to as SPRACH-S) has a stochastic finite state machine structure for use...

Experience with a Language Engineering Architecture: Three Years of GATE (1999)

Hamish Cunningham, Robert Gaizauskas, Kevin Humphreys, Yorick Wilks

GATE, the General Architecture for Text Engineering, aims to provide a software infrastructure for researchers and developers working in the area of natural language processing. A version of GATE has...

Baseline IE-NE experiments using the SPRACH/LaSIE system (1999)

Steve Renals, Yoshihiko Gotoh, Robert Gaizauskas, Mark Stevenson

We have developed two conceptually different systems that are able to identify named entities from spoken audio. One (referred to as SPRACH-S) has a stochastic finite state machine structure for use...

Evaluating a focus-based approach to anaphora resolution (1998)

Saliha Azzam, Kevin Humphreys, Robert Gaizauskas

We present an approach to anaphora resolution based on a focusing algorithm, and implemented within an existing MUC (Message Understand-ing Conference) Information Extraction system, allowing...

Evaluating a focus-based approach to anaphora resolution (1998)

Saliha Azzam, Kevin Humphreys, Robert Gaizauskas

We present an approach to anaphora resolution based on a focusing algorithm, and implemented within an existing MUC (Message Understand-ing Conference) Information Extraction system, allowing...

Coreference resolution in a multilingual information extraction (1998)

Saliha Azzam, Kevin Humphreys, Robert Gaizauskas

We present in this paper the coreference mechanism implemented in the M-LaSIE system, a prototype multilingual Information Extraction (IE) system. We describe an experiment in which texts from a...

Information Extraction: Beyond Document Retrieval (1998)

Y. Wilks, Robert Gaizauskas, Yorick Wilks

In this paper we give a synoptic view of the growth text processing technology of information extraction (IE) whose function is to extract information about a pre-specified set of entities, relations...

Extending a Simple Coreference Algorithm with a Focusing Mechanism (1998)

Saliha Azzam, Kevin Humphreys, Robert Gaizauskas

this paper we present results about an evaluation of both approaches on real-world texts to determine the main drawbacks and advantages of each. We see this as a first step towards answering the...

Information Extraction: Beyond Document Retrieval (1998)

Robert Gaizauskas, Yorick Wilks

In this paper we give a synoptic view of the growth text processing technology of information extraction (IE) whose function is to extract information about a pre-specified set of entities, relations...

Evaluating a Focus-Based Approach to Anaphora Resolution (1998)

Saliha Azzam, Kevin Humphreys, Robert Gaizauskas

We present an approach to anaphora resolution based on a focusing algorithm, and implemented within an existing MUC (Message Understanding Conference) Information Extraction system, allowing...

Extraction (1997)

Jonathan Furner, School Of Information, Media Studies, A. M. Robertson, R. Gaizauskas, Alexander M. Robertson, ...

©Copyright in this paper belongs to the author(s) Published in collaboration with the

Event coreference for information extraction (1997)

Kevin Humphreys, Robert Gaizauskas, Saliha Azzam

We propose a general approach for per-forming event coreference and for con-structing complex event representations, such as those required for information ex-traction tasks. Our approach is based on...

Using a language independent domain model for multilingual information extraction (1997)

Saliha Azzam, Kevin Humphreys, Robert Gaizauskas, Yorick Wilks

The volume of electronic text in different languages, particularly on the World Wide Web, is growing significantly, and the problem of users who are restricted in the number of languages they read...

GATE: A TIPSTER-based general architecture for text engineering (1997)

Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas

Building on the work of the TIPSTER architecture group, the University of Sheffield Natural Language Processing group have developed GATE, a General Architecture for Text Engineering. GATE implements...

Coupling Information Retrieval and Information Extraction: A New Text Technology for Gathering Information from the Web (1997)

Robert Gaizauskas, Alexander M. Robertson, Er M. Robertson

The techniques of information retrieval and information extraction are complementary, but to date there has been little concrete work aimed at integrating the two. We describe how each of these...

Event coreference for information extraction (1997)

Kevin Humphreys, Robert Gaizauskas, Saliha Azzam

We propose a general approach for performing event coreference and for constructing complex event representations, such as those required for information extraction tasks. Our approach is based on a...

Compacting the Penn Treebank Grammar (1997)

Alexander Krotov, Robert Gaizauskas, Mark Hepple, Yorick Wilks

Treebanks, such as the Penn Treebank (PTB), afford a simple approach to obtaining a broad coverage grammar: simply read the grammar off the parse trees in the treebank. While such a grammar is easy...

Software Infrastructure for Natural Language Processing (1997)

Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas, Yorick Wilks

We classify and review current approaches to software infrastructure for research, development and delivery of NLP systems. The task is motivated by a discussion of current trends in the field of NLP...

Event Coreference for Information Extraction (1997)

Kevin Humphreys, Robert Gaizauskas, Saliha Azzam

We propose a general approach for performing event coreference and for constructing complex event representations, such as those required for information extraction tasks. Our approach is based on a...

Compacting the Penn Treebank Grammar (1997)

Alexander Krotov, Er Krotov, Robert Gaizauskas, Yorick Wilks

Treebanks, such as the Penn Treebank (PTB), offer a simple approach to obtaining a broad coverage grammar: one can simply read the grammar off the parse trees in the treebank. While such a grammar is...

Event Coreference for Information Extraction (1997)

Kevin Humphreys, Robert Gaizauskas, Saliha Azzam

We propose a general approach for performing event coreference and for constructing complex event representations, such as those required for information extraction tasks. Our approach is based on a...

Using A Semantic Network for Information Extraction (1997)

Robert Gaizauskas, Kevin Humphreys

This paper describes the approach to knowledge representation taken in the LaSIE information extraction (IE) system. Unlike many IE systems that skim texts and use large collections of shallow,...

Software Infrastructure for Natural Language Processing (1997)

Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas

We classify and review current approaches to software infrastructure for research, development and delivery of NLP systems. The task is motivated by a discussion of current trends in the field of NLP...

Report of the Study Group on Assessment and Evaluation (1996)

Crouch, Richard, Gaizauskas, Robert, Netter, Klaus

This is an interim report discussing possible guidelines for the assessment and evaluation of projects developing speech and language systems. It was prepared at the request of the European...

TIPSTER-Compatible Projects at Sheffield (1996)

Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas, Yorick Wilks

more appropriately described by the term Language Engineering than the well-established labels of Nat-ural Language Processing or Computational Linguis-tics. This reflects an increased focus on...

A General Architecture for Text Engineering (1996)

Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas

For a variety of reasons NLP has recently spawned a related engineering discipline called language en-gineering (LE), whose orientation is towards the ap-plication of NLP techniques to solving...

GATE: an environment to support research and development in natural language engineering (1996)

Robert Gaizauskas, Hamish Cunningham, Yorick Wilks, Peter Rodgers, Kevin Humphreys

We describe a software environment to support research and development in natural language (NL) engineering. This environment-- GATE (General Architecture for Text Engineering)-- aims to advance...

Evaluation of an Algorithm for the Recognition and Classification of Proper Names (1996)

Takahiro Wakao, Robert Gaizauskas, Yorick Wilks

We describe an information extraction system in which four classes of naming expressions -- organisation, person, location and time names -- are recognised and classified with nearly 92% combined...

Using Verb Semantic Role Information to Extend Partial Parses via a Co-reference Mechanism (1996)

Robert Gaizauskas, Kevin Humphreys

We describe a technique for the robust interpretation of newswire texts which uses semantic role information about verb complements together with a general co-reference mechanism to extend the...

Quantitative Evaluation of Coreference Algorithms in an Information Extraction System (1996)

Robert Gaizauskas, Kevin Humphreys

Algorithms for performing coreference resolution can only be precisely evaluated given a benchmark corpus of coreference-annotated texts, together with techniques for evaluating the algorithms'...

Using Verb Semantic Role Information to Extend Partial Parses via a Co-reference Mechanism (1996)

Robert Gaizauskas, Kevin Humphreys

We describe a technique for the robust interpretation of newswire texts which uses semantic role information about verb complements together with a general co-reference mechanism to extend the...

Quantitative Evaluation of Coreference Algorithms in an Information Extraction System (1996)

Robert Gaizauskas, Kevin Humphreys

Algorithms for performing coreference resolution can only be precisely evaluated given a benchmark corpus of coreference-annotated texts, together with techniques for evaluating the algorithms'...

GATE: An Environment to Support Research and Development in Natural Language Engineering (1996)

Robert Gaizauskas, Hamish Cunningham, Yorick Wilks, Peter Rodgers, Kevin Humphreys

We describe a software environment to support research and development in natural language (NL) engineering. This environment -- GATE (General Architecture for Text Engineering) -- aims to advance...

Tipster-Compatible Projects At Sheffield (1996)

Hamish Cunningham, Kevin Humphreys, Robert Gaizauskas, Yorick Wilks

this paper at the April 1996 TIPSTER workshop, and for extensive comments during the preparation of this paper. References

GATE: An Environment to Support Research and Development in Natural Language Engineering (1996)

Robert Gaizauskas, Hamish Cunningham, Yorick Wilks, Peter Rodgers, Kevin Humphreys

We describe a software environment to support research and development in natural language (NL) engineering. This environment -- GATE (General Architecture for Text Engineering) -- aims to advance...

Evaluation of an algorithm for the recognition and classification of proper names (1996)

Takahiro Wakao, Robert Gaizauskas, Yorick Wilks

We describe an information extraction system in which four classes of naming expressions organisation, person, location and tinm names are recognised and classified with nearly 92 % combined...

Acquiring a Stochastic Context-Free Grammar from the Penn Treebank (1994)

Alexander Krotov, Robert Gaizauskas, Yorick Wilks

In this paper we present preliminary results of investigating the structure of the Penn Treebank and how these results can be used in probabilistic parsing of English. Penn Treebank is a corpus of...

The CLEF Corpus: Semantic Annotation of Clinical Text

Roberts, Angus, Gaizauskas, Robert, Hepple, Mark, Davis, Neil, Demetriou, George, Guo, Yikun, ...

The Clinical E-Science Framework (CLEF) project is building a framework for the capture, integration and presentation of clinical information: for clinical research, evidence-based health care and...