| Automatic term list generation for entity tagging (2005) | |||||||||||||||
Abstract | |||||||||||||||
| Entity tagging is perhaps the most basic tasks in information extraction. Consequently, if any sophisticated information extraction is to be done, this task must be done well. Currently, state of the art taggers are trained with hand-labeled training data that resembles the text to which they will be applied. However, such training data is rarely abundant enough to contain more than a modest subset of entities from the target entity class. To remedy this, designers of information extraction systems frequently augment their systems with large lists of known members of the entity class so as to cover most of the prototypical, and perhaps not so prototypical, positive instances that the training data may not contain. Such entity lists, or term lists | |||||||||||||||
Publication details | |||||||||||||||
| |||||||||||||||