Publication View

ABSTRACT The Case for Explicit Knowledge in Documents (2008)

Abstract
The Web is full of documents which must be interpreted by human readers and by software agents (search engines, recommender systems, clustering processes etc.). Although Web standards have addressed format obfuscation by using XML schemas and stylesheets to specify unambiguous structure and presentation semantics, interpretation is still hampered by the fundamental ambiguity of information in #PCDATA text. Even the most easily distinguishable kinds of knowledge such as article citations and proper nouns (referring to people, organisations, projects, products, technical concepts) have to be identified by fallible, post-hoc extraction processes. The WiCK project has investigated the writing process in a Semantic Web environment where knowledge services exist and actively assist the author. In this paper we discuss the need to make knowledge an explicit part of the document representation and the advantages and disadvantages of this step.

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=?doi=10.1.1.105.5222
Source http://eprints.ecs.soton.ac.uk/9360/1/pl-04-carr.pdf
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Keywords Document Structure, Semantic Web, Knowledge Writing
Type text
Language English
Relation 10.1.1.70.1622, 10.1.1.20.2437, 10.1.1.66.33, 10.1.1.43.6959, 10.1.1.12.8297, 10.1.1.90.1351, 10.1.1.67.5433, 10.1.1.15.3571, 10.1.1.18.3755