Publication View

Using Empirical Methods for Evaluating Expression and Content Similarity (2004)

Abstract
Despite lack of any significant quantifiable similarities between documents, people can intuitively compare documents and evaluate their similarity. To understand how people evaluate text similarity, we queried subjects about the level of content similarity and expression similarity of pairs of documents. Using these judgments on similarity as ground truth, we automated evaluation of similarity.

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.60.7047
Source http://people.csail.mit.edu/ozlem/Uzuner-HICSS04.pdf
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Type text
Language English
Relation 10.1.1.5.5668, 10.1.1.48.6880, 10.1.1.51.2820, 10.1.1.51.6064, 10.1.1.54.5414, 10.1.1.19.8925, 10.1.1.26.3053, 10.1.1.59.9404, 10.1.1.63.5578, 10.1.1.138.8295, 10.1.1.60.2247, 10.1.1.60.6579