| Using Empirical Methods for Evaluating Expression and Content Similarity (2004) | |||||||||||||||
Abstract | |||||||||||||||
| Despite lack of any significant quantifiable similarities between documents, people can intuitively compare documents and evaluate their similarity. To understand how people evaluate text similarity, we queried subjects about the level of content similarity and expression similarity of pairs of documents. Using these judgments on similarity as ground truth, we automated evaluation of similarity. | |||||||||||||||
Publication details | |||||||||||||||
| |||||||||||||||