Publication View

Vocabulary Patterns in Free-for-all Collaborative Indexing Systems (2009)

Abstract
Abstract. In collaborative indexing systems users generate a big amount of metadata by labelling web-based content. These labels are known as tags and form a shared vocabulary. In order to understand the characteristics of that vocabulary, we study structural patterns of these tags by implying the theory of self-organizing systems. Therefore, we utilize the graph theoretic notion to model the network of tags and their valued connections, which represent frequency rates of co-occurring tags. Empirical data is provided by the free-for-all collaborative indexing systems Delicious, Connotea and CiteULike. First, we measure the frequency distribution of co-occurring tags. Secondly, we correlate these tags towards their rank over time. Results indicate a strong relationship among a few tags as well as a notable persistence of these tags over time. Therefore, we make the educated guess that the observed collaborative indexing systems are self-organizing systems towards a shared vocabulary building. Implications on the results are the presence of semantic domains based on high frequency rates of co-occurring tags, which reflect topics of interest among the user community. When observing those semantic domains over time, that information can be used to provide a historical or trend-setting development of the community’s interests, thus enhancing collaborative indexing systems in general as well as providing a new tool to develop community-based products and services at the same time.

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.142.9594
Source http://sunsite.informatik.rwth-aachen.de/Publications/CEUR-WS/Vol-292/paper6.pdf
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Keywords Key words, Metadata, tagging, shared vocabulary, online community
Type text
Language English
Relation 10.1.1.103.8364, 10.1.1.135.1000, 10.1.1.142.7750, 10.1.1.134.4929, 10.1.1.79.1933, 10.1.1.97.4194, 10.1.1.134.296, 10.1.1.128.6045