Publication View

ABSTRACT Optimizing Multi-Graph Learning: Towards A Unified Video Annotation Scheme (2008)

Abstract
Learning based semantic video annotation is a promising approach for enabling content-based video search. However, severe difficulties, such as insufficiency of training data and curse of dimensionality, are frequently encountered. This paper proposes a novel unified scheme, Optimized Multi-Graph-based Semi-Supervised Learning (OMG-SSL), to simultaneously attack these difficulties. Instead of only using a single graph, OMG-SSL integrates multiple graphs into a regularization and optimization framework to sufficiently explore their complementary nature. We then show that various crucial factors in video annotation, including multiple modalities, multiple distance metrics, and temporal consistency, in fact all correspond to different correlations among samples, and hence they can be represented by different graphs. Therefore, OMG-SSL is able to simultaneously deal with these factors within a unified framework. Experiments on the TRECVID benchmark demonstrate the effectiveness of our proposed approach.

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.73.1383
Source http://research.microsoft.com/~xshua/publications/pdf/2007_ACMMM_OMGSSL.pdf
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Keywords Categories and Subject Descriptors H.3.1 [Information Storage and Retrieval, Content Analysis and Index—indexing methods, I.2.10 [Artificial Intelligence, Video and Scene Understanding—video analysis General Terms Algorithms, Experimentation Keywords Video annotation
Type text
Language English
Relation 10.1.1.41.2789, 10.1.1.31.1422, 10.1.1.14.8561, 10.1.1.115.3219, 10.1.1.103.1693, 10.1.1.23.7409, 10.1.1.66.3905, 10.1.1.110.3004, 10.1.1.4.1570, 10.1.1.78.5928, 10.1.1.85.9105, 10.1.1.91.8166, 10.1.1.69.5704, 10.1.1.107.4728, 10.1.1.124.8036, 10.1.1.90.4005