Towards responsive Sensitive Artificial Listeners (2009)
Marc Schröder, Roddy Cowie, Dirk Heylen, Maja Pantic, Catherine Pelachaud, Björn Schuller
Abstract. This paper describes work in the recently started project SEMAINE, which aims to build a set of Sensitive Artificial Listeners – conversational agents designed to sustain an interaction...
Björn Schuller, Martin Wöllmer, Tobias Moosmayr, Gerhard Rigoll
Performance of speech recognition systems strongly degrades in the presence of background noise, like the driving noise inside a car. In contrast to existing works, we aim to improve noise robustness...
LOW-LEVEL FUSION OF AUDIO AND VIDEO FEATURE FOR MULTI-MODAL EMOTION RECOGNITION (2008)
Matthias Wimmer, Björn Schuller, Dejan Arsic, Gerhard Rigoll, Bernd Radig
Abstract: Bimodal emotion recognition through audiovisual feature fusion has been shown superior over each individual modality in the past. Still, synchronization of the two streams is a challenge,...
Combining Efforts for Improving Automatic Classification of Emotional User States (2008)
Batliner, Anton, Steidl, Stefan, Schuller, Björn, Seppi, Dino, Laskowski, Kornel, Vogt, Thurid, ...
TOWARDS MORE REALITY IN THE RECOGNITION OF EMOTIONAL SPEECH (2008)
Björn Schuller, Anton Batliner, Andreas Maier, Stefan Steidl
As automatic emotion recognition based on speech matures, new challenges can be faced. We therefore address the major aspects in view of potential applications in the �eld, to benchmark today’s...
Combining Frame and Turn-Level Information for Robust Recognition of Emotions within Speech (2008)
Bogdan Vlasenko, Björn Schuller, Andreas Wendemuth, Gerhard Rigoll
Current approaches to the recognition of emotion within speech usually use statistic feature information obtained by application of functionals on turn- or chunk levels. Yet, it is well known that...
Bogdan Vlasenko, Björn Schuller, Andreas Wendemuth, Gerhard Rigoll
Abstract. Opposing the pre-dominant turn-wise statistics of acoustic Low-Level-Descriptors followed by static classification we re-investigate dynamic modeling directly on the frame-level in...
Multiple Camera Person Tracking in Multiple Layers Combining 2D and 3D Information (2008)
Arsié, Dejan, Schuller, Björn, Rigoll, Gerhard
CCTV systems have been introduced in most public spaces in order to increase security. Video outputs are observed by human operators if possible but mostly used as a forensic tool. Therefore it seems...
Multiple Camera Person Tracking in Multiple Layers Combining 2D and 3D Information (2008)
Arsié, Dejan, Schuller, Björn, Rigoll, Gerhard
CCTV systems have been introduced in most public spaces in order to increase security. Video outputs are observed by human operators if possible but mostly used as a forensic tool. Therefore it seems...
Tango or Waltz?: Putting Ballroom Dance Style into Tempo Detection (2008)
Björn Schuller, Florian Eyben, Gerhard Rigoll
Rhythmic information plays an important role in Music Information Retrieval. Example applications include automatically annotating large databases by genre, meter, ballroom dance style or tempo,...
Automatische Emotionserkennung aus sprachlicher und manueller Interaktion (2007)
Integration emotionaler Aspekte ist Basis natürlicher und zukunftsweisender Mensch-Maschine-Kommunikation. Vor diesem Hintergrund werden innovative Verfahren zur robusten maschinellen Erkennung...
Combining Efforts for Improving Automatic Classification of Emotional User States. (2006)
Batliner, Anton, Steidl, Stefan, Schuller, Björn, Seppi, Dino, Laskowski, Kornel, Vogt, Thurid, ...
Combining Efforts for Improving Automatic Classification of Emotional User States (2006)
Anton Batliner, Stefan Steidl, Björn Schuller, Dino Seppi, Kornel Laskowski, Thurid Vogt, ...
Classification performance of emotional user states found in realistic, spontaneous speech is not very high, compared to the performance reported for acted speech in the literature. This might be...
Using multimodal interaction to navigate in arbitrary virtual VRML worlds (2001)
Frank Althoff, Gregor Mcglaun, Björn Schuller, Peter Morguet, Manfred Lang
In this paper we present a multimodal interface for navigating in arbitrary virtual VRML worlds. Conventional haptic devices like keyboard, mouse, joystick and touchscreen can freely be combined with...