Imed Zitouni

OrienTel – Arabic speech resources for the IT market (2008)

Rainer Siemund, Barbara Heuft, Khalid Choukri, Ossama Emam, Herbert Tropf, Oren Gedge, ...

A survey of the language resources market clearly shows that the Arabic language is still a stepchild of international R&D efforts in the field of speech recognition. OrienTel for the first time...

Maximum entropy based restoration of Arabic diacritics (2006)

Imed Zitouni, Jeffrey S. Sorensen, Ruhi Sarikaya

Short vowels and other diacritics are not part of written Arabic scripts. Exceptions are made for important political and religious texts and in scripts for beginning students of Arabic. Script...

Factorizing complex models: A case study in mention detection (2006)

Radu Florian, Hongyan Jing, A Kambhatla, Imed Zitouni

As natural language understanding research advances towards deeper knowledge modeling, the tasks become more and more complex: we are interested in more nuanced word characteristics, more linguistic...

The Impact of Morphological Stemming on Arabic Mention Detection and Coreference Resolution (2005)

Imed Zitouni, Jeff Sorensen, Xiaoqiang Luo, Radu Florian

Arabic presents an interesting challenge to natural language processing, being a highly inflected and agglutinative language. In particular, this paper presents an in-depth investigation of the...

Adaptive language models for spoken dialogue systems (2002)

Roger Argiles Solsona, Eric Fosler-lussier, Ros Potamianos, Imed Zitouni

In this paper, we investigate both generative and statistical approaches for language modeling in spoken dialogue systems. Semantic class-based finite state ¢ and-gram grammars are used for...

Multilingual Access to Interactive Communication (2002)

Rainer Siemund, Barbara Heuft, Khalid Choukri, Ossama Emam, Herbert Tropf, Oren Gedge, ...

OrienTel is a project funded within the European Commission’s IST framework that focuses on collecting linguistic data for telephony-based IT applications across the Mediterranean and the Middle...

A comparative study of topic identification on newspaper and e-mail (2001)

Brigitte Bigi, Armelle Brun, Jean-paul Haton, Kamel Smaïli, Imed Zitouni

This paper presents several statistical methods for topic identification on two kinds of textual data: newspaper articles and e-mails. Five methods are tested on these two corpora: topic unigrams,...

A comparative study of topic identification on newspaper and e-mail (2001)

Brigitte Bigi, Armelle Brun, Jean-paul Haton, Kamel Smali, Imed Zitouni

This paper presents several statistical methods for topic identification on two kinds of textual data: newspaper articles and e-mails. Five methods are tested on these two corpora: topic unigrams,...