Adrià de Gispert

TALP Phrase-Based System and TALP System Combination for IWSLT 2006 (2008)

Josep M. Crego, Adrià De Gispert, Patrik Lambert, Maxim Khalilov, ...

This paper describes the TALP phrase-based statistical machine translation system, enriched with the statistical machine reordering technique. We also report the combination of this system and the...

INTERSPEECH 2006- ICSLP Linguistic tuple segmentation in ngram-based statistical machine translation (2008)

Adrià De Gispert, Joséb. Mariño

Ngram-based Statistical Machine Translation relies on a standard Ngram language model of tuples to estimate the translation process. In training, this translation model requires a segmentation of...

Low–cost Named Entity Classification for Catalan: Exploiting Multilingual Resources and Unlabeled Data (2007)

Lluís Màrquez, Adrià De Gispert, Xavier Carreras, Lluís Padró

This work studies Named Entity Classification (NEC) for Catalan without making use of large annotated resources of this language. Two views are explored and compared, namely exploiting solely the...

Segmentación lingüística de tuplas para el modelado de la traducción estocástica mediante n-gramas (2006)

Gispert, Adrià De, Mariño, José B.

Ngram-based Statistical Machine Translation relies on a standard Ngram language model of tuples to estimate the translation process. In training, this translation model requires a segmentation of...

Improving statistical word alignments with morpho-syntactic transformations (2006)

Adrià De Gispert, Deepa Gupta, Maja Popović, Patrik Lambert, Jose B. Mariño, Marcello Federico, ...

Abstract. This paper presents a wide range of statistical word alignment experiments incorporating morphosyntactic information. By means of parallel corpus transformations according to information of...

Morpho-syntactic information for automatic error analysis of statistical machine translation output (2006)

Maja Popović, Hermann Ney, Adrià De Gispert, José B. Mariño, Deepa Gupta, Marcello Federico, ...

Evaluation of machine translation output is an important but difficult task. Over the last years, a variety of automatic evaluation measures have been studied, some of them like Word Error Rate...

Statistical machine translation without parallel corpus: bridging through Spanish (2006)

Adrià De Gispert, José B. Mariño

This paper presents a full experiment on large-vocabulary Catalan-English statistical machine translation without an English-Catalan parallel corpus, in the context of the debates of the European...

Morpho-syntactic Information for Automatic Error Analysis of Statistical Machine Translation Output (2006)

Maja Popovic, Hermann Ney, Adrià De Gispert, José B. Mariño, Deepa Gupta, Marcello Federico, ...

Evaluation of machine translation output is an important but difficult task. Over the last years, a variety of automatic evaluation measures have been studied, some of them like Word Error Rate...

Proceedings of the Workshop on Statistical Machine Translation, pages 162--165, (2006)

New York City, Josep M. Crego, José B. Mariño, Adrià De Gispert, Maxim Khalilov, ...

This work presents translation results for the three data sets made available in the shared task "Exploiting Parallel Texts for Statistical Machine Translation" of the HLT-NAACL 2006...

Ngram-based smt system enhanced with reordering patterns (2006)

Josep M. Crego, José B. Mariño, Adrià De Gispert, Maxim Khalilov, ...

This work presents translation results for the three data sets made available in the shared task “Exploiting Parallel Texts for Statistical Machine Translation ” of the HLT-NAACL 2006 Workshop on...

Modelo estocástico de traducción basado en N-gramas de tuplas bilingües y combinación log-lineal de características (2005)

Mariño Acebal, José B., Crego, Josep Maria, Lambert, Patrik, Banchs, Rafael, Gispert, Adrià De, Fonollosa, José A. R., ...

This communication introduces a stochastic machine translation system based on Ngram modelling of the joint probability of bilingual texts. The basic unit of this model is called a tuple and consists...

Algoritmo de Decodificación de Traducción Automática Estocástica basado en N-gramas (2005)

Crego, Josep Maria, Mariño Acebal, José B., Gispert, Adrià De

In this paper we describe MARIE, an N-gram-based stochastic machine translation decoder. It is implemented using a beam search strategy, with distortion (or reordering) capabilities. The underlying...

Clasificación y generalización de formas verbales en sistemas de traducción estocástica (2005)

Gispert, Adrià De, Mariño Acebal, José B., Crego, Josep Maria

En esta comunicación se propone un método para incorporar conocimiento lingüístico relativo a las formas verbales en sistemas estocásticos de traducción. Por medio de una clasificación basada...

Improving statistical machine translation by classifying and generalizing inflected verb forms (2005)

Adrià De Gispert, José B. Mariño, Josep M. Crego

This paper introduces a rule-based classification of single-word and compound verbs into a statistical machine translation approach. By substituting verb forms by the lemma of their head verb, the...

An ngrambased statistical machine translation decoder (2005)

Josep M. Crego, José B. Mariño, Adrià De Gispert

In this paper we describe MARIE, an Ngram-based statistical machine translation decoder. It is implemented using a beam search strategy, with distortion (or reordering) capabilities. The underlying...

Finitestate-based and phrase-based statistical machine translation (2004)

Josep M. Crego, José B. Mariño, Adrià De Gispert

This paper shows the common framework that underlies the translation systems based on phrases or driven by finite state transducers, and summarizes a first comparison between them. In both approaches...