Geographical Scope Resolution (2009)
Geoffrey Andogah, Gosse Bouma, John Nerbonne, Elwin Koster
It is common for placenames to reference other named entities (e.g., names of people, names of organizations, etc.) and to be used as vocabulary words (e.g., city of Split). Apart from reference...
Multiple sequence alignments in linguistics (2009)
Jelena Prokić, Martijn Wieling, John Nerbonne
In this study we apply and evaluate an iterative pairwise alignment program for producing multiple sequence alignments, ALPHAMALIG (Alonso et al., 2004), using as material the phonetic transcriptions...
Title: Applying Language Technology to Detect Shift Effects Authors: (2009)
John Nerbonne, Timo Lauttamus, Wybo Wiersma
We discuss an application of a technique from language technology to tag a corpus automatically and to detect syntactic differences between two varieties of Finnish Australian English, one spoken by...
The Forests behind the Trees (2009)
Abstract: Syntactic databases are increasingly available and are put to a variety of uses, including serving as organized reference material for descriptive and theoretical syntacticians. Dense...
Detecting Contact Effects in Pronunciation 1 (2009)
Wilbert Heeringa, John Nerbonne, Petya Osenova
We investigate language contact effects between Bulgarian dialects on the one hand, and the languages of the countries bordering Bulgaria on the other. The Bulgarian data comes from Stojkov's...
Evaluating the pairwise string alignment of pronunciations (2009)
Martijn Wieling, Jelena Prokić, John Nerbonne
Pairwise string alignment (PSA) is an important general technique for obtaining a measure of similarity between two strings, used e.g., in dialectology, historical linguistics, transliteration, and...
Mapping Aggregate Variation (2009)
We proceed from the view that linguistic variation must be examined from an aggregate perspective, i.e. from a perspective which encompasses as much of the variation between language varieties as...
In many theoretical and applied areas of computational linguistics researchers operate with a notion of linguistic distance or, conversely, linguistic similarity, which is the focus of the present...
Data is from Reeks Nederlands(ch)e Dialectatlassen (2008)
John Nerbonne, Wilbert Heeringa, Peter Kleiweg
This project measures and classifies language variation. In contrast to earlier dialectology, we seek a comprehensive characterization of (potentially gradual) differences between dialects, rather...
Methods for the Extraction of Hungarian Multi-Word Lexemes (2008)
Balázs Kis, Begoña Villada Moirón, Tamás Bíró, Gosse Bouma, Gábor Pohl, Gábor Ugray, ...
This paper describes an experiment on extracting Hungarian multi-word lexemes from a corpus, using statistical methods. Corpus preparation—the addition of POS tags and stems—was done...
Conditional Entropy Measures Intelligibility among Related Languages (2008)
Jens Moberg, Charlotte Gooskens, John Nerbonne, Nathan Vaillette
The Scandinavian languages are so alike that their speakers often communicate, each using their own language, which Haugen (1966) dubbed SEMICOMMUNICATION. The success of semi-communication depends...
Detecting Syntactic Substratum Effects Automatically in Interlanguage (2008)
Timo Lauttamus, Wybo Wiersma, John Nerbonne
6,660 word tokens (1 December 2006) This paper applies techniques to obtain an aggregate measure of syntactic distance between two varieties of English spoken by first- and second-generation Finnish...
Projecting Dialect Distances to Geography: Bootstrap Clustering vs. Noisy Clustering (2008)
John Nerbonne, Peter Kleiweg, Franz Manni, Wilbert Heeringa
Abstract. Dialectometry produces aggregate distance matrices in which a distance is specified for each pair of sites. By projecting groups obtained by clustering onto geography one compares results...
In many theoretical and applied areas of computational linguistics researchers operate with a notion of linguistic distance or, conversely, linguistic similarity, which is the focus of the present...
£57,50, £22,50 (paperback). Review (2008)
Evolutionary biologists have developed powerful, largely automated techniques for inferring the common history of groups of organisms or species based on their shared characteristics, a field known...
Learning Computational Grammars (2008)
Miles Osborne, Franck Thollardx, Erik Tjong, Kim Sangk, John Nerbonne, Anja Belzy, ...
This paper reports on the LEARNING COMPUTATIONAL GRAMMARS (LCG) project, a postdoc network devoted to studying the application of machine learning techniques to grammars suitable for computational...
John Unsworth. Thanks also to (2008)
John Nerbonne, Geoffrey Rockwell, Koen De Smedt, Lisa Lena Opas, John Nerbonne
the Humanities* * This paper under the title ‘Date Deluge’, was delivered as the keynote plenary address
Preliminary Identification of Language Groups and Loan Words in Central Asia (2008)
We use Levenshtein distance to classify language groups as opposed to dialects. If successful, this technique could be usefully applied in the preliminary analysis of linguists ’ field notes. We...
The linguistic situation in Gabon is highly complex as the various varieties form long chains, and multilingualism is common. Most previous classifications of Gabon varieties have used lexical...
Learning Computational Grammars (2008)
Miles Osborne, Franck Thollardx, Erik Tjong, Kim Sangk, John Nerbonne, Anja Belzy, ...
This paper reports on the LEARNING COMPUTATIONAL GRAMMARS (LCG) project, a postdoc network devoted to studying the application of machine learning techniques to grammars suitable for computational...
Robert A. Wilson, John Nerbonne
The MIT Encyclopedia of the Cognitive Sciences (MITECS) brings together 471 brief articles on a very wide range of topics within cognitive science. The general editors worked with advisory editors in...
Various Variation Aggregates in the LAMSAS (2008)
John Nerbonne, Michael Picone (eds
www.let.rug.nl/˜nerbonne
Abstract Feature-Based Inheritance Networks for Computational Lexicons ∗ (2008)
Hans-ulrich Krieger, John Nerbonne
The virtues of viewing the lexicon as an inheritance network are its succinctness and its tendency to highlight significant clusters of linguistic properties. From its succinctness follow two...
Chapter 18 Parallel texts in computer-assisted language learning (2008)
advanced language learners, particularly in the area of lexis, subtle lexical dependencies. Typically this information is either not available or sporadically available only in very large...
Martijn Wieling, Wilbert Heeringa, John Nerbonne
Contemporary Dutch dialects are compared using the Levenshtein distance, a measure of pronunciation difference. The material consists of data from the most recent Dutch dialect source available: the...
Interfaces Area: Natural Language and Speech Understanding. (2008)
Abdel Kader Diagne, John Nerbonne, Deutsche Zusammenfassung, Wir Kommunikation Modulen
Abstract: We consider communication between modules in an integrated architecture for Speech and Natural Language (NL), in particular the communication with the semantics module. In an integrated...
Interrogative Investigations (II) is a big, important book about the syntax and seman-tics of questions. It suggests innovations in a part of semantic theory in which consen-sus has reigned for...
Computing and Historical Phonology (2008)
John Nerbonne, T. Mark Ellison, Grzegorz Kondrak
We introduce the proceedings of the workshop
Wilbert Heeringa, John Nerbonne, Hermann Niebaum, Rogier Nieuweboer, Peter Kleiweg
Up to the middle of the 20 th century, for people on both sides of the Dutch-German border, the border was no impediment for understanding each other. The Low Saxon dialects on both sides of the...
Proceedings of EACL '99 Comparison and Classification of Dialects (2008)
John Nerbonne, Wilbert Heeringa, Peter Kleiweg
This project measures and classifies lan-guage variation. In contrast to earlier dialectology, we seek a comprehensive characterization of (potentially gradual) differences between dialects, rather...
Chapter 18 Parallel texts in computer-assisted language learning (2008)
advanced language learners, particularly in the area of lexis, subtle lexical dependencies. Typically this information is either not available or sporadically available only in very large...
Methods for the Extraction of Hungarian Multi-Word (2008)
Balázs Kis, Begoña Villada Moirón, Tamás Bíró, Gosse Bouma, Gábor Pohl, Gábor Ugray, ...
This paper describes an experiment on extracting Hungarian multi-word lexemes from a corpus, using statistical methods. Corpus preparation—the addition of POS tags and stems—was done...
UMR 5145, Group of population genetics National Museum of Natural History (2008)
Franz Manni, Wilbert Heeringa, John Nerbonne
To what extent are surnames words? Comparing geographic patterns of surname and dialect variation in the Netherlands
Computing and Historical Phonology (2008)
John Nerbonne, T. Mark Ellison, Grzegorz Kondrak
We introduce the proceedings from the
Do Surname Differences Mirror Dialect Variation (2008)
Franz Manni, Wilbert Heeringa, Bruno Toupance, John Nerbonne
Our focus in this paper is the analysis of surnames, which have been proven to be reliable genetic markers because in patrilineal systems they are transmitted along generations virtually unchanged,...
Quantitatively Detecting Semantic Relations: The Case of Aspectual Affinity (2008)
The explosion in the creation of text corpora in recent years suggests that the opportunity may be ripe to examine quantitative techniques for their value in semantics. 1 The present paper aims to...
Introducing Theory and Evidence in Semantics (2008)
Erhard Hinrichs, John Nerbonne
This is a collection of papers from a one-day symposium, Theory and
Measuring the Diffusion of Linguistic Change (2008)
Abstract We examine situations in which linguistic changes have likely been propagated via normal contact as opposed to via conquest, recent settlement and large-scale migration. We proceed then from...
The Gershwins celebrated linguistic variation famously in Let’s call the whole (2008)
Abstract Most studies of language variation proceed from the geographic or social distribution of single elements (features), and find it difficult to proceed further. Data-driven dialectology, and...
A Semantics for Nominal Comparatives (2007)
This work adopts the perspective of plural logic and measurement theory in order first to focus on the microstructure of comparative determiners; and second, to derive the properties of comparative...
rticles at an accessible level on this still very current topic. An updated version with discussion of typed theories would be welcome. The second half of Fenstad et al.'s paper is a 24-pp....
Language Teaching and Language Technology (2007)
John Nerbonne, Sake Jager, Arthur Van Essen
ion of lemmatization given an inflected word, find its lemma (dictionary form) syntactic categorisation (also known as "Part- of-Speech (POS) Disambiguation "). Given a word in context,...
Learning the Logic of Simple Phonotactics (2007)
Erik F. Tjong, Kim Sang, John Nerbonne
We report on experiments which demonstrate that by abductive inference it is possible to learn enough simple phonotactics to distinguish words from non-words for a simplified set of Dutch, the...
The Journal of Language and Computation (2007)
Language And Computation, Michele Abrusci, Nick Asher, Johan Van Benthem, Robert Berwick, Peter Bosch, ...
In this paper we show that the dynamic interpretation techniques of Janssen (assignment modalities), Groenendijk and Stokhof (dynamic binding), and Hendriks (exibly scoping rules) enable a rigorous...
John Nerbonne, Elena Paskaleva, Lauri Karttunen, Gabor Proszeky, Tiit Roosmaa
GLOSSER is designed to support read-ing and learning to read in a foreign lan-guage. There are four language pairs cur-rently supported by GLOSSER: English-Bulgarian, English-Estonian,...
Michael Rosner, Roderick Johnson (editors, John Nerbonne
This is a collection of excellent papers from a workshop, chaired by the editors, held in
1-881526-29-1, $21.95, £17.50 Reviewed by (2007)
Carl Pollard, Ivan A. Sag, Edited John Goldsmith, James D, Jerrold M. Sadock, John Nerbonne, ...
It has been almost ten years since the classic Generalized Phrase Structure Grammar (Gazdar, Klein, Pullum, and Sag 1985) appeared. And like its predecessor, Head-driven Phrase Structure Grammar will...
Sergio Balari, Jim Barnett, Stephan Busemann, Martin Kay, Hans-ulrich Krieger, John Nerbonne, ...
This research begun while I was working at the department of Computational Linguistics
Ivilin Stoianov, John Nerbonne
The language phonotactics describes the possible combinations of phonemes in order to form syllables and words, and it is typical for every language. Kaplan & Kay (1994) showed that phonological...
Learning Computational Grammars (2007)
Nr. Erbfmrxct, John Nerbonne, Erik Tjong, Kim Sang
This document describes the second year's progress of the TMR Project Learning Computational Grammars (LCG). In brief, LCG now has a full complement of postdocs, and work is ongoing in areas as...
Connectionist Grapheme to Phoneme Conversion: Exploring Distributed Representations (2007)
Ivelin Stoianov, John Nerbonne
In this paper we explore Simple Recurrent Networks with feature-based letter and phoneme encoding to transform orthographic representations to phonological ones of Dutch words, which is a part of the...
Wilbert Heeringa, John Nerbonne, Hermann Niebaum, Rogier Nieuweboer, Peter Kleiweg
Up to the middle of the 20
Flexible Semantics Communication in Integrated Speech/Language Architectures (2007)
Abdel Kader Diagne, John Nerbonne
Abstract: We consider communication between modules in an integrated architecture for Speech and Natural Language (NL), in particular the communication with the semantics module. In an integrated...
Alfa Informatica and Centre for Behavioral, Cognitive and Neuro-sciences (2007)
John Nerbonne, Klaus Netter, Gunter Neumann, Stephen Spackman, Harald Trost
A successful natural language interface (NLI) would carry on a conversation with a user in much the same way that a human being would---in other words, it would constitute a proof of machine...
Alfa Informatica and Centre for Behavioral, Cognitive and Neuro-sciences (2007)
Phrase structure analyses of partial verb phrase (hence: PVP) fronting in German recognize PVPs as potential constituents---i.e, they are constituents not only in the Vorfeld, which they can and must...
A Feature-Based Syntax/Semantics Interface (2007)
Intelligenz Gmbh, John Nerbonne, Deutsches Forschungszentrum
f ur
Learning Computational Grammars (2007)
Tmr Project Nr, Nr. Erbfmrxct, John Nerbonne
This report presents a general overview of the network related activities at this site and specific reports for the postdoc, the PhD student, the local coordinator and others. An overview of the...
Detecting Aspectual Relations Quantitatively (2007)
The explosion in the creation of text corpora in recent years suggests that the opportunity may be ripe to examine quantitative techniques for their value in semantics. The present paper aims to...
Variation in the Aggregate: An Alternative Perspective for Variationist Linguistics (2007)
Abstract We argue that an aggregate view of language variation is needed to supplement the usual characterizations based on single features. Aggregate characterizations are compatible with the...
A Quantitative Analysis of Bulgarian Dialect Pronunciation ∗ (2007)
Petya Osenova, Wilbert Heeringa, John Nerbonne
We apply a computational measure of pronunciation di erence to a database of 36 word pronunciations from 490 sites throughout Stoykov's Bulgarian Dialect Atlases. The result is a comprehensive...
The Exact Analysis of Text (2007)
More than forty years after its initial publication, Frederick Mosteller and David Wallace's Inference and Disputed Authorship: The Federalist is an excellent introduction to the application of...
Detecting syntactic contamination in emigrants. The English of Finnish Australians (2007)
Timo Lauttamus, John Nerbonne, Wybo Wiersma, Of Finnish Australians
The paper discusses an application of a technique to tag a corpus containing the English of Finnish Australians automatically and to analyse the frequency vectors of part-ofspeech (POS) trigrams...
Evaluation of string distance algorithms for dialectology (2006)
Heeringa, Wilbert, Kleiweg, Peter, Gooskens, Charlotte, Nerbonne, John
We examine various string distance measures for suitability in modeling dialect distance, especially its perception. We find measures superior which do not normalize for word length, but which are...
Evaluation of string distance algorithms for dialectology (2006)
Heeringa, Wilbert, Kleiweg, Peter, Gooskens, Charlotte, Nerbonne, John
We examine various string distance measures for suitability in modeling dialect distance, especially its perception. We find measures superior which do not normalize for word length, but which are...
Toward a dialectological yardstick (2006)
Dialectometry measures the differences between dialects in ways which may involve many independently varying parameters which must be spec-ified in combination in order to arrive at measures of...
Evaluation of string distance algorithms for dialectology (2006)
Wilbert Heeringa, Peter Kleiweg, Charlotte Gooskens, John Nerbonne
We examine various string distance measures for suitability in modeling dialect distance, especially its perception. We find measures superior which do not normalize for word length, but which are...
Literary and Linguistic Computing, (2006)
John Nerbonne, William Kretzschmar
www.let.rug.nl/nerbonne
Cognitive and Cultural Perspectives on Language (2006)
Cognitive science normally defines itself as the “science of the mind, ” meaning in particular the science of mental processes abstracted away from the concrete realization of such processes in...
A measure of aggregate syntactic distance (2006)
We compare vectors containing counts of trigrams of part-of-speech (POS) tags in order to obtain an aggregate measure of syntax difference. Since lexical syntactic categories reflect more abstract...
Progress in Dialectometry: Toward Explanation (2006)
Nerbonne, John, Kretzschmar, William
Dialectometric techniques analyze linguistic variation quantitatively, allowing one to aggregate over what are frequently rebarbative geographic patterns of individual linguistic variants, such as...
Identifying Linguistic Structure in Aggregate Comparison (2006)
Dialectometric techniques for analyzing variation in the aggregate are maturing rapidly, but there is still little agreement on how to extract linguistic structure from aggregate comparison. The...
Manni, Franz, Heeringa, Wilbert, Nerbonne, John
Since the early studies by Sokal (1988) and Cavalli-Sforza et al. (1989), there has been an increasing interest in depicting the history of human migrations by comparing genetic and linguistic...
Progress in Dialectometry: Toward Explanation (2006)
Nerbonne, John, Kretzschmar, William
Dialectometric techniques analyze linguistic variation quantitatively, allowing one to aggregate over what are frequently rebarbative geographic patterns of individual linguistic variants, such as...
Manni, Franz, Heeringa, Wilbert, Nerbonne, John
Since the early studies by Sokal (1988) and Cavalli-Sforza et al. (1989), there has been an increasing interest in depicting the history of human migrations by comparing genetic and linguistic...
Manni, Franz, Heeringa, Wilbert, Nerbonne, John
Since the early studies by Sokal (1988) and Cavalli-Sforza et al. (1989), there has been an increasing interest in depicting the history of human migrations by comparing genetic and linguistic...
Identifying Linguistic Structure in Aggregate Comparison (2006)
Dialectometric techniques for analyzing variation in the aggregate are maturing rapidly, but there is still little agreement on how to extract linguistic structure from aggregate comparison. The...
Crosstalk in Humanities Computing (2005)
We would like to stimulate crosstalk among the humanities in this paper, i.e., interdisciplinary investigation, and more specifically to show how humanities computing might stimulate this. We use the...
Computational Contributions to the Humanities (2005)
At the University of Groningen we have emphasized a simple view of humanities computing as computing in service of the humanities. This means that we seek to answer scholarly questions in...
Computational Contributions to the Humanities (2005)
At the University of Groningen we have emphasized a simple view of humanities computing as computing in service of the humanities. This means that we seek to answer scholarly questions in...
Computational Contributions to the Humanities (2005)
At the University of Groningen we have emphasized a simple view of humanities computing as computing in service of the humanities. This means that we seek to answer scholarly questions in...
Computational Contributions to the Humanities (2005)
At the University of Groningen we have emphasized a simple view of humanities computing as computing in service of the humanities. This means that we seek to answer scholarly questions in...
Computational Contributions to the Humanities (2005)
At the University of Groningen we have emphasized a simple view of humanities computing as computing in service of the humanities. This means that we seek to answer scholarly questions in...
2004): Geographic Projection of Cluster Composites (2004)
Peter Kleiweg, John Nerbonne, Leonie Bosveld
Abstract. A composite cluster map displays a fuzzy categorisation of geographic areas. It combines information from several sources to provide a visualisation of the significance of cluster borders....
Linguistic variation and computation (2003)
Language variationists study how languages vary along geographical or social lines or along lines of age and gender. Variationist data is available and challenging, in particular for DIALECTOLOGY,...
Natural language processing in computer-assisted language learning (2003)
This chapter examines the application of natural language processing to computerassisted language learning including the history of work in this field over the last thirtyfive years but with a focus...
Linguistic variation and computation (2003)
Language variationists study how languages vary along geographical or social lines or along lines of age and gender. Variationist data is available and challenging, in particular for DI-ALECTOLOGY,...
Linguistic Variation and Computation (2003)
vary along geographical or social lines or along lines of age and gender. Variationist data is available and challenging, in particular for DIALECTOLOGY, the study of geographical variation, which...
Computer-Assisted Language Learning And Natural Language Processing (2002)
This chapter examines the application of natural language processing to computerassisted language learning including the history of work in this field over the last thirtyfive years but with a focus...
Learning Computational Grammars (2002)
This document describes the fourth and final year's progress of the TMR Project Learning Computational Grammars (LCG). Although different sites wound down their activities during the course of...
Learning Computational Grammars (2001)
Nerbonne, John, Belz, Anja, Cancedda, Nicola, Dejean, Herve, Hammerton, James, Koeling, Rob, ...
This paper reports on the "Learning Computational Grammars" (LCG) project, a postdoc network devoted to studying the application of machine learning techniques to grammars suitable for computational...
Learning computational grammars (2001)
John Nerbonne, Anja Belz, Nicola Cancedda, Hervé Déjean, James Hammerton, Rob Koeling, ...
This paper reports on the LEARNING COMPUTATIONAL GRAMMARS (LCG) project, a postdoc network devoted to studying the application of machine learning techniques to grammars suitable for computational...
Learning Computational Grammars - 3rd Annual Report (2001)
This document describes the third year's progress of the TMR Project Learning Computational Grammars (LCG). In brief, LCG continues with full complement of postdocs and predocs, and work in...
Learning computational grammars (2001)
John Nerbonne, Anja Belz, Nicola Cancedda, Hervé Déjean, James Hammerton, Rob Koeling, ...
This paper reports on the LEARNING COMPUTATIONAL GRAMMARS (LCG) project, a postdoc network devoted to studying the application of machine learning techniques to grammars suitable for computational...
Computational comparison and classification of dialects. Dialectologia et Geolinguistica (2001)
John Nerbonne, Wilbert Heeringa
In this paper a range of methods for measuring the phonetic distance between dialectal variants are described. It concerns variants of the frequency method, the frequency per word method and...
Change, convergence and divergence among Dutch and Frisian (2000)
Wilbert Heeringa, John Nerbonne
The Algemeen Nederduitsch en Friesch dialecticon (Winkler, 1874) (ANFD) contains 186 translations of the parable of 'the prodigal son ' into dialects of the Netherlands, northern Belgium...
Learning the Logic of Simple Phonotactics (2000)
Erik F. Tjong, Kim Sang, John Nerbonne
We report on experiments which demonstrate that by abductive inference it is possible to learn enough simple phonotactics to distinguish words from non-words for a simplified set of Dutch, the...
Computational Comparison and Classification of Dialects (2000)
John Nerbonne, Wilbert Heeringa
In this paper a range of methods for measuring the phonetic distance between dialectal variants are described. It concerns variants of the frequency method, the frequency per word method and...
Null-Headed Nominals in German and English (2000)
In this paper we argue that certain nominal phrase constructions in German and English are best considered as having empty lexical heads. We propose a feature LP, which gives the status of the LEFT...
Null-Headed Nominals in German and English (2000)
In this paper we argue that certain nominal phrase constructions in German and English are best considered as having empty lexical heads. We propose a feature LP, which gives the status of the LEFT...
Change, convergence and divergence among Dutch and Frisian (2000)
Wilbert Heeringa, John Nerbonne
The Algemeen Nederduitsch en Friesch dialecticon (Winkler, 1874) (ANFD) contains 186 translations of the parable of 'the prodigal son ' into dialects of the Netherlands, northern Belgium...
Comparison and classification of dialects (1999)
John Nerbonne, Wilbert Heeringa, Peter Kleiweg
This project measures and classifies language variation. In contrast to earlier dialectology, we seek a comprehensive characterization of (potentially gradual) differences between dialects, rather...
An FGREP investigation into phonotactics (1999)
Peter Kleiweg, Peter Kleiweg, John Nerbonne, John Nerbonne
We discuss experiments with neural networks being trained in a phonotactic processing task. A recurrent network not only learns to predict the next letter given a partial processed word, but also...
An FGREP Investigation into Phonotactics (1999)
We discuss experiments with neural networks being trained in a phonotactic processing task. A recurrent network not only learns to predict the next letter given a partial processed word, but also...
Learning Simple Phonotactics (1999)
Erik Tjong Kim, Kim Sang, John Nerbonne
The present paper compares stochastic learning (Hidden Markov Models) , symbolic learning (Inductive Logic Programming), and connectionist learning (Simple Recurrent Networks using backpropagation)...
Learning Computational Grammars (1999)
Nr. Erbfmrxct, John Nerbonne, Miles Osborne
This document describes the first year's progress of the TMR Project Learning Computational Grammars (LCG). In brief, despite all sites experiencing delays recruiting staff, LCG now has a full...
Connectionist Learning to Read Aloud and Correlation to Human Data (1999)
Ivelin Stoianov, Laurie Stowe, John Nerbonne
A research on connectionist mapping from written to spoken forms in natural language is presented. The more plausible for this task Simple Recurrent Network was used instead of a static one. The...
Learning Simple Phonotactics (1999)
Erik F. Tjong, Kim Sang, John Nerbonne
The present paper compares stochastic learning (Hidden Markov Models) , symbolic learning (Inductive Logic Programming), and connectionist learning (Simple Recurrent Networks using backpropagation)...
Learning Simple Phonotactics (1999)
Erik Tjong Kim, Kim Sang, John Nerbonne, Teen Good, Zachtst Good
.0% 3 100.0% 92.2% 4 100.0% 82.2% Cause: signal/noise ration in network goes down when the grammar becomes more complex. TKSN 5 Concluding remarks 1. What machine learning technique generates the...
An FGREP investigation into phonotactics (1999)
Peter Kleiweg, Peter Kleiweg, John Nerbonne, John Nerbonne
We discuss experiments with neural networks being trained in a phonotactic processing task. A recurrent network not only learns to predict the next letter given a partial processed word, but also...
Connectionist learning to read aloud and correlation to human data (1999)
Ivelin Stoianov, Laurie Stowe, John Nerbonne
A research on connectionist mapping from written to spoken forms in natural language is presented. The more plausible for this task Simple Recurrent Network was used instead of a static one. The...
An FGREP investigation into phonotactics (1999)
We discuss experiments with neural networks being trained in a phonotactic processing task. A recurrent network not only learns to predict the next letter given a partial processed word, but also...
Exploring Phonotactics with Simple Recurrent Networks (1998)
Ivelin Stoianov, John Nerbonne
graphotactics of Dutch monosyllabic words, overcoming shortcomings of previous implementations. The current report is a continuation of our earlier research, but using phonetic data representations...
Morphological processing and computer-assisted language learning (1998)
John Nerbonne, Duco Dokter, Petra Smit
Contrary to most current practice and contrary to the explicit comments of some practitioners, natural language processing (NLP) can now play a valuable role in computer-assisted language learning...
Modelling the phonotactic structure of natural language words with Simple Recurrent Networks (1998)
Ivelin Stoianov, John Nerbonne, Huub Bouma
Simple Recurrent Networks (SRN) are Neural Network (connectionist) models able to process natural language. Phonotactics concerns the order of symbols in words. We continued an earlier unsuccessful...
A Session with Glosser-RuG (1998)
This paper describes the French-Dutch demonstrator.
Glosser-RuG: A User Study (1998)
Duco Dokter, John Nerbonne, Lily Schurcks-grozeva, Petra Smit
This paper supports Last's (1992) claim that relatively straightforward programs such as Glosser-RuG can achieve a great deal of success within the general framework of CALL. One of the key...
Connectionist Learning of Natural Language Lexical Phonotactics (1998)
Ivelin Stoianov, John Nerbonne
Connectionist learning of natural language words and their phonetic regularities is presented. The Neural Network (NN) model we employ in this problem is the Simple Recurrent Network, trained with...
this paper, the authors report on work in progress. They set up a general database for phonotactic statements for (at the moment) more than 200 languages. Queries supported includes examples such as...
Morphological processing and computer-assisted language learning (1998)
John Nerbonne, Duco Dokter, Petra Smit
Contrary to most current practice and contrary to the explicit comments of some practitioners, natural language processing (NLP) can now play a valuable role in computer-assisted language learning...
Modelling the phonotactic structure of natural language words with Simple Recurrent Networks (1998)
Ivelin Stoianov, John Nerbonne, Huub Bouma
Simple Recurrent Networks (SRN) are Neural Network (connectionist) models able to process natural language. Phonotactics concerns the order of symbols in words. We continued an earlier unsuccessful...
Measuring Dialect Distance Phonetically (1997)
John Nerbonne, Wilbert Heeringa
We describe ongoing work in the experi-mental evaluation of a range of methods for measuring the phonetic distance be-tween the dialectal variants of pronuncia-tions. Alllare variants of Levenshtein...
Measuring Dialect Distance Phonetically (1997)
John Nerbonne, Wilbert Heeringa
We describe ongoing work in the experimental evaluation of a range of methods for measuring the phonetic distance between the dialectal variants of pronunciations. All are variants of Levenshtein...
Reading more into Foreign Languages (1997)
John Nerbonne, Lauri Karttunen, Elena Paskaleva, Gabor Proszeky, Tiit Roosmaa
GLOSSER is designed to support reading and learning to read in a foreign language. There are four language pairs currently supported by GLOSSER: EnglishBulgarian, English-Estonian, EnglishHungarian...
Modelling the phonotactic structure of natural language words with Simple Recurrent Networks (1997)
Ivelin Stoianov Huub, Huub Bouma, John Nerbonne
The very promising reported results of Neural Networks grammar modelling has motivated a lot of researchers to use them in linguistic analysis at each level of natural language - semantics, syntax,...
Computational semantics--linguistics and processing (1996)
Computational linguistics assumes from linguistics a characterization of grammar---the relation between meaning and form---in order to focus on processing, e.g., the algorithms needed to compute...
GLOSSER-RuG: in Support of Reading (1996)
John Nerbonne, Petra Smit, Vakgroep Alfa-informatica
This paper reports on ongoing work on a CALL system to facilitate foreign language learning: GLOSSER-RuG. The system is particularly dependent on advanced morphological analysis. Following a brief...
Phonetic Distance between Dutch Dialects (1996)
John Nerbonne, Wilbert Heeringa, Simone Otten, ...
Traditional dialectology relies on identifying language features which are common to one dialect area while distinguishing it from others. It has difficulty in dealing with partial matches of...
Phonetic Distance between Dutch Dialects (1996)
John Nerbonne, Wilbert Heeringa, Erik Hout, Peter Kooi, Simone Otten
Traditional dialectology relies on identifying language features which are common to one dialect area while distinguishing it from others. It has difficulty in dealing with partial matches of...
Lexicons for Feature-Based Systems (1994)
. Feature description languages including relational constraints and default inheritance allow lexical specifications which are transparent, concise, and capture all relevant generalizations; e.g.,...
Software for Applied Semantics (1993)
John Nerbonne, Joachim Laubsch, Abdel Kader Diagne, Stephan Oepen, Kiinstliche Intelligenz
nerbonne©let.rug.n1
Feature-based allomorphy (1993)
Hans-ulrich Krieger, Hannes Pirker, John Nerbonne
Morphotactics and allomorphy are usually modeled in different components, leading to in-terface problems. To describe both uniformly, we define finite automata (FA) for allomorphy in the same feature...
Software for Applied Semantics (1993)
John Nerbonne, Joachim Laubsch, Abdel Kader Diagne, Stephan Oepen, Deutsches Forschungszentrum Fur
This paper recommends an approach to the implementation of semantic representation languages (SRLs) which exploits a parallelism between SRLs and programming languages (PLs). The design requirements...
Feature Based Allomorphy (1993)
Hans-Ulrich Krieger, John Nerbonne, Hannes Pirker
Morphotactics and allomorphy are usually modeled in different components, leading to interface problems. To describe both uniformly, we define finite automata (FA) for allomorphy in the same feature...
A Diagnostic Tool for German Syntax (1993)
John Nerbonne, Klaus Netter, Abdel Kader Diagne, Judith Klein, Ludwig Dickmann
In this paper we describe an ongoing effort to construct a catalogue of syntactic data exemplifying the major syntactic patterns of German. The purpose of the corpus is to support the diagnosis of...
A Feature-Based Syntax/Semantics Interface (1993)
Syntax/Semantics interfaces using unification-based or feature-based formalisms are increasingly common in the existing computational linguistics literature. The primary reason for attempting to...
A Diagnostic Tool for German Syntax (1992)
Intelligenz Gmbh, John Nerbonne, Klaus Netter, Judith Klein, Abdel Kader Diagne, Ludwig Dickmann, ...
In this paper we describe an effort to construct a catalogue of syntactic data, exemplifying the major syntactic patterns of German. The purpose of the corpus is to support the diagnosis of errors in...
Feature-Based Lexicons: An Example and a Comparison to DATR (1992)
Intelligenz Gmbh, John Nerbonne, Deutsches Forschungszentrum
Among the expertises relevant for successful natural language understanding are grammar, meaning and background knowledge, all of which must be represented in order to decode messages from text (or...
Constraint-Based Semantics (1991)
Montague's famous characterization of the homomorphic relation between syntax and semantics naturally gives way in computational applications to constraint-based formulations. This was...
Inheritance and Complementation: A Case Study of Easy Adjectives and Related Nouns (1991)
Intelligenz Gmbh, Dan Flickinger, John Nerbonne, Deutsches Forschungszentrum
Mechanisms for representing lexically the bulk of syntactic and semantic information for a language have been under active development, as is evident in the recent studies contained in this volume....
Lewis G. Creary, J. Mark Gawron, John Nerbonne
We propose a semantics for locative expressions such as near Jones or west of Denver, an impor-tant subsystem for NLP applications. Locative ex-pressions denote regions of space, and serve as...
Lewis G. Creary, J. Mark Gawron, John Nerbonne
We propose a semantics for locative expressions such as near Jones or west of Denver, an important subsystem for NLP applications. Locative expressions denote regions of space, and serve as arguments...
Reading more into Foreign Languages
John Nerbonne, Lauri Karttunen, Elena Paskaleva, Gabor Proszeky, Tiit Roosmaa