Kurt Hornik

Isotone Optimization in R: Pool-Adjacent-Violators Algorithm (PAVA) and Active Set Methods (2009)

De Leeuw, Jan, Hornik, Kurt, Mair, Patrick

In this paper we give a general framework for isotone optimization. First we discuss a generalized version of the Pool-Adjacent-Violators Algorithm (PAVA) to minimize a separable convex function with...

Generalized and Customizable Sets in R (2009)

Meyer, David, Hornik, Kurt

We present data structures and algorithms for sets and some generalizations thereof (fuzzy sets, multisets, and fuzzy multisets) available for R through the sets package. Fuzzy (multi-)sets are based...

Social Network Analysis of Weighted Telecommunications Graphs (2009)

Bohn, Angela, Walchhofer, Norbert, Mair, Patrick, Hornik, Kurt

SNA provides a wide range of tools that allow examination of telecommunications graphs. Those graphs contain vertices representing cell phone users and lines standing for established connections....

Social Network Analysis of Weighted Telecommunications Graphs (2009)

Bohn, Angela, Walchhofer, Norbert, Mair, Patrick, Hornik, Kurt

SNA provides a wide range of tools that allow examination of telecommunications graphs. Those graphs contain vertices representing cell phone users and lines standing for established connections....

Generalized and Customizable Sets in R (2009)

Meyer, David, Hornik, Kurt

We present data structures and algorithms for sets and some generalizations thereof (fuzzy sets, multisets, and fuzzy multisets) available for R through the sets package. Fuzzy (multi-)sets are based...

Bookmaker Consensus and Agreement for the UEFA Champions League 2008/09 (2009)

Leitner, Christoph, Zeileis, Achim, Hornik, Kurt

Bookmakers odds are an easily available source of ``prospective" information that is thus often employed for forecasting the outcome of sports events. To investigate the statistical properties of...

Dissimilarity Plots (2009)

Hahsler, Michael, Hornik, Kurt

For hierarchical clustering, dendrograms provide convenient and powerful visualization. Although many visualization methods have been suggested for partitional clustering, their usefulness...

Generalized and Customizable Sets in R (2009)

David Meyer, Kurt Hornik

We present data structures and algorithms for sets and some generalizations thereof (fuzzy sets, multisets, and fuzzy multisets) available for R through the sets package. Fuzzy (multi-)sets are based...

Isotone Optimization in R: Pool-Adjacent-Violators Algorithm (PAVA) and Active Set Methods (2009)

Jan De Leeuw, Kurt Hornik, Patrick Mair

In this paper we give a general framework for isotone optimization. First we discuss a generalized version of the pool-adjacent-violators algorithm (PAVA) to minimize a separable convex function with...

Is Federer Stronger in a Tournament Without Nadal? An Evaluation of Odds and Seedings for Wimbledon 2009 (2009)

Leitner, Christoph, Zeileis, Achim, Hornik, Kurt

Wimbledon is one of the most popular annual sports tournament. In the Gentlemen's Single 2009 the top seeded and defending champion Rafael Nadal withdrew from the tournament due to injury days prior...

FESTSCHRIFT 50 JAHRE ÖSTERREICHISCHE STATISTISCHE GESELLSCHAFT (SEITEN 61-70) Vienna and R: Love, Marriage and the Future (2008)

Kurt Hornik, Friedrich Leisch

Abstract: Vienna is the home of many contributions to the R Project for statistical computing. In this paper we describe the current status of the R data analysis environment and give an overview of...

TSP – A R-Package for the Traveling Salesperson Problem (2008)

Michael Hahsler, Kurt Hornik

2. Definition and types of TSPs 3. Basic infrastructure in R 4. Algorithms and Heuristics

Abstract (2008)

Achim Zeileis, Torsten Hothorn, Kurt Hornik

Recursive partitioning is embedded into the general and well-established class of parametric models that can be fitted using M-type estimators (including maximum likelihood). An algorithm for...

Selective association rule generation (2008)

Hahsler, Michael, Buchta, Christian, Hornik, Kurt

Mining association rules is a popular and well researched method for discovering interesting relations between variables in large databases. A practical problem is that at medium to low support...

New probabilistic interest measures for association rules (2008)

Hahsler, Michael, Hornik, Kurt

Mining association rules is an important technique for discovering meaningful patterns in transaction databases. Many different measures of interestingness have been proposed for association rules....

Selective Association Rule Generation 1 (2008)

Michael Hahsler, Christian Buchta, Kurt Hornik

Mining association rules is a popular and well researched method for discovering interesting relations between variables in large databases. A practical problem is that at medium to low support...

New probabilistic interest measures for association rules (2008)

Michael Hahsler, Kurt Hornik

Mining association rules is an important technique for discovering meaningful patterns in transaction databases. Many different measures of interestingness have been proposed for association rules....

Building on the arules infrastructure for analyzing transaction data with R (2008)

Michael Hahsler, Kurt Hornik

Abstract. The free and extensible statistical computing environment R with its enormous number of extension packages already provides many state-of-the-art techniques for data analysis. Support for...

News The Newsletter of the R Project Volume 1/1, January 2001 Editorial (2008)

Kurt Hornik, Friedrich Leisch

Welcome to the first volume of R News, the newsletter of the R project for statistical computing. R News will feature short to medium length articles covering topics that might be of interest to...

News The Newsletter of the R Project Volume 2/3, December 2002 Editorial (2008)

Kurt Hornik

Time to say good-bye... with the new system of rolling three-year periods of office for the members of the Editorial Board, my term comes to an end on Dec 31, with Doug Bates joining the board in...

News The Newsletter of the R Project Volume 2/2, June 2002 Editorial (2008)

Kurt Hornik

Welcome to this first regular issue of R News in 2002, following the special issue on applications of R in medical statistics. In the future, it is planned to have two non-patch releases of R per...

News The Newsletter of the R Project Volume 1/2, June 2001 Editorial (2008)

Kurt Hornik, Friedrich Leisch

Welcome to the second issue of R News, the newsletter of the R project for statistical computing. First we would like to thank for all the positive feedback we got on the first volume, which...

News The Newsletter of the R Project Volume 2/1, March 2002 Editorial (2008)

Kurt Hornik

Welcome to the fourth issue of R News, the newsletter of the R project for statistical computing. This is the first special issue of R News, with an emphasis on applying R in medical statistics. It...

Rasp: A Package for Spatial Statistics (2008)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Barry Rowlingson, Rolf Turner, Peter Diggle

This paper describes the development of a new R package for spatial data and statistics. We review existing packages that deal with spatial information and comment on how they handle the basic data....

Quality Assurance for Graphics in R (2008)

Paul Murrell, Kurt Hornik

An important part of software development involves testing the reliability of the software. R has good tools for checking that the software runs without catastrophic failure, but provides less...

Converting a Large R Package to S4 Classes and Methods (2008)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Douglas M. Bates, Saikat Debroy

The nlme package for fitting and examining linear and nonlinear mixedeffects models in R is a required package and also one of the largest R packages, based on source package size. In the first phase...

Yellow (2008)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Ross Ihaka, Green Red

Choosing a “good ” set of colours for a graphical display is an important problem, but one which many data analysis practitioners are ill-equipped to solve. Most current graphics systems provide...

TSP – Infrastructure for the Traveling Salesperson Problem (2008)

Michael Hahsler, Kurt Hornik

The traveling salesperson (or, salesman) problem (TSP) is a well known and important combinatorial optimization problem. The goal is to find the shortest tour that visits each city in a given list...

Who is Going to Win the EURO 2008? (2008)

Leitner, Christoph, Zeileis, Achim, Hornik, Kurt

This June one of the biggest and most popular sports tournaments will take place in Austria and Switzerland, the European soccer championship 2008 (UEFA EURO 2008). Therefore millions of soccer fans...

Getting Things in Order: An Introduction to the R Package seriation (2008)

Michael Hahsler, Kurt Hornik, Christian Buchta

Seriation, i.e., finding a suitable linear order for a set of objects given data and a loss or merit function, is a basic problem in data analysis. Caused by the problem's combinatorial nature, it is...

Text Mining Infrastructure in R (2008)

Ingo Feinerer, Kurt Hornik, David Meyer

During the last decade text mining has become a widely used discipline utilizing statistical and machine learning methods. We present the tm package which provides a framework for text mining...

Predicting the Winner of the EURO 2008 (2008)

Leitner, Christoph, Zeileis, Achim, Hornik, Kurt

In June 2008 one of the biggest and most popular sports tournaments took place in Austria and Switzerland, the European football championship 2008 (UEFA EURO 2008). Before the tournament started...

On Two-Periodic Random Walks with Boundaries (2008)

Böhm, Walter, Hornik, Kurt

Two-periodic random walks are models for the one-dimensional motion of particles in which the jump probabilities depend on the parity of the currently occupied state. Such processes have interesting...

Implementing a Class of Permutation Tests: The coin Package (2008)

Torsten Hothorn, Kurt Hornik, Achim Zeileis

The R package coin implements a unified approach to permutation tests providing a huge class of independence tests for nominal, ordered, numeric, and censored data as well as multivariate data at...

On Two-Periodic Random Walks with Boundaries (2008)

Böhm, Walter, Hornik, Kurt

Two-periodic random walks are models for the one-dimensional motion of particles in which the jump probabilities depend on the parity of the currently occupied state. Such processes have interesting...

Predicting the Winner of the EURO 2008 (2008)

Leitner, Christoph, Zeileis, Achim, Hornik, Kurt

In June 2008 one of the biggest and most popular sports tournaments took place in Austria and Switzerland, the European football championship 2008 (UEFA EURO 2008). Before the tournament started...

NeuroCOLT Coordinating Partner (2007)

Kurt Hornik, Maxwell Stinchcombe, Halbert White, Peter Auer

1 1 White's participation was supported by National Science Foundation Grant SES--8921382. We are grateful to Andrew Barron for helpful comments. 2 Technische Universitat Wien 3

Neural Network Algorithms for Online Principal Component Analysis and Singular Value Decomposition (2007)

Andreas Weingessel, Kurt Hornik

Introduction to PCA and SVD ffl Overview over neural networks algorithms for PCA ffl Neural network algorithms for SVD: Experimental comparison NN Algos for Online PCA & SVD TU Wien --- Center...

Quality Assurance for Graphics in R (2007)

Friedrich Leisch, Achim Zeileis (eds, Paul Murrell, Kurt Hornik, Kurt Hornik

An important part of software development involves testing the reliability of the software. R has good tools for checking that the software runs without catastrophic failure, but provides less...

Opportunities and Requirements for Open-Source/Zero-Cost Statistical Software in Companies Under Permanent Reorganization (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Christian Ritter

In companies under permanent reorganization, stable software environments are hard or even impossible to maintain. Parts of the company break off into joint ventures and licences need to be split,...

Some Notes on lattice (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Deepayan Sarkar

Trellis Graphics is implemented in S-PLUS using the traditional S graphics engine. In contrast, lattice, the implementation of Trellis Graphics in R, uses grid graphics as the underlying mechanism....

kernlab – A Kernel Methods Package (2007)

Friedrich Leisch, Achim Zeileis (eds, Alexandros Karatzoglou, Achim Zeileis, ...

Designing software for Support Vector Machines (SVM) and kernel methods in general poses an interesting design problem. Our aim is to provide one possible solution using R object oriented features....

Statistical Computing and Databases: Distributed Computing Near the Data (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Fei Chen, Brian D. Ripley

This paper addresses the following question: “how do we fit statistical models efficiently with very large data sets that reside in databases?” Nowadays it is quite common to we encounter a...

A Graphical Users Interface to Normalize Microarray Data (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Fatima Sanchez Cabo, Zlatko Trajanoski, Kwang-hyun Cho, ...

Microarray technology is becoming an essential tool in functional genomics. The possibility of monitoring the expression level of thousands of genes simultaneously, as the response to a particular...

A False Discovery Rate Approach to Separate the Score Distributions of Induced and Non-induced Genes (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Stefanie Scheid, Rainer Spang

The distribution of scores for differential gene expression observed in microarray experiments give rise to the assumption that the underlying score distributions of induced and non-induced genes...

Visualizing Independence Using Extended Association and Mosaic Plots (2007)

David Meyer, Achim Zeileis, Kurt Hornik

Two visualization techniques for the independence problem in 2-way contingency tables—assocation and mosaic plots—are extended in two directions: 1. The visualization is enhanced by displaying...

GIS and the Random Forest Predictor: Integration in R for Tick-Borne Disease Risk Assessment (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, C. Furlanello, M. Neteler, S. Merler, ...

We discuss how sophisticated machine learning methods may be rapidly integrated within a GIS for the development of new approaches in landscape epidemiology. A multitemporal predictive map is...

svlab – A Kernel Methods Package (2007)

Kurt Hornik

svlab is an extensible, object oriented, package for kernel based learning in R. Its main objective is to provide a tool kit consisting of basic kernel functionality, optimizers and high level...

Management Science'). Generalized M-Fluctuation Tests for Parameter Instability (2007)

In Cooperation, Achim Zeileis, Achim Zeileis, Kurt Hornik, Kurt Hornik

Papers published in this report series are preliminary versions of journal articles. This piece of research was supported by the Austrian Science Foundation (FWF) under

License GPL (2007)

David Meyer, Achim Zeileis, Ros Karatzoglou, Kurt Hornik, Michael Friendly

Description Functions and data sets based on the book ”Visualizing Categorical Data ” by

Programmer's Niche: nifty hints for programming (2007)

Friedrich Leisch, Kurt Hornik, Kurt Hornik

Welcome to the first volume of R News, the newsletter of the R project for statistical computing. R News will feature short to medium length articles covering topics that might be of interest to...

Editorial (2007)

Kurt Hornik, Friedrich Leisch

Welcome to the second issue of R News, the newsletter of the R project for statistical computing. First we would like to thank for all the positive feedback we got on the first volume, which...

Depends R (> = 1.3.0) License GPL R topics documented: (2007)

Achim Zeileis, Friedrich Leisch, Bruce Hansen, Kurt Hornik, Christian Kleiber, Andrea Peters

Description Testing on structural change in linear regression relationships. It features tests/methods from the generalized fluctuation test framework as well as from the F test (Chow test)...

Benchmarking (2007)

Support Vector Machines, David Meyer, Friedrich Leisch, Kurt Hornik, In Cooperation

Support Vector Machines (SVMs) are rarely benchmarked against other classification or regression methods. We compare a popular SVM implementation (libsvm) to 16 classification methods and 9...

StatDataML--An XML Format for (2007)

Statistical Data David, David Meyer, Friedrich Leisch, Torsten Hothorn, Kurt Hornik, In Cooperation

In order to circumvent common difficulties in exchanging statistical data between heterogeneous applications (format incompatibilities, technocentric data representation), we introduce an XML-based...

Prediction Analysis of Microarrays in Excel (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Balasubramanian Narasimhan

Prediction Analysis of Microarrays (PAM) is a statistical technique for class prediction using gene expression data using shrunken centroids. It is described in Tibshirani, Hastie, Narasimhan, and...

Quantian: A Scientific Computing Environment (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Dirk Eddelbuettel

This paper introduces Quantian, a scientific computing environment. Quantian is directly bootable from cdrom, self-configuring without user intervention, and comprises hundreds of applications,...

Revealing Predictive Gene Clusters with Supervised Algorithms (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Marcel Dettling

Microarray technology allows the measurement of expression levels of thousands of genes simultaneously and is expected to contribute significantly to advances in fundamental questions of biology and...

Fractals and Statistics: An R Package Called ifs (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Stefano Maria, Iacus Davide, La Torre

Iterated Function Systems (IFSs) were born in mid eighties (see Hutchinson, 1981; Barnsley and Demko, 1985) as applications of the theory of discrete dynamical systems and as useful tools to build...

Putting RGtk to Work (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, James Robison-cox

The RGtk package built by Duncan Temple Lang provides one way to build statistical applications in R which can be controlled by the user through a graphical user interface. RGtk is an interface to...

R: Windows Component Services Integrating R and Excel on the COM Layer (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Thomas Baier

On the Windows platform a standardized model for component services and interprocess and inter-machine communication has been established by Microsoft—COM. As many applications provide their own or...

Sweave and Beyond: Computations on Text Documents (2007)

Kurt Hornik, Achim Zeileis (eds, Friedrich Leisch, Friedrich Leisch

Sweave is a tool that allows to embed R code in L ATEX documents. The code can be evaluated and the resulting console output, figures and tables are automatically inserted into the final document. In...

Learning Bayesian Networks with R (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Susanne G. Bøttcher, Claus Dethlefsen

deal is a software package freely available for use with R. It includes several methods for analysing data using Bayesian networks with variables of discrete and/or continuous types but restricted to...

Detecting Clusters of Diseases with R (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, V. Gómez-rubio, J. Ferrándiz, A. López

One of the main concerns in Public Health surveillance (Aylin, Maheswaran, Wakefield, Cockings, Jarup, Arnold, Wheeler, and Elliot, 1999) is detection of clusters of diseases, i. e., the presence of...

Database and R Interfacing for Annotated Microarray Data (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Michael Mader, Werner Mewes

A framework for expression data repositories (MAGE-ML compatiblity) emerged during the last years. Focus is now shifting towards the integration of expression databases into genomic and other...

JAGS: A Program for Analysis of Bayesian Graphical Models Using Gibbs Sampling (2007)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Martyn Plummer

JAGS is a program for Bayesian Graphical modelling which aims for compatibility with classic BUGS. The program could eventually be developed as an R package. This article explains the motivations for...

Open-Source Machine Learning: R Meets Weka (2007)

Hornik, Kurt, Buchta, Christian, Zeileis, Achim

Two of the prime open-source environments available for machine/statistical learning in data mining and knowledge discovery are the software packages Weka and R which have emerged from the machine...

Text Mining of Supreme Administrative Court Jurisdictions (2007)

Feinerer, Ingo, Hornik, Kurt

Within the last decade text mining, i.e., extracting sensitive information from text corpora, has become a major factor in business intelligence. The automated textual analysis of law corpora is...

Benchmarking Open-Source Tree Learners in R/RWeka (2007)

Schauerhuber, Michael, Zeileis, Achim, Meyer, David, Hornik, Kurt

The two most popular classification tree algorithms in machine learning and statistics - C4.5 and CART - are compared in a benchmark experiment together with two other more recent constant-fit tree...

Implementing a Class of Permutation Tests: The coin Package (2007)

Hothorn, Torsten, Hornik, Kurt, Van De Wiel, Mark A., Zeileis, Achim

The R package coin implements a unified approach to permutation tests providing a huge class of independence tests for nominal, ordered, numeric, and censored data as well as multivariate data at...

Getting Things in Order: An Introduction to the R package seriation (2007)

Hahsler, Michael, Hornik, Kurt, Buchta, Christian

Seriation, i.e., finding a linear order for a set of objects given data and a loss or merit function, is a basic problem in data analysis. Caused by the problem’s combinatorial nature, it is hard to...

Let's Have a party! An Open-Source Toolbox for Recursive Partytioning (2007)

Hothorn, Torsten, Zeileis, Achim, Hornik, Kurt

Package party, implemented in the R system for statistical computing, provides basic classes and methods for recursive partitioning along with reference implementations for three recently-suggested...

Escaping RGBland: Selecting Colors for Statistical Graphics (2007)

Zeileis, Achim, Hornik, Kurt, Murrell, Paul

Statistical graphics are often augmented by the use of color coding information contained in some variable. When this involves the shading of areas (and not only points or lines) - e.g., as in bar...

TSP – Infrastructure for the Traveling Salesperson Problem (2007)

Michael Hahsler, Kurt Hornik

The traveling salesperson (or, salesman) problem (TSP) is a well known and important combinatorial optimization problem. The goal is to find the shortest tour that visits each city in a given list...

SystemRequirements Java (> = 5.0) (2007)

Kurt Hornik, Torsten Hothorn, David Meyer, Achim Zeileis

Description An R interface to Weka (Version 3.5.7). Weka is a collection of machine learning algorithms for data mining tasks written in Java, containing tools for data pre-processing,...

SystemRequirements Java (> = 5.0) (2007)

Kurt Hornik, Torsten Hothorn, David Meyer, Achim Zeileis

Description An R interface to Weka (Version 3.5.6). Weka is a collection of machine learning algorithms for data mining tasks written in Java, containing tools for data pre-processing,...

The Traveling Salesperson Problem (2007)

Michael Hahsler, Kurt Hornik, Michael Hahsler, Kurt Hornik Freiburg

1. The traveling salesperson problem 2. TSP package for R 3. Applications for data analysis

License GPL-2 (2007)

David Meyer, Achim Zeileis, Kurt Hornik, R Topics Documented

Description Visualization techniques, data sets, summary and inference procedures aimed particularly at categorical data. Special emphasis is given to highly extensible grid graphics. The package was...

Evaluating Model-based Trees in Practice (2006)

Zeileis, Achim, Hothorn, Torsten, Hornik, Kurt

A recently suggested algorithm for recursive partitioning of statistical models (Zeileis, Hothorn and Hornik, 2005), such as models estimated by maximum likelihood or least squares, is evaluated in...

Deriving Consensus Rankings from Benchmarking Experiments (2006)

Hornik, Kurt, Meyer, David

Whereas benchmarking experiments are very frequently used to investigate the performance of statistical or machine learning algorithms for supervised and unsupervised learning tasks, overall analyses...

Deriving Consensus Rankings from Benchmarking Experiments (2006)

Hornik, Kurt; Meyer, David

Whereas benchmarking experiments are very frequently used to investigate the performance of statistical or machine learning algorithms for supervised and unsupervised learning tasks, overall analyses...

New Probabilistic Interest Measures for Association Rules (2006)

Hahsler, Michael, Hornik, Kurt

Mining association rules is an important technique for discovering meaningful patterns in transaction databases. Many different measures of interestingness have been proposed for association rules....

Choosing Color Palettes for Statistical Graphics (2006)

Zeileis, Achim, Hornik, Kurt

Statistical graphics are often augmented by the use of color coding information contained in some variable. When this involves the shading of areas (and not only points or lines) - e.g., as in bar...

TSP - Infrastructure for the Traveling Salesperson Problem (2006)

Hahsler, Michael, Hornik, Kurt

The traveling salesperson or salesman problem (TSP) is a well known and important combinatorial optimization problem. The goal is to find the shortest tour that visits each city in a given list...

Support Vector Machines in R (2006)

Alexandros Karatzoglou, David Meyer, Kurt Hornik

Being among the most popular and efficient classification and regression methods currently available, implementations of support vector machines exist in almost every popular programming language....

The Strucplot Framework: Visualizing Multi-way (2006)

David Meyer, Achim Zeileis, Kurt Hornik

This paper describes the “strucplot” framework for the visualization of multi-way contingency tables. Strucplot displays include hierarchical conditional plots such as mosaic, association, and...

Title Misc Functions of the Department of Statistics (e1071), TU Wien Depends class Imports graphics, stats Suggests cluster, mlbench, nnet, randomForest, rpart, SparseM, xtable (2006)

Evgenia Dimitriadou, Kurt Hornik, Friedrich Leisch, David Meyer, Andreas Weingessel, Maintainer Friedrich Leisch

Description Functions for latent class analysis, short time Fourier transform, fuzzy clustering, support vector machines, License GPL version 2. See COPYRIGHT.svm.cpp for the copyright of the svm C++...

TSP – Infrastructure for the Traveling Salesperson Problem (2006)

Michael Hahsler, Kurt Hornik, Michael Hahsler, Kurt Hornik, Kurt Hornik

The traveling salesperson or salesman problem (TSP) is a well known and important combinatorial optimization problem. The goal is to find the shortest tour that visits each city in a given list...

Implications of probabilistic data modeling for mining association rules (2006)

Michael Hahsler, Kurt Hornik, Thomas Reutterer

Abstract. Mining association rules is an important technique for discovering meaningful patterns in transaction databases. In the current literature, the properties of algorithms to mine association...

Version 0.2-4 Date 2006-05-07 Title R/Weka interface The RWeka Package (2006)

Kurt Hornik, Torsten Hothorn, David Meyer, Achim Zeileis

Description An R interface to Weka (Version 3.5.2). Weka is a collection of machine learning algorithms for data mining tasks written in Java, containing tools for data pre-processing,...

Choosing Color Palettes for Statistical Graphics (2006)

Achim Zeileis, Kurt Hornik, Achim Zeileis, Kurt Hornik

Statistical graphics are often augmented by the use of color coding information contained in some variable. When this involves the shading of areas (and not only points or lines)— e.g., as in bar...

A GABRB3 promoter haplotype associated with childhood absence epilepsy impairs transcriptional activity (2006)

Urak, Lydia, Feucht, Martha, Fathi, Nahid, Hornik, Kurt, Fuchs, Karoline

Childhood absence epilepsy (CAE) is considered to exhibit a complex non-Mendelian pattern of inheritance. So far, only few CAE susceptibility genes have been identified. In a previous study of our...

A GABRB3 promoter haplotype associated with childhood absence epilepsy impairs transcriptional activity (2006)

Urak, Lydia, Feucht, Martha, Fathi, Nahid, Hornik, Kurt, Fuchs, Karoline

Childhood absence epilepsy (CAE) is considered to exhibit a complex non-mendelian pattern of inheritance. So far, only few CAE susceptibility genes have been identified. In a previous study of our...

Implications of probabilistic data modeling for rule mining (2005)

Hahsler, Michael, Hornik, Kurt, Reutterer, Thomas

Mining association rules is an important technique for discovering meaningful patterns in transaction databases. In the current literature, the properties of algorithms to mine associations are...

A computational environment for mining association rules and frequent item sets (2005)

Hahsler, Michael, Grün, Bettina, Hornik, Kurt

Mining frequent itemsets and association rules is a popular and well researched approach to discovering interesting relationships between variables in large databases. The R package arules presented...

A CLUE for CLUster ensembles (2005)

Hornik, Kurt

Cluster ensembles are collections of individual solutions to a given clustering problem which are useful or necessary to consider in a wide range of applications. The R package clue provides an...

Model-based recursive partitioning (2005)

Zeileis, Achim, Hothorn, Torsten, Hornik, Kurt

Recursive partitioning is embedded into the general and well-established class of parametric models that can be fitted using M-type estimators (including maximum likelihood). An algorithm for...

Residual-based shadings for visualizing (conditional) independence (2005)

Zeileis, Achim, Meyer, David, Hornik, Kurt

Residual-based shadings for enhancing mosaic and association plots to visualize independence models for contingency tables are extended in two directions: (a) perceptually uniform HCL colors are used...

Support Vector Machines in R (2005)

Karatzoglou, Alexandros, Meyer, David, Hornik, Kurt

Being among the most popular and efficient classification and regression methods currently available, implementations of support vector machines exist in almost every popular programming language....

The Strucplot Framework: Visualizing Multi-way Contingency Tables with vcd (2005)

Meyer, David, Zeileis, Achim, Hornik, Kurt

This paper describes the `strucplot' framework for the visualization of multi-way contingency tables. Strucplot displays include hierarchical conditional plots such as mosaic, association, and sieve...

A Lego System for Conditional Inference (2005)

Hothorn, Torsten, Hornik, Kurt, Zeileis, Achim

Conditioning on the observed data is an important and flexible design principle for statistical test procedures. Although generally applicable, permutation tests currently in use are limited to the...

A CLUE for CLUster ensembles (2005)

Hornik, Kurt

Cluster ensembles are collections of individual solutions to a given clustering problem which are useful or necessary to consider in a wide range of applications. The R package clue provides an...

A CLUE for CLUster Ensembles (2005)

Kurt Hornik

Cluster ensembles are collections of individual solutions to a given clustering problem which are useful or necessary to consider in a wide range of applications. The R package clue provides an...

arules – A computational environment for mining association rules and frequent item sets (2005)

Michael Hahsler, Michael Hahsler, Bettina Grün, Bettina Grün, Kurt Hornik, Kurt Hornik, ...

Mining frequent itemsets and association rules is a popular and well researched approach to discovering interesting relationships between variables in large databases. The R package arules presented...

arules – A computational environment for mining association rules and frequent item sets (2005)

Michael Hahsler, Kurt Hornik, Bettina Grün

Mining frequent itemsets and association rules is a popular and well researched approach for discovering interesting relationships between variables in large databases. The R package arules presented...

The design and analysis of benchmark experiments (2005)

Torsten Hothorn, Friedrich Leisch, Kurt Hornik, Achim Zeileis, ...

The assessment of the performance of learners by means of benchmark experiments is an established exercise. In practice, benchmark studies are a tool to compare the performance of several competing...

Unbiased recursive partitioning: A conditional inference framework (2005)

Torsten Hothorn, Kurt Hornik, Achim Zeileis

Recursive binary partitioning is a popular tool for regression analysis. Two fundamental problems of exhaustive search procedures usually applied to fit such models have been known for a long time:...

Title Mining Association Rules and Frequent Itemsets (2005)

Michael Hahsler, Bettina Gruen, Kurt Hornik, Maintainer Michael Hahsler

Description Provides the basic infrastructure for mining and analyzing association rules and an interface to C implementations of the mining algorithms Apriori and Eclat. Depends R (> = 2.1.0),...

Bioconductor: open software development for computational biology and bioinformatics (2004)

Gentleman, Robert C, Carey, Vincent J, Bates, Douglas M, Bolstad, Ben, Dettling, Marcel, Dudoit, Sandrine, ...

Abstract The Bioconductor project is an initiative for the collaborative creation of extensible software for computational biology and bioinformatics. The goals of the project include: fostering...

Bioconductor: Open software development for computational biology and bioinformatics (2004)

Gentleman, Robert C., Carey, Vincent J., Bates, Douglas J., Bolstad, Benjamin M., Dettling, Marcel, Dudoit, Sandrine, ...

The Bioconductor project is an initiative for the collaborative creation of extensible software for computational biology and bioinformatics. We detail some of the design decisions, software...

Bioconductor: Open software development for computational biology and bioinformatics (2004)

Gentleman, Robert C., Carey, Vincent J., Bates, Douglas J., Bolstad, Benjamin M., Dettling, Marcel, Dudoit, Sandrine, ...

The Bioconductor project is an initiative for the collaborative creation of extensible software for computational biology and bioinformatics. We detail some of the design decisions, software...

Bioconductor: Open software development for computational biology and bioinformatics (2004)

Gentleman, Robert C., Carey, Vincent J., Bates, Douglas J., Bolstad, Benjamin M., Dettling, Marcel, Dudoit, Sandrine, ...

The Bioconductor project is an initiative for the collaborative creation of extensible software for computational biology and bioinformatics. We detail some of the design decisions, software...

Bioconductor: Open software development for computational biology and bioinformatics (2004)

Gentleman, Robert C., Carey, Vincent J., Bates, Douglas J., Bolstad, Benjamin M., Dettling, Marcel, Dudoit, Sandrine, ...

The Bioconductor project is an initiative for the collaborative creation of extensible software for computational biology and bioinformatics. We detail some of the design decisions, software...

Unbiased Recursive Partitioning: A Conditional Inference Framework (2004)

Hothorn, Torsten, Hornik, Kurt, Zeileis, Achim

Recursive binary partitioning is a popular tool for regression analysis. Two fundamental problems of exhaustive search procedures usually applied to fit such models have been known for a long time:...

kernlab - An S4 package for kernel methods in R (2004)

Karatzoglou, Alexandros, Smola, Alex, Hornik, Kurt, Zeileis, Achim

kernlab is an extensible package for kernel-based machine learning methods in R. It takes advantage of R's new S4 object model and provides a framework for creating and using kernel-based algorithms....

The vcd Package - Visualizing Categorical Data (2004)

David Meyer, Achim Zeileis, Alexandros Karatzoglou, Ros Karatzoglou, Kurt Hornik, R Topics Documented

SexualFun . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 SpaceShuttle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 Suicide . . ....

Generalized M-Fluctuation Tests for Parameter Instability (2003)

Zeileis, Achim, Hornik, Kurt

A general class of fluctuation tests for parameter instability in an M-estimation framework is suggested. The tests are based on partial sum processes of M-estimation scores for which functional...

The design and analysis of benchmark experiments (2003)

Hothorn, Torsten, Leisch, Friedrich, Zeileis, Achim, Hornik, Kurt

The assessment of the performance of learners by means of benchmark experiments is established exercise. In practice, benchmark studies are a tool to compare the performance of several competing...

Zur Erlangung des akademischen Grades (2003)

Kurt Hornik, David Meyer

In this place, I would like to acknowledge several people who contributed, in one way or another, to this work. First and foremost, I would like to thank my supervisor (and employer) Kurt Hornik for...

A simulation framework for heterogeneous agents (2003)

David Meyer, Christian Buchta, Alexandros Karatzoglou, Friedrich Leisch, Kurt Hornik, In Cooperation

Papers published in this report series are preliminary versions of journal articles and not for quotations.

Proceedings of the 3rd International Workshop (2003)

On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds

SciViews aims to provide a unique, polymorph object-oriented interface (called "Plug") to di#erent data analysis software like R, Octave, Scilab, MATLAB, S-PLUS, Ox, Mathematica,. . . It is...

Proceedings of the 3rd International Workshop (2003)

On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Florian Markowetz, Rainer Spang

Introduction A genetic network is a set of genes in which individual genes influence the activity of other genes. The core task in identifying genetic networks is to distinguish direct from indirect...

Proceedings of the 3rd International Workshop (2003)

On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Henrik Bengtsson

An easy to install and platform independent package named R.oo, which provides support for references and mutable objects via a specific class model using standard R code, has been developed. The...

Proceedings of the 3rd International Workshop (2003)

On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Simon Urbanek, Martin Theus

This paper describes the integration of interactive statistical graphics within R using state-of-the art R-connectivity. The iPlots project uses SJava to deliver interactive plots for R and an...

Proceedings of the 3rd International Workshop (2003)

On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Günter Tirler, Josef Leydold

Random variate generation is an important tool in statistical computing. Many programs for simulation or statistical computing (e.g. R) provide a collection of random variate generators for many...

Proceedings of the 3rd International Workshop on Distributed Statistical Computing (DSC 2003) (2003)

March Vienna Austria, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Richard M. Heiberger, Burt Holl

We have designed several multi-panel graphical displays for which the panels are defined by the Cartesian product of one set of variables or parameters with another set. The traditional scatterplot...

Proceedings of the 3rd International Workshop (2003)

On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, In R, Roger Bivand

Access to well-structured and sometimes self-describing spatial position data with associated data attributes in geographical scales domains is increasing, and is expected to increase further. Until...

RArcInfo: Using G.I.S. Data with R (2003)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Virgilio Gomez Rubio

Geographical Information Systems (G.I.S.) have become powerful tools for integration, visualisation and analysis of spatial data, also providing the capability of integrating data from di#erent...

Proceedings of the 3rd International Workshop (2003)

On Distributed Statistical, Friedrich Leisch, Achim Zeileis (eds, David Meyer, Achim Zeileis, Kurt Hornik, ...

Association plots---a visualization technique for the independence problem in 2-way contingency tables---are extended in three directions: 1. The visualization is enhanced by using colors for the...

Proceedings of the 3rd International Workshop (2003)

On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Uwe Ligges

In the first part of this paper the status quo of R-WinEdt, a plug-in for the shareware editor WinEdt for Windows, is described. This description includes the interface between R and WinEdt,...

Proceedings of the 3rd International Workshop (2003)

On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Simon Urbanek

Rserve is a TCP/IP server which allows other programs to use facilities of R from various languages without the need to initialize R or link to the R library.

Proceedings of the 3rd International Workshop (2003)

On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Peter Bühlmann

We present an extended abstract about boosting. We describe first in section 1 (in a self-contained way) a generic functional gradient descent algorithm, which yields a general representation of...

Proceedings of the 3rd International Workshop (2003)

On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Thomas Baier, Erich Neuwirth

Spreadsheet programs in general and Microsoft Excel in particular implement a very special paradigm of computing: automatic creation of dependency structures for calculation, and automatic...

Economics and Management Science’). The Design and Analysis of Benchmark Experiments (2003)

Torsten Hothorn, Friedrich Leisch, Achim Zeileis, Kurt Hornik, In Cooperation, Torsten Hothorn, ...

Papers published in this report series are preliminary versions of journal articles. This piece of research was supported by the Austrian Science Foundation

Exploratory Visual Analysis of Graphs in GGobi (2003)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Deborah Swayne, Andreas Buja, Duncan Temple Lang

Graphs have long been of interest in telecommunications and social network analysis, and they are now receiving increasing attention from statisticians working in other areas, particularly in...

GIS and the Random Forest Predictor: Integration in R for Tick-Borne Disease Risk Assessment (2003)

Cesare Furlanello, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Steno Fontanari, Angela Donini, ...

We discuss how sophisticated machine learning methods may be rapidly integrated within a GIS for the development of new approaches in landscape epidemiology. A multitemporal predictive map is...

PVM Kriging with R (2003)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Pvm Kriging R, Albrecht Gebhardt

Kriging is one of the most frequently used prediction methods in spatial data analysis. This paper examines which steps of the underlying algorithms can be performed in parallel on a PVM cluster. It...

Gstat: Multivariable Geostatistics for S (2003)

Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Edzer J. Pebesma

This paper introduces the gstat package for the S language (R, S-PLUS).

Proceedings of the 3rd International Workshop (2003)

On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Paulo J. Ribeiro, Peter J. Diggle, ...

The packages geoR and geoRglm are contributed packages to the statistical software system R, implementing methods for model-based geostatistical data-analysis. In this paper we focus on the...

Proceedings of the 3rd International Workshop (2003)

On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Ernst Glatzer, Österreichische Nationalbank, ...

There exists a growing number of spatial statistics software packages covering a broad range of methods including the classical technique of kriging following variogram fitting (for a review see e.g....

Proceedings of the 3rd International Workshop (2003)

On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Alternative To Bioconductor, Christian Stratowa

Novel high-throughput technologies such as DNA microarray analyses are allowing biologists to generate sets of data in the terabyte realm. Many of these data will be deposited in the public domain,...

Testing and dating of structural changes in practice (2003)

Achim Zeileis, Christian Kleiber, Walter Krämer, Kurt Hornik

The paper presents an approach to the analysis of data that contains (multiple) structural changes in a linear regression setup. We implement various strategies which have been suggested in the...

Monitoring structural change in dynamic econometric models (2002)

Zeileis, Achim, Leisch, Friedrich, Kleiber, Christian, Hornik, Kurt

The classical approach to testing for structural change employs retrospective tests using a historical data set of a given length. Here we consider a wide array of fluctuation-type tests in a...

A Simulation Framework for Heterogeneous Agents (2002)

Meyer, David, Buchta, Christian, Karatzoglou, Alexandros, Leisch, Friedrich, Hornik, Kurt

We introduce a generic simulation framework suitable for agent-based simulations featuring the support of heterogeneous agents, hierarchical scheduling and flexible specification of design...

StatDataML (2002)

Meyer, David, Leisch, Friedrich, Hothorn, Torsten, Hornik, Kurt

In order to circumvent common difficulties in exchanging statistical data between heterogeneous applications (format incompatibilities, technocentric data representation), we introduce an XML-based...

A Mixed Ensemble Approach for the Semi-Supervised Problem (2002)

Dimitriadou, Evgenia, Weingessel, Andreas, Hornik, Kurt

In this paper we introduce a mixed approach for the semi-supervised data problem. Our approach consists of an ensemble unsupervised learning part where the labeled and unlabeled points are segmented...

Benchmarking Support Vector Machines (2002)

Meyer, David, Leisch, Friedrich, Hornik, Kurt

Support Vector Machines (SVMs) are rarely benchmarked against other classification or regression methods. We compare a popular SVM implementation (libsvm) to 16 classification methods and 9...

A mixed ensemble approach for the semi-supervised problem (2002)

In Cooperation, Evgenia Dimitriadou, Evgenia Dimitriadou, Andreas Weingessel, Andreas Weingessel, Kurt Hornik, ...

Papers published in this report series are preliminary versions of journal articles and not for quotations. This paper was accepted for publication in:

strucchange: An R Package for Testing for Structural Change in Linear Regression Models (2002)

Achim Zeileis, Friedrich Leisch, Kurt Hornik, Christian Kleiber

This paper introduces ideas and methods for testing for structural change in linear regression models and presents how these have been realized in an R package called strucchange. It features tests...

strucchange: An R Package for Testing for Structural Change in Linear Regression Models (2002)

In Cooperation, Achim Zeileis, Achim Zeileis, Friedrich Leisch, Friedrich Leisch, Kurt Hornik, ...

Papers published in this report series are preliminary versions of journal articles. This paper was accepted for publication in:

Monitoring Structural Change in Dynamic Econometric Models (2002)

Kleiber Kurt Hornik, In Cooperation, Achim Zeileis, Achim Zeileis, Friedrich Leisch, Friedrich Leisch, ...

The classical approach to testing for structural change employs retrospective tests using a historical data set of a given length. Here we consider a wide array of fluctuation-type tests in a...

ANXIETY AS A PREDICTOR OF RELAPSE IN DETOXIFIED ALCOHOL-DEPENDENT PATIENTS (2002)

Willinger, Ulrike, Lenzinger, Elisabeth, Hornik, Kurt, Fischer, Gabriele, Schönbeck, Georg, Aschauer, Harald N., ...

— Aims: To evaluate the impact of mood, affect, and personality on predicting relapse in detoxified alcohol-dependent patients to uncontrolled drinking during a 1-year treatment study. Methods: A...

Emacs Speaks Statistics: A Universal Interface for Statistical Analysis (2001)

Rossini, Anthony, Maechler, Martin, Hornik, Kurt, Heiberger, Richard M., Sparapani, Rodney

Emacs Speaks Statistics (ESS) is a user interface for developing statistical applications and performing data analysis using any of several common statistical programming languages. ESS falls in the...

Emacs Speaks Statistics: A Universal Interface for Statistical Analysis (2001)

Rossini, Anthony, Maechler, Martin, Hornik, Kurt, Heiberger, Richard M., Sparapani, Rodney

Emacs Speaks Statistics (ESS) is a user interface for developing statistical applications and performing data analysis using any of several common statistical programming languages. ESS falls in the...

Emacs Speaks Statistics: A Universal Interface for Statistical Analysis (2001)

Rossini, Anthony, Maechler, Martin, Hornik, Kurt, Heiberger, Richard M., Sparapani, Rodney

Emacs Speaks Statistics (ESS) is a user interface for developing statistical applications and performing data analysis using any of several common statistical programming languages. ESS falls in the...

Emacs Speaks Statistics: A Universal Interface for Statistical Analysis (2001)

Rossini, Anthony, Maechler, Martin, Hornik, Kurt, Heiberger, Richard M., Sparapani, Rodney

Emacs Speaks Statistics (ESS) is a user interface for developing statistical applications and performing data analysis using any of several common statistical programming languages. ESS falls in the...

strucchange (2001)

Zeileis, Achim, Leisch, Friedrich, Hornik, Kurt, Kleiber, Christian

This paper introduces ideas and methods for testing for structural change in linear regression models and presents how these have been realized in an R package called strucchange. It features tests...

Running agent-based simulations (2001)

Meyer, David, Karatzoglou, Alexandros, Buchta, Christian, Leisch, Friedrich, Hornik, Kurt

When running agent-based simulations using ready-made components, one usually faces heterogenity problems both for the agents’ implementation and for the underlying platform. To circumvent these...

Emacs speaks statistics: A universal interface for statistical analysis (2001)

A. J. Rossini, Martin Mächler, Kurt Hornik, Richard M. Heiberger, Rodney Sparapani

Emacs Speaks Statistics (ESS) is a user interface for developing statistical applications and performing data analysis using any of several common statistical programming languages. ESS falls in the...

Voting-merging: An ensemble method for clustering. ICANN (2001)

Andreas Weingessel, Evgenia Dimitriadou, Kurt Hornik

Combination strategies in classification are a popular way of overcoming instabilities in classification algorithms. A direct application of ideas such as “voting ” to cluster analysis problems...

A voting-merging clustering algorithm (1999)

Dimitriadou, Evgenia, Weingessel, Andreas, Hornik, Kurt

In this paper we propose an unsupervised voting-merging scheme that is capable of clustering data sets, and also of finding the number of clusters existing in them. The voting part of the algorithm...

On the ergodicity and stationarity of the ARMA (1,1) recurrent neural network process (1999)

Trapletti, Adrian, Leisch, Friedrich, Hornik, Kurt

In this note we consider the autoregressive moving average recurrent neural network ARMA-NN(1, 1) process. We show that in contrast to the pure autoregressive process simple ARMA-NN processes exist...

Fuzzy voting in clustering (1999)

Dimitriadou, Evgenia, Weingessel, Andreas, Hornik, Kurt

In this paper we present a fuzzy voting scheme for cluster algorithms. This fuzzy voting method allows us to combine several runs of cluster algorithms resulting in a common fuzzy partition. This...

Voting in clustering and finding the number of clusters (1999)

Dimitriadou, Evgenia, Weingessel, Andreas, Hornik, Kurt

In this paper we present an unsupervised algorithm which performs clustering given a data set and which can also find the number of clusters existing in it. This algorithm consists of two techniques....

Voting in Clustering and Finding the Number of Clusters (1999)

In Cooperation, Evgenia Dimitriadou, Evgenia Dimitriadou, Andreas Weingessel, Andreas Weingessel, Kurt Hornik, ...

In this paper we present an unsupervised algorithm which performs clustering given a data set and which can also nd the number of clusters existing in it. This algorithm consists of two techniques....

Optimal Linear Compression Under Unreliable Representation and Robust PCA Neural Models (1999)

Konstantinos I. Diamantaras, Kurt Hornik, Michael G. Strintzis, Senior Member

In a typical linear data compression system the representation variables resulting from the coding operation are assumed totally reliable and therefore the solution in the mean-squared-error sense is...

Stationarity and Stability of Autoregressive Neural Network Processes (1999)

Friedrich Leisch, Adrian Trapletti, Kurt Hornik

We analyze the asymptotic behavior of autoregressive neural network (AR-NN) processes using techniques from Markov chains and non-linear time series analysis. It is shown that standard AR-NNs without...

Using independent component analysis for feature extraction and multivariate data projection (1998)

Weingessel, Andreas, Natter, Martin, Hornik, Kurt

Deriving low-dimensional perceptual spaces from data consisting of many variables is of crucial interest in strategic market planning. A frequently used method in this context is Principal Components...

On the stationarity of autoregressive neural network models (1998)

Leisch, Friedrich, Trapletti, Adrian, Hornik, Kurt

We analyze the asymptotic behavior of autoregressive neural network (AR-NN) processes using techniques from Markov chains and non-linear time series analysis. It is shown that standard AR-NNs without...

On the generation of correlated artificial binary data (1998)

Leisch, Friedrich, Weingessel, Andreas, Hornik, Kurt

The generation of random variates from multivariate binary distributions has not gained as much interest in the literature as, e.g., multivariate normal or Poisson distributions. Binary variables are...

Stationary and integrated autoregressive neural network processes (1998)

Trapletti, Adrian, Leisch, Friedrich, Hornik, Kurt

We consider autoregressive neural network (ARNN) processes driven by additive noise. Sufficient conditions on the network weights (parameters) are derived for the ergodicity and stationarity of the...

Cross-Validation with Active Pattern Selection for Neural Network Classifiers (1998)

Friedrich Leisch, Lakhmi C. Jain, Kurt Hornik, Classification Risk

We propose a new approach for leave-one-out cross-validation of neural network classifiers called "crossvalidation with active pattern selection" (CV/APS). In CV/APS, the contribution of...

Stationary and Integrated Autoregressive Neural Network Processes (1998)

Management Science, Adrian Trapletti, Adrian Trapletti, Friedrich Leisch, Friedrich Leisch, Kurt Hornik, ...

We consider autoregressive neural network (ARNN) processes driven by additive noise. Sufficient conditions on the network weights (parameters) are derived for the ergodicity and stationarity of the...

Using Independent Component Analysis for Feature Extraction and Multivariate Data Projection (1998)

Management Science, Andreas Weingessel, Andreas Weingessel, Martin Natter, Martin Natter, Kurt Hornik, ...

Deriving low-dimensional perceptual spaces from data consisting of many variables is of crucial interest in strategic market planning. A frequently used method in this context is Principal Components...

On the Stationarity of Autoregressive Neural Network Models (1998)

Adrian Trapletti, Management Science, In Cooperation, Friedrich Leisch, Friedrich Leisch, Kurt Hornik, ...

We analyze the asymptotic behavior of autoregressive neural network (AR-NN) processes using techniques from Markov chains and non-linear time series analysis. It is shown that standard AR-NNs without...

On the Generation of Correlated Artificial Binary Data (1998)

Friedrich Leisch, Friedrich Leisch, Andreas Weingessel, Andreas Weingessel, Kurt Hornik, Kurt Hornik

this paper we present a computationally fast method to simulate multivariate binary distributions with a given correlation structure. After some remarks on multivariate binary distributions in...

Combining Neural Network Voting Classifiers and Error Correcting Output Codes (1997)

In Cooperation, Friedrich Leisch, Friedrich Leisch, Kurt Hornik, Kurt Hornik

We show that error correcting output codes (ECOC) can further improve the effects of error dependent adaptive resampling methods such as arc-lh. In traditional one-in-n coding, the distance between...

SVD Algorithms: APEX-like versus Subspace Methods (1997)

Kurt Hornik, Andreas Weingessel, Andreas Weingessel

. We compare several new SVD learning algorithms which are based on the subspace method in principal component analysis with the APEX-like algorithm proposed by Diamantaras. It is shown...

ARC-LH: A New Adaptive Resampling Algorithm for Improving ANN Classifiers (1997)

Friedrich Leisch, Kurt Hornik

We introduce arc-lh, a new algorithm for improvement of ANN classifier performance, which measures the importance of patterns by aggregated network output errors. On several artificial benchmark...

Adaptive Combination of PCA and VQ Networks (1997)

Andreas Weingessel, Horst Bischof, Kurt Hornik, Friedrich Leisch

In this paper we consider Principal Component Analysis (PCA) and Vector Quantization (VQ) neural networks for image compression. We present a method where the PCA and VQ steps are adaptively...

The Adaptive Network Combined for Data Compression (1997)

Andreas Weingessel, Horst Bischof, Kurt Hornik

We present a method to adaptively combine multilayer perceptrons and vector quantization. We show that this combined network yields better results than the sequential application. 1 Introduction In...

Combining Neural Network Voting Classifiers and Error Correcting Output Codes (1997)

Friedrich Leisch, Kurt Hornik

We show that error correcting output codes (ECOC) can further improve the effects of error dependent adaptive resampling methods such as arc-lh. In traditional one-in-n coding, the distance between...

Error-Dependent Resampling for Artificial Neural Network Classifiers (1997)

Friedrich Leisch, Kurt Hornik

this paper we investigate several ways of utilizing error-dependent resampling for artificial neural network (ANN) classifiers. The main idea of all methods is that the output error of the trained...

Learning in Linear Neural Networks: a Survey (1995)

Pierre Baldi, Kurt Hornik

Networks of linear units are the simplest kind of networks, where the basic questions related to learning, generalization, and self-organisation can sometimes be answered analytically. We survey most...

NN Classifiers: Reducing the Computational Cost of Cross-Validation by Active Pattern Selection (1995)

Friedrich Leisch, Lakhmi C. Jain, Kurt Hornik

We propose a new approach for leave-one-out crossvalidation of neural network classifiers called "crossvalidation with active pattern selection" (CV/APS). In CV/APS, the contribution of the...

MOSUM tests for parameter constancy (1995)

CHU, CHIA-SHANG J., HORNIK, KURT, KAUN, CHUNG-MING

In this paper, tests for structural change based on moving sums (MOSUMS) of recursive and least-squares residuals are investigated. We obtain and tabulate the asymptotic critical values of the MOSUM...

Recurrent back-propagation and Newton algorithms for training recurrent neural networks (1993)

Chung-Ming Kuan, Kurt Hornik, Tung Liu

In this paper the recurrent back-propagation and Newton algorithms for an important class of recurrent networks and their convergence properties are discussed. To ensure proper convergence behavior,...

Noisy Linear Networks (1993)

Kurt Hornik

this paper, we are concerned with situations where a linear feature extraction process is contaminated by noise. More precisely, suppose that upon presentation of an input x, the network actually...

Noisy Principal Component Analysis (1993)

Kostas Diamantaras, Kurt Hornik

this paper, we are concerned with situations in which the output y of the linear network with weight matrix A from an input x actually is y = Ax + e,

Bioconductor: open software development for computational biology and bioinformatics

Gentleman, Robert C, Carey, Vincent J, Bates, Douglas M, Bolstad, Ben, Dettling, Marcel, Dudoit, Sandrine, ...

A detailed description of the aims and methods of the Bioconductor project, an initiative for the collaborative creation of extensible software for computational biology and bioinformatics.

Bioconductor: open software development for computational biology and bioinformatics

Gentleman, Robert C, Carey, Vincent J, Bates, Douglas M, Bolstad, Ben, Dettling, Marcel, Dudoit, Sandrine, ...

A detailed description of the aims and methods of the Bioconductor project, an initiative for the collaborative creation of extensible software for computational biology and bioinformatics.

A Simulation Framework for Heterogeneous Agents

David Meyer, Alexandros Karatzoglou, Christian Buchta, Friedrich Leisch, Kurt Hornik

Agent Based Simulations, Simulation Framework, Heterogeneous Agents, Simluation Design

MONITORING STRUCTURAL CHANGES WITH THE GENERALIZED FLUCTUATION TEST

Leisch, Friedrich, Hornik, Kurt, Kuan, Chung-Ming

In this paper we introduce the generalized fluctuation test for monitoring structural changes and establish a result characterizing the limiting behavior of this class of tests. As applications of...

The Strucplot Framework: Visualizing Multi-way

David Meyer, Achim Zeileis, Kurt Hornik

This paper describes the “strucplot†framework for the visualization of multi-way contingency tables. Strucplot displays include hierarchical conditional plots such as mosaic, association,...

strucchange: An R Package for Testing for Structural Change in Linear Regression Models

Achim Zeileis, Friedrich Leisch, Kurt Hornik, Christian Kleiber

This paper reviews tests for structural change in linear regression models from the generalized fluctuation test framework as well as from the F test (Chow test) framework. It introduces a unified...

kernlab - An S4 Package for Kernel Methods in R

Alexandros Karatzoglou, Alexandros Smola, Kurt Hornik, Achim Zeileis

kernlab is an extensible package for kernel-based machine learning methods in R. It takes advantage of R’s new S4 ob ject model and provides a framework for creating and using kernel-based...

A CLUE for CLUster Ensembles

Kurt Hornik

Cluster ensembles are collections of individual solutions to a given clustering problem which are useful or necessary to consider in a wide range of applications. The R package clue provides an...

arules - A Computational Environment for Mining Association Rules and Frequent Item Sets

Michael Hahsler, Bettina Grun, Kurt Hornik

Mining frequent itemsets and association rules is a popular and well researched approach for discovering interesting relationships between variables in large databases. The R package arules presented...

Support Vector Machines in R

Alexandros Karatzoglou, David Meyer, Kurt Hornik

Being among the most popular and efficient classification and regression methods currently available, implementations of support vector machines exist in almost every popular programming language....

TSP : Infrastructure for the Traveling Salesperson Problem

Michael Hahsler, Kurt Hornik

The traveling salesperson (or, salesman) problem (TSP) is a well known and important combinatorial optimization problem. The goal is to ï¬nd the shortest tour that visits each city in a given list...

The generalized fluctuation test: A unifying view

Chung-Ming Kuan, Kurt Hornik

In this paper, a general principle of constructing tests for parameter constancy without assuming a specific alternative is introduced. A unified asymptotic result is established to analyze this...

A Simulation Framework for Heterogeneous Agents

David Meyer, Alexandros Karatzoglou, Friedrich Leisch, Christian Buchta, Kurt Hornik

We introduce a generic simulation framework suitable for agent-based simulations featuring the support of heterogeneous agents, hierarchical scheduling, and flexible specification of design...

Monitoring structural change in dynamic econometric models

Kurt Hornik, Friedrich Leisch, Christian Kleiber, Achim Zeileis

The classical approach to testing for structural change employs retrospective tests using a historical data set of a given length. Here we consider a wide array of fluctuation-type tests in a...

Selective association rule generation

Michael Hahsler, Christian Buchta, Kurt Hornik

Data mining, Association rules, Rule generation,

MONITORING STRUCTURAL CHANGES WITH THE GENERALIZED FLUCTUATION TEST

Leisch, Friedrich, Hornik, Kurt, Kuan, Chung-Ming

In this paper we introduce the generalized fluctuation test for monitoring structural changes and establish a result characterizing the limiting behavior of this class of tests. As applications of...

On the number of points of a homogeneous poisson process

Auer, Peter, Hornik, Kurt

We investigate the dense and sparse regions of a d-dimensional Poisson process and establish strong laws for both the maximal and the minimal number of points in families of sets of a certain volume.

On the distribution of functionals of stationary Gaussian processes

Hornik, Kurt

Let (X(t),0 [less-than-or-equals, slant] t [less-than-or-equals, slant] 1) be a zero mean, continuous-path stationary Gaussian process with autocovariance function [rho]X(s) = 1 - ? s , where...

The Moving-Estimates Test for Parameter Stability

Chu, Chia-Shang James, Hornik, Kurt, Kuan, Chung-Ming

In this paper a new class of tests for parameter stability, the moving-estimates (ME) test, is proposed. It is shown that in the standard situation the ME test asymptotically equivalent to the...

Escaping RGBland: Selecting colors for statistical graphics

Zeileis, Achim, Hornik, Kurt, Murrell, Paul

Statistical graphics are often augmented by the use of color coding information contained in some variable. When this involves the shading of areas (and not only points or lines)--e.g., as in bar...

Generalized and Customizable Sets in R

Kurt Hornik

We present data structures and algorithms for sets and some generalizations thereof (fuzzy sets, multisets, and fuzzy multisets) available for R through the sets package. Fuzzy (multi-)sets are based...