Isotone Optimization in R: Pool-Adjacent-Violators Algorithm (PAVA) and Active Set Methods (2009)
De Leeuw, Jan, Hornik, Kurt, Mair, Patrick
In this paper we give a general framework for isotone optimization. First we discuss a generalized version of the Pool-Adjacent-Violators Algorithm (PAVA) to minimize a separable convex function with...
Generalized and Customizable Sets in R (2009)
We present data structures and algorithms for sets and some generalizations thereof (fuzzy sets, multisets, and fuzzy multisets) available for R through the sets package. Fuzzy (multi-)sets are based...
Social Network Analysis of Weighted Telecommunications Graphs (2009)
Bohn, Angela, Walchhofer, Norbert, Mair, Patrick, Hornik, Kurt
SNA provides a wide range of tools that allow examination of telecommunications graphs. Those graphs contain vertices representing cell phone users and lines standing for established connections....
Social Network Analysis of Weighted Telecommunications Graphs (2009)
Bohn, Angela, Walchhofer, Norbert, Mair, Patrick, Hornik, Kurt
SNA provides a wide range of tools that allow examination of telecommunications graphs. Those graphs contain vertices representing cell phone users and lines standing for established connections....
Generalized and Customizable Sets in R (2009)
We present data structures and algorithms for sets and some generalizations thereof (fuzzy sets, multisets, and fuzzy multisets) available for R through the sets package. Fuzzy (multi-)sets are based...
Bookmaker Consensus and Agreement for the UEFA Champions League 2008/09 (2009)
Leitner, Christoph, Zeileis, Achim, Hornik, Kurt
Bookmakers odds are an easily available source of ``prospective" information that is thus often employed for forecasting the outcome of sports events. To investigate the statistical properties of...
Hahsler, Michael, Hornik, Kurt
For hierarchical clustering, dendrograms provide convenient and powerful visualization. Although many visualization methods have been suggested for partitional clustering, their usefulness...
Generalized and Customizable Sets in R (2009)
We present data structures and algorithms for sets and some generalizations thereof (fuzzy sets, multisets, and fuzzy multisets) available for R through the sets package. Fuzzy (multi-)sets are based...
Isotone Optimization in R: Pool-Adjacent-Violators Algorithm (PAVA) and Active Set Methods (2009)
Jan De Leeuw, Kurt Hornik, Patrick Mair
In this paper we give a general framework for isotone optimization. First we discuss a generalized version of the pool-adjacent-violators algorithm (PAVA) to minimize a separable convex function with...
Leitner, Christoph, Zeileis, Achim, Hornik, Kurt
Wimbledon is one of the most popular annual sports tournament. In the Gentlemen's Single 2009 the top seeded and defending champion Rafael Nadal withdrew from the tournament due to injury days prior...
Testing and dating of structural changes in (2008)
Achim Zeileis, Christian Kleiber, Kurt Hornik, Walter Krämer
practice
Abstract: Vienna is the home of many contributions to the R Project for statistical computing. In this paper we describe the current status of the R data analysis environment and give an overview of...
TSP – A R-Package for the Traveling Salesperson Problem (2008)
2. Definition and types of TSPs 3. Basic infrastructure in R 4. Algorithms and Heuristics
Achim Zeileis, Torsten Hothorn, Kurt Hornik
Recursive partitioning is embedded into the general and well-established class of parametric models that can be fitted using M-type estimators (including maximum likelihood). An algorithm for...
Selective association rule generation (2008)
Hahsler, Michael, Buchta, Christian, Hornik, Kurt
Mining association rules is a popular and well researched method for discovering interesting relations between variables in large databases. A practical problem is that at medium to low support...
New probabilistic interest measures for association rules (2008)
Hahsler, Michael, Hornik, Kurt
Mining association rules is an important technique for discovering meaningful patterns in transaction databases. Many different measures of interestingness have been proposed for association rules....
Selective Association Rule Generation 1 (2008)
Michael Hahsler, Christian Buchta, Kurt Hornik
Mining association rules is a popular and well researched method for discovering interesting relations between variables in large databases. A practical problem is that at medium to low support...
New probabilistic interest measures for association rules (2008)
Mining association rules is an important technique for discovering meaningful patterns in transaction databases. Many different measures of interestingness have been proposed for association rules....
RDBMS in Bioinformatics: The Bioconductor Experience (2008)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Vincent J. Carey
Bioconductor
Building on the arules infrastructure for analyzing transaction data with R (2008)
Abstract. The free and extensible statistical computing environment R with its enormous number of extension packages already provides many state-of-the-art techniques for data analysis. Support for...
News The Newsletter of the R Project Volume 1/1, January 2001 Editorial (2008)
Welcome to the first volume of R News, the newsletter of the R project for statistical computing. R News will feature short to medium length articles covering topics that might be of interest to...
News The Newsletter of the R Project Volume 2/3, December 2002 Editorial (2008)
Time to say good-bye... with the new system of rolling three-year periods of office for the members of the Editorial Board, my term comes to an end on Dec 31, with Doug Bates joining the board in...
News The Newsletter of the R Project Volume 2/2, June 2002 Editorial (2008)
Welcome to this first regular issue of R News in 2002, following the special issue on applications of R in medical statistics. In the future, it is planned to have two non-patch releases of R per...
News The Newsletter of the R Project Volume 1/2, June 2001 Editorial (2008)
Welcome to the second issue of R News, the newsletter of the R project for statistical computing. First we would like to thank for all the positive feedback we got on the first volume, which...
News The Newsletter of the R Project Volume 2/1, March 2002 Editorial (2008)
Welcome to the fourth issue of R News, the newsletter of the R project for statistical computing. This is the first special issue of R News, with an emphasis on applying R in medical statistics. It...
Rasp: A Package for Spatial Statistics (2008)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Barry Rowlingson, Rolf Turner, Peter Diggle
This paper describes the development of a new R package for spatial data and statistics. We review existing packages that deal with spatial information and comment on how they handle the basic data....
Quality Assurance for Graphics in R (2008)
An important part of software development involves testing the reliability of the software. R has good tools for checking that the software runs without catastrophic failure, but provides less...
Converting a Large R Package to S4 Classes and Methods (2008)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Douglas M. Bates, Saikat Debroy
The nlme package for fitting and examining linear and nonlinear mixedeffects models in R is a required package and also one of the largest R packages, based on source package size. In the first phase...
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Ross Ihaka, Green Red
Choosing a “good ” set of colours for a graphical display is an important problem, but one which many data analysis practitioners are ill-equipped to solve. Most current graphics systems provide...
TSP – Infrastructure for the Traveling Salesperson Problem (2008)
The traveling salesperson (or, salesman) problem (TSP) is a well known and important combinatorial optimization problem. The goal is to find the shortest tour that visits each city in a given list...
Who is Going to Win the EURO 2008? (2008)
Leitner, Christoph, Zeileis, Achim, Hornik, Kurt
This June one of the biggest and most popular sports tournaments will take place in Austria and Switzerland, the European soccer championship 2008 (UEFA EURO 2008). Therefore millions of soccer fans...
Getting Things in Order: An Introduction to the R Package seriation (2008)
Michael Hahsler, Kurt Hornik, Christian Buchta
Seriation, i.e., finding a suitable linear order for a set of objects given data and a loss or merit function, is a basic problem in data analysis. Caused by the problem's combinatorial nature, it is...
Text Mining Infrastructure in R (2008)
Ingo Feinerer, Kurt Hornik, David Meyer
During the last decade text mining has become a widely used discipline utilizing statistical and machine learning methods. We present the tm package which provides a framework for text mining...
Predicting the Winner of the EURO 2008 (2008)
Leitner, Christoph, Zeileis, Achim, Hornik, Kurt
In June 2008 one of the biggest and most popular sports tournaments took place in Austria and Switzerland, the European football championship 2008 (UEFA EURO 2008). Before the tournament started...
On Two-Periodic Random Walks with Boundaries (2008)
Two-periodic random walks are models for the one-dimensional motion of particles in which the jump probabilities depend on the parity of the currently occupied state. Such processes have interesting...
Implementing a Class of Permutation Tests: The coin Package (2008)
Torsten Hothorn, Kurt Hornik, Achim Zeileis
The R package coin implements a unified approach to permutation tests providing a huge class of independence tests for nominal, ordered, numeric, and censored data as well as multivariate data at...
On Two-Periodic Random Walks with Boundaries (2008)
Two-periodic random walks are models for the one-dimensional motion of particles in which the jump probabilities depend on the parity of the currently occupied state. Such processes have interesting...
Predicting the Winner of the EURO 2008 (2008)
Leitner, Christoph, Zeileis, Achim, Hornik, Kurt
In June 2008 one of the biggest and most popular sports tournaments took place in Austria and Switzerland, the European football championship 2008 (UEFA EURO 2008). Before the tournament started...
vector machines, License GPL-2 (2008)
Evgenia Dimitriadou, Kurt Hornik, Friedrich Leisch, David Meyer, Andreas Weingessel, Lazyload Yes
R topics documented:
ARMA(1,1) Recurrent Neural Network Process (2007)
Adrian Trapletti, Adrian Trapletti, Friedrich Leisch, Friedrich Leisch, Kurt Hornik, Kurt Hornik
This piece of research was supported by the Austrian Science
NeuroCOLT Coordinating Partner (2007)
Kurt Hornik, Maxwell Stinchcombe, Halbert White, Peter Auer
1 1 White's participation was supported by National Science Foundation Grant SES--8921382. We are grateful to Andrew Barron for helpful comments. 2 Technische Universitat Wien 3
Andreas Weingessel, Kurt Hornik
Introduction to PCA and SVD ffl Overview over neural networks algorithms for PCA ffl Neural network algorithms for SVD: Experimental comparison NN Algos for Online PCA & SVD TU Wien --- Center...
Quality Assurance for Graphics in R (2007)
Friedrich Leisch, Achim Zeileis (eds, Paul Murrell, Kurt Hornik, Kurt Hornik
An important part of software development involves testing the reliability of the software. R has good tools for checking that the software runs without catastrophic failure, but provides less...
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Christian Ritter
In companies under permanent reorganization, stable software environments are hard or even impossible to maintain. Parts of the company break off into joint ventures and licences need to be split,...
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Deepayan Sarkar
Trellis Graphics is implemented in S-PLUS using the traditional S graphics engine. In contrast, lattice, the implementation of Trellis Graphics in R, uses grid graphics as the underlying mechanism....
kernlab – A Kernel Methods Package (2007)
Friedrich Leisch, Achim Zeileis (eds, Alexandros Karatzoglou, Achim Zeileis, ...
Designing software for Support Vector Machines (SVM) and kernel methods in general poses an interesting design problem. Our aim is to provide one possible solution using R object oriented features....
Statistical Computing and Databases: Distributed Computing Near the Data (2007)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Fei Chen, Brian D. Ripley
This paper addresses the following question: “how do we fit statistical models efficiently with very large data sets that reside in databases?” Nowadays it is quite common to we encounter a...
A Graphical Users Interface to Normalize Microarray Data (2007)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Fatima Sanchez Cabo, Zlatko Trajanoski, Kwang-hyun Cho, ...
Microarray technology is becoming an essential tool in functional genomics. The possibility of monitoring the expression level of thousands of genes simultaneously, as the response to a particular...
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Stefanie Scheid, Rainer Spang
The distribution of scores for differential gene expression observed in microarray experiments give rise to the assumption that the underlying score distributions of induced and non-induced genes...
Visualizing Independence Using Extended Association and Mosaic Plots (2007)
David Meyer, Achim Zeileis, Kurt Hornik
Two visualization techniques for the independence problem in 2-way contingency tables—assocation and mosaic plots—are extended in two directions: 1. The visualization is enhanced by displaying...
GIS and the Random Forest Predictor: Integration in R for Tick-Borne Disease Risk Assessment (2007)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, C. Furlanello, M. Neteler, S. Merler, ...
We discuss how sophisticated machine learning methods may be rapidly integrated within a GIS for the development of new approaches in landscape epidemiology. A multitemporal predictive map is...
svlab – A Kernel Methods Package (2007)
svlab is an extensible, object oriented, package for kernel based learning in R. Its main objective is to provide a tool kit consisting of basic kernel functionality, optimizers and high level...
Management Science'). Generalized M-Fluctuation Tests for Parameter Instability (2007)
In Cooperation, Achim Zeileis, Achim Zeileis, Kurt Hornik, Kurt Hornik
Papers published in this report series are preliminary versions of journal articles. This piece of research was supported by the Austrian Science Foundation (FWF) under
David Meyer, Achim Zeileis, Ros Karatzoglou, Kurt Hornik, Michael Friendly
Description Functions and data sets based on the book ”Visualizing Categorical Data ” by
Programmer's Niche: nifty hints for programming (2007)
Friedrich Leisch, Kurt Hornik, Kurt Hornik
Welcome to the first volume of R News, the newsletter of the R project for statistical computing. R News will feature short to medium length articles covering topics that might be of interest to...
Welcome to the second issue of R News, the newsletter of the R project for statistical computing. First we would like to thank for all the positive feedback we got on the first volume, which...
Depends R (> = 1.3.0) License GPL R topics documented: (2007)
Achim Zeileis, Friedrich Leisch, Bruce Hansen, Kurt Hornik, Christian Kleiber, Andrea Peters
Description Testing on structural change in linear regression relationships. It features tests/methods from the generalized fluctuation test framework as well as from the F test (Chow test)...
Support Vector Machines, David Meyer, Friedrich Leisch, Kurt Hornik, In Cooperation
Support Vector Machines (SVMs) are rarely benchmarked against other classification or regression methods. We compare a popular SVM implementation (libsvm) to 16 classification methods and 9...
StatDataML--An XML Format for (2007)
Statistical Data David, David Meyer, Friedrich Leisch, Torsten Hothorn, Kurt Hornik, In Cooperation
In order to circumvent common difficulties in exchanging statistical data between heterogeneous applications (format incompatibilities, technocentric data representation), we introduce an XML-based...
Prediction Analysis of Microarrays in Excel (2007)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Balasubramanian Narasimhan
Prediction Analysis of Microarrays (PAM) is a statistical technique for class prediction using gene expression data using shrunken centroids. It is described in Tibshirani, Hastie, Narasimhan, and...
Quantian: A Scientific Computing Environment (2007)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Dirk Eddelbuettel
This paper introduces Quantian, a scientific computing environment. Quantian is directly bootable from cdrom, self-configuring without user intervention, and comprises hundreds of applications,...
Revealing Predictive Gene Clusters with Supervised Algorithms (2007)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Marcel Dettling
Microarray technology allows the measurement of expression levels of thousands of genes simultaneously and is expected to contribute significantly to advances in fundamental questions of biology and...
Fractals and Statistics: An R Package Called ifs (2007)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Stefano Maria, Iacus Davide, La Torre
Iterated Function Systems (IFSs) were born in mid eighties (see Hutchinson, 1981; Barnsley and Demko, 1985) as applications of the theory of discrete dynamical systems and as useful tools to build...
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, James Robison-cox
The RGtk package built by Duncan Temple Lang provides one way to build statistical applications in R which can be controlled by the user through a graphical user interface. RGtk is an interface to...
R: Windows Component Services Integrating R and Excel on the COM Layer (2007)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Thomas Baier
On the Windows platform a standardized model for component services and interprocess and inter-machine communication has been established by Microsoft—COM. As many applications provide their own or...
Sweave and Beyond: Computations on Text Documents (2007)
Kurt Hornik, Achim Zeileis (eds, Friedrich Leisch, Friedrich Leisch
Sweave is a tool that allows to embed R code in L ATEX documents. The code can be evaluated and the resulting console output, figures and tables are automatically inserted into the final document. In...
Learning Bayesian Networks with R (2007)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Susanne G. Bøttcher, Claus Dethlefsen
deal is a software package freely available for use with R. It includes several methods for analysing data using Bayesian networks with variables of discrete and/or continuous types but restricted to...
Detecting Clusters of Diseases with R (2007)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, V. Gómez-rubio, J. Ferrándiz, A. López
One of the main concerns in Public Health surveillance (Aylin, Maheswaran, Wakefield, Cockings, Jarup, Arnold, Wheeler, and Elliot, 1999) is detection of clusters of diseases, i. e., the presence of...
Ten Years of Octave— Recent Developments and Plans for the Future (2007)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, John W. Eaton, James B. Rawlings
GNU Octave
Database and R Interfacing for Annotated Microarray Data (2007)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Michael Mader, Werner Mewes
A framework for expression data repositories (MAGE-ML compatiblity) emerged during the last years. Focus is now shifting towards the integration of expression databases into genomic and other...
JAGS: A Program for Analysis of Bayesian Graphical Models Using Gibbs Sampling (2007)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Martyn Plummer
JAGS is a program for Bayesian Graphical modelling which aims for compatibility with classic BUGS. The program could eventually be developed as an R package. This article explains the motivations for...
Open-Source Machine Learning: R Meets Weka (2007)
Hornik, Kurt, Buchta, Christian, Zeileis, Achim
Two of the prime open-source environments available for machine/statistical learning in data mining and knowledge discovery are the software packages Weka and R which have emerged from the machine...
Text Mining of Supreme Administrative Court Jurisdictions (2007)
Within the last decade text mining, i.e., extracting sensitive information from text corpora, has become a major factor in business intelligence. The automated textual analysis of law corpora is...
Benchmarking Open-Source Tree Learners in R/RWeka (2007)
Schauerhuber, Michael, Zeileis, Achim, Meyer, David, Hornik, Kurt
The two most popular classification tree algorithms in machine learning and statistics - C4.5 and CART - are compared in a benchmark experiment together with two other more recent constant-fit tree...
Implementing a Class of Permutation Tests: The coin Package (2007)
Hothorn, Torsten, Hornik, Kurt, Van De Wiel, Mark A., Zeileis, Achim
The R package coin implements a unified approach to permutation tests providing a huge class of independence tests for nominal, ordered, numeric, and censored data as well as multivariate data at...
Getting Things in Order: An Introduction to the R package seriation (2007)
Hahsler, Michael, Hornik, Kurt, Buchta, Christian
Seriation, i.e., finding a linear order for a set of objects given data and a loss or merit function, is a basic problem in data analysis. Caused by the problems combinatorial nature, it is hard to...
Let's Have a party! An Open-Source Toolbox for Recursive Partytioning (2007)
Hothorn, Torsten, Zeileis, Achim, Hornik, Kurt
Package party, implemented in the R system for statistical computing, provides basic classes and methods for recursive partitioning along with reference implementations for three recently-suggested...
Escaping RGBland: Selecting Colors for Statistical Graphics (2007)
Zeileis, Achim, Hornik, Kurt, Murrell, Paul
Statistical graphics are often augmented by the use of color coding information contained in some variable. When this involves the shading of areas (and not only points or lines) - e.g., as in bar...
TSP – Infrastructure for the Traveling Salesperson Problem (2007)
The traveling salesperson (or, salesman) problem (TSP) is a well known and important combinatorial optimization problem. The goal is to find the shortest tour that visits each city in a given list...
Ros Karatzoglou, Alex Smola, Kurt Hornik
detection, quantile regression and dimensionality reduction. Among other methods kernlab
SystemRequirements Java (> = 5.0) (2007)
Kurt Hornik, Torsten Hothorn, David Meyer, Achim Zeileis
Description An R interface to Weka (Version 3.5.7). Weka is a collection of machine learning algorithms for data mining tasks written in Java, containing tools for data pre-processing,...
SystemRequirements Java (> = 5.0) (2007)
Kurt Hornik, Torsten Hothorn, David Meyer, Achim Zeileis
Description An R interface to Weka (Version 3.5.6). Weka is a collection of machine learning algorithms for data mining tasks written in Java, containing tools for data pre-processing,...
vector machines, License GPL-2 (2007)
Evgenia Dimitriadou, Kurt Hornik, Friedrich Leisch, David Meyer, Andreas Weingessel, Saveimage No, ...
R topics documented:
The Traveling Salesperson Problem (2007)
Michael Hahsler, Kurt Hornik, Michael Hahsler, Kurt Hornik Freiburg
1. The traveling salesperson problem 2. TSP package for R 3. Applications for data analysis
David Meyer, Achim Zeileis, Kurt Hornik, R Topics Documented
Description Visualization techniques, data sets, summary and inference procedures aimed particularly at categorical data. Special emphasis is given to highly extensible grid graphics. The package was...
Evaluating Model-based Trees in Practice (2006)
Zeileis, Achim, Hothorn, Torsten, Hornik, Kurt
A recently suggested algorithm for recursive partitioning of statistical models (Zeileis, Hothorn and Hornik, 2005), such as models estimated by maximum likelihood or least squares, is evaluated in...
Deriving Consensus Rankings from Benchmarking Experiments (2006)
Whereas benchmarking experiments are very frequently used to investigate the performance of statistical or machine learning algorithms for supervised and unsupervised learning tasks, overall analyses...
Deriving Consensus Rankings from Benchmarking Experiments (2006)
Whereas benchmarking experiments are very frequently used to investigate the performance of statistical or machine learning algorithms for supervised and unsupervised learning tasks, overall analyses...
New Probabilistic Interest Measures for Association Rules (2006)
Hahsler, Michael, Hornik, Kurt
Mining association rules is an important technique for discovering meaningful patterns in transaction databases. Many different measures of interestingness have been proposed for association rules....
Choosing Color Palettes for Statistical Graphics (2006)
Statistical graphics are often augmented by the use of color coding information contained in some variable. When this involves the shading of areas (and not only points or lines) - e.g., as in bar...
TSP - Infrastructure for the Traveling Salesperson Problem (2006)
Hahsler, Michael, Hornik, Kurt
The traveling salesperson or salesman problem (TSP) is a well known and important combinatorial optimization problem. The goal is to find the shortest tour that visits each city in a given list...
Support Vector Machines in R (2006)
Alexandros Karatzoglou, David Meyer, Kurt Hornik
Being among the most popular and efficient classification and regression methods currently available, implementations of support vector machines exist in almost every popular programming language....
The Strucplot Framework: Visualizing Multi-way (2006)
David Meyer, Achim Zeileis, Kurt Hornik
This paper describes the “strucplot” framework for the visualization of multi-way contingency tables. Strucplot displays include hierarchical conditional plots such as mosaic, association, and...
Evgenia Dimitriadou, Kurt Hornik, Friedrich Leisch, David Meyer, Andreas Weingessel, Maintainer Friedrich Leisch
Description Functions for latent class analysis, short time Fourier transform, fuzzy clustering, support vector machines, License GPL version 2. See COPYRIGHT.svm.cpp for the copyright of the svm C++...
TSP – Infrastructure for the Traveling Salesperson Problem (2006)
Michael Hahsler, Kurt Hornik, Michael Hahsler, Kurt Hornik, Kurt Hornik
The traveling salesperson or salesman problem (TSP) is a well known and important combinatorial optimization problem. The goal is to find the shortest tour that visits each city in a given list...
Implications of probabilistic data modeling for mining association rules (2006)
Michael Hahsler, Kurt Hornik, Thomas Reutterer
Abstract. Mining association rules is an important technique for discovering meaningful patterns in transaction databases. In the current literature, the properties of algorithms to mine association...
Version 0.2-4 Date 2006-05-07 Title R/Weka interface The RWeka Package (2006)
Kurt Hornik, Torsten Hothorn, David Meyer, Achim Zeileis
Description An R interface to Weka (Version 3.5.2). Weka is a collection of machine learning algorithms for data mining tasks written in Java, containing tools for data pre-processing,...
Choosing Color Palettes for Statistical Graphics (2006)
Achim Zeileis, Kurt Hornik, Achim Zeileis, Kurt Hornik
Statistical graphics are often augmented by the use of color coding information contained in some variable. When this involves the shading of areas (and not only points or lines)— e.g., as in bar...
Urak, Lydia, Feucht, Martha, Fathi, Nahid, Hornik, Kurt, Fuchs, Karoline
Childhood absence epilepsy (CAE) is considered to exhibit a complex non-Mendelian pattern of inheritance. So far, only few CAE susceptibility genes have been identified. In a previous study of our...
Urak, Lydia, Feucht, Martha, Fathi, Nahid, Hornik, Kurt, Fuchs, Karoline
Childhood absence epilepsy (CAE) is considered to exhibit a complex non-mendelian pattern of inheritance. So far, only few CAE susceptibility genes have been identified. In a previous study of our...
Implications of probabilistic data modeling for rule mining (2005)
Hahsler, Michael, Hornik, Kurt, Reutterer, Thomas
Mining association rules is an important technique for discovering meaningful patterns in transaction databases. In the current literature, the properties of algorithms to mine associations are...
A computational environment for mining association rules and frequent item sets (2005)
Hahsler, Michael, Grün, Bettina, Hornik, Kurt
Mining frequent itemsets and association rules is a popular and well researched approach to discovering interesting relationships between variables in large databases. The R package arules presented...
A CLUE for CLUster ensembles (2005)
Cluster ensembles are collections of individual solutions to a given clustering problem which are useful or necessary to consider in a wide range of applications. The R package clue provides an...
Model-based recursive partitioning (2005)
Zeileis, Achim, Hothorn, Torsten, Hornik, Kurt
Recursive partitioning is embedded into the general and well-established class of parametric models that can be fitted using M-type estimators (including maximum likelihood). An algorithm for...
Residual-based shadings for visualizing (conditional) independence (2005)
Zeileis, Achim, Meyer, David, Hornik, Kurt
Residual-based shadings for enhancing mosaic and association plots to visualize independence models for contingency tables are extended in two directions: (a) perceptually uniform HCL colors are used...
Support Vector Machines in R (2005)
Karatzoglou, Alexandros, Meyer, David, Hornik, Kurt
Being among the most popular and efficient classification and regression methods currently available, implementations of support vector machines exist in almost every popular programming language....
The Strucplot Framework: Visualizing Multi-way Contingency Tables with vcd (2005)
Meyer, David, Zeileis, Achim, Hornik, Kurt
This paper describes the `strucplot' framework for the visualization of multi-way contingency tables. Strucplot displays include hierarchical conditional plots such as mosaic, association, and sieve...
A Lego System for Conditional Inference (2005)
Hothorn, Torsten, Hornik, Kurt, Zeileis, Achim
Conditioning on the observed data is an important and flexible design principle for statistical test procedures. Although generally applicable, permutation tests currently in use are limited to the...
A CLUE for CLUster ensembles (2005)
Cluster ensembles are collections of individual solutions to a given clustering problem which are useful or necessary to consider in a wide range of applications. The R package clue provides an...
A CLUE for CLUster Ensembles (2005)
Cluster ensembles are collections of individual solutions to a given clustering problem which are useful or necessary to consider in a wide range of applications. The R package clue provides an...
arules – A computational environment for mining association rules and frequent item sets (2005)
Michael Hahsler, Michael Hahsler, Bettina Grün, Bettina Grün, Kurt Hornik, Kurt Hornik, ...
Mining frequent itemsets and association rules is a popular and well researched approach to discovering interesting relationships between variables in large databases. The R package arules presented...
arules – A computational environment for mining association rules and frequent item sets (2005)
Michael Hahsler, Kurt Hornik, Bettina Grün
Mining frequent itemsets and association rules is a popular and well researched approach for discovering interesting relationships between variables in large databases. The R package arules presented...
Implications of probabilistic data modeling for rule mining (2005)
Thomas Reutterer, Michael Hahsler, Michael Hahsler, Kurt Hornik, Kurt Hornik, ...
The design and analysis of benchmark experiments (2005)
Torsten Hothorn, Friedrich Leisch, Kurt Hornik, Achim Zeileis, ...
The assessment of the performance of learners by means of benchmark experiments is an established exercise. In practice, benchmark studies are a tool to compare the performance of several competing...
Unbiased recursive partitioning: A conditional inference framework (2005)
Torsten Hothorn, Kurt Hornik, Achim Zeileis
Recursive binary partitioning is a popular tool for regression analysis. Two fundamental problems of exhaustive search procedures usually applied to fit such models have been known for a long time:...
Title Mining Association Rules and Frequent Itemsets (2005)
Michael Hahsler, Bettina Gruen, Kurt Hornik, Maintainer Michael Hahsler
Description Provides the basic infrastructure for mining and analyzing association rules and an interface to C implementations of the mining algorithms Apriori and Eclat. Depends R (> = 2.1.0),...
Bioconductor: open software development for computational biology and bioinformatics (2004)
Gentleman, Robert C, Carey, Vincent J, Bates, Douglas M, Bolstad, Ben, Dettling, Marcel, Dudoit, Sandrine, ...
Abstract The Bioconductor project is an initiative for the collaborative creation of extensible software for computational biology and bioinformatics. The goals of the project include: fostering...
Bioconductor: Open software development for computational biology and bioinformatics (2004)
Gentleman, Robert C., Carey, Vincent J., Bates, Douglas J., Bolstad, Benjamin M., Dettling, Marcel, Dudoit, Sandrine, ...
The Bioconductor project is an initiative for the collaborative creation of extensible software for computational biology and bioinformatics. We detail some of the design decisions, software...
Bioconductor: Open software development for computational biology and bioinformatics (2004)
Gentleman, Robert C., Carey, Vincent J., Bates, Douglas J., Bolstad, Benjamin M., Dettling, Marcel, Dudoit, Sandrine, ...
The Bioconductor project is an initiative for the collaborative creation of extensible software for computational biology and bioinformatics. We detail some of the design decisions, software...
Bioconductor: Open software development for computational biology and bioinformatics (2004)
Gentleman, Robert C., Carey, Vincent J., Bates, Douglas J., Bolstad, Benjamin M., Dettling, Marcel, Dudoit, Sandrine, ...
The Bioconductor project is an initiative for the collaborative creation of extensible software for computational biology and bioinformatics. We detail some of the design decisions, software...
Bioconductor: Open software development for computational biology and bioinformatics (2004)
Gentleman, Robert C., Carey, Vincent J., Bates, Douglas J., Bolstad, Benjamin M., Dettling, Marcel, Dudoit, Sandrine, ...
The Bioconductor project is an initiative for the collaborative creation of extensible software for computational biology and bioinformatics. We detail some of the design decisions, software...
kernlab - An S4 Package for Kernel Methods in R (2004)
Alexandros Karatzoglou, Alex Smola, Kurt Hornik, Achim Zeileis
Unbiased Recursive Partitioning: A Conditional Inference Framework (2004)
Hothorn, Torsten, Hornik, Kurt, Zeileis, Achim
Recursive binary partitioning is a popular tool for regression analysis. Two fundamental problems of exhaustive search procedures usually applied to fit such models have been known for a long time:...
kernlab - An S4 package for kernel methods in R (2004)
Karatzoglou, Alexandros, Smola, Alex, Hornik, Kurt, Zeileis, Achim
kernlab is an extensible package for kernel-based machine learning methods in R. It takes advantage of R's new S4 object model and provides a framework for creating and using kernel-based algorithms....
kernlab - An S4 Package for Kernel Methods in R (2004)
Alexandros Karatzoglou, Alex Smola, Kurt Hornik, Achim Zeileis
The vcd Package - Visualizing Categorical Data (2004)
David Meyer, Achim Zeileis, Alexandros Karatzoglou, Ros Karatzoglou, Kurt Hornik, R Topics Documented
SexualFun . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 SpaceShuttle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 Suicide . . ....
Generalized M-Fluctuation Tests for Parameter Instability (2003)
A general class of fluctuation tests for parameter instability in an M-estimation framework is suggested. The tests are based on partial sum processes of M-estimation scores for which functional...
The design and analysis of benchmark experiments (2003)
Hothorn, Torsten, Leisch, Friedrich, Zeileis, Achim, Hornik, Kurt
The assessment of the performance of learners by means of benchmark experiments is established exercise. In practice, benchmark studies are a tool to compare the performance of several competing...
Zur Erlangung des akademischen Grades (2003)
In this place, I would like to acknowledge several people who contributed, in one way or another, to this work. First and foremost, I would like to thank my supervisor (and employer) Kurt Hornik for...
A simulation framework for heterogeneous agents (2003)
David Meyer, Christian Buchta, Alexandros Karatzoglou, Friedrich Leisch, Kurt Hornik, In Cooperation
Papers published in this report series are preliminary versions of journal articles and not for quotations.
Proceedings of the 3rd International Workshop (2003)
On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds
SciViews aims to provide a unique, polymorph object-oriented interface (called "Plug") to di#erent data analysis software like R, Octave, Scilab, MATLAB, S-PLUS, Ox, Mathematica,. . . It is...
Proceedings of the 3rd International Workshop (2003)
On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Florian Markowetz, Rainer Spang
Introduction A genetic network is a set of genes in which individual genes influence the activity of other genes. The core task in identifying genetic networks is to distinguish direct from indirect...
Proceedings of the 3rd International Workshop (2003)
On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Henrik Bengtsson
An easy to install and platform independent package named R.oo, which provides support for references and mutable objects via a specific class model using standard R code, has been developed. The...
Proceedings of the 3rd International Workshop (2003)
On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Simon Urbanek, Martin Theus
This paper describes the integration of interactive statistical graphics within R using state-of-the art R-connectivity. The iPlots project uses SJava to deliver interactive plots for R and an...
Proceedings of the 3rd International Workshop (2003)
On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Günter Tirler, Josef Leydold
Random variate generation is an important tool in statistical computing. Many programs for simulation or statistical computing (e.g. R) provide a collection of random variate generators for many...
Proceedings of the 3rd International Workshop on Distributed Statistical Computing (DSC 2003) (2003)
March Vienna Austria, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Richard M. Heiberger, Burt Holl
We have designed several multi-panel graphical displays for which the panels are defined by the Cartesian product of one set of variables or parameters with another set. The traditional scatterplot...
Proceedings of the 3rd International Workshop (2003)
On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, In R, Roger Bivand
Access to well-structured and sometimes self-describing spatial position data with associated data attributes in geographical scales domains is increasing, and is expected to increase further. Until...
RArcInfo: Using G.I.S. Data with R (2003)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Virgilio Gomez Rubio
Geographical Information Systems (G.I.S.) have become powerful tools for integration, visualisation and analysis of spatial data, also providing the capability of integrating data from di#erent...
Proceedings of the 3rd International Workshop (2003)
On Distributed Statistical, Friedrich Leisch, Achim Zeileis (eds, David Meyer, Achim Zeileis, Kurt Hornik, ...
Association plots---a visualization technique for the independence problem in 2-way contingency tables---are extended in three directions: 1. The visualization is enhanced by using colors for the...
Proceedings of the 3rd International Workshop (2003)
On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Uwe Ligges
In the first part of this paper the status quo of R-WinEdt, a plug-in for the shareware editor WinEdt for Windows, is described. This description includes the interface between R and WinEdt,...
Proceedings of the 3rd International Workshop (2003)
On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Simon Urbanek
Rserve is a TCP/IP server which allows other programs to use facilities of R from various languages without the need to initialize R or link to the R library.
Proceedings of the 3rd International Workshop (2003)
On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Peter Bühlmann
We present an extended abstract about boosting. We describe first in section 1 (in a self-contained way) a generic functional gradient descent algorithm, which yields a general representation of...
Proceedings of the 3rd International Workshop (2003)
On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Thomas Baier, Erich Neuwirth
Spreadsheet programs in general and Microsoft Excel in particular implement a very special paradigm of computing: automatic creation of dependency structures for calculation, and automatic...
Economics and Management Science’). The Design and Analysis of Benchmark Experiments (2003)
Torsten Hothorn, Friedrich Leisch, Achim Zeileis, Kurt Hornik, In Cooperation, Torsten Hothorn, ...
Papers published in this report series are preliminary versions of journal articles. This piece of research was supported by the Austrian Science Foundation
Exploratory Visual Analysis of Graphs in GGobi (2003)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Deborah Swayne, Andreas Buja, Duncan Temple Lang
Graphs have long been of interest in telecommunications and social network analysis, and they are now receiving increasing attention from statisticians working in other areas, particularly in...
GIS and the Random Forest Predictor: Integration in R for Tick-Borne Disease Risk Assessment (2003)
Cesare Furlanello, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Steno Fontanari, Angela Donini, ...
We discuss how sophisticated machine learning methods may be rapidly integrated within a GIS for the development of new approaches in landscape epidemiology. A multitemporal predictive map is...
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Pvm Kriging R, Albrecht Gebhardt
Kriging is one of the most frequently used prediction methods in spatial data analysis. This paper examines which steps of the underlying algorithms can be performed in parallel on a PVM cluster. It...
Gstat: Multivariable Geostatistics for S (2003)
Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Edzer J. Pebesma
This paper introduces the gstat package for the S language (R, S-PLUS).
Proceedings of the 3rd International Workshop (2003)
On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Paulo J. Ribeiro, Peter J. Diggle, ...
The packages geoR and geoRglm are contributed packages to the statistical software system R, implementing methods for model-based geostatistical data-analysis. In this paper we focus on the...
Proceedings of the 3rd International Workshop (2003)
On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Ernst Glatzer, Österreichische Nationalbank, ...
There exists a growing number of spatial statistics software packages covering a broad range of methods including the classical technique of kriging following variogram fitting (for a review see e.g....
Proceedings of the 3rd International Workshop (2003)
On Distributed Statistical, Kurt Hornik, Friedrich Leisch, Achim Zeileis (eds, Alternative To Bioconductor, Christian Stratowa
Novel high-throughput technologies such as DNA microarray analyses are allowing biologists to generate sets of data in the terabyte realm. Many of these data will be deposited in the public domain,...
Testing and dating of structural changes in practice (2003)
Achim Zeileis, Christian Kleiber, Walter Krämer, Kurt Hornik
The paper presents an approach to the analysis of data that contains (multiple) structural changes in a linear regression setup. We implement various strategies which have been suggested in the...
strucchange: an R package for Testing for Structural Change in Linear Regression Models (2002)
Achim Zeileis, Friedrich Leisch, Kurt Hornik, Christian Kleiber
Monitoring structural change in dynamic econometric models (2002)
Zeileis, Achim, Leisch, Friedrich, Kleiber, Christian, Hornik, Kurt
The classical approach to testing for structural change employs retrospective tests using a historical data set of a given length. Here we consider a wide array of fluctuation-type tests in a...
A Simulation Framework for Heterogeneous Agents (2002)
Meyer, David, Buchta, Christian, Karatzoglou, Alexandros, Leisch, Friedrich, Hornik, Kurt
We introduce a generic simulation framework suitable for agent-based simulations featuring the support of heterogeneous agents, hierarchical scheduling and flexible specification of design...
Meyer, David, Leisch, Friedrich, Hothorn, Torsten, Hornik, Kurt
In order to circumvent common difficulties in exchanging statistical data between heterogeneous applications (format incompatibilities, technocentric data representation), we introduce an XML-based...
A Mixed Ensemble Approach for the Semi-Supervised Problem (2002)
Dimitriadou, Evgenia, Weingessel, Andreas, Hornik, Kurt
In this paper we introduce a mixed approach for the semi-supervised data problem. Our approach consists of an ensemble unsupervised learning part where the labeled and unlabeled points are segmented...
Benchmarking Support Vector Machines (2002)
Meyer, David, Leisch, Friedrich, Hornik, Kurt
Support Vector Machines (SVMs) are rarely benchmarked against other classification or regression methods. We compare a popular SVM implementation (libsvm) to 16 classification methods and 9...
strucchange: an R package for Testing for Structural Change in Linear Regression Models (2002)
Achim Zeileis, Friedrich Leisch, Kurt Hornik, Christian Kleiber
A mixed ensemble approach for the semi-supervised problem (2002)
In Cooperation, Evgenia Dimitriadou, Evgenia Dimitriadou, Andreas Weingessel, Andreas Weingessel, Kurt Hornik, ...
Papers published in this report series are preliminary versions of journal articles and not for quotations. This paper was accepted for publication in:
strucchange: An R Package for Testing for Structural Change in Linear Regression Models (2002)
Achim Zeileis, Friedrich Leisch, Kurt Hornik, Christian Kleiber
This paper introduces ideas and methods for testing for structural change in linear regression models and presents how these have been realized in an R package called strucchange. It features tests...
strucchange: An R Package for Testing for Structural Change in Linear Regression Models (2002)
In Cooperation, Achim Zeileis, Achim Zeileis, Friedrich Leisch, Friedrich Leisch, Kurt Hornik, ...
Papers published in this report series are preliminary versions of journal articles. This paper was accepted for publication in:
Monitoring Structural Change in Dynamic Econometric Models (2002)
Kleiber Kurt Hornik, In Cooperation, Achim Zeileis, Achim Zeileis, Friedrich Leisch, Friedrich Leisch, ...
The classical approach to testing for structural change employs retrospective tests using a historical data set of a given length. Here we consider a wide array of fluctuation-type tests in a...
Monitoring Structural Change in Dynamic Econometric Models (2002)
Hornik, Kurt, Kleiber, Christian, Leisch, Friedrich, Zeileis, Achim
Testing and Dating of Structural Changes in Practice (2002)
Hornik, Kurt, Kleiber, Christian, Krämer, Walter, Zeileis, Achim
ANXIETY AS A PREDICTOR OF RELAPSE IN DETOXIFIED ALCOHOL-DEPENDENT PATIENTS (2002)
Willinger, Ulrike, Lenzinger, Elisabeth, Hornik, Kurt, Fischer, Gabriele, Schönbeck, Georg, Aschauer, Harald N., ...
— Aims: To evaluate the impact of mood, affect, and personality on predicting relapse in detoxified alcohol-dependent patients to uncontrolled drinking during a 1-year treatment study. Methods: A...
Emacs Speaks Statistics: A Universal Interface for Statistical Analysis (2001)
Rossini, Anthony, Maechler, Martin, Hornik, Kurt, Heiberger, Richard M., Sparapani, Rodney
Emacs Speaks Statistics (ESS) is a user interface for developing statistical applications and performing data analysis using any of several common statistical programming languages. ESS falls in the...
Emacs Speaks Statistics: A Universal Interface for Statistical Analysis (2001)
Rossini, Anthony, Maechler, Martin, Hornik, Kurt, Heiberger, Richard M., Sparapani, Rodney
Emacs Speaks Statistics (ESS) is a user interface for developing statistical applications and performing data analysis using any of several common statistical programming languages. ESS falls in the...
Emacs Speaks Statistics: A Universal Interface for Statistical Analysis (2001)
Rossini, Anthony, Maechler, Martin, Hornik, Kurt, Heiberger, Richard M., Sparapani, Rodney
Emacs Speaks Statistics (ESS) is a user interface for developing statistical applications and performing data analysis using any of several common statistical programming languages. ESS falls in the...
Emacs Speaks Statistics: A Universal Interface for Statistical Analysis (2001)
Rossini, Anthony, Maechler, Martin, Hornik, Kurt, Heiberger, Richard M., Sparapani, Rodney
Emacs Speaks Statistics (ESS) is a user interface for developing statistical applications and performing data analysis using any of several common statistical programming languages. ESS falls in the...
Zeileis, Achim, Leisch, Friedrich, Hornik, Kurt, Kleiber, Christian
This paper introduces ideas and methods for testing for structural change in linear regression models and presents how these have been realized in an R package called strucchange. It features tests...
Running agent-based simulations (2001)
Meyer, David, Karatzoglou, Alexandros, Buchta, Christian, Leisch, Friedrich, Hornik, Kurt
When running agent-based simulations using ready-made components, one usually faces heterogenity problems both for the agents implementation and for the underlying platform. To circumvent these...
Emacs speaks statistics: A universal interface for statistical analysis (2001)
A. J. Rossini, Martin Mächler, Kurt Hornik, Richard M. Heiberger, Rodney Sparapani
Emacs Speaks Statistics (ESS) is a user interface for developing statistical applications and performing data analysis using any of several common statistical programming languages. ESS falls in the...
Voting-merging: An ensemble method for clustering. ICANN (2001)
Andreas Weingessel, Evgenia Dimitriadou, Kurt Hornik
Combination strategies in classification are a popular way of overcoming instabilities in classification algorithms. A direct application of ideas such as “voting ” to cluster analysis problems...
Strucchange: An R Package for Testing for Structural Change in Linear Regression Models (2001)
Hornik, Kurt, Kleiber, Christian, Leisch, Friedrich, Zeileis, Achim
A voting-merging clustering algorithm (1999)
Dimitriadou, Evgenia, Weingessel, Andreas, Hornik, Kurt
In this paper we propose an unsupervised voting-merging scheme that is capable of clustering data sets, and also of finding the number of clusters existing in them. The voting part of the algorithm...
On the ergodicity and stationarity of the ARMA (1,1) recurrent neural network process (1999)
Trapletti, Adrian, Leisch, Friedrich, Hornik, Kurt
In this note we consider the autoregressive moving average recurrent neural network ARMA-NN(1, 1) process. We show that in contrast to the pure autoregressive process simple ARMA-NN processes exist...
Fuzzy voting in clustering (1999)
Dimitriadou, Evgenia, Weingessel, Andreas, Hornik, Kurt
In this paper we present a fuzzy voting scheme for cluster algorithms. This fuzzy voting method allows us to combine several runs of cluster algorithms resulting in a common fuzzy partition. This...
Voting in clustering and finding the number of clusters (1999)
Dimitriadou, Evgenia, Weingessel, Andreas, Hornik, Kurt
In this paper we present an unsupervised algorithm which performs clustering given a data set and which can also find the number of clusters existing in it. This algorithm consists of two techniques....
Voting in Clustering and Finding the Number of Clusters (1999)
In Cooperation, Evgenia Dimitriadou, Evgenia Dimitriadou, Andreas Weingessel, Andreas Weingessel, Kurt Hornik, ...
In this paper we present an unsupervised algorithm which performs clustering given a data set and which can also nd the number of clusters existing in it. This algorithm consists of two techniques....
Optimal Linear Compression Under Unreliable Representation and Robust PCA Neural Models (1999)
Konstantinos I. Diamantaras, Kurt Hornik, Michael G. Strintzis, Senior Member
In a typical linear data compression system the representation variables resulting from the coding operation are assumed totally reliable and therefore the solution in the mean-squared-error sense is...
Stationarity and Stability of Autoregressive Neural Network Processes (1999)
Friedrich Leisch, Adrian Trapletti, Kurt Hornik
We analyze the asymptotic behavior of autoregressive neural network (AR-NN) processes using techniques from Markov chains and non-linear time series analysis. It is shown that standard AR-NNs without...
Using independent component analysis for feature extraction and multivariate data projection (1998)
Weingessel, Andreas, Natter, Martin, Hornik, Kurt
Deriving low-dimensional perceptual spaces from data consisting of many variables is of crucial interest in strategic market planning. A frequently used method in this context is Principal Components...
On the stationarity of autoregressive neural network models (1998)
Leisch, Friedrich, Trapletti, Adrian, Hornik, Kurt
We analyze the asymptotic behavior of autoregressive neural network (AR-NN) processes using techniques from Markov chains and non-linear time series analysis. It is shown that standard AR-NNs without...
On the generation of correlated artificial binary data (1998)
Leisch, Friedrich, Weingessel, Andreas, Hornik, Kurt
The generation of random variates from multivariate binary distributions has not gained as much interest in the literature as, e.g., multivariate normal or Poisson distributions. Binary variables are...
Stationary and integrated autoregressive neural network processes (1998)
Trapletti, Adrian, Leisch, Friedrich, Hornik, Kurt
We consider autoregressive neural network (ARNN) processes driven by additive noise. Sufficient conditions on the network weights (parameters) are derived for the ergodicity and stationarity of the...
Cross-Validation with Active Pattern Selection for Neural Network Classifiers (1998)
Friedrich Leisch, Lakhmi C. Jain, Kurt Hornik, Classification Risk
We propose a new approach for leave-one-out cross-validation of neural network classifiers called "crossvalidation with active pattern selection" (CV/APS). In CV/APS, the contribution of...
Stationary and Integrated Autoregressive Neural Network Processes (1998)
Management Science, Adrian Trapletti, Adrian Trapletti, Friedrich Leisch, Friedrich Leisch, Kurt Hornik, ...
We consider autoregressive neural network (ARNN) processes driven by additive noise. Sufficient conditions on the network weights (parameters) are derived for the ergodicity and stationarity of the...
Using Independent Component Analysis for Feature Extraction and Multivariate Data Projection (1998)
Management Science, Andreas Weingessel, Andreas Weingessel, Martin Natter, Martin Natter, Kurt Hornik, ...
Deriving low-dimensional perceptual spaces from data consisting of many variables is of crucial interest in strategic market planning. A frequently used method in this context is Principal Components...
On the Stationarity of Autoregressive Neural Network Models (1998)
Adrian Trapletti, Management Science, In Cooperation, Friedrich Leisch, Friedrich Leisch, Kurt Hornik, ...
We analyze the asymptotic behavior of autoregressive neural network (AR-NN) processes using techniques from Markov chains and non-linear time series analysis. It is shown that standard AR-NNs without...
On the Generation of Correlated Artificial Binary Data (1998)
Friedrich Leisch, Friedrich Leisch, Andreas Weingessel, Andreas Weingessel, Kurt Hornik, Kurt Hornik
this paper we present a computationally fast method to simulate multivariate binary distributions with a given correlation structure. After some remarks on multivariate binary distributions in...
Combining Neural Network Voting Classifiers and Error Correcting Output Codes (1997)
In Cooperation, Friedrich Leisch, Friedrich Leisch, Kurt Hornik, Kurt Hornik
We show that error correcting output codes (ECOC) can further improve the effects of error dependent adaptive resampling methods such as arc-lh. In traditional one-in-n coding, the distance between...
SVD Algorithms: APEX-like versus Subspace Methods (1997)
Kurt Hornik, Andreas Weingessel, Andreas Weingessel
. We compare several new SVD learning algorithms which are based on the subspace method in principal component analysis with the APEX-like algorithm proposed by Diamantaras. It is shown...
ARC-LH: A New Adaptive Resampling Algorithm for Improving ANN Classifiers (1997)
We introduce arc-lh, a new algorithm for improvement of ANN classifier performance, which measures the importance of patterns by aggregated network output errors. On several artificial benchmark...
Adaptive Combination of PCA and VQ Networks (1997)
Andreas Weingessel, Horst Bischof, Kurt Hornik, Friedrich Leisch
In this paper we consider Principal Component Analysis (PCA) and Vector Quantization (VQ) neural networks for image compression. We present a method where the PCA and VQ steps are adaptively...
The Adaptive Network Combined for Data Compression (1997)
Andreas Weingessel, Horst Bischof, Kurt Hornik
We present a method to adaptively combine multilayer perceptrons and vector quantization. We show that this combined network yields better results than the sequential application. 1 Introduction In...
Combining Neural Network Voting Classifiers and Error Correcting Output Codes (1997)
We show that error correcting output codes (ECOC) can further improve the effects of error dependent adaptive resampling methods such as arc-lh. In traditional one-in-n coding, the distance between...
Error-Dependent Resampling for Artificial Neural Network Classifiers (1997)
this paper we investigate several ways of utilizing error-dependent resampling for artificial neural network (ANN) classifiers. The main idea of all methods is that the output error of the trained...
Learning in Linear Neural Networks: a Survey (1995)
Networks of linear units are the simplest kind of networks, where the basic questions related to learning, generalization, and self-organisation can sometimes be answered analytically. We survey most...
Friedrich Leisch, Lakhmi C. Jain, Kurt Hornik
We propose a new approach for leave-one-out crossvalidation of neural network classifiers called "crossvalidation with active pattern selection" (CV/APS). In CV/APS, the contribution of the...
MOSUM tests for parameter constancy (1995)
CHU, CHIA-SHANG J., HORNIK, KURT, KAUN, CHUNG-MING
In this paper, tests for structural change based on moving sums (MOSUMS) of recursive and least-squares residuals are investigated. We obtain and tabulate the asymptotic critical values of the MOSUM...
Recurrent back-propagation and Newton algorithms for training recurrent neural networks (1993)
Chung-Ming Kuan, Kurt Hornik, Tung Liu
In this paper the recurrent back-propagation and Newton algorithms for an important class of recurrent networks and their convergence properties are discussed. To ensure proper convergence behavior,...
this paper, we are concerned with situations where a linear feature extraction process is contaminated by noise. More precisely, suppose that upon presentation of an input x, the network actually...
Noisy Principal Component Analysis (1993)
Kostas Diamantaras, Kurt Hornik
this paper, we are concerned with situations in which the output y of the linear network with weight matrix A from an input x actually is y = Ax + e,
Bioconductor: open software development for computational biology and bioinformatics
Gentleman, Robert C, Carey, Vincent J, Bates, Douglas M, Bolstad, Ben, Dettling, Marcel, Dudoit, Sandrine, ...
A detailed description of the aims and methods of the Bioconductor project, an initiative for the collaborative creation of extensible software for computational biology and bioinformatics.
Bioconductor: open software development for computational biology and bioinformatics
Gentleman, Robert C, Carey, Vincent J, Bates, Douglas M, Bolstad, Ben, Dettling, Marcel, Dudoit, Sandrine, ...
A detailed description of the aims and methods of the Bioconductor project, an initiative for the collaborative creation of extensible software for computational biology and bioinformatics.
A Lego System for Conditional Inference
Hothorn, Torsten, Hornik, Kurt, Van De Wiel, Mark A., Zeileis, Achim
A Simulation Framework for Heterogeneous Agents
David Meyer, Alexandros Karatzoglou, Christian Buchta, Friedrich Leisch, Kurt Hornik
Agent Based Simulations, Simulation Framework, Heterogeneous Agents, Simluation Design
MONITORING STRUCTURAL CHANGES WITH THE GENERALIZED FLUCTUATION TEST
Leisch, Friedrich, Hornik, Kurt, Kuan, Chung-Ming
In this paper we introduce the generalized fluctuation test for monitoring structural changes and establish a result characterizing the limiting behavior of this class of tests. As applications of...
The Strucplot Framework: Visualizing Multi-way
David Meyer, Achim Zeileis, Kurt Hornik
This paper describes the “strucplot†framework for the visualization of multi-way contingency tables. Strucplot displays include hierarchical conditional plots such as mosaic, association,...
strucchange: An R Package for Testing for Structural Change in Linear Regression Models
Achim Zeileis, Friedrich Leisch, Kurt Hornik, Christian Kleiber
This paper reviews tests for structural change in linear regression models from the generalized fluctuation test framework as well as from the F test (Chow test) framework. It introduces a unified...
kernlab - An S4 Package for Kernel Methods in R
Alexandros Karatzoglou, Alexandros Smola, Kurt Hornik, Achim Zeileis
kernlab is an extensible package for kernel-based machine learning methods in R. It takes advantage of R’s new S4 ob ject model and provides a framework for creating and using kernel-based...
Cluster ensembles are collections of individual solutions to a given clustering problem which are useful or necessary to consider in a wide range of applications. The R package clue provides an...
arules - A Computational Environment for Mining Association Rules and Frequent Item Sets
Michael Hahsler, Bettina Grun, Kurt Hornik
Mining frequent itemsets and association rules is a popular and well researched approach for discovering interesting relationships between variables in large databases. The R package arules presented...
Alexandros Karatzoglou, David Meyer, Kurt Hornik
Being among the most popular and efficient classification and regression methods currently available, implementations of support vector machines exist in almost every popular programming language....
TSP : Infrastructure for the Traveling Salesperson Problem
The traveling salesperson (or, salesman) problem (TSP) is a well known and important combinatorial optimization problem. The goal is to ï¬nd the shortest tour that visits each city in a given list...
The generalized fluctuation test: A unifying view
In this paper, a general principle of constructing tests for parameter constancy without assuming a specific alternative is introduced. A unified asymptotic result is established to analyze this...
Testing and dating of structural changes in practice
Zeileis, Achim, Kleiber, Christian, Kramer, Walter, Hornik, Kurt
A Simulation Framework for Heterogeneous Agents
David Meyer, Alexandros Karatzoglou, Friedrich Leisch, Christian Buchta, Kurt Hornik
We introduce a generic simulation framework suitable for agent-based simulations featuring the support of heterogeneous agents, hierarchical scheduling, and flexible specification of design...
Monitoring structural change in dynamic econometric models
Kurt Hornik, Friedrich Leisch, Christian Kleiber, Achim Zeileis
The classical approach to testing for structural change employs retrospective tests using a historical data set of a given length. Here we consider a wide array of fluctuation-type tests in a...
Selective association rule generation
Michael Hahsler, Christian Buchta, Kurt Hornik
Data mining, Association rules, Rule generation,
MONITORING STRUCTURAL CHANGES WITH THE GENERALIZED FLUCTUATION TEST
Leisch, Friedrich, Hornik, Kurt, Kuan, Chung-Ming
In this paper we introduce the generalized fluctuation test for monitoring structural changes and establish a result characterizing the limiting behavior of this class of tests. As applications of...
On the number of points of a homogeneous poisson process
We investigate the dense and sparse regions of a d-dimensional Poisson process and establish strong laws for both the maximal and the minimal number of points in families of sets of a certain volume.
On the distribution of functionals of stationary Gaussian processes
Let (X(t),0 [less-than-or-equals, slant] t [less-than-or-equals, slant] 1) be a zero mean, continuous-path stationary Gaussian process with autocovariance function [rho]X(s) = 1 - ? s , where...
The Moving-Estimates Test for Parameter Stability
Chu, Chia-Shang James, Hornik, Kurt, Kuan, Chung-Ming
In this paper a new class of tests for parameter stability, the moving-estimates (ME) test, is proposed. It is shown that in the standard situation the ME test asymptotically equivalent to the...
Escaping RGBland: Selecting colors for statistical graphics
Zeileis, Achim, Hornik, Kurt, Murrell, Paul
Statistical graphics are often augmented by the use of color coding information contained in some variable. When this involves the shading of areas (and not only points or lines)--e.g., as in bar...
Generalized and Customizable Sets in R
We present data structures and algorithms for sets and some generalizations thereof (fuzzy sets, multisets, and fuzzy multisets) available for R through the sets package. Fuzzy (multi-)sets are based...