Publication View

A generalized linear model for principal component analysis of binary data (2003)

Abstract
We investigate a generalized linear model for dimensionality reduction of binary data. The model is related to principal component analysis (PCA) in the same way that logistic regression is related to linear regression. Thus we refer to the model as logistic PCA. In this paper, we derive an alternating least squares method to estimate the basis vectors and generalized linear coefficients of the logistic PCA model. The resulting updates have a simple closed form and are guaranteed at each iteration to improve the model’s likelihood. We evaluate the performance of logistic PCA—as measured by reconstruction error rates—on data sets drawn from four real world applications. In general, we find that logistic PCA is much better suited to modeling binary data than conventional PCA. 1

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.79.4306
Source http://www.cis.upenn.edu/~ais/publications/ssu-aistat2003.pdf
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Type text
Language English
Relation 10.1.1.21.4665, 10.1.1.33.4726, 10.1.1.130.6341, 10.1.1.28.4279, 10.1.1.137.6073, 10.1.1.29.210, 10.1.1.45.4317, 10.1.1.1.3688, 10.1.1.64.1567, 10.1.1.100.1570, 10.1.1.103.1538, 10.1.1.103.6856, 10.1.1.104.7530, 10.1.1.64.3077, 10.1.1.66.5928, 10.1.1.122.702, 10.1.1.128.8294, 10.1.1.5.3301, 10.1.1.59.8796, 10.1.1.61.7078, 10.1.1.61.9750