Publication View

92c/MFlops/s, Ultra-Large-Scale Neural-Network Training on a PIII Cluster (2000)

Abstract
Artificial neural networks with millions of adjustable parameters and a similar number of training examples are a potential solution for difficult, large-scale pattern recognition problems in areas such as speech and face recognition, classification of large volumes of web data, and finance. The bottleneck is that neural network training involves iterative gradient descent and is extremely computationally intensive. In this paper we present a technique for distributed training of Ultra Large Scale Neural Networks 1 (ULSNN) on Bunyip, a Linux-based cluster of 196 Pentium III processors. To illustrate ULSNN training we describe an experiment in which a neural network with 1.73 million adjustable parameters was trained to recognize machineprinted Japanese characters from a database containing 9 million training patterns. The training runs with a average performance of 163.3 GFlops/s (single precision). With a machine cost of $150,913, this yields a price/performance ...

Publication details
Download http://citeseer.ist.psu.edu/395228.html
Source http://www.sc2000.org/techpapr/papers/pap.pap255.pdf
Publisher unknown
Contributors The Pennsylvania State University CiteSeer Archives
Repository CiteSeer (United States)
Keywords Jonathan Baxter,Robert Edwards 92c/MFlops/s, Ultra-Large-Scale Neural-Network Training on a PIII Cluster
Language Englisch
Relation oai:CiteSeerPSU:161393, oai:CiteSeerPSU:263587, oai:CiteSeerPSU:312269