Neural Networks for Pattern RecognitionThis book provides the first comprehensive treatment of feed-forward neural networks from the perspective of statistical pattern recognition. After introducing the basic concepts of pattern recognition, the book describes techniques for modelling probability density functions, and discusses the properties and relative merits of the multi-layer perceptron and radial basis function network models. It also motivates the use of various forms of error functions, and reviews the principal algorithms for error function minimization. As well as providing a detailed discussion of learning and generalization in neural networks, the book also covers the important topics of data processing, feature extraction, and prior knowledge. The book concludes with an extensive treatment of Bayesian techniques and their applications to neural networks. |
Contents
1 Statistical Pattern Recognition | 1 |
2 Probability Density Estimation | 33 |
3 SingleLayer Networks | 77 |
4 The Multilayer Perceptron | 116 |
5 Radial Basis Functions | 164 |
6 Error Functions | 194 |
7 Parameter Optimization Algorithms | 253 |
8 Preprocessing and Feature Extraction | 295 |
10 Bayesian Techniques | 385 |
Symmetric Matrices | 440 |
Gaussian Integrals | 444 |
Lagrange Multipliers | 448 |
Calculus of Variations | 451 |
Principal Components | 454 |
| 457 | |
| 477 | |
Other editions - View all
Common terms and phrases
activation function algorithm approach approximation back-propagation basis function network Bayes Bayesian bias C₁ Chapter class Ck classification problems coefficients component computational consider corresponding covariance matrix data points data set decision boundary defined denotes density estimation density function derivatives dimensionality discriminant function discussed in Section eigenvalues eigenvectors equations evaluate example expression feed-forward Gaussian give given gradient descent Hessian matrix hidden units input data input space input variables input vector kernel linear discriminant maximum likelihood minimization minimum mixture model multi-layer perceptron network mapping network outputs neural networks non-linear normal obtain optimal output units output variables parameters polynomial posterior distribution posterior probabilities pre-processing prior probabilities probability density procedure quadratic radial basis function regression regularization represents result search direction sigmoid simple solution sum-of-squares error function target data target values techniques term theorem training data training set transformation variance w₁ weight space weight vector Yk(x



