Discussion
Mean subtraction (a.k.a. "mean centering") is necessary for performing PCA to ensure that the first principal component describes the direction of maximum variance. If mean subtraction is not performed, the first principal component might instead correspond more or less to the mean of the data. A mean of zero is needed for finding a basis that minimizes the mean square error of the approximation of the data.
Assuming zero empirical mean (the empirical mean of the distribution has been subtracted from the data set), the principal component w1 of a data set X can be defined as:
(See arg max for the notation.) With the first k − 1 components, the kth component can be found by subtracting the first principal components from X:
and by substituting this as the new data set to find a principal component in
PCA is equivalent to empirical orthogonal functions (EOF), a name which is used in meteorology.
An autoencoder neural network with a linear hidden layer is similar to PCA. Upon convergence, the weight vectors of the K neurons in the hidden layer will form a basis for the space spanned by the first K principal components. Unlike PCA, this technique will not necessarily produce orthogonal vectors.
PCA is a popular primary technique in pattern recognition. It is not, however, optimized for class separability. An alternative is the linear discriminant analysis, which does take this into account.
Read more about this topic: Principal Component Analysis
Famous quotes containing the word discussion:
“Americans, unhappily, have the most remarkable ability to alchemize all bitter truths into an innocuous but piquant confection and to transform their moral contradictions, or public discussion of such contradictions, into a proud decoration, such as are given for heroism on the battle field.”
—James Baldwin (19241987)
“We cannot set aside an hour for discussion with our children and hope that it will be a time of deep encounter. The special moments of intimacy are more likely to happen while baking a cake together, or playing hide and seek, or just sitting in the waiting room of the orthodontist.”
—Neil Kurshan (20th century)
“It was heady stuff, recognizing ourselves as an oppressed class, but the level of discussion was poor. We explained systemic discrimination, and men looked prettily confused and said: But, I like women.”
—Jane OReilly, U.S. feminist and humorist. The Girl I Left Behind, ch. 2 (1980)