General
Feature extraction involves simplifying the amount of resources required to describe a large set of data accurately. When performing analysis of complex data one of the major problems stems from the number of variables involved. Analysis with a large number of variables generally requires a large amount of memory and computation power or a classification algorithm which overfits the training sample and generalizes poorly to new samples. Feature extraction is a general term for methods of constructing combinations of the variables to get around these problems while still describing the data with sufficient accuracy.
Best results are achieved when an expert constructs a set of application-dependent features. Nevertheless, if no such expert knowledge is available general dimensionality reduction techniques may help. These include:
- Principal component analysis
- Semidefinite embedding
- Multifactor dimensionality reduction
- Multilinear subspace learning
- Nonlinear dimensionality reduction
- Isomap
- Kernel PCA
- Multilinear PCA
- Latent semantic analysis
- Partial least squares
- Independent component analysis
- Autoencoder
Read more about this topic: Feature Extraction
Famous quotes containing the word general:
“The following general definition of an animal: a system of different organic molecules that have combined with one another, under the impulsion of a sensation similar to an obtuse and muffled sense of touch given to them by the creator of matter as a whole, until each one of them has found the most suitable position for its shape and comfort.”
—Denis Diderot (17131784)
“We have left undone those things which we ought to have done; and we have done those things which we ought not to have done.”
—Morning Prayer, General Confession, Book of Common Prayer (1662)
“He who never sacrificed a present to a future good or a personal to a general one can speak of happiness only as the blind do of colors.”
—Olympia Brown (18351900)