Use in Artificial Intelligence, Machine Learning, and Statistics
In artificial intelligence or machine learning, a training set consists of an input vector and an answer vector, and is used together with a supervised learning method to train a knowledge database (e.g. a neural net or a naive bayes classifier) used by an AI machine.
In statistical modeling, a training set is used to fit a model that can be used to predict a "response value" from one or more "predictors." The fitting can include both variable selection and parameter estimation. Statistical models used for prediction are often called regression models, of which linear regression and logistic regression are two examples.
In these fields, a major emphasis is placed on avoiding overfitting, so as to achieve the best possible performance on an independent test set that follows the same probability distribution as the training set.
Read more about this topic: Training Set
Famous quotes containing the words artificial, machine and/or statistics:
“Before I finally went into winter quarters in November, I used to resort to the north- east side of Walden, which the sun, reflected from the pitch pine woods and the stony shore, made the fireside of the pond; it is so much pleasanter and wholesomer to be warmed by the sun while you can be, than by an artificial fire. I thus warmed myself by the still glowing embers which the summer, like a departed hunter, had left.”
—Henry David Thoreau (18171862)
“The machine unmakes the man. Now that the machine is perfect, the engineer is nobody. Every new step in improving the engine restricts one more act of the engineer,unteaches him.”
—Ralph Waldo Emerson (18031882)
“He uses statistics as a drunken man uses lamp-postsfor support rather than illumination.”
—Andrew Lang (18441912)