Use in Artificial Intelligence, Machine Learning, and Statistics
In artificial intelligence or machine learning, a training set consists of an input vector and an answer vector, and is used together with a supervised learning method to train a knowledge database (e.g. a neural net or a naive bayes classifier) used by an AI machine.
In statistical modeling, a training set is used to fit a model that can be used to predict a "response value" from one or more "predictors." The fitting can include both variable selection and parameter estimation. Statistical models used for prediction are often called regression models, of which linear regression and logistic regression are two examples.
In these fields, a major emphasis is placed on avoiding overfitting, so as to achieve the best possible performance on an independent test set that follows the same probability distribution as the training set.
Read more about this topic: Training Set
Famous quotes containing the words artificial, machine and/or statistics:
“You must recollect however that I know nothing of painting & that I detest it, unless it reminds me of something I have seen or think it possible to see.... Depend upon it of all the arts it is the most artificial & unnatural& that by which the nonsense of mankind is the most imposed upon.”
—George Gordon Noel Byron (17881824)
“What is man, when you come to think upon him, but a minutely set, ingenious machine for turning, with infinite artfulness, the red wine of Shiraz into urine?”
—Isak Dinesen [Karen Blixen] (18851962)
“and Olaf, too
preponderatingly because
unless statistics lie he was
more brave than me: more blond than you.”
—E.E. (Edward Estlin)