Training Set - Use in Artificial Intelligence, Machine Learning, and Statistics

Use in Artificial Intelligence, Machine Learning, and Statistics

In artificial intelligence or machine learning, a training set consists of an input vector and an answer vector, and is used together with a supervised learning method to train a knowledge database (e.g. a neural net or a naive bayes classifier) used by an AI machine.

In statistical modeling, a training set is used to fit a model that can be used to predict a "response value" from one or more "predictors." The fitting can include both variable selection and parameter estimation. Statistical models used for prediction are often called regression models, of which linear regression and logistic regression are two examples.

In these fields, a major emphasis is placed on avoiding overfitting, so as to achieve the best possible performance on an independent test set that follows the same probability distribution as the training set.

Read more about this topic:  Training Set

Famous quotes containing the words artificial, machine and/or statistics:

    Nothing strengthens the judgment and quickens the conscience like individual responsibility. Nothing adds such dignity to character as the recognition of one’s self-sovereignty; the right to an equal place, everywhere conceded—a place earned by personal merit, not an artificial attainment by inheritance, wealth, family and position.
    Elizabeth Cady Stanton (1815–1902)

    Man is a shrewd inventor, and is ever taking the hint of a new machine from his own structure, adapting some secret of his own anatomy in iron, wood, and leather, to some required function in the work of the world.
    Ralph Waldo Emerson (1803–1882)

    and Olaf, too

    preponderatingly because
    unless statistics lie he was
    more brave than me: more blond than you.
    —E.E. (Edward Estlin)