Ground Truth - Statistics and Machine Learning

Statistics and Machine Learning

In machine learning, the term "ground truth" refers to the accuracy of the training set's classification for supervised learning techniques. This is used in statistical models to prove or disprove research hypotheses. The verb "ground truthing" refers to the process of gathering the proper objective data for this test. Compare with Gold standard (test).

Bayesian spam filtering is a common example of supervised learning. In this system, the algorithm is manually taught the differences between spam and non-spam. This depends on the ground truth of the messages used to train the algorithm; inaccuracies in that ground truth will correlate to inaccuracies in the resulting spam/non-spam verdicts.

Read more about this topic:  Ground Truth

Famous quotes containing the words statistics, machine and/or learning:

    and Olaf, too

    preponderatingly because
    unless statistics lie he was
    more brave than me: more blond than you.
    —E.E. (Edward Estlin)

    The chrysanthemums’ astringent fragrance comes
    Each year to disguise the clanking mechanism
    Of machine within machine within machine.
    Wallace Stevens (1879–1955)

    Our goal as a parent is to give life to our children’s learning—to instruct, to teach, to help them develop self-discipline—an ordering of the self from the inside, not imposition from the outside. Any technique that does not give life to a child’s learning and leave a child’s dignity intact cannot be called discipline—it is punishment, no matter what language it is clothed in.
    Barbara Coloroso (20th century)