Statistics and Machine Learning
In machine learning, the term "ground truth" refers to the accuracy of the training set's classification for supervised learning techniques. This is used in statistical models to prove or disprove research hypotheses. The verb "ground truthing" refers to the process of gathering the proper objective data for this test. Compare with Gold standard (test).
Bayesian spam filtering is a common example of supervised learning. In this system, the algorithm is manually taught the differences between spam and non-spam. This depends on the ground truth of the messages used to train the algorithm; inaccuracies in that ground truth will correlate to inaccuracies in the resulting spam/non-spam verdicts.
Read more about this topic: Ground Truth
Famous quotes containing the words statistics, machine and/or learning:
“We ask for no statistics of the killed,
For nothing political impinges on
This single casualty, or all those gone,
Missing or healing, sinking or dispersed,
Hundreds of thousands counted, millions lost.”
—Karl Shapiro (b. 1913)
“Man is a beautiful machine that works very badly.”
—H.L. (Henry Lewis)
“The best way of learning to be an independent sovereign state is to be an independent sovereign state.”
—Kwame Nkrumah (19001972)