Machine Learning - Machine Learning, Knowledge Discovery in Databases (KDD) and Data Mining

Machine Learning, Knowledge Discovery in Databases (KDD) and Data Mining

Two terms are commonly confused, as they often employ the same methods and overlap significantly. They can be roughly defined as follows:

  • Machine learning focuses on prediction, based on known properties learned from the training data.
  • Data mining (which is the analysis step of Knowledge Discovery in Databases) focuses on the discovery of (previously) unknown properties on the data.

The two areas overlap in many ways: data mining uses many machine learning methods, but often with a slightly different goal in mind. On the other hand, machine learning also employs data mining methods as "unsupervised learning" or as a preprocessing step to improve learner accuracy. Much of the confusion between these two research communities (which do often have separate conferences and separate journals, ECML PKDD being a major exception) comes from the basic assumptions they work with: in machine learning, performance is usually evaluated with respect to the ability to reproduce known knowledge, while in KDD the key task is the discovery of previously unknown knowledge. Evaluated with respect to known knowledge, an uninformed (unsupervised) method will easily be outperformed by supervised methods, while in a typical KDD task, supervised methods cannot be used due to the unavailability of training data.

Read more about this topic:  Machine Learning

Famous quotes containing the words machine, knowledge, discovery, data and/or mining:

    The momentary charge at Balaklava, in obedience to a blundering command, proving what a perfect machine the soldier is, has, properly enough, been celebrated by a poet laureate; but the steady, and for the most part successful, charge of this man, for some years, against the legions of Slavery, in obedience to an infinitely higher command, is as much more memorable than that as an intelligent and conscientious man is superior to a machine. Do you think that that will go unsung?
    Henry David Thoreau (1817–1862)

    The endless cycle of idea and action,
    Endless invention, endless experiment,
    Brings knowledge of motion, but not of stillness;
    Knowledge of speech, but not of silence;
    Knowledge of words, and ignorance of the Word.
    All our knowledge brings us nearer to our ignorance.
    —T.S. (Thomas Stearns)

    That the discovery of this great truth, which lies so near and obvious to the mind, should be attained to by the reason of so very few, is a sad instance of the stupidity and inattention of men, who, though they are surrounded with such clear manifestations of the Deity, are yet so little affected by them, that they seem as it were blinded with excess of light.
    George Berkeley (1685–1753)

    This city is neither a jungle nor the moon.... In long shot: a cosmic smudge, a conglomerate of bleeding energies. Close up, it is a fairly legible printed circuit, a transistorized labyrinth of beastly tracks, a data bank for asthmatic voice-prints.
    Susan Sontag (b. 1933)

    It’s a mining town in lotus land.
    F. Scott Fitzgerald (1896–1940)