Machine Learning - Machine Learning, Knowledge Discovery in Databases (KDD) and Data Mining

Machine Learning, Knowledge Discovery in Databases (KDD) and Data Mining

Two terms are commonly confused, as they often employ the same methods and overlap significantly. They can be roughly defined as follows:

  • Machine learning focuses on prediction, based on known properties learned from the training data.
  • Data mining (which is the analysis step of Knowledge Discovery in Databases) focuses on the discovery of (previously) unknown properties on the data.

The two areas overlap in many ways: data mining uses many machine learning methods, but often with a slightly different goal in mind. On the other hand, machine learning also employs data mining methods as "unsupervised learning" or as a preprocessing step to improve learner accuracy. Much of the confusion between these two research communities (which do often have separate conferences and separate journals, ECML PKDD being a major exception) comes from the basic assumptions they work with: in machine learning, performance is usually evaluated with respect to the ability to reproduce known knowledge, while in KDD the key task is the discovery of previously unknown knowledge. Evaluated with respect to known knowledge, an uninformed (unsupervised) method will easily be outperformed by supervised methods, while in a typical KDD task, supervised methods cannot be used due to the unavailability of training data.

Read more about this topic:  Machine Learning

Famous quotes containing the words machine, knowledge, discovery, data and/or mining:

    Man is a shrewd inventor, and is ever taking the hint of a new machine from his own structure, adapting some secret of his own anatomy in iron, wood, and leather, to some required function in the work of the world.
    Ralph Waldo Emerson (1803–1882)

    If education is always to be conceived along the same antiquated lines of a mere transmission of knowledge, there is little to be hoped from it in the bettering of man’s future. For what is the use of transmitting knowledge if the individual’s total development lags behind?
    Maria Montessori (1870–1952)

    As the mother of a son, I do not accept that alienation from me is necessary for his discovery of himself. As a woman, I will not cooperate in demeaning womanly things so that he can be proud to be a man. I like to think the women in my son’s future are counting on me.
    Letty Cottin Pogrebin (20th century)

    This city is neither a jungle nor the moon.... In long shot: a cosmic smudge, a conglomerate of bleeding energies. Close up, it is a fairly legible printed circuit, a transistorized labyrinth of beastly tracks, a data bank for asthmatic voice-prints.
    Susan Sontag (b. 1933)

    In strict science, all persons underlie the same condition of an infinite remoteness. Shall we fear to cool our love by mining for the metaphysical foundation of this elysian temple? Shall I not be as real as the things I see? If I am, I shall not fear to know them for what they are.
    Ralph Waldo Emerson (1803–1882)