Machine Learning, Knowledge Discovery in Databases (KDD) and Data Mining
Two terms are commonly confused, as they often employ the same methods and overlap significantly. They can be roughly defined as follows:
- Machine learning focuses on prediction, based on known properties learned from the training data.
- Data mining (which is the analysis step of Knowledge Discovery in Databases) focuses on the discovery of (previously) unknown properties on the data.
The two areas overlap in many ways: data mining uses many machine learning methods, but often with a slightly different goal in mind. On the other hand, machine learning also employs data mining methods as "unsupervised learning" or as a preprocessing step to improve learner accuracy. Much of the confusion between these two research communities (which do often have separate conferences and separate journals, ECML PKDD being a major exception) comes from the basic assumptions they work with: in machine learning, performance is usually evaluated with respect to the ability to reproduce known knowledge, while in KDD the key task is the discovery of previously unknown knowledge. Evaluated with respect to known knowledge, an uninformed (unsupervised) method will easily be outperformed by supervised methods, while in a typical KDD task, supervised methods cannot be used due to the unavailability of training data.
Read more about this topic: Machine Learning
Famous quotes containing the words machine, knowledge, discovery, data and/or mining:
“The machine unmakes the man. Now that the machine is perfect, the engineer is nobody. Every new step in improving the engine restricts one more act of the engineer,unteaches him.”
—Ralph Waldo Emerson (18031882)
“Is America a land of God where saints abide for ever? Where golden fields spread fair and broad, where flows the crystal river? Certainly not flush with saints, and a good thing, too, for the saints sent buzzing into mans ken now are but poor- mouthed ecclesiastical film stars and cliché-shouting publicity agents.
Their little knowledge bringing them nearer to their ignorance,
Ignorance bringing them nearer to death,
But nearness to death no nearer to God.”
—Sean OCasey (18841964)
“Your discovery of the contradiction caused me the greatest surprise and, I would almost say, consternation, since it has shaken the basis on which I intended to build my arithmetic.... It is all the more serious since, with the loss of my rule V, not only the foundations of my arithmetic, but also the sole possible foundations of arithmetic seem to vanish.”
—Gottlob Frege (18481925)
“To write it, it took three months; to conceive it three minutes; to collect the data in itall my life.”
—F. Scott Fitzgerald (18961940)
“Any relation to the land, the habit of tilling it, or mining it, or even hunting on it, generates the feeling of patriotism. He who keeps shop on it, or he who merely uses it as a support to his desk and ledger, or to his manufactory, values it less.”
—Ralph Waldo Emerson (18031882)