Machine Learning, Knowledge Discovery in Databases (KDD) and Data Mining
Two terms are commonly confused, as they often employ the same methods and overlap significantly. They can be roughly defined as follows:
- Machine learning focuses on prediction, based on known properties learned from the training data.
- Data mining (which is the analysis step of Knowledge Discovery in Databases) focuses on the discovery of (previously) unknown properties on the data.
The two areas overlap in many ways: data mining uses many machine learning methods, but often with a slightly different goal in mind. On the other hand, machine learning also employs data mining methods as "unsupervised learning" or as a preprocessing step to improve learner accuracy. Much of the confusion between these two research communities (which do often have separate conferences and separate journals, ECML PKDD being a major exception) comes from the basic assumptions they work with: in machine learning, performance is usually evaluated with respect to the ability to reproduce known knowledge, while in KDD the key task is the discovery of previously unknown knowledge. Evaluated with respect to known knowledge, an uninformed (unsupervised) method will easily be outperformed by supervised methods, while in a typical KDD task, supervised methods cannot be used due to the unavailability of training data.
Read more about this topic: Machine Learning
Famous quotes containing the words machine, knowledge, discovery, data and/or mining:
“Goodbye, boys; Im under arrest. I may have to go to jail. I may not see you for a long time. Keep up the fight! Dont surrender! Pay no attention to the injunction machine at Parkersburg. The Federal judge is a scab anyhow. While you starve he plays golf. While you serve humanity, he serves injunctions for the money powers.”
—Mother Jones (18301930)
“The formation of an oppositional world view is necessary for feminist struggle. This means that the world we have most intimately known, the world in which we feel safe ... must be radically changed. Perhaps it is the knowledge that everyone must change, not just those we label enemies or oppressors, that has so far served to check our revolutionary impulses.”
—Bell (c. 1955)
“One of the laudable by-products of the Freudian quackery is the discovery that lying, in most cases, is involuntary and inevitablethat the liar can no more avoid it than he can avoid blinking his eyes when a light flashes or jumping when a bomb goes off behind him.”
—H.L. (Henry Lewis)
“To write it, it took three months; to conceive it three minutes; to collect the data in itall my life.”
—F. Scott Fitzgerald (18961940)
“Its a mining town in lotus land.”
—F. Scott Fitzgerald (18961940)