Process
The Knowledge Discovery in Databases (KDD) process is commonly defined with the stages:
- (1) Selection
- (2) Pre-processing
- (3) Transformation
- (4) Data Mining
- (5) Interpretation/Evaluation.
It exists, however, in many variations on this theme, such as the Cross Industry Standard Process for Data Mining (CRISP-DM) which defines six phases:
- (1) Business Understanding
- (2) Data Understanding
- (3) Data Preparation
- (4) Modeling
- (5) Evaluation
- (6) Deployment
or a simplified process such as (1) pre-processing, (2) data mining, and (3) results validation.
Polls conducted in 2002, 2004, and 2007 show that the CRISP-DM methodology is the leading methodology used by data miners. The only other data mining standard named in these polls was SEMMA. However, 3-4 times as many people reported using CRISP-DM. Azevedo and Santos conducted a comparison of CRISP-DM and SEMMA in 2008.
Read more about this topic: Data Mining
Famous quotes containing the word process:
“Opinions are formed in a process of open discussion and public debate, and where no opportunity for the forming of opinions exists, there may be moodsmoods of the masses and moods of individuals, the latter no less fickle and unreliable than the formerbut no opinion.”
—Hannah Arendt (19061975)
“The a priori method is distinguished for its comfortable conclusions. It is the nature of the process to adopt whatever belief we are inclined to, and there are certain flatteries to the vanity of man which we all believe by nature, until we are awakened from our pleasing dream by rough facts.”
—Charles Sanders Peirce (18391914)
“The invention of photography provided a radically new picture-making processa process based not on synthesis but on selection. The difference was a basic one. Paintings were madeconstructed from a storehouse of traditional schemes and skills and attitudesbut photographs, as the man on the street put, were taken.”
—Jean Szarkowski (b. 1925)