Information Integration

Information integration (II) (also called deduplication and referential integrity) is the merging of information from disparate sources with differing conceptual, contextual and typographical representations. It is used in data mining and consolidation of data from unstructured or semi-structured resources. Typically, information integration refers to textual representations of knowledge but is sometimes applied to rich-media content. information fusion which is a related term involves the combination of information into a new set of information towards reducing uncertainty.

An example of technologies available to integrate information include string metrics which allow the detection of similar text in different data sources by fuzzy matching. A host of methods for these research areas are available such as those presented in the International Society of Information Fusion.

Famous quotes containing the words information and/or integration:

    Knowledge is of two kinds. We know a subject ourselves, or we know where we can find information upon it.
    Samuel Johnson (1709–1784)

    The more specific idea of evolution now reached is—a change from an indefinite, incoherent homogeneity to a definite, coherent heterogeneity, accompanying the dissipation of motion and integration of matter.
    Herbert Spencer (1820–1903)