Han Unification

Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the so-called CJK languages into a single set of unified characters. Han characters are a common feature of written Chinese (hanzi), Japanese (kanji), Korean (hanja), and—at least historically—other East and Southeast Asian languages. (See Vietnamese Hán Tự and Chữ Nôm.)

Modern Chinese, Japanese and Korean typefaces typically use regional or historical variants of a given Han character. In the formulation of Unicode, an attempt was made to unify these variants by considering them different glyphs representing the same "grapheme", or orthographic unit, hence, "Han unification", with the resulting character repertoire sometimes contracted to Unihan.

Unihan can also refer to the Unihan Database maintained by the Unicode Consortium, which provides information about all of the unified Han characters encoded in the Unicode standard, including mappings to various national and industry standards, indices into standard dictionaries, encoded variants, pronunciations in various languages, and an English definition. The database is available to the public as text files and via an interactive Web site. The latter also includes representative glyphs and definitions for compound words drawn from the free Japanese EDICT and Chinese CEDICT dictionary projects (which are provided for convenience and are not a formal part of the Unicode standard).

Read more about Han Unification:  Rationale and Controversy, Examples of Language Dependent Characters, Examples of Some Non-unified Han Ideographs, Unicode Ranges, Unihan Database Files

Famous quotes containing the word han:

    men may wel often finde
    A lordes sone do shame and vileinye;
    And he that wol han prys of his gentrye
    For he was boren of a gentil hous,
    Geoffrey Chaucer (1340?–1400)