Corpus linguistics is the study of language as expressed in samples (corpora) of "real world" text. This method represents a digestive approach to deriving a set of abstract rules by which a natural language is governed or else relates to another language. Originally done by hand, corpora are now largely derived by an automated process.
Corpus linguistics adherents believe that reliable language analysis best occurs on field-collected samples, in natural contexts and with minimal experimental interference. Within corpus linguistics there are divergent views as to the value of corpus annotation, from John Sinclair advocating minimal annotation and allowing texts to 'speak for themselves', to others, such as the Survey of English Usage team (based in University College, London) advocating annotation as a path to greater linguistic understanding and rigour.
Linguistics |
---|
Theoretical linguistics |
|
Descriptive linguistics |
|
Applied and experimental linguistics |
|
Related articles |
|
Portal |
Famous quotes containing the word corpus:
“By that bedes side ther kneleth a may,
And she wepeth both nyght and day.
And by that beddes side ther stondith a ston,
Corpus Christiwretyn theron.”
—Unknown. Corpus Christi Carol (l. 1114)