Using Corpora for Language Research: Studies in Honour of Geoffrey LeechJenny Thomas, Mick Short Corpus linguistics is a relatively new subject in linguistics, whereby corpora of spoken or written texts are stored on computer and used as a major source for linguistic analysis. This volume discusses ways in which corpora can be used in language research and their range of current applications. |
Contents
From central embedding to corpus linguistics | 14 |
Sipping a cocktail of corpora | 27 |
Corpora databases and the organization of linguistic data | 36 |
Copyright | |
15 other sections not shown
Common terms and phrases
ACAMRIT adverbial algorithm alignment ambiguity annotation approach assigned Atwell automatic British National Corpus Brown Corpus Claws cognates complement clauses conditional clauses construction corpora corpus linguistics corpus resources corpus-based developed Dice's similarity coefficient dictionary disambiguation discourse English language example frequency future time orientation Gale and Church Garside Geoffrey Leech grammar grammatical tagging input Lancaster University language models language testing learners Leech G N lexical lexicography lexicon LOB Corpus Longman machine learning matching mensa million words Mindt modal multiple central embeddings N-gram noun NRSA number of words occur pairs parser parsing part-of-speech part-of-speech tagging patterns phrase possible potential tags probabilistic pronoun punctuation relative clause reporting clause S&TP score sentence sequence of tenses SGML simple present speech presentation speech recognition structure syntactic Table tagger tagset template rules text grammar thought presentation treebank Variant verb wordtag



