annotation (corpus annotation) – the process of adding additional descriptive and/or linguistic information to corpus data (cf. Baker 2006: 13). Corpus annotation in a general sense includes both linguistic annotation and structural markup (cf. Meyer 2002: 98-99; Teubert 2007: 139), which provides descriptive information about the corpus texts including bibliographical references or ethnographical information about the authors, as well as indicating text structure, etc. (cf. Meyer 2002: 98). In a narrower sense, corpus annotation usually only refers to linguistic corpus annotation (cf. McEnery, Xiao and Tono 2006: 347; Garside 1997: 2) and includes part-of-speech tagging, syntactic parsing and other kinds of linguistic annotation, such as phonological, semantic, or discourse annotation.
Created with the Personal Edition of HelpNDoc: Full-featured Kindle eBooks generator