corpus (pl. corpora) – a collection of texts stored in an electronic database. Linguistic corpora are usually large bodies of machine-readable text selected to be representative of a particular language variety or genre, etc. Corpora are often annotated with additional information (annotation) and can be used for both quantitative and qualitative analysis with the help of specially designed corpus software and/or advanced database packages (cf. Baker 2006: 48).
Created with the Personal Edition of HelpNDoc: Free help authoring environment