2.3 Corpus manuals

Parent Previous Next


This handbook is a guide to the usage of the BNC with SARA, an earlier version of XAIRA, included in the BNC's official XML edition. As the main options and organization principles of software are the same, the book can be helpful for dealing with XAIRA as well.


This manual is a guide to conducting research on the basis of the POS-tagged 'Brown family' of corpora (including BROWN, LOB, FLOB, FROWN corpora). It includes a description of POS-tagging and post-editing in Frown and F-LOB, a comparative overview of the frequencies of the different word classes in the four corpora of the 'Brown family', information on the composition of the four corpora, the C8 tagset, a complete table of the frequencies of major POS-tags and an overview of original and revised corpus markup codes.

https://webspace.utexas.edu/lh9896/public/hinrichs/Manual_final.pdf


This book is a detailed step-by-step introduction to the exploration of the main methodological issues in corpus linguistics with BNCweb - a web-based interface of the British National Corpus. The tasks and exercises, along with thorough explanations introduce all the main options of BNCweb for quantitative and qualitative analysis of the corpus such as creating subcorpora, reimporting data and covering simple word queries. They also cover automatic options, advanced CQP query syntax as well as statistical and categorizing tools.


This is a brief introduction to the FLOB corpus with basic information about the corpus, its sampling techniques and its mark-up conventions. The manual contains lists of codes and text categories, as well as a catalogue of corpus-related publications. It also serves as a useful source for basic information about the FLOB corpus.

http://khnt.aksis.uib.no/icame/manuals/flob/INDEX.HTM.


This comprehensive guide to ICE-GB introduces the corpus with the ICE corpus utility program (ICECUP). It presents basic and advanced options of query building and language analysis, including searching for syntactic structures with ICECUP's Fuzzy Tree Fragment models. The book contains six case studies illustrating the usage of ICE-GB for linguistic research and valuable references with a reference guide to ICE grammar and additional information on ICE-GB sampling, usage of special symbols and mark-up annotation.


This manual is a thorough description of the rich ICE tagset, which provides full information on its complex structure and explains complex cases of tag assignment with many illustrative examples from the corpus.

www.ice-corpora.net/ice/TaggingManual.doc



Created with the Personal Edition of HelpNDoc: Single source CHM, PDF, DOC and HTML Help creation