This book introduces the reader to freely accessible online corpora and their research possibilities. After discussions of basic search techniques in corpus studies and main issues of linguistic analysis, each chapter provides detailed guidelines on corpus-based research in lexis, grammar and syntax, discourse and phonology. The book explains basic search techniques and main analytical tools with the help of simple exercises in five online corpora, a number of case studies, a helpful glossary and further reading sections. The chapter “Exploring Grammar with Corpora” is of special interest as further reading.
This alphabetic guide provides definitions of and information on key terms used in corpus linguistics, including technical and linguistic terms, statistical measures, main corpora and software. It also offers a list of abbreviations and an extensive bibliography. However, aiming to be useful for a wide range of users, the mixture of complicated specific terms makes it unsuited as an introductory glossary.
The book presents a comprehensive survey of the ways in which corpus linguistics is being used by researchers. The contributors present analyses of recent methods and theories in corpus linguistics, development of new software, and recent applications in translation, stylistics, discourse analysis and language teaching. Each chapter in the book reviews key and current work in a particular field of linguistics, with a recent example or case study of the author's own research in this area.
Introducing the corpus-based approach to the study of language, this book contains chapters on lexicography, grammar, discourse, register variation, language acquisition and historical linguistics. It contains an appendix on corpora and analytical tools, and methodology boxes devoted to characteristics of tagged corpora and the tagging process.
The first part of the book provides descriptions of some corpora, including the BNC and the ICE and discusses the main topics of the discipline of corpus linguistics such as text encoding, text annotating, corpus analysis (with an overview of the main software options) and types of linguistic annotation. The second part is devoted to computational linguistics and its contributions to and benefits from corpus linguistics, especially in the field of computational tools, such as ICECUP, CLAWS and Wordsmith. In conclusion, some recommendations on building a corpus are given. The book contains a glossary of main terms and a collection of useful Internet resources.
Introducing the reader to corpus annotation, the book covers all the main kinds of annotation, including grammatical tagging as well as syntactic, semantic and discourse annotation. The process of tagging and the use of annotation tools are explained by means of easy-to-follow examples. A review of the exploitation of annotated corpora is given. Appendices include a glossary of acronyms and abbreviations, sources of further information and a collection of tagsets. As additional reading for the chapter by Smith and Seoane, a chapter on grammatical tagging can be of special interest, along with discussions on the accuracy of automatic tagging and a comparative overview of different tagsets and developments of tagging software in practical application.
This book is a comprehensive introduction to syntactic wordclass tagging - an essential feature of annotated corpora. Describing the state of the art in syntactic word tagging, the book presents this complicated field from the points of view of both users of tagged texts and creators of tagging software. The book consists of chapters written by experts in specific subfields. As additional reading for the chapter, the chapters on the use of tagging and the introduction to tagsets are recommended.
The first part of the book presents the techniques of corpus-based language studies and demonstrates the application of corpora in various fields, discussing key terms, types of corpora, issues of corpus design, methods of interpreting concordance lines, main analytical options available in corpora, as well as tagging, parsing and other kinds of linguistic annotation. The second part is devoted to applications of corpora in language teaching.
This book covers all the main branches of corpus linguistics, such as the design and development of corpora, corpus-based descriptions of corpora, corpus analysis and its implications and applications. It contains chapters on corpus-based grammatical studies, corpus annotations, basic procedures used in corpus analysis, as well as thorough descriptions of a few corpora including the BNC and the ICE, and software such as WordSmith, ICECUP and CQP.
The book provides an introduction to the use of electronic corpora in the description and analysis of English. After introducing corpora and the basic terminology of corpus linguistics, the author presents a number of case studies providing insights into vocabulary, collocations, phraseology, metaphor and metonymy, syntactic structures, male and female language, and language style. The final chapter discusses the use of the web as a corpus. Each chapter contains study questions, web-based exercises and suggestions for further reading and is illustrated with case studies and step-by-step guidelines for following the original research. There is also an accompanying web page with exercises and updated information about web-based corpora.
http://www.euppublishing.com/series/ETOTELAdvanced/Lindquist
This book includes sections on the history of corpus linguistics, corpus compilation and reprocessing corpora, as well as an extensive overview of existing corpora including thorough descriptions of the BNC, the ICE and the Brown family. Further, it addresses some of the major aspects of use and exploitation of corpora. The section on reprocessing corpora includes articles on corpus annotation, part-of-speech tagging, tokenization, morphological and semantic tagging, as well as searching and concordancing and quality assurance of annotated corpora. There is also an article devoted to corpus-based grammar studies.
This course-book on the corpus-based approach to language provides an introduction to corpus linguistics, existing types of corpora and their main features, the use of corpora in different branches of language studies, including a chapter on part-of-speech analysis and parsing. It contains case studies, a glossary, study questions and practical exercises, as well as a catalogue of corpora and software for corpus research.
The book provides an overview of analytical issues, techniques, and findings, as well as a collection of key articles published up to date, and a set of case studies illustrating the most important analytical and design issues. It includes an overview of available corpora with descriptions and a collection of excerpts from published material covering various research areas in more depth. The book contains exercises and follow-up questions, an appendix of useful Internet links on corpus resources and tools. The companion websites contain a fair-sized catalogue of useful resources, as well as an extensive corpus survey with an overview of all the major English corpora.
http://www.lancs.ac.uk/fass/projects/corpus/cbls/default.asp
http://www.routledge.com/textbooks/0415286239/default.asp
Covering all the main branches of corpus linguistics, the book discusses the role of corpus linguistics in linguistic theory and focuses, in great detail, on the process of creating a corpus, collecting and computerizing data, annotating data and conducting a corpus analysis.It also includes an extensive catalogue of corpus resources and concordancing programs with short descriptions and web-links.
This book introduces the main concepts of corpus linguistics and provides a brief historical overview. Having no chapter devoted to grammar studies in particular, it contains a short glossary of basic terms and references to some corpora which are available online.
Created with the Personal Edition of HelpNDoc: Full-featured Help generator