Corpus Linguistic software, Corpora and databases for language research
Students in Linguistics and Modern Languages have access to a wide range of electronic resources specifically for language research. These include a range of linguistic corpora (language databases) including, but not limited to, the following:
· British National Corpus (100 million words of written and spoken British English)
· Brown Corpus of 1960s American English
· LOB (Lancaster-Oslo-Bergen) Corpus of 1960s British English
· FLOB (Freiburg-LOB) Corpus of 1990s British English
· Frown (Freiburg-Brown) Corpus of 1990s British English
· Kolhapur Corpus of Indian English
· Australian Corpus of English
· Corpus of London Teenage Language
· Helsinki Corpus of English Texts: Diachronic Part (including Old, Middle and Early Modern English)
· Helsinki Corpus of Older Scots
· Corpus of Early English Correspondence
In addition, we have numerous specially-constructed and annotated corpora compiled in the department, including the Early Modern English Discourse Presentation Corpus and the Early Modern English Modality Corpus.
We also have a range of licensed software available for corpus linguistic analysis. Currently this includes the following:
Wmatrix
WordSmith Tools
Sketch Engine
We encourage students to make use of the wide variety of freeware available for corpus linguistic analysis. Some particularly useful packages include:
AntConc
WebCorp
Multilingual Corpus Toolkit
You can also access many databases designed to support language research, such as Early English Books Online, ProQuest Newsstand (providing access to news archives) and the online Oxford English Dictionary.