Small Business Resources, Business Advice and Forms from AllBusiness.com
Categories New Releases Bestsellers Special Offers Security

Billion-word Corpus opens

The corpus was set up in 2000 as a research tool for the OUP dictionaries programme, collecting examples of English from international books, journals, and internet resources, including transcribed speeches and chatrooms.
It will be made available to academics for research purposes, with a secure login enabling access from any computer.
Judy Pearsall, OUP publishing manager for English dictionaries, said the OEC was the first corpus to examine "worldwide English" from sources that are "carefully selected to be representative across the board". It follows the British National Corpus, launched in 1993, which was set up by a consortium including OUP and now holds 100 million words of British English.
Software on the OEC enables words to be analysed for positive or negative, and male or female, associations. It also allows researchers to examine the frequency of words--and recurring mistakes such as "damp squid" that are filtering into the language.
"It’s about looking at different types of language, how they’re developing and how words are linked," Pearsall said. "It is the gold on which everything is based."

In addition, make sure to read these articles: