Jump to ContentJump to Main Navigation
Show Summary Details
More options …

Zeitschrift für Anglistik und Amerikanistik

A Quarterly of Language, Literature and Culture

[Journal of English and American Studies]

Ed. by Butter, Michael / Eckstein, Lars / Frenk, Joachim / Georgi-Findlay, Brigitte / Herbst, Thomas / Korte, Barbara / Leypoldt, Günter / Reinfandt, Christoph / Stefanowitsch, Anatol

4 Issues per year

CiteScore 2017: 0.07

SCImago Journal Rank (SJR) 2017: 0.123
Source Normalized Impact per Paper (SNIP) 2017: 0.323

See all formats and pricing
More options …
Volume 54, Issue 2


How Random is a Corpus? The Library Metaphor

Stefan Evert


There is a stark contrast between the random sample model underlying the statistical analysis of corpus frequency data and our intuitive knowledge that sentences are more than random bags of words. The 'library metaphor' illustrates how randomness results from the selection of a corpus as the basis for a linguistic study. At the same time it reveals two reasons why corpus data do not fully meet the assumptions of the random sample model. Finally, practicable methods for identifying and quantifying non-randomness are introduced and demonstrated on the example of passive verb forms.

About the article

Published in Print: 2006-04-01

Citation Information: Zeitschrift für Anglistik und Amerikanistik, Volume 54, Issue 2, Pages 177–190, ISSN (Online) 2196-4726, DOI: https://doi.org/10.1515/zaa-2006-0208.

Export Citation

© 2014 by Walter de Gruyter Berlin/Boston.Get Permission

Citing Articles

Here you can find all Crossref-listed publications in which this article is cited. If you would like to receive automatic email messages as soon as this article is cited in other publications, simply activate the “Citation Alert” on the top of this page.

Judi Pajo
Social Science Computer Review, 2018, Page 089443931876795
Marianne Hundt, Gerold Schneider, and Elena Seoane
Corpora, 2016, Volume 11, Number 1, Page 29
Dirk Siepmann, Christoph Bürgel, and Sascha Diwersy
International Journal of Lexicography, 2015, Page ecv043
English Language and Linguistics, 2010, Volume 14, Number 03, Page 347

Comments (0)

Please log in or register to comment.
Log in