Jump to ContentJump to Main Navigation
Show Summary Details
More options …


International Journal of Theoretical Linguistics

Ed. by Roelcke, Thorsten / Kelih, Emmerich / Köhler, Reinhard

Editorial Board: Altmann, Gabriel / Andreev, Sergey / Bär, Jochen A. / Embleton, Sheila / Gries, Stefan Th. / Grzybek, Peter / Leiss, Elisabeth / Liu, Haitao / Macutek, Ján / Bruch Nemcová, Emilia / Patrás, Vladimír / Sanada, Haruko / Wilson, Andrew

2 Issues per year

CiteScore 2017: 0.21

SCImago Journal Rank (SJR) 2017: 0.126
Source Normalized Impact per Paper (SNIP) 2017: 0.429

See all formats and pricing
More options …

A Corpus-Based Study on English and Chinese Intertextual Vocabulary Growth

Zhao Xiaodong


Based on the Chinese-English Sentence-Aligned Bilingual Corpus constructed by Institute of Automation and Institute of Computing Technology of Chinese Academy of Sciences, this paper investigates the Chinese and English inter-textual type/token relationship and tests the fitness of BRUNET’s model to the vocabulary growth curves of Chinese and English texts, and it also explores the growth patterns of hapax legomena in the two languages. Results of the study show that Chinese and English vocabulary growth displays a similar sharp-slow increasing tendency, but initially with the Chinese types rising more sharply than those of English; and BRUNET’s model is powerful enough to match both Chinese and English inter-textual type/token relationship. This study also finds that there are far fewer hapax legomena in Chinese than in English, and with the increase of tokens, the hapax legomena in the two languages both display a growth pattern similar to that of their type/token relationship. But from the cross point (about 2,500,000 cumulative word tokens) downwards, the cumulative number of Chinese hapax legomena has become much smaller than that of English.

Keywords: corpus; vocabulary growth; Brunet’s model; hapax legomena

About the article

Published in Print: 2013-04-01

Citation Information: Glottotheory International Journal of Theoretical Linguistics, Volume 4, Issue 1, Pages 119–131, ISSN (Print) 1337-7892, DOI: https://doi.org/10.1524/glot.2013.0010.

Export Citation

© by Akademie Verlag, Berlin, Germany.Get Permission

Comments (0)

Please log in or register to comment.
Log in