Jump to ContentJump to Main Navigation
Show Summary Details
More options …

Glottotheory

International Journal of Theoretical Linguistics

Ed. by Roelcke, Thorsten / Kelih, Emmerich / Köhler, Reinhard

Editorial Board Member: Altmann, Gabriel / Andreev, Sergey / Embleton, Sheila / Gries, Stefan Th. / Grzybek, Peter / Leiss, Elisabeth / Liu, Haitao / Macutek, Ján / Bruch Nemcová, Emilia / Patrás, Vladimír / Sanada, Haruko / Wilson, Andrew

2 Issues per year


SCImago Journal Rank (SJR) 2016: 0.125

Online
ISSN
2196-6907
See all formats and pricing
More options …

A Corpus-Based Study on English and Chinese Intertextual Vocabulary Growth

Zhao Xiaodong

Abstract

Based on the Chinese-English Sentence-Aligned Bilingual Corpus constructed by Institute of Automation and Institute of Computing Technology of Chinese Academy of Sciences, this paper investigates the Chinese and English inter-textual type/token relationship and tests the fitness of BRUNET’s model to the vocabulary growth curves of Chinese and English texts, and it also explores the growth patterns of hapax legomena in the two languages. Results of the study show that Chinese and English vocabulary growth displays a similar sharp-slow increasing tendency, but initially with the Chinese types rising more sharply than those of English; and BRUNET’s model is powerful enough to match both Chinese and English inter-textual type/token relationship. This study also finds that there are far fewer hapax legomena in Chinese than in English, and with the increase of tokens, the hapax legomena in the two languages both display a growth pattern similar to that of their type/token relationship. But from the cross point (about 2,500,000 cumulative word tokens) downwards, the cumulative number of Chinese hapax legomena has become much smaller than that of English.

Keywords: corpus; vocabulary growth; Brunet’s model; hapax legomena

About the article

Published in Print: 2013-04-01


Citation Information: Glottotheory International Journal of Theoretical Linguistics, ISSN (Print) 1337-7892, DOI: https://doi.org/10.1524/glot.2013.0010.

Export Citation

© by Akademie Verlag, Berlin, Germany. Copyright Clearance Center

Comments (0)

Please log in or register to comment.
Log in