, cognateness, wordfrequency and length, has been conducted in relation to general English vocabulary learning. Given the lack of such research in relation to English academic vocabulary, the present study examines how predictive cognateness, wordfrequency and length are of EFL learners’ ability to recognise English academic vocabulary. In this study participants are Spanish L1 EFL learners. This L1-L2 pair was chosen due to a pedagogical consideration. Research suggests that Spanish-English bilinguals do not recognise all cognates as such (e. g. Nagy et al. 1993 ; August
Universytet Warszawski, Katedra Lingwistyki Formalnej, Warszawa, 2001.
Pawłowski, Adam (2005): Modelling of sequential structures in Text. In: Köhler,
Reinhard, Altmann, Gabriel - Piotrovskij, Rajmund G. (eds.), Quantitative Lin-
guistik. Ein Internationales Handbuch. Quantitative Linguistics. An Internation
Handbook. Berlin, New York: Mouton de Gruyter, 738-750.
Wimmer, Gejza - Altmann, Gabriel (1999), Thesaurus of univariate discrete probabi-
lity distributions. Essen: Stamm.
The project (2007-2008) is a common
This list is arranged according to the frequency of occurrence of words.
The list consists of four columns:
• Column 1: Reference No.
Gives each word a reference number. This number is used as cross reference in other sections where
the meanings of words are not given.
• Column 2: Syriac Lexical Entry.
Gives the Syriac form of the word in vocalized Serto (Western) script.
• Column 3: Category.
Gives the grammatical (i.e. morphological) category of the lexical entry.
• Column 4: English Meanings.
Can an English WordFrequency Counter Serve the Requirements
of Persian Automatic Indexing?
Mohammad Reza Falahati-Fumani1
Abstract. This article assessed the possible suitability of an English wordfrequency counter for
Persian. Ten Persian articles related to agriculture were input to the software. The error analysis
revealed that it had the potential to be used for Persian but modifications were necessary to reduce
the errors. The main source of the errors lied in Persian morphology and alphabet and the
availability of Persian and
Wordfrequency and word order in freezes1
The theoretical part of this analysis attempts to explain and predict
sequencing phenomena by means of principles of cognitive economy: in
order to avoid peaks of information that are difficult to process (Fenk and
Fenk 1980), and also because of their easier 'lexical' and articulatory
accessibility, the more frequent and therefore informationally poorer ele-
ments tend to occupy initial position (Fenk-Oczlon 1983a). In the context
of 'freezes', this means 'more frequent (high-token frequency
Word Length and WordFrequency in Slovak
Katarína Jedličková, Emília Nemcová1
Abstract. The aim of this article is to test the well-known Zipfian hypotheses concerning the relationships
between length and frequency of words using Slovak data.
In his seminal works G.K. Zipf (1932, 1935/68, 1950) expressed the following
hypothesis concerning wordfrequency: „…the larger a word is in length, the less
likely it is to be used.“ (1935: 22) or „…the magnitude of words tends, on the whole,
to stand in an inverse (not necessarily proportionate) relationship to the
11 Wordfrequency and position in sentence
Word forms represent the elementary concept forming level (empirical con-
cept) which can be abandoned in different more abstract directions. One can
replace a word form by a lemma, by its part of speech, by its denotation and
references, by its different kinds of length-related or other properties, and
finally, by its frequency. Whatever the replacement, the original empirical
“word” will be partially or completely bereft of its original form. We obtain a
series of symbols, which can be considered as
4 The geometry of wordfrequencies
The points h, k, m and n which have been defined in the preceding chapters
can be used not only related to some elementary questions asked in the field
of textology (such as to vocabulary richness, or text coverage), but can also
give us the possibility of characterizing the distribution in a geometric way.
How did the writer manage to place the given restricted number of words (V )
in the texts? Consider first the rank frequency distribution.
In very short texts it is possible to use each word only once. In
replication study. However, as argued below, the present study adopts better-suited analyses to investigate these questions. The factor “wordfrequency” has (to my knowledge) not been investigated in L2 speech before. Additionally, the current study will investigate potential differences in pause durations between L1 and L2 speech, taking pause placement into account. Riazantseva (2001) also investigated differences in pause duration between L1 and L2 speech and between speakers with different levels of proficiency. She found that higher proficient speakers paused, on