Jump to ContentJump to Main Navigation
Show Summary Details
More options …

Corpus Linguistics and Linguistic Theory

Founded by Gries, Stefan Th. / Stefanowitsch, Anatol

Ed. by Wulff, Stefanie

IMPACT FACTOR 2017: 1.200
5-year IMPACT FACTOR: 1.386

CiteScore 2017: 0.80

SCImago Journal Rank (SJR) 2017: 0.288
Source Normalized Impact per Paper (SNIP) 2017: 0.930

See all formats and pricing
More options …

Syntactic annotation in the Reference Corpus for the Processing of Basque (EPEC): Theoretical and practical issues

Izaskun Aldezabal / Maria Jesus Aranzabe / Jose Mari Arriola / Arantza Diaz de Ilarraza
Published Online: 2009-10-16 | DOI: https://doi.org/10.1515/CLLT.2009.010


In this paper, we will describe some theoretical and practical issues raised during the construction of the Basque Dependency Treebank (BDT): the syntactic annotation of EPEC (Reference Corpus for the Processing of Basque). EPEC is a 300,000 word corpus of standard written Basque whose purpose is to be a training corpus for the development and improvement of several NLP (Natural Language Processing) tools for Basque. BDT will be the first corpus for the Basque language tagged at syntactic level. We will also present the dependency-based annotation hierarchy that we have established for the syntactic tagging. Decisions made during design of the annotation hierarchy are based on the description of Basque grammar made by Euskaltzaindia (Academy for the Basque Language). When describing dependency relations, we consider lexical units as syntactic heads. This will open up a way for us to work with semantics.

Keywords:: Syntactic annotation; dependency grammar; treebank

About the article

Published Online: 2009-10-16

Published in Print: 2009-09-01

Citation Information: Corpus Linguistics and Linguistic Theory, Volume 5, Issue 2, Pages 241–269, ISSN (Online) 1613-7035, ISSN (Print) 1613-7027, DOI: https://doi.org/10.1515/CLLT.2009.010.

Export Citation

Citing Articles

Here you can find all Crossref-listed publications in which this article is cited. If you would like to receive automatic email messages as soon as this article is cited in other publications, simply activate the “Citation Alert” on the top of this page.

Ainara Estarrona, Izaskun Aldezabal, and Arantza Díaz de Ilarraza
Language Resources and Evaluation, 2018
Ainara Estarrona, Izaskun Aldezabal, Arantza Díaz de Ilarraza, and María Jesús Aranzabe
Digital Scholarship in the Humanities, 2016, Volume 31, Number 3, Page 470

Comments (0)

Please log in or register to comment.
Log in