Jump to ContentJump to Main Navigation

Online

49,00 € / $74.00*

* Prices subject to change. Shipping costs will be added if applicable.
Publication Date:
December 2007
ISSN:
1613-3706
DOI:
10.1515/ZFS.2007.016

See all formats and pricing

Online
Individual Subscription Online only
Euro [D] 49.00
RRP for USA, Canada, Mexico
US$ 74.00 *
Print
Individual Subscription Online only
Euro [D] 125.00
RRP for USA, Canada, Mexico
US$ 188.00 *
Print + Online
Individual Subscription Online only
Euro [D] 150.00
RRP for USA, Canada, Mexico
US$ 226.00 *
*Prices subject to change. Shipping costs will be added if applicable.

Managing Editor: Holler, Anke

2 Issues per year

Increased IMPACT FACTOR 2011: 1.273
Rank 23 out of 161 in category Linguistics in the 2011 Thomson Reuters Journal Citation Report/Social Sciences Edition

VolumeIssuePage

Issues

Eigennamenerkennung zwischen morphologischer Analyse und Part-of-Speech Tagging: ein automatentheoriebasierter Ansatz

Jörg Didakowski1 / Alexander Geyken2 / Thomas Hanneforth3

1

2

3

Citation Information: Zeitschrift für Sprachwissenschaft. Volume 26, Issue 2, Pages 157–186, ISSN (Online) 1613-3706, ISSN (Print) 0721-9067, DOI: 10.1515/ZFS.2007.016, December 2007

Publication History:
Received:
2007-02-02
Revised:
2007-06-12
Published Online:
2007-12-04

Abstract

Previous rule-based approaches for Named Entity Recognition (NER) in German base NER on Part-of-Speech tagged texts. We present a new approach where NER is situated between morphological analysis and Part-of-Speech Tagging and model the NER-grammar entirely with weighted finite state transducers (WFST). We show that NER strategies like the resolution of proper noun/common noun or company-name/family-name ambiguities can be formulated as a best path function of a WFST. The frequently used second pass resolution of coreferential Named Entities can be formulated as a re-assignment of appropriate weights. A prototypical NE recognition system built on the basis of WSFT and large lexical resources was tested on a manually annotated corpus of 65,000 tokens. The results show that our system compares in recall and precision to existing rule-based approaches.

Keywords: Named Entity Recognition; weighted finite state transducers; large lexical resources

Comments (0)

Please log in or register to comment.