Accessible Requires Authentication Published by De Gruyter November 24, 2014

Korpusbasierte Exploration der Variation der nominalen Genitivmarkierung

Noah Bubenhofer, Sandra Hansen-Morath and Marek Konopka

Abstract

The variation of the strong genitive marker of the singular noun has been treated by diverse accounts. Still there is a consensus that it is to a large extent systematic but can be approached appropriately only if many heterogeneous factors are taken into account. Over thirty variables influencing this variation have been proposed. However, it is actually unclear how effective they can be, and above all, how they interact. In this paper, the potential influencing variables are evaluated statistically in a machine learning approach and modelled in decision trees in order to predict the genitive marking variants. Working with decision trees based exclusively on statistically significant data enables us to determine what combination of factors is decisive in the choice of a marking variant of a given noun. Consequently the variation factors can be assessed with respect to their explanatory power for corpus data and put in a hierarchized order.

Literatur

Appel, Elsbeth (1941): Vom Fehlen des genitiv-s. (=Arbeiten zur Entwicklungspsychologie 21) München: Beck. Search in Google Scholar

Baayen, R. Harald/Piepenbrock, Richard/Gulikers, Leon (1995): The CELEX Lexical Database (CD-ROM). Philadelphia: Linguistic Data Consortium. Search in Google Scholar

Bubenhofer, Noah/Konopka, Marek/Schneider, Roman et al. (2013): Präliminarien einer Korpusgrammatik. (=Korpuslinguistik und interdisziplinäre Perspektiven auf Sprache, CLIP 4.) Tübingen: Narr. Search in Google Scholar

Bubenhofer, Noah/Brinkmann, Caren/Hein, Katrin (2013): Maschinelles Lernen zur Vorhersage von Fugenelementen in nominalen Komposita. In: Bubenhofer, Noah/Konopka, Marek/Schneider, Roman et al.: Präliminarien einer Korpusgrammatik. (=Korpuslinguistik und interdisziplinäre Perspektiven auf Sprache, CLIP 4.) Tübingen: Narr, S. 183–227. Search in Google Scholar

Drazin, Sam/Montag, Matt: Decision Tree Analysis using Weka. Machine Learning-Project II, University of Miami. http://wwww.samdrazin.com/classes/een548/project2report.pdf, zuletzt eingesehen am 30.06.2014. Search in Google Scholar

Duden (2007): Band 9. Richtiges und gutes Deutsch. Wörterbuch der sprachlichen Zweifelsfälle. 6. vollst. überarb. Aufl. Mannheim/Leipzig/Wien/Zürich: Dudenverlag. Search in Google Scholar

Duden (2009): Band 4. Die Grammatik: Unentbehrlich für richtiges Deutsch. 8. überarb. Auflage. Mannheim: Dudenverlag. Search in Google Scholar

Fehringer, Carol (2011): Allomorphy in the German genitive. A paradigmatic account. In: Zeitschrift für Germanistische Linguistik 39/1, S. 90–112 Search in Google Scholar

Fuß, Eric/Konopka, Marek (2014): Variation der starken Genitivmarkierung. Manuskript. Mannheim: Institut für Deutsche Sprache. Search in Google Scholar

Heister, Julian/Würzner, Kay-Michael/Bubenzer, Johannes/Pohl, Edmund/Hanneforth, Thomas/Geyken, Alexander/Kliegl, Reinhold (2011): dlexDB – eine lexikalische Datenbank für die psychologische und linguistische Forschung. Psychologische Rundschau 62(1), 10–20. Search in Google Scholar

Institut für Deutsche Sprache (2011a): Deutsches Referenzkorpus/Archiv der Korpora geschriebener Gegenwartssprache 2011-I (Release vom 29.03.2011). Mannheim: Institut für Deutsche Sprache. http://www.ids-mannheim.de/DeReKo, zuletzt eingesehen am 30.06.2014. Search in Google Scholar

Kullback, Solomon/Leibler, Richard Arthur (1951): On information and sufficiency. Annals of Mathematical Statistics 22/1, S. 79–86. Search in Google Scholar

Pfeffer, J. Alan/Morrison, Scott E. (1979): The genitive singular with -s and/or -es in spoken and written German. IRAL 17, pp. 303–311. Search in Google Scholar

Pfeffer, J. Alan/Morrison, Scott E. (1984): The genetive singular with -s and/or -es in spoken and written German). In: Pfeffer, J. Alan (Ed.): Studies in Descriptive German Grammar. Heidelberg: Groos, pp. 9–18 (Abdruck v. Pfeffer/Morrison (1979). Search in Google Scholar

Quinlan, J. Ross (1993): C4.5: Programs for Machine Learning. San Francisco: Morgan Kaufmann. Search in Google Scholar

Schmid, Helmut (1994): Probabilistic Part-of-Speech Tagging Using Decision Trees. Proceedings of International Conference on New Methods in Language Processing. Manchester, UK. Search in Google Scholar

Schneider, Roman (2014): GenitivDB ― a Corpus-Generated Database for German Genitive Classification. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14). Reykjavik. Search in Google Scholar

Szczepaniak, Renata (2010): Während des Flug(e)s/des Ausflug(e)s? German Short and Long Genitive Endings between Norm and Variation. In: Lenz, Alexandra N./Plewnia, Albrecht (eds.): Grammar between Norm and Variation. Frankfurt am Main: Peter Lang, S. 103–126. Search in Google Scholar

Wall, Larry/Christiansen, Tom/Orwant, Jon/Schwartz, Randal (2001): Programmieren mit Perl. 2. Aufl. Köln: O’Reilly. Search in Google Scholar

Wiese, Bernd (2009): Variation in der Flexionsmorphologie: Starke und schwache Adjektivflexion nach Pronominaladjektiven. In: Konopka, Marek/Strecker, Bruno (Hgg.): Deutsche Grammatik – Regeln, Normen, Sprachgebrauch. (= Institut für Deutsche Sprache – Jahrbuch 2008). Berlin, New York: de Gruyter, S. 166–194. Search in Google Scholar

Witten, Ian H./Frank, Eibe (2005): Data Mining: Practical Machine Learning Tools and Techniques. 2. Aufl. San Francisco. Search in Google Scholar

Online erschienen: 2014-11-24
Erschienen im Druck: 2014-11-1

© 2012 Walter de Gruyter GmbH & Co. KG, Berlin/Boston