Jump to ContentJump to Main Navigation
Show Summary Details
More options …

Open Linguistics

Editor-in-Chief: Ehrhart, Sabine

1 Issue per year


Covered by:
Clarivate Analytics - Emerging Sources Citation Index
ERIH PLUS

Open Access
Online
ISSN
2300-9969
See all formats and pricing
More options …

LatMor: A Latin Finite-State Morphology Encoding Vowel Quantity

Uwe Springmann
  • Centrum für Informations- und Sprachverarbeitung, Ludwig-Maximilians-Universität München
  • Institut für deutsche Sprache und Linguistik, Humboldt-Universität zu Berlin
  • Other articles by this author:
  • De Gruyter OnlineGoogle Scholar
/ Helmut Schmid / Dietmar Najock
Published Online: 2016-10-14 | DOI: https://doi.org/10.1515/opli-2016-0019

Abstract

We present the first large-coverage finite-state open-source morphology for Latin (called LatMor) which parses as well as generates vowel quantity information. LatMor is based on the Berlin Latin Lexicon comprising about 70,000 lemmata of classical Latin compiled by the group of Dietmar Najock in theirwork on concordances of Latin authors (see Rapsch and Najock, 1991) which was recently updated by us. Compared to the well-known Morpheus system of Crane (1991, 1998), which is written in the C programming language, based on 50,000 lemmata of Lewis and Short (1907), not well documented and therefore not easily extended, our new morphology has a larger vocabulary, is about 60 to 1200 times faster and is built in the form of finite-state transducers which can analyze as well as generate wordforms and represent the state-of-the-art implementation method in computational morphology. The current coverage of LatMor is evaluated against Morpheus and other existing systems (some of which are not openly accessible), and is shown to rank first among all systems together with the Pisa LEMLAT morphology (not yet openly accessible). Recall has been analyzed taking the Latin Dependency Treebank¹ as gold data and the remaining defect classes have been identified. LatMor is available under an open source licence to allow its wide usage by all interested parties.

Keywords: morphology; finite state methods; Latin; historical linguistics

References

  • William Sidney Allen. Vox Latina (2nd edn., corrected reprint). Cambridge University Press, 1989.Google Scholar

  • Kenneth R. Beesley. Arabic finite-state morphological analysis and generation. In Proceedings of the 16th Conference on Computational Linguistics (COLING), volume 1, pages 89–94, 1996.Google Scholar

  • Çağrı Çöltekin. A freely available morphological analyzer for Turkish. In Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pages 820–827, 2010. URL http://www.lrec-conf.org/proceedings/lrec2010/ summaries/109.html.Google Scholar

  • Gregory Crane. Generating and parsing classical Greek. Literary and Linguistic Computing, 6(4):243–245, 1991.Google Scholar

  • Gregory Crane. New technologies for reading: The lexicon and the digital library. The Classical World, pages 471–501, 1998.Google Scholar

  • Egidio Forcellini, Giuseppe Furlanetto, Francesco Corradini, and Joseph Perin. Lexicon totius Latinitatis, t. V-VI: Onomasticon. Typis Seminarii, Patavii, 1940.Google Scholar

  • Karl Ernst Georges. Ausführliches lateinisch-deutsches Handwörterbuch. Hahnsche Buchhandlung, Hannover und Leipzig, 1913.Google Scholar

  • Dag Trygve Truslew Haug and Marius Jøhndal. Creating a parallel treebank of the old Indo-European Bible translations. In Proceedings of the Language Technology for Cultural Heritage DataWorkshop (LaTeCH 2008),Marrakech,Morocco, 1st June 2008, pages 27–34, 2008.Google Scholar

  • Manu Leumann, Johann Baptist Hofmann, and Anton Szantyr. Lateinische Grammatik: Lateinische Laut- und Formenlehre. CH Beck, 1977.Google Scholar

  • Charlton T Lewis and Charles Short. A New Latin Dictionary. Founded on the Translation of Freund’s Latin German Lexicon Edited By E. A. Andrews, LL D. Clarendon Press, 1907.Google Scholar

  • Hermann Menge, Otto Güthling, and Erich Pertsch. Langenscheidts großes Schulwörterbuch lateinisch-deutsch. Langenscheidt, 1983.Google Scholar

  • Thomas Müller, Helmut Schmid, and Hinrich Schütze. Eflcient higher-order CRFs for morphological tagging. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 322–332, 2013.Google Scholar

  • Marco Carlo Passarotti. Development and perspectives of the Latin morphological analyser LEMLAT. Linguistica computazionale, 20(A):397–414, 2004.Google Scholar

  • Johann Ramminger. Neo-Latin: Character and Development. In Philip Ford, Jan Bloemendal, and Charles Fantazzi, editors, Brill’s Encyclopaedia of the Neo-Latin World, pages 21–36. Brill, 2014.Google Scholar

  • Jürgen Rapsch and Dietmar Najock. Concordantia in Corpus Sallustianum, 2 vols. Olms, Hildesheim, Zürich, New York, 1991.Google Scholar

  • Peter Rosumek and Dietmar Najock. Concordantia in C. Plinii Secundi NaturalemHistoriam. Alpha - Omega / A. Olms-Weidmann, Hildesheim u.a, 1996. ISBN 9783487100166. URL http://isbnplus.org/9783487100166.Google Scholar

  • Hans Rubenbauer, Johann Baptist Hofmann, and Rolf Heine. Lateinische Grammatik, 12. Auflage. Buchner, Lindauer, Oldenbourg, Bamberg-München, 1995.Google Scholar

  • Helmut Schmid. A programming language for finite state transducers. In Anssi Yli-Jyrä, editor, Finite-State Methods and Natural Language Processing: 5th International Workshop (FSMNLP 2005), volume 4002 of Lecture Notes in Artificial Intelligence, pages 308–309. Springer, Heidelberg, Germany, 2006.Google Scholar

  • Helmut Schmid, Arne Fitschen, and Ulrich Heid. SMOR: A German computational morphology covering derivation, composition and inflection. In Proceedings of the 4th International Conference on Language Resources and Evaluation, volume 4, pages 1263–1266, Lisbon, Portugal, 2004. Google Scholar

About the article

Received: 2016-02-29

Accepted: 2016-05-18

Published Online: 2016-10-14


Citation Information: Open Linguistics, ISSN (Online) 2300-9969, DOI: https://doi.org/10.1515/opli-2016-0019.

Export Citation

© 2016 U. Springmann et al.. This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License. BY-NC-ND 3.0

Comments (0)

Please log in or register to comment.
Log in