Jump to ContentJump to Main Navigation
Show Summary Details

Moisl, Hermann

Cluster Analysis for Corpus Linguistics

Series:Quantitative Linguistics [QL] 66

DE GRUYTER MOUTON

    103,95 € / $119.99 / £93.99*

    eBook (PDF)
    Publication Date:
    February 2015
    Copyright year:
    2015
    ISBN
    978-3-11-036381-4
    See all formats and pricing

    Overview

    • Describes a range of clustering methods for analysis of data derived from language corpora.
    • Gives an intuitively accessible account of the mathematical concepts which underlie data creation, data transformation, and cluster analysis.

    Aims and Scope

    The standard scientific methodology in linguistics is empirical testing of falsifiable hypotheses. As such the process of hypothesis generation is central, and involves formulation of a research question about a domain of interest and statement of a hypothesis relative to it. In corpus linguistics the domain is text, and generation involves abstraction of data from text, data analysis, and formulation of a hypothesis based on inference from the results. Traditionally this process has been paper-based, but the advent of electronic text has increasingly rendered it obsolete both because the size of digital corpora is now at or beyond the limit of what can efficiently be used in the traditional way, and because the complexity of data abstracted from them can be impenetrable to understanding. Linguists are increasingly turning to mathematical and statistical computational methods for help, and cluster analysis is such a method. It is used across the sciences for hypothesis generation by identification of structure in data which are too large or complex, or both, to be interpretable by direct inspection. This book aims to show how cluster analysis can be used for hypothesis generation in corpus linguistics, thereby contributing to a quantitative empirical methodology for the discipline.

    Supplementary Information

    Details

    xv, 381 pages
    DE GRUYTER MOUTON
    Language:
    English
    Type of Publication:
    Monograph
    Keyword(s):
    Corpus linguistics; cluster analysis; quantitative linguistics; hypothesis generation

    MARC record

    MARC record for eBook

    request permissions

    More ...

    Hermann Moisl, Newcastle University, Newcastle upon Tyne, UK.

    Comments (0)

    Please log in or register to comment.
    Log in