Skip to content
Licensed Unlicensed Requires Authentication Published by De Gruyter Mouton December 14, 2020

Using automated methods to explore the social stratification of anglicisms in Spanish

Jacqueline Serigos

Abstract

Traditionally, automated methods for loanword detection have not received an abundance of attention within the field of language contact. However, as research on loanwords has begun utilizing corpora with word counts in the millions, these generous quantities of data pose challenges for traditional methods of linguistic annotation. This paper presents a method for automatically detecting anglicisms within Spanish text and presents a case study, applying this method to explore the social stratification of anglicisms in Argentine media. The findings of the case study suggest that anglicisms may function as prestige markers in Argentina, which may be a logical consequence of the mode of contact: those of upper socio-economic status have greater access to outlets where loanwords seem to emerge, such as the media, Internet, and second language education.


Corresponding author: Jacqueline Serigos, George Mason University, Fairfax, USA, E-mail:

References

Alex, Beatrice. 2006. Integrating language knowledge resources to extend the English inclusion classifier to a new language. In Proceedings of the fifth international conference on language resources and evaluation, 2431–2436. Italy: Genoa.Search in Google Scholar

Alex, Beatrice. 2007. Automatic detection of english inclusions in mixed-lingual data with an application to parsing. Edinburgh: University of Edinburgh dissertation.Search in Google Scholar

Alex, Beatrice. 2008. Comparing corpus-based to web-based lookup techniques for automatic English inclusion detection. In Proceedings of the sixth international language resources and evaluation conference, 2693–2697. Morocco: Marrakech.Search in Google Scholar

Andersen, Gisle. 2005. Assessing algorithms for automatic extraction of anglicisms in Norwegian texts. In Pernilla Danielsson & Martijn Wagenmakers (eds.), Proceedings from the corpus linguistics 2005 conference. UK: Birmingham.Search in Google Scholar

Andersen, Gisle. 2014. Pragmatic borrowing. Journal of Pragmatics 67. 17–33. https://doi.org/10.1016/j.pragma.2014.03.005.Search in Google Scholar

Baayen, R. Harald. 2010. Demythologizing the word frequency effect: A discriminative learning perspective. The Mental Lexicon 5(3). 436–461. https://doi.org/10.1075/ml.5.3.10baa.Search in Google Scholar

Backus, Ad. 2013. A usage-based approach to borrowability. In Zenner Eline & Kristiansen Gitte (eds.), New perspectives on lexical borrowing, 19–39. Boston & Berlin: De Gruyter Mouton.10.1515/9781614514305.19Search in Google Scholar

Balteiro, Isabel. 2011. A reassessment of traditional lexicographical tools in the light of new corpora: Sports anglicisms in Spanish. International Journal of English Studies 11(2). 23–52. https://doi.org/10.6018/ijes/2011/2/149631.Search in Google Scholar

Barrs, Keith. 2014. Lexical semantics of English loanwords in Japanese. In Proceedings of the 47th British association for applied linguistics annual meeting, 29–35. Coventry: University of Warwick.Search in Google Scholar

Biber, Douglas, Randi Reppen, Erin Schnur & Romy Ghanem. 2016. On the (non) utility of Juilland’s D to measure lexical dispersion in large corpora. International Journal of Corpus Linguistics 21(4). 439–464. https://doi.org/10.1075/ijcl.21.4.01bib.Search in Google Scholar

Boczkowski, Pablo J. & Martin de Santos. 2007. When more media equals less news: Patterns of content homogenization in Argentina’s leading print and online newspapers. Political Communication 24(2). 167–180. https://doi.org/10.1080/10584600701313025.Search in Google Scholar

Brysbaert, Marc & New Boris. 2009. Moving beyond Kučera and Francis: A critical evaluation of current word frequency norms and the introduction of a New and improved word frequency measure for American English. Behavior Research Methods 41(4). 977–990. https://doi.org/10.3758/brm.41.4.977.Search in Google Scholar

Calude, Andreea Simona, Steven Miller & Mark Pagel. 2020. Modelling loanword success – A sociolinguistic quantitative study of Māori loanwords in New Zealand English. Corpus Linguistics and Linguistic Theory 16(1). 29–66. https://doi.org/10.1515/cllt-2017-0010.Search in Google Scholar

Chan, Tak Wing & John H. Goldthorpe. 2007. Social status and newspaper readership. American Journal of Sociology 112(4). 1095–1134. https://doi.org/10.1086/508792.Search in Google Scholar

Chesley, Paula. 2010. Lexical borrowings in French: Anglicisms as a separate phenomenon. Journal of French Language Studies 20(3). 231–251. https://doi.org/10.1017/s0959269510000049.Search in Google Scholar

Crespo-Fernández, Eliecer & Carmen Luján-García. 2017. Anglicisms and word axiology in homosexual language. Revista Española de Lingüística Aplicada/Spanish Journal of Applied Linguistics 30(1). 74–103. https://doi.org/10.1075/resla.30.1.04cre.Search in Google Scholar

De Smedt, Tom & Daelemans Walter. 2012. Pattern for Python. Journal of Machine Learning Research 13(66). 2063–2067.Search in Google Scholar

Finkel, Jenny Rose, Trond Grenager & Christopher Manning. 2005. Incorporating non-local information into information extraction systems by Gibbs sampling. In Proceedings of the 43rd annual meeting of the association for computational linguistics, 363–370. Michigan: Ann Arbor.10.3115/1219840.1219885Search in Google Scholar

Fisherman, Haya. 1990. Attitudes toward foreign words in contemporary Hebrew. International Journal of the Sociology of Language 86(1). 5–40. https://doi.org/10.1515/ijsl.1990.86.5.Search in Google Scholar

Francom, Jerid, Mans Hulden & Ussishkin Adam. 2014. ACTIV-ES: A comparable, cross-dialect corpus of ‘everyday’ Spanish from Argentina, Mexico, and Spain. In The ninth international conference on language resources and evaluation, 1733–1737. Iceland: Reykjavik.Search in Google Scholar

Gries, Stefan Th. 2008. Dispersions and adjusted frequencies in corpora. International Journal of Corpus Linguistics 13(4). 403–437. https://doi.org/10.1075/ijcl.13.4.02gri.Search in Google Scholar

Guzman, Gualberto A., Jacqueline Serigos, Barbara E. Bullock & Jacqueline Almeida Toribio. 2016. Simple tools for exploring variation in code-switching for linguists. In Proceedings of the second workshop on computational approaches to code switching, 12–20. Austin, Texas: Association for Computational Linguistics.10.18653/v1/W16-5802Search in Google Scholar

Hassall, Tim, Elisabet Titik Murtisari, Christine Donnelly & Jeff Wood. 2008. Attitudes to western loanwords in Indonesian. International Journal of the Sociology of Language 2008(189). 55–84. https://doi.org/10.1515/ijsl.2008.003.Search in Google Scholar

Haugen, Einar. 1950. The analysis of linguistic borrowing. Language 26(2). 210–231. https://doi.org/10.2307/410058.Search in Google Scholar

Hughes, Baden, Timothy Baldwin, Steven Bird, Jeremy Nicholson & Andrew MacKinlay. 2006. Reconsidering language identification for written language resources. In Proceedings of 5th international conference on language resources and evaluation, 485–488. Italy: Genoa.Search in Google Scholar

Koo, Hahn. 2015. An unsupervised method for identifying loanwords in Korean. Language Resources and Evaluation 49(2). 1–19. https://doi.org/10.1007/s10579-015-9296-5.Search in Google Scholar

Leidig, Sebastian, Tim Schlippe & Tanja Schultz. 2014. Automatic detection of anglicisms for the pronunciation dictionary generation: A case study on our German it corpus. In 4th Workshop on spoken language technologies for under-resourced languages. Russia: Petersburg.Search in Google Scholar

Loveday, Leo. 1996. Language contact in Japan: A socio-linguistic history. Oxford studies in language contact. Oxford: Clarendon Press.Search in Google Scholar

Mansikkaniemi, André & Mikko Kurimo. 2012. Unsupervised vocabulary adaptation for morph-based language models. In Proceedings of the NAACL-HLT 2012 workshop: Will we ever really replace the N-Gram model? On the future of language modeling for HLT, 37–40. Montréal, Canada: Association for Computational Linguistics.Search in Google Scholar

Matus-Mendoza, Maríadelaluz. 2002. The English lexical loan: A class marker. Journal of Hispanic Higher Education 1(4). 329–337. https://doi.org/10.1177/153819202236977.Search in Google Scholar

Nair, Vineeth G. 2014. Getting started with beautiful soup. Birmingham, UK: Packt Publishing.Search in Google Scholar

Ngom, Fallou. 2003. The social status of Arabic, French, and English in the Senegalese speech community. Language Variation and Change 15(3). 351–368. https://doi.org/10.1017/s0954394503153045.Search in Google Scholar

Onysko, Alexander & Esme Winter-Froemel. 2011. Necessary loans – luxury loans? Exploring the pragmatic dimension of borrowing. Journal of Pragmatics 43(6). 1550–1567. https://doi.org/10.1016/j.pragma.2010.12.004.Search in Google Scholar

Rosner, Mike & Farrugia Paulseph-John. 2007. A tagging algorithm for mixed language identification in a noisy domain. In Proceedings of the eighth annual conference of the international speech communication association, 190–193. Belgium: Antwerp.10.21437/Interspeech.2007-77Search in Google Scholar

Ruellot, Viviane. 2011. English in French print advertising from 1999 to 2007. World Englishes 30(1). 5–20. https://doi.org/10.1111/j.1467-971x.2010.01654.x.Search in Google Scholar

Sanou, Rosa, Graciela Albiñana, Graciela Galli & Claudia Castañeda. 2017. Análisis Sociolingüístico: Actitudes y Empleo de Anglicismos En San Juan [Sociolinguistic analysis: Attitudes and use of anglicisms in San Juan]. Texturas 16. 19–40. https://doi.org/10.14409/texturas.v0i16.6937.Search in Google Scholar

Serigos, Jacqueline. 2016. Using distributional semantics in loanword research: A concept-based approach to quantifying semantic specificity of anglicisms in Spanish. International Journal of Bilingualism 21(5). 521–540. https://doi.org/10.1177/1367006916635836.Search in Google Scholar

Solorio, Thamar & Yang Liu. 2008. Learning to predict code-switching points. In Proceedings of the conference on empirical methods in natural language processing, 973–981. Hawaii: Honolulu.10.3115/1613715.1613841Search in Google Scholar

Varga, Dražen, Lidija Orešković Dvorski & Saša Bjelobaba. 2012. English loanwords in French and Italian daily newspapers. Studia Romanica et Anglica Zagrabiensia 56. 71–84. UDC 811.133.1’373.45:811.111.Search in Google Scholar

Varra, Rachel Marie. 2013. The social correlates of lexical borrowing in Spanish in New York City. New York: University of New York dissertation.Search in Google Scholar

Winford, Donald. 2003. An introduction to contact linguistics, 1st edn. Oxford: Wiley-Blackwell.Search in Google Scholar

Zenner, Eline, Dirk Speelman & Dirk Geeraerts. 2014. A sociolinguistic analysis of borrowing in weak contact situations: English loanwords and phrases in expressive utterances in a Dutch reality TV show. International Journal of Bilingualism 19(3). 333–346. https://doi.org/10.1177/1367006914521699.Search in Google Scholar

Zenner, Eline, Dirk Speelman & Dirk Geeraerts. 2012. Cognitive sociolinguistics meets loanword research: Measuring variation in the success of anglicisms in Dutch. Cognitive Linguistics 23(4). 749–792. https://doi.org/10.1515/cog-2012-0023.Search in Google Scholar

Zipf, George. 1935. The psychobiology of language: An introduction to dynamic philology. Cambridge, Massachusetts: M.I.T. Press.Search in Google Scholar

Published Online: 2020-12-14
Published in Print: 2022-05-25

© 2020 Walter de Gruyter GmbH, Berlin/Boston

Scroll Up Arrow