Abstract
Although the use of Geographic Information Systems (GIS) has a long history in archaeology, spatial technologies have been rarely used to analyse the content of textual collections. A newly developed approach termed Geographic Text Analysis (GTA) is now allowing the semi-automated exploration of large corpora incorporating a combination of Natural Language Processing techniques, Corpus Linguistics, and GIS. In this article we explain the development of GTA, propose possible uses of this methodology in the field of archaeology, and give a summary of the challenges that emerge from this type of analysis.
References
[1] L. Isaksen, ‘The application of network analysis to ancient transport geography: A case study of Roman Baetica’, Digital Medievalist, vol. 4, 2008. 10.16995/dm.20Search in Google Scholar
[2] J. Baker and S. Brookes, ‘Outside the gate: sub-urban legal practices in early medieval England’, World Archaeology, vol. 45, no. 5, pp. 747–761, Dec. 2013, doi: 10.1080/00438243.2013.865330. 10.1080/00438243.2013.865330Search in Google Scholar
[3] S. Jeffrey, J. Richards, F. Ciravegna, S. Waller, S. Chapman, and Z. Zhang, ‘The Archaeotools project: faceted classification and natural language processing in an archaeological context’, Philosophical Transactions of the Royal Society of London A: Mathematical, Physical and Engineering Sciences, vol. 367, no. 1897, pp. 2507–2519, Jun. 2009. Search in Google Scholar
[4] E. Barker, S. Bouzarovski, C. Pelling, L. Isaksen, ‘Mapping an ancient historian in a digital age: the Herodotus Encoded Space-Text-Image Archive (HESTIA)’, Leeds International Classical Studies, vol. 9, no. 1, 2010. Search in Google Scholar
[5] E. Barker, K. Byrne, L. Isanksen, E. Kansa, N. Rabinowitz, Google Ancient Places. (2012). at http://googleancientplaces. wordpress.com/2012/02/25/ the-story-continues/ Search in Google Scholar
[6] C. Grover, R. Tobin, K. Byrne, M. Woollard, J. Reid, S. Dunn, and J. Ball, ‘Use of the Edinburgh geoparser for georeferencing digitized historical collections’, Phil. Trans. R. Soc. A, vol. 368, no. 1925, pp. 3875–3889, Aug. 2010. Search in Google Scholar
[7] T. Harris, J. Corrigan, and D. Bodenhamer, ‘Challenges for the Spatial Humanities: Toward a Research Agenda’, in The Spatial Humanities: GIS and the Future of Humanities Scholarship, Bloomington: Indiana University Press, 2010, pp. 167–176. Search in Google Scholar
[8] F. Moretti, Graphs, Maps, Trees: Abstract Models for Literary History. Verso, 2005. Search in Google Scholar
[9] D. Cooper and I. Gregory, ‘Mapping the English Lake District: A literary GIS’, Transactions of the Institute of British Geographers, vol. 36, no. 1, pp. 89–108, 2011. 10.1111/j.1475-5661.2010.00405.xSearch in Google Scholar
[10] I. N. Gregory and A. Hardie, ‘Visual GISting: bringing together corpus linguistics and Geographical Information Systems’, Lit Linguist Computing, vol. 26, no. 3, pp. 297–314, Jan. 2011. 10.1093/llc/fqr022Search in Google Scholar
[11] P. Murrieta-Flores, A. Baron, I. Gregory, A. Hardie, and P. Rayson, ‘Automatically Analyzing Large Texts in a GIS Environment: The Registrar General’s Reports and Cholera in the 19th Century’, Transactions in GIS, 2015, doi: 10.1111/ tgis.12106. 10.1111/tgis.12106Search in Google Scholar
[12] S. Adolphs, Introducing electronic text analysis. New York: Routledge, 2006. 10.4324/9780203087701Search in Google Scholar
[13] T. McEnery and A. Hardie, Corpus linguistics: method, theory and practice. Cambridge ; New York: Cambridge University Press, 2012. Search in Google Scholar
[14] M. Kulldorff, ‘A spatial scan statistic’, Communications in Statistics - Theory and Methods, vol. 26, no. 6, pp. 1481–1496, Jan. 1997. Search in Google Scholar
[15] M. D. Eddy, ‘The prehistoric mind as a historical artefact’, Notes Rec. R. Soc., vol. 65, no. 1, pp. 1–8, Mar. 2011. 10.1098/rsnr.2010.0097Search in Google Scholar
[16] T. Hitchcock, ‘Confronting the Digital: Or How Academic History Writing Lost the Plot’, Cultural and Social History, vol. 10, no. 1, pp. 9–23, Mar. 2013. 10.2752/147800413X13515292098070Search in Google Scholar
[17] S. Tanner, T. Munoz, and P. Hemy Ros, ‘Measuring Mass Text Digitization Quality and Usefulness : Lessons Learned from Assessing the OCR Accuracy of the British Library’s 19th Century Online Newspaper Archive’, Dlib Magazine, vol. 15, no. 78, 2009. 10.1045/july2009-munozSearch in Google Scholar
[18] R. Tobin, C. Grover, K. Byrne, J. Reid & J. Walsh, Evaluation of georeferencing. in 1 (ACM Press, 2010). doi:10.1145/1722080.1722089 10.1145/1722080.1722089Search in Google Scholar
[19] C. J. Rupp, P. Rayson, I. Gregory, A. Hardie, A. Joulain, and D. Hartmann, ‘Dealing with heterogeneous big data when geoparsing historical corpora’, 2014, pp. 80–83.10.1109/BigData.2014.7004457Search in Google Scholar
© 2015 Patricia Murrieta-Flores, Ian Gregory
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.