Item Details

Principal Components and clustering on lexical distances from standard Basque to local varieties

Issue: Vol 3 No. 1 (2016) Mixed Methods

Journal: Journal of Research Design and Statistics in Linguistics and Communication Science

Subject Areas: Linguistics

DOI: 10.1558/jrds.30201

Abstract:

It is common in Dialectology the use of linguistic distances aiming to the delimitation of dialects. Different statistical techniques have been applied in modern dialectology to draw dialectal boundaries. So far little work has been published about the determinants of language variation. In this contribution, we propose a two step method (combining Principal Components and Cluster Analysis) in order to get a dialectal classification of localities using distances from standard Basque to a number of local varieties of the Basque language. In addition, we determine, on the one hand, which lexical features have caused the greatest variation in the different dialect areas and, on the other hand, which localities are the most representative ones in each dialect area.

Author: Gotzon Aurrekoetxea, Karmele Fernandez-Aguirre, Juan Ignacio Modroño, Jesus Rubio

View Original Web Page

References :

Aurrekoetxea, G. (1992). Nafarroako euskara: azterketa dialektometrikoa, Uztaro 5: 59–109.


Aurrekoetxea, G. (2007). Grammatical and lexical variation in the Basque language, Linguistica Atlantica, 27–28: 15–20.


Aurrekoetxea, G. (2008). Different patterns of geolinguistic structure. In G. Blaikner-Hohenwart, E. Bortolotti, R. Dranceschini, E. Lörincz et al. (eds), Ladinometria. Festschrift für Hans Goebl zum 65. Geburtstag, 9–18. Salzburg: Universität Salzburg / Freie Universität Bozen.


Aurrekoetxea, G. (2009). Iparraldeko hizkeren sailkapena (1): lexikoa, ASJU XXXVIII–1: 287–331.


Aurrekoetxea, G., Iglesias, A. and Videgain, Ch. (2004). Bourciez Bildumako Euskal Atlasa (BBEA-1): 1. Lexikoa. [Bourciez linguistic atlas: 1. Lexic], ASJU 38–2 (2004) [ed. 2007].


Aurrekoetxea, G., Iglesias, A. and Videgain, Ch. (2005). Bourciez Bildumako Euskal Atlasa (BBEA-2): 2. Gramatika. [Bourciez linguistic atlas: 2. Grammar], ASJU 39–1 (2005) [ed. 2008].


Aurrekoetxea, G. and Videgain, Ch. (2004). Seme prodigoaren parabola Ipar Euskal Herriko 150 bertsiotan, ASJUren gehigarriak, EHU, Bilbo [now in http://klasikoak.armiarma.com/testuak/testuakBourciez001.htm]


Aurrekoetxea, G. and Videgain, Ch. (2009). Le projet Bourciez: Traitement géolinguistique d’un corpus dialectal de 1895, Dialectologia 2 (http://www.publicacions.ub.es/revistes/dialectologia2/), 81–111.


Embleton, Sh., Uritescu, D. and Wheeler, E. S. (2013). ‘Defining dialect regions with interpretations: Advancing the multidimensional scaling approach’. Literary and Linguistic Computing 28 (1), 13–22. http://dx.doi.org/10.1093/llc/fqs048


Goebl, H. (2012). Introducción a los problemas y métodos según los principios de la Escuela Dialectométrica de Salzburgo (con ejemplos sacados del ‘Atlante Italo-Svizzero’, AIS). In G. Aurrekoetxea and J. I. Ormaetxea (eds) Tools for Linguistic Variation, 3–39. Bilbao: Universidad del País Vasco (UPV/EHU).


Gries, S.-T. (2015). Quantitative Linguistics. International Encyclopedia of the Social and Behavioral Sciences, 2nd Ed., Vol. 19, pp.725–732. http://dx.doi.org/10.1016/B978-0-08-097086-8.53037-2


Iglesias, A. (2005). Soziatiboa Bourciez-en testuetan, Lapurdum X, 65–94. http://dx.doi.org/10.4000/lapurdum.44


Lebart, L., Morineau, A. and Warwick, K. (1984). Multivariate Descriptive Statistical Analysis. New York: Wiley.


Lebart, L., Salem, A. and Berry, L. (1998). Exploring Textual Data. New York: Kluwer Academic Publisher. http://dx.doi.org/10.1007/978-94-017-1525-6


Moisl, H. and Maguire, W. (2008). Identifying the main determinants of phonetic variation in the Newcastle Electronic Corpus of Tyneside English. Journal of Quantitative Linguistics 15 (1), 46–69. http://dx.doi.org/10.1080/09296170701794302


Nakache, J.-P. and Confais, J. (2005). Approche Pragmatique de la Classification. Paris: Technip.


Nerbonne, J. (2015). Various variation aggregates in the LAMSAS South. In Michael D. Picone and Catherine Evans Davies (eds) Language Variety in the South: Historical and Contemporary Perspectives, 369–382. Tuscaloosa, AL: University of Alabama Press.


Ormaetxea, J. L. (2005). Euskal lexikografiari ekarri berria: Bourciez-en euskal testuetako lexikoa, Lapurdum X, 153–168. http://dx.doi.org/10.4000/lapurdum.57


Wieling, M. (2012). A Quantitative Approach to Social and Geographical Dialect Variation. Groningen: University of Groningen.


Wieling, M. and Nerbonne, J. (2015). Advances in Dialectometry, Annual Review of Linguistics. Vol. 1: 243–264 (Volume publication date January 2015). First published online as a Review in Advance on 28 July 2014. http://dx.doi.org/10.1146/annurev-linguist-030514-124930