Uiboaed, Kristel (Tartu)

Collostructional analysis of Estonian dialects

This study gives an overview of the first attempt to apply collostructional (blend of collocational and constructional) analyses (Stefanowitsch, Gries 2003; 2005, Gries et. al 2005) on Estonian dialects. Collostructional analysis focuses on the relationships between words and constructions they form (Stefanowitsch, Gries 2003) and adopts the terminology of Construcion Grammar (Goldberg 1995). In the present study, the constructions of non-finite verb form + finite verb form are studied. The aim of the study is to answer the question – which verbal constructions are more common in different dialects based on a certain association measure – Mutual Sensitivity Coefficient (MS) – values (Wiechmann 2008). 
MS is used to calculate the collostructional strength between a finite verb and non-finite verb form in the same clause. Clause boundaries were set automatically using the parser of Estonian which has been adapted for dialect parsing (Lindström, Müürisep 2009). In addition to MS Correspondence Analyses method is applied to find more similar dialects in terms of the studied constructions.
The data comes from the morphologically annotated Corpus of Estonian Dialects (CED) containing the dialect data from all ten dialects of Estonian (altogether over 550 000 tokens). (CED)
In the presentation, an overview over the methods used to extract finite + non-finite verb constructions from CED is given and the similarities and differences between dialect groups are presented. Constructions with highest MS values are studied in more depth to give a short overview of their semantic and morphosyntactic properties.


