Methods
Four different methods have been tested for the establishment of the IIS:
Method 1
Estimate and comparison of the percentage of occurrences of each variable and variants, considered for each pair of analysed speakers/writers. Application value variant. % of variant realization /each variant and calculate difference between the 2 speakers. Sum up all the % differences and calculate the average. Divide this figure by 100. Substract 1 to the final figure.
Method 2
Calculation of the Adjusted Residual Value. Cross-tabulation running with SPSS. Calculation of the Adjusted Residual Values (ARV) for each variable. Assignment of a value to each A range:
- ARV <1 -> 0
- ARV >1&<2 -> 1
- ARV >2&<3 -> 2
- ARV > 5 -> 5
(max. ARV conversion)
The IIS is obtained by the calculation of the following formula:
Method 3
Calculation of the Phi coefficient. Calculation of χ2 and obtention of the Phi coefficient, which is located between 0 and 1, indicating the relation between variables. The square of the Phi coefficient (Phi2) represents the percentage of overall variance. The IIS is obtained through the calculation of the average of Phi:
Corpora
Corpus for the English modules
- Phonological module of English: 6 speakers (all from the Southern British English variety); 14 variables, 2 measurement times (MT1 and MT2) and 3 statistical methods.
- Morpho-syntactic module of English: 6 speakers (all from the Southern British English variety); 7 variables, 2 measurement times (MT1 and MT2) and 3 statistical methods.
- Discourse-pragmatic module of English: 6 speakers (all from the Southern British English variety); 9 variables, 2 measurement times (MT1 and MT2) and 3 statistical methods.
Corpus for the Catalan modules
- Phonological module of Catalan: 6 speakers (4 from La Canonja variety, tarragoní, and 2 from barceloní); 18 variables, 2 measurement times (MT1 and MT2) and 3 statistical methods.
- Morpho-syntactic module of Catalan: 6 speakers (all from La Canonja variety, tarragoní); 7 variables, 2 measurement times (MT1 and MT2) and 3 statistical methods.
- Discourse-pragmatic module of Catalan: 6 speakers (all from La Canonja variety, tarragoní), 8 variables, 2 measurement times (MT1 and MT2) and 3 statistical methods.
Corpora for the Spanish modules
- Phonological module of Spanish: 5 speakers from the same Mexican Spanish variety, 5 in MT1 and 5 in MT2; 13 variables and 3 statistical methods.
- Morpho-syntactic module of Spanish: 6 writers (all from the same dialect of Peninsular Spanish); 10 variables, 2 measurement times (MT1 and MT2) and 3 statistical methods.
- Discourse-pragmatic module of Spanish: 6 speakers (all from the same Peninsular Spanish variety); 10 variables, 1 measurement time only (MT1) and 3 statistical methods.
Modules
For the phonological modules, we examine oral samples by means of the auditory-acoustic approach which combines the use of specially developed techniques ―normally involving specialized computer software aimed at the acoustic analysis of speech― together with the phonetician’s knowledge on physics and the acoustic properties of the speech signal, especially those of the properties of the sounds of the language under analysis.
For the analysis of the morpho-syntactic and discourse-pragmatic modules, we analyse each written sample by identifying and coding the variables under analysis.
Preliminary results and conclusions
- Method 3, based on the Phi Coefficient has turned out to be the most reliable method and has triggered the most robust results for both intra and inter speaker/writer variation, and in particular in the phonological modules, with some exceptions.
- For all modules and languages, hypothesis 1 is confirmed in that there is more variation and thus more idiolectal distance between speakers and writers’ samples than between two samples of the same speaker or writer, which show a quite steady idiolectal similitude throughout time.
- For all modules and languages, hypothesis 2 is confirmed in that an individual’s idiolectal style (spoken or written) doesn’t seem to vary much throughout time.
- Inter-speaker/writer IIS values may seem to be too high, but it has to be borne in mind that, except for the phonological module of Catalan, all the speakers or writers considered are from the same language variety.