Who’s afraid of homophones? A multimethodological approach to homophony avoidance

Isabeau De Smet; Laura Rosseel

doi:10.1017/langcog.2023.50

Who’s afraid of homophones? A multimethodological approach to homophony avoidance

Published online by Cambridge University Press: 11 December 2023

Isabeau De Smet

and

Laura Rosseel

Show author details

Isabeau De Smet: Affiliation:
KU Leuven and FWO (Research Foundation Flanders), Leuven, Belgium
Laura Rosseel*: Affiliation:
Vrije Universiteit Brussel, Brussel, Belgium
*: Corresponding author: Laura Rosseel; Email: laura.rosseel@vub.be

Article contents

Abstract
Introduction
Homophony between present and past tense in Dutch
Experiment
Corpus study
Discussion and conclusion
Data availability statement
Competing interest
Footnotes
References

Rights & Permissions

Abstract

Homophony avoidance has often been claimed to be a mechanism of language change. We investigate this mechanism in Dutch by applying two strands of research – corpus studies and experimental data – to find support for claims based on earlier historical observations. Throughout the history of Dutch, homophony avoidance has been named as the cause of language change or inhibition of change on several occasions. We build on these historical observations with an experimental study and a corpus study on a synchronic Dutch alternation, where avoidance of homophony between present and past tense can appear. Plurals of verbs with a stem ending in a dental show homophony with the present when they are used in the preterite (compare zetten ‘put’ pst-pl with zetten ‘put’ prs-pl). This homophony can be avoided by using the perfectum (hebben gezet ‘have put’). A wug-style experiment shows that verbs with dental stem are indeed used significantly more in the perfectum in the plural than in the singular, while verbs without dental stem do not show this difference. A corpus study on Dutch further corroborates these results. Combined, these studies make a strong case for homophony avoidance as a plausible mechanism of language change.

Keywords

homophony avoidance experimental linguistics corpus linguistics Dutch past tense language variation and change

Type: Article
Information: Language and Cognition , First View , pp. 1 - 24

DOI: https://doi.org/10.1017/langcog.2023.50 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2023. Published by Cambridge University Press

1. Introduction

Are language users reluctant to use homophonous forms, that is, linguistic items that sound the same but have a different meaning, and can such aversion to homophony facilitate or inhibit language variation and change? Evidence suggesting that there is indeed a case to be made for homophony avoidance as a mechanism of language change stems from different sources. The main body of research consists of diachronic observations of individual cases of change that have been claimed to have or have not taken place due to homophony avoidance (Baerman, Reference Baerman2011; Blevins & Wedel, Reference Blevins and Wedel2009; Campbell, Reference Campbell and Dahlstedt1975, p. 390, ibid. Reference Campbell, Durie and Ross1996, p. 77, ibid. Reference Campbell1998, pp. 288–290; Gilliéron & Roques, Reference Gilliéron and Roques1912; Lloyd, Reference Lloyd1987; Martinet, Reference Martinet1955; Samuels, Reference Samuels, Koopman, van der Leek, Fischer and Eaton1987). Probably the most famous example of homophony avoidance is the case of gat in Gascon dialects (Gilliéron & Roques, Reference Gilliéron and Roques1912), where as the result of sound change the form gat could either mean ‘rooster’ (< Latin gallus) or ‘cat’ (< Latin cattus). To resolve this ambiguous situation, gat ‘rooster’ was replaced by words such as faisan ‘pheasant’ or vicaire ‘vicar.’ Important to add is that in most documented cases of homophony avoidance, there is some kind of semantic overlap between the homophones so that actual ambiguity or confusion is caused. Rooster and cat clearly belong to the same semantic field, so the chances of creating ambiguous situations are quite high. Dautriche et al. (Reference Dautriche, Swingley and Christophe2015) show, for example, that French toddlers have difficulties learning phonological neighbors when they belong to the same word class, but not when they belong to different word classes.

Yet, historical evidence for homophony avoidance has often been inconclusive, and critics have argued that if it were to play a role at all in these observed changes, it must have been minor (King, Reference King1967; Lass, Reference Lass1987 pp. 355–362, Reference Lass1997a, Reference Lass1997b; Sampson, Reference Sampson2013). Most notable is the critique of Lass (Reference Lass1987, Reference Lass1997a, Reference Lass1997b, pp. 262–355) who reviews and rejects three possible scenarios of how homophony avoidance could take place: (i) language change is blocked because speakers foresee the homophony it would cause (‘prophylaxis’), (ii) language change is reversed after it has taken place because homophony was created (‘therapy’), and (iii) a non-homophonous variant is chosen over a homophonous variant (‘selective variation control’). According to Lass, all three scenarios are implausible, particularly because of the intentionality of the speakers they presume.

In response to Lass’ criticism, it has been argued that homophony avoidance need not be a teleological mechanism (Blevins & Wedel, Reference Blevins and Wedel2009; De Vogelaer & Coussé, Reference De Vogelaer and Coussé2011). Labov (Reference Labov1994, pp. 569–599) states that because homophonous variants are more often misunderstood, the frequency of these variants is lower in the language user’s input, which results in a lower frequency of use of these variants. Blevins and Wedel (Reference Blevins and Wedel2009) offer a similar account. Normally, in a situation of (phonological) variation, with all else being equal, there is a balance in the use of the variants. However, when one variant shifts in such an extreme manner to the boundary of its category that it becomes indistinguishable from its adjacent category (i.e., homophony), this variant will, in many cases, no longer be recognized as belonging to its own category and will be stored in the adjacent category. This makes the original balance between the variants shift in favor of the nonambiguous variant. Ambiguity of context plays a role in this as well. Wedel and Fatkullin (Reference Wedel and Fatkullin2017) show, using a computational model, that when context can disambiguate between the meaning of the homophones, there is no competition between categories and homophony avoidance is less likely to occur.

Empirical evidence in the homophony avoidance debate mainly stems from research in phonology. In a large-scale corpus study, Wedel et al. (Reference Wedel, Kaplan and Jackson2013) showed, for example, that whether or not a merger between phoneme pairs takes place depends partly on the amount of lexical contrasts that are expressed by these phoneme pairs. Silverman’s (Reference Silverman2010) study found that less homophony was created by Korean neutralizing rules than would be expected purely by chance, which was subsequently backed up by evidence from simulations in Kaplan (Reference Kaplan2011). Other computational evidence can be found in Flego (Reference Flego2022), Wedel (Reference Wedel2012), and Winter and Wedel (Reference Winter and Wedel2016). The assumptions for the computational models are mostly based on the assumption described above that homophonous variants are embedded less strongly in the language user’s mind because they are often miscategorized. Finally, experimental data can also be found in phonological research. Kaplan and Muratani (Reference Kaplan and Muratani2015) show, in an experiment combining existing and artificial verbs, that nasal contraction in Japanese more often fails to be applied to new verbs when this would result in homophony with existing verbs. The results of Seyfarth et al. (Reference Seyfarth, Buz and Jaeger2016) indicate that participants enhance disambiguating clues in pronunciation when context is ambiguous. Yin and White (Reference Yin and White2018) demonstrate, in an artificial language learning experiment, that neutralizing rules were harder to learn for participants when they resulted in higher levels of homophony in the (artificial) language.

Steering away from the phonological domain, we also find a few studies that show an effect of homophony avoidance in morphology. De Vogelaer and Coussé (Reference De Vogelaer and Coussé2011) show, in a corpus study, how homophony avoidance played a crucial role in the evolution of Dutch and English plural pronouns (you guys versus original you 2sg/pl in English and jij lieden ‘you guys’ or jullie ‘you’ 2pl versus original jij or gij ‘you’ 2sg/pl in Dutch). Also, in a corpus study, Holtz (Reference Holtz, Holtz, Kovač, Puggaard-Rode and Wall2021) shows that when TD deletion (deletion of t/d after a consonant at the end of a word) in words in US English would result in higher levels of homophony (even for words that are not related), this deletion was less likely to apply. Given that homophony avoidance is even more likely to occur in related forms (e.g., present versus past tense), she argues that homophony avoidance is likely to play a role in the smaller degree of TD deletion in regular past tense forms.

Finally, on a broader level, syntactic research has also shown an effect of ambiguity avoidance, especially in phenomena such as differential object marking (inter alia, Levshina, Reference Levshina2020; Tal et al., Reference Tal, Smith, Culbertson, Grossman and Arnon2022) and argument structure (inter alia. Zehentner, Reference Zehentner2022). For example, Zehentner (Reference Zehentner2022) has shown that the rise of the prepositional phrase construction in the famous dative alternation (We gave them cake vs. We gave cake to them) was impacted by potentially ambiguous arguments. When agent versus recipient could not be told apart based on their morphological form, prepositional marking aided in the disambiguation.

In this paper, we add to this growing body of research by combining corpus research and experimental research. We apply this multimethodological approach to a case study on the avoidance of homophony between present and past tense in Dutch. In what follows, we first discuss several (often older) historical observations where homophony avoidance between present and past tense has been claimed to have taken place throughout the centuries. Next, we present an experimental study using semi-artificial language (n = 222) in which we test the cognitive plausibility of homophony avoidance as a mechanism driving change in verbal morphology in Dutch. As it is, of course, impossible to test historical cases on present-day participants, we resort to a case of possible homophony avoidance in present-day Dutch similar to the reported historical cases. In Dutch, the past tense is created by adding a dental suffix to the stem. When a Dutch verb stem already ends in a dental, homophony with the present is created in the past plural, e.g. zetten ‘put’ pst-3pl versus zetten ‘put’ prs-3pl. A strategy to avoid this homophony could be the use of the perfectum instead, which is semantically, in many cases, interchangeable, e.g. hebben gezet ‘have put.’ We will also take a possible effect of ambiguity of context into account. We expect the perfectum will more likely be used (and homophony will thus be avoided) in cases where context does not offer any clues with regard to tense. Where contextual clues are given, we expect homophony does not need to be avoided. Finally, we back up our experimental evidence with a synchronic corpus study of this variation.

The observed historical cases of homophony avoidance in Dutch present and past tense are discussed in Section 2. Section 3 then covers the experimental component of the study and Section 4 the corpus part. Finally, in Section 5, all pieces of evidence are brought together.

2. Homophony between present and past tense in Dutch

One specific type of homophony that could cause significant ambiguity and has therefore been claimed to be avoided in several instances is homophony between present and past tense. It is easy to fathom how this type of interparadigmatic homophony could create ambiguous situations: the meaning of the homophones only differs with respect to tense, and both homophones would appear in almost exactly the same grammatical context. Potential examples of the influence of this type of homophony avoidance are plenty in Dutch.Footnote ¹ For a better understanding of these examples, we first provide some background on the Dutch past tense system. In contemporary Dutch, either a perfectum (formed by an auxiliary zijn ‘be’ or hebben ‘have’ and a past participle) or a preterite can be used to express past tense. Whereas originally the perfectum could only be used to express a resultative aspect, perfectum and preterite have become largely interchangeable in many cases in present-day Dutch. The reference grammar for Dutch (Haeseryn et al. Reference Haeseryn, Romijn, Geerts, de Rooij and van den Toorn1997, 2.4.8.7.i) notes that perfectums denote facts, while preterites denote descriptions, while, at the same time, indicating that it is hard to distinguish between both categories. Furthermore, it is said that “the differences are sometimes rather subtle and the acceptability of certain sentences is not for all language users the same” (Haeseryn et al. Reference Haeseryn, Romijn, Geerts, de Rooij and van den Toorn1997, 2.4.8.4.i, our translation).

Like in most Germanic languages, verbs in Dutch can take both the strong and weak inflection. The strong inflection is characterized by a vowel change (ablaut) in the preterite and past participle and a nasal suffix in the past participle (e.g., rijden-reed-gereden ‘drive-drove-driven’). The different vowel changes can be categorized in seven historical ablaut classes. In the weak inflection a dental suffix -de or -te is added to the stem (e.g., spelen-speelde-gespeeld ‘play-played-played’ and hopen-hoopte-gehoopt ‘hope-hoped-hoped’). The distribution of the voiced and voiceless dental suffix depends on the final consonant of the stem: the voiceless variant is added when the stem ends in a voiceless obstruent; in all other cases, the voiced variant is added. Verbs generally either take the strong or weak inflection (with a few exceptions, e.g., waaien-waaide/woei ‘blow-blowed/blew’), but changes from one inflection to the other occur, as well as changes from one ablaut class to another.

Returning to the reported historical cases of alleged homophony avoidance, we find a first example in Early Modern Dutch. When apocope of schwa took place in nearly all words, weak preterites (e.g., hoopte pst-3sg ‘hoped’) were not affected by this sound change as it would render them indistinguishable from the present (hoopt prs-3sg ‘hopes’). It did take place, however, in strong preterites (nam < name pst-3sg ‘took’), which remained distinguishable from their present counterparts (neem prs-3sg ‘takes’) without schwa because of the ablaut (Van Loon Reference Van Loon2014, p. 261). Another example can be found in the ablaut vowel change in Dutch strong verbs sterven ‘die’, helpen ‘help’, werpen ‘throw,’ and werven ‘acquire.’ Originally, these showed a preterite with [ɑ], that is, starf ‘died’ and halp ‘helped’ pst-1/3sg. As a result of a sound change, however, [ɑ] became [ε] before liquidae, followed by a labial or velar consonant, that is, sterf ‘died’ and help ‘helped’ pst-1/3sg. This change rendered these past tense forms indistinguishable from their present counterparts (sterf ‘die’, help ‘help’ prs-1sg) in the first person singular. The literature suggests that these verbs adopted a new ablaut vowel, [i], i.e. stierf ‘died’, hielp ‘helped’, pst-1/3sg to avoid this homophony (Van Bree Reference Van Bree1987, p. 212).

In an earlier stage of Dutch, we find yet another example in the weak preterite morphology. In Middle Dutch, the weak preterite could be formed either using a -te/-de suffix or an -ede suffix (e.g., claghede ‘complained’ pst-1/3sg). The distribution of these forms was originally based on the syncope law of Sievers: after a heavy syllable (with a long vowel or consonant cluster in the coda) the monosyllabic -te/-de followed, and after a light syllable (short vowel with single consonant), the disyllabic -ede suffix followed. Yet, this conditioning had already disappeared largely in Middle Dutch (Taeldeman, Reference Taeldeman2011). When syncope of schwa took place (not to be confused with the apocope of schwa discussed earlier), the distinction between past and present tense disappeared for verbs ending in a dental. Compare, for example, wacht(e)de ‘waited’ pst-1/3sg with wachte ‘wait’ (before schwa apocope took place) prs-1/3sg or wacht(e)den ‘waited’ pst-1/3pl with wachten ‘wait’ prs-1/3pl. To avoid this homophony, East-Flemish dialects repaired this syncope, either back to wachtede ‘waited’ pst-1/3sg or to a new form wachtege ‘waited’ pst-1/3sg (Goossens & Verheyden Reference Goossens, Verheyden, Heeroma, Meertens and Den Besten1970, p. 138). In the further evolution of this preterite suffix, homophony avoidance comes up again. The preterite suffix -ege became reanalyzed as -tege, and its use was expanded to non-dental stems (e.g., maaktege ‘made’ pst-1/3sg). In contemporary East-Flemish (and southeastern West-Flemish) dialects, both this suffix and the standard Dutch -te/-de suffix can be used. Its distribution seems to be partially conditioned by the phonological context in which the verb appears, especially in the singular. Before vocals, -dege/-tege is more frequent than -te/-de (Vandekerckhove, Reference Vandekerckhove2003). In the Flemish dialects, apocope of schwa takes place in auslaut before a vocal. Therefore, the standard Dutch -de/-te causes homophony in prs-3sg (Vandekerckhove, Reference Vandekerckhove2003). Compare, for example, hij pakt(e) ons mee ‘he took as along’ with hij pakt ons mee ‘he takes us along’ (Taeldeman, Reference Taeldeman2011). When -tege/-dege is used, this homophony can be avoided: hij pakteg(e) ons mee ‘he took us along.’

De Smet (Reference De Smet2021) hypothesizes that homophony avoidance between present and preterite can also play a role in the change of inflection, specifically the weakening of strong preterites, which is frequently observed in Dutch. In her corpus study of historical Dutch, she notes that strong preterites that are homophonous to present stems of different verbs (e.g., rook ‘smelled’ and rook ‘smoke’) are more likely to become weak over time (e.g., ruikte ‘smelled’) and thus solving the homophony, than verbs that are not homophonous. Strong verbs that are homophonous with weak verbs in their present stems (compare scheppen ‘create’ with scheppen ‘shovel’), but do not show this ambiguity in their preterites because one of the verbs shows the strong inflection (compare schiep ‘created’ with schepte ‘shovelled), tend to preserve their strong inflection better, perhaps in order to avoid more homophony. Furthermore, De Smet reports a case of homophony avoidance among plural verbs ending in a dental stem. When those become weak, the preterite plural becomes homophonous to the present plural: compare, for example, vindden ‘found’ pst-pl (instead of originally strong vonden ‘found’) to vinden ‘find’ prs-pl). Indeed, the data show that verbs ending in a dental tend to be better protected from weakening than verbs that end with a different consonant (De Smet, Reference De Smet2021, pp. 135–136). Finally, also individual cases of homophony can be recognized. Heten ‘to be called’ originally showed a strong preterite hiet ‘was called’ and even though it shows a very high frequency (which usually means the verb is well protected against weakening, see inter alia, De Smet & Van de Velde, Reference De Smet and Van de Velde2019), it became weak already in Middle Dutch (heette ‘was called’). What might have played a role is that due to sound changes, the present stem of heten often occurred as hiet (‘is called’) as well. Thus, with the weakening of heten, the verb moved away from this homophony.

While homophony avoidance works as a potential explanation for the observed examples of historical change reported earlier, it is, of course, impossible to back up these claims with experimental data as speakers of previous stages of Dutch are no longer around. That is why, in Sections 3 and 4, we now turn to a potential case of language variation driven by homophony avoidance in present-day Dutch. A case study of this type will allow us to collect experimental data and directly compare that to contemporary corpus data, which offers the opportunity to assess the plausibility of homophony avoidance as a mechanism of language variation and change in the Dutch past tense system but also to contribute to the growing body of evidence documenting the plausibility of homophony avoidance as a mechanism in language change (De Vogelaer & Coussé, Reference De Vogelaer and Coussé2011; Holtz, Reference Holtz, Holtz, Kovač, Puggaard-Rode and Wall2021; Kaplan & Muratani, Reference Kaplan and Muratani2015; Silverman, Reference Silverman2010; Wedel et al., Reference Wedel, Kaplan and Jackson2013; Yin & White, Reference Yin and White2018).

3. Experiment

In this experiment, we study the variation between the use of the preterite and the perfectum to express past tense. Homophony with the present arises when a verb stem ending in a double dental is used in the preterite plural: compare schudden ‘shook’ pst-pl with schudden ‘shake’ prs-pl. When instead a perfectum is used, homophony is avoided: compare hebben geschud ‘have shaken/shook’ pst-pl with schudden ‘shake’ prs-pl. The same goes for verbs ending in a single dental: compare praatten ‘talked’ pst-pl with praten ‘talk’ prs-pl, though, in this case, there is no homonymy at play, that is, the forms sound identical, but they are spelled differently. Neither homophony nor homonymy is created when the verb stem does not end in a dental: compare werkten ‘worked’ pst-pl with werken ‘work’ prs-pl. There is also neither homophony nor homonymy in the singular in any type of verb: compare schudde ‘shook’ pst-sg with schud ‘shake’ prs-sg, praatte ‘talked’ pst-sg with praat ‘talk’ prs-sg, and werkte ‘worked’ pst-sg with werkt ‘work’ prs-sg. If it is indeed the case that language users avoid homophony, we expect plural verbs ending in a dental to be used more frequently in the perfectum to express past tense than verbs not ending in a dental or singular verbs. As this experiment is based on written language (see Section 3.1), we expect orthography to play a role as well: we hypothesize homonymy, i.e., forms that both sound identical and are spelled identically, to be avoided even more than homophony, i.e., forms that sound identical but have different spellings. This means we expect even higher preference for the perfectum for verb stems ending in a double dental than for verb stems ending in a single dental. Furthermore, we expect the homonymy/homophony avoidance effect to increase when the context the verb occurs in is more ambiguous with regard to tense, i.e., when it is not explicitly mentioned whether an utterance is set in the past or the present. If there are contextual elements that signal the past/present meaning of the utterance (e.g., tense adverbials), the homonymy/homophony is likely less problematic from a communicative perspective.

As mentioned in Section 2, perfectums and preterites are largely interchangeable in Dutch, though in some sentences, one variant may be preferred over the other. Given that preterites are preferred for descriptions, we may expect them to show up more frequently in subclauses, rather than main clauses. For similar reasons, we expect them to show up more often in literary genres (see also De Smet, Reference De Smet2021, p. 142). There is also a regional difference: preterites are slightly more popular in Northern-Dutch than in Southern-Dutch (see De Smet, Reference De Smet2021, p. 143; Grondelaers et al., Reference Grondelaers, De Troij, Speelman and van den Bosch2020, p. 88). However, there is no reason to expect the distribution of preterite versus perfectum forms to depend on the number or on the final consonant on the stem, unless homophony avoidance plays a role.

To test whether language users indeed avoid homophonous forms in their expression of past tense in Dutch, we designed a forced choice task in which participants were asked to complete a sentence with either the preterite or the perfectum of a nonsense verb. In what follows, we first describe the experimental design and instrumentation in Section 3.1. In Section 3.2, the materials are discussed, followed by the procedure in Section 3.3 and participant sample in Section 3.4. In Section 3.5, the analysis and results are presented, and Section 3.6 brings an intermediary discussion.

3.1. Design and instrumentation

We used a 2 (number: singular [SG] vs. plural [PL]) x 3 (verb stem: single dental [SD] vs. double dental [DD] vs. no dental [ND]) x 2 (context: presence of time adverbial vs. absence of time adverbial) factorial design. Number was manipulated between subject. The motivation for this choice is that we did not want participants to see the same verb twice because we wanted to avoid any possible priming effects. Verb stem was manipulated within subject, which meant that every participant was presented with an equal number of verbs ending in a single dental (SD), verbs ending in a double dental (DD), and verb stems without final dental (ND). Additionally, the presence of contextual markers setting the reported action in the past was manipulated within subject as well: for all participants, half of the target fill-in-the-blank sentences appeared with time adverb gisteren ‘yesterday’ (WG), half without (NG). To avoid confounding between verb and the presence of a time adverb, two versions of each condition were created, which we label A and B: the verbs that appear in a sentence with gisteren in version A or appear in a sentence without the adverb in version B, or vice versa. The initial design contained 24 different target verbs, but because piloting showed this design was too long for participants to stay focused, we split the A and B versions in two, labeling them A1, A2, B1, and B2. Each version consisted of the same number of verbs of each type and of the same number of sentences with and without time adverb gisteren. In total, this makes 8 versions of the experiment: SG-A1, SG-A2, SG-B1, SG-B2, PL-A1, PL-A2, PL-B1, and PL-B2. Table 1 gives an overview of this design. Participants were randomly assigned to one of the eight versions of the study.

Table 1. Design experiment (DD, double dental; F, filler; ND, no dental; NG, no gisteren; SD, single dental; WG, with gisteren)

Participants were presented with 24 trials, each containing a sentence with a word blanked out. Of those trials, 12 contained target items and 16 contained fillers. In the target items, participants were presented with a binary choice between the perfectum and the preterite of a non-existing verb (see Fig. 1). The order of the possible answers was randomized. These target items were interspersed with filler items which contained the same question format but presented participants with different cases of variation in Dutch (cf. Section 3.3). The order of the trials was randomized. Every trial came with a time limit of 7 seconds in order to encourage participants not to overthink their responses and to approximate in a way more online language processing. When 7 seconds had passed, the experiment moved on automatically, even if the participant had not selected a response yet. Participants were also able to skip to the following trial by clicking a ‘next’ button.

Figure 1. Example of target item (‘Tom mentioned that they … yesterday.’).

3.2. Materials

The design outlined above requires three types of materials: (1) nonsense verbs, (2) fill-in-the-blank matrix sentences, and (3) filler items. Starting with (1), 48 non-existing verbs were created. We chose to work with nonsense verbs because the choice between a preterite or a perfectum to express past tense can depend on the semantics of the verb. Overall, 16 of the 48 verb stems ended in a double dental, 16 in a single dental, and 16 in a non-dental. The verbs were based on the most frequent monosyllabic Dutch verbs (not taking into account strong or irregular verbs and loanwords), making minimal changes (to the onset, stem vowel, or coda), in order to create plausible but non-existing verbs in Dutch. Non-existing verbs are likely to be associated with existing verbs, each with their own semantics and preferences for preterite or perfectum. In an attempt to control for this, a pretest was conducted. Participants (n = 11) were asked to give all existing words they associated with the non-verbs. Only non-verbs that were associated with the same existing word by less than half of the participants were selected. For each stem type, 8 verbs were selected.Footnote ² The final verbs can be found in Table 2. As an auxiliary to form the perfectum, we always used hebben ‘have’.

Table 2. Final selection of verbs and their 3rd person inflection

For (2), we created 24 fill-in-the-blank sentences. The blank had to be filled in by either the preterite or the perfectum of a non-verb. In order to avoid any bias toward either the preterite or the perfectum, we made sure the main verb always appeared on the first pole. Given that the main verb appears on the second pole when using the perfectum in a main clause, the sentences were constructed, so the preterite or the perfectum always had to be filled in in a subclause, more specifically a complement clause. We alternated between the following verbs for the main clause: vertellen ‘tell’, horen ‘hear’, zeggen ‘say’, beweren ‘claim’, vermelden ‘mention’, verklappen ‘reveal,’ and vernemen ‘find out.’ The subject of the complement clause was always in the third person. Half of the sentences appeared without any other past tense markers and are therefore more ambiguous regarding present/past interpretation. The other half appeared with gisteren ‘yesterday’ and are therefore nonambiguously set in the past. Fig. 1 shows an example of a target item. All sentences can be found in Appendix A.

Finally, we created (3) 16 fillers containing different types of morphosyntactic alternations in Dutch: variation in plural marking (-en vs. -s suffix), variation in neuter versus non-neuter definite article (het vs. de), and variation in auxiliary for the future tense (zullen vs. gaan). These can be found in Appendix B. All fillers were of the same fill-in-the-blank format as the target items, where participants could choose between two variants. For one of the filler trials, only one variant was grammatically possible: nakje (a diminutive) can only appear with the neuter article het and not with the other non-neuter option de. This filler thus functioned as an attention check to see whether participants were taking the experiment seriously and were sufficiently focused.

3.3. Procedure

The study was distributed among students in non-language-related programs and in the social network of the researchers. The experiment was conducted using the online survey software Qualtrics. Participants were told that the experiment tested how language users dealt with non-existing words. They were first presented with three demographic questions (native language, variant of Dutch, age), after which they received instructions for the actual experiment. First, participants were presented with two practice trials to allow them to get used to the question format and the response window (7 seconds per trials, cf. Section 3.1). Then, the actual experiment began. Afterwards, participants were asked what their strategy was for filling out the experiment and whether they had any further comments about the study. At the end of the study, participants received more information about the aim of the study. The study was approved by the KU Leuven Ethics Committee.

3.4. Participants

The experiment was completed by 232 participants. Non-native speakers (n = 4) and speakers who failed the attention check (n = 4) were excluded. We also excluded two participants who reported in the response strategy and comment field they had dyslexia and felt this may have interfered with their responses. This left a total of 222 participants to be included in the analyses. In total, 221 participants were speakers of the Belgian-Dutch variety, and 1 participant was a speaker of the Netherlandic-Dutch variety.

3.5. Analysis and results

We analyzed our data using a mixed effects logistic regression (using the package lme4 by Bates et al., Reference Bates, Maechler, Bolker and Walker2015).Footnote ³ The dataset and R-code can be found at: https://osf.io/sr87h. In total, we have 2994 attestations. Figs. 2 and 3 summarize the raw data.

Figure 2. Number of perfects and preterites in contexts without gisteren.

Figure 3. Number of perfects and preterites in contexts with gisteren.

The following predictors were added as fixed effects:

- verb stem (double dental, single dental, no dental)
- number (singular or plural)
- context (with temporal adverb gisteren ‘yesterday’ or without gisteren ‘yesterday’)
- priming (no priming, priming of perfectum, priming of preterite).

We added for each answer whether the previous answer was a perfectum or a preterite to account for an effect of priming. In case the previous answer was a filler or there was no previous answer, the value for this predictor is ‘no priming.’
- display order (preterite first, perfectum first).

This predictor represents the order in which the participants saw the multiple choice options.
- trial number (numerical, scaled and centred).

This predictor was included to account for effects of fatigue.

All categorical variables were dummy-coded. A three-way interaction was added between verb stem, number, and context. Subject (i.e., participant) and item (i.e., verb) were added as random effects. Following Barr et al. (Reference Barr, Levy, Scheepers and Tily2013), we also added (correlated) random slopes for all factors of interest. A correlated random slope for verb stem in interaction with context was added by subject (number does not differ by subject) and a correlated random slope for number in interaction with context was added by item (verb stem does not differ by item). We started with a maximal model, which did not converge. We simplified the random structure of the model until we reached convergence. We respectively removed interaction effects, correlation parameters, and random slopes one by one, checking each time whether the AIC did not significantly increase. This way, we obtained a model with a correlated random slope for context by subject, a correlated random slope for number by item, and a correlated random slope for context by item. When no convergence could be reached, simplifying as far as possible without increasing the AIC, we applied bound optimization by quadratic approximation (bobyqa). The final model contains display order, priming, trial number, and an interaction between verb stem, number, and context as fixed effects and a correlated random slope for context by subject, a correlated random slope for number by item, and a correlated random slope for context by item.Footnote ⁴ The model was checked for multicollinearity, but no problems arose. The numerical output of the final model can be found in Tables 3–5. Table 6 shows the contrasts between singular and plural for each combination of verb stem and context in a post-hoc Tukey test, and Table 7 the contrasts between different verb stems. Fig. 4 visualizes the three-way interaction effect.

Table 3. Fixed effects for (simple) from mixed effects model for the experimental study

Note: C-value: 0.865, marginal R²: 0.130, conditional R²: 0.385.

^*** = <0.001;

^** = <0.01;

* = <0.05.

Table 4. Random effects for mixed effects model experimental study

Table 5. Mixed model ANOVA table experimental study

^*** = <0.001;

^** = <0.01;

* = <0.05.

Table 6. Post-hoc Tukey testFootnote ⁵: estimated marginal means for contrasts between singular and plural for verbal stem and context

^*** = <0.001;

^** = <0.01;

* = <0.05.

Table 7. Post-hoc Tukey test: estimated marginal means for contrasts between different types of verb stem for number and context

^*** = <0.001;

^** = <0.01;

* = <0.05.

Figure 4. Interaction effect for verb stem, number, and context (error bars represent 95% confidence intervals).

3.6. Discussion

The results confirm our hypotheses. We expected to see a significant difference between singular and plural forms for verbs with a double dental in the stem coda, but not for verbs without a dental in the stem coda. This expectation is borne out (see Table 5). Indeed, plural verbs with a double dental tend to be used in the preterite significantly less often than singular verbs with a double dental. Furthermore, this difference between singular and plural for verbs with a double dental in the stem coda turns out to be larger in the contexts without explicit past marking through the adverb gisteren, which are more ambiguous. Verbs with a single dental take an in-between position: a difference between singular and plural only shows up in the more ambiguous contexts, without gisteren. This tells us that orthography plays a role as well: verbs with a single dental are homophonous but not homonymous; thus, in written data such as these, speakers can still visually differentiate between these forms.

When we look at the contrasts between the different verb stems (Table 7), we again see our hypotheses confirmed. This comparison also shows the effect of context more clearly: only for plurals in the context without gisteren (ambiguous contexts) do we see a significant difference between verbs with a double dental and verbs with a single dental, on the one hand, and verbs without a dental, on the other hand. Again, ambiguity caused by homophony is also avoided for verbs with a single dental, but, to a lesser degree than for verbs with a double dental, where homonymy is at play as well.

4. Corpus study

To complement our experimental data, which allowed carefully controlled manipulations of factors like contextual temporal expression and semantic interference of word meaning, but which can only approximate actual language use at best, we also conducted a corpus study. This way, we were able to investigate whether we can find additional evidence in naturally occurring language production for our hypothesis that the alternation between preterites and perfecta in Dutch is affected by homophony avoidance.

4.1. Data collection and annotation

Our corpus study covers the same alternation as the experiment. From the Spoken Dutch Corpus (covering both Northern- and Southern-Dutch) (Oostdijk et al., Reference Oostdijk, Goedertier, Van Eynde, Boves, Martens, Moortgat and Baayen2002), we extracted all preterites and past participles. As past participles that were part of a perfectum needed to be distinguished from other past participles by hand, we only used a subset of all attestations. We selected all attestations of the six most frequent verbs (not taking into account strong or irregular verbs) with a stem ending in a dental (n = 3151). This number of verbs allowed for a balanced dataset with regard to work load, on the one hand, and sufficient attestations, on the other hand. The verbs were heten ‘to be called,’ verplichten ‘to obligate,’ praten ‘to talk,’ verwachten ‘to expect,’ richten ‘to direct’ and zetten ‘to set’. As a control group, we selected six verbs with frequencies closest to the six most frequent verbs with dental stem (n = 3153). Frequency was the only criterion, and we did not look at the semantics (nor possible preferences for preterites versus perfects) for these verbs. The verbs were betalen ‘to pay,’ meemaken ‘to experience,’ draaien ‘to turn,’ missen ‘to miss,’ pakken ‘to take,’ and spelen ‘to play.’ We manually selected all perfecta and distinguished between perfectum singular and perfectum plural. For the preterites, this information was already in the pos-tag. Furthermore, we added whether the verb form was found in a main clause or subclause. The final dataset consists of 3606 attestations, of which 1661 are perfecta and 1945 are preterites. Fig. 5 shows the ratio of preterites versus perfects for each verb. Fig. 6 shows the ratio of preterites versus perfects for each verb stem in the singular and plural.

Figure 5. Ratio perfects and preterites for each verb lemma.

Figure 6. Ratio perfects and preterites for each for verb stem in plural and singular.

4.2. Analysis and results

Again, a mixed effects regression model was used to analyze the data. The dataset and R-code can be found at: https://osf.io/sr87h/. The outcome variable was the variant used to express past tense, that is, preterite or perfectum. The fixed effects were as follows:

- verb stem: double dental, single dental or no dental
- number: singular or plural
- clause type: main clause or subclause
- register: formal or informal
- genre: read aloud literary texts or other genres

Genre was included as a covariate, given that De Smet (Reference De Smet2021, p. 141) shows that preterites are more likely to occur in literary genres (see also Section 3).
- region: Northern-Dutch versus Southern-Dutch

Region was included as a covariate, given that De Smet (Reference De Smet2021, p. 143) and Grondelaers et al. (Reference Grondelaers, De Troij, Speelman and van den Bosch2020, p. 88) show that preterites are used more often in northern Dutch (see also Section 3).

All categorical variables were dummy-coded. An interaction effect between verb stem and number was added. Random intercepts were verb and speaker. We also included a correlated random slope by number for verb. Theoretically, a correlated random slope for number in interaction with verb stem should also be added by speaker, or even just a correlated random slope for number and verb stem separately by speaker, but there was only very little variation for these variables by speaker. Many speakers only appear one time in this dataset and only use one of the verbs or only use the singular or plural. As a result, there was no need for these random slopes. We thus started with only a correlated random slope by number for verb and a random intercept for speaker. As this model did not converge, we simplified, taking the same steps as outlined in Section 3.5. The final model contained a random intercept for verb and a random intercept for speaker and clause type, register, genre, region, and an interaction between number and verb stem as fixed effects.Footnote ⁶ There were no problems with multicollinearity. The output of the model can be found in Tables 8–10. Table 11 shows the contrasts between singular and plural for each verbal category. Fig. 7 visualizes these results.

Table 8. Fixed effects for (simple) mixed effects model for the corpus study

Note: C-value: 0.966, marginal R²: 0.301, conditional R²: 0.746.

^*** = <0.001;

^** = <0.01;

* = <0.05.

Table 9. Random effects for mixed effects model corpus study

Table 10. Mixed model ANOVA table corpus study

^*** = <0.001;

^** = <0.01;

* = <0.05.

Table 11. Post-hoc Tukey test: estimated marginal means for contrasts between singular and plural for verbal category

^*** = <0.001;

^** = <0.01;

* = <0.05.

Figure 7. Interaction effect verb stem and number (error bars represent 95% confidence intervals).

4.3. Discussion

Again, our hypotheses are confirmed. A significant difference between singular and plural forms can be noted for both verbs with a double dental and a single dental: singular verbs, where no homophony is created, show more preterites. No such difference can be found for verbs without a dental in the stem coda as neither in the singular nor in the plural homophony can appear. There are also substantial differences between the verb stem categories, which is likely due to the fact that each verb stem category only contains a limited number of verbs (there is even only one double dental verb stem, zetten ‘put’). As these verbs each have their own semantics, they each have their own preference for past tense formations (which is also shown by the variance explained by the random intercept for verb in Table 9). In contrast to the experimental results, double dental and single dental verbs show a similar difference between singular and plural in the probability of preterites. This may be explained by the spoken nature of the data in the corpus study, where orthography does not play a role and where double dental and single dental verbs are thus equally ambiguous in the past tense plural.

5. Discussion and conclusion

In this study, we combined experimental research and corpus data in a bid to further understand the role of homophony avoidance in language variation and change. We discussed several historical observations in Dutch, where homophony avoidance is claimed to work as a mechanism of language change. An experimental study and a corpus study showed that language users are indeed prone to avoid homophony between present and past tense, providing a stronger footing for the plausibility of homophony avoidance explanations in the historical observations as well. The question remains how this mechanism works. The teleological explanation where language users somehow (subconsciously) predict the ambiguity a homophone is going to cause is not unproblematic, especially, as Lass (Reference Lass1987, Reference Lass1997a, Reference Lass1997b, pp. 355–261) notes, with regard to the presumed intentionality of the speaker. In the experimental study particularly, this explanation does not sit well as there is no actual communication going on and an addressee is lacking. In that case, why would the language user care whether or not the language utterance could be ambiguous?

In the input-based explanation of Blevins and Wedel (Reference Blevins and Wedel2009) and Labov (Reference Labov1994), the much debated intentionality of the speaker is put aside. However, this explanation does not immediately match up with our results either. A first problem lies with the artificial verbs that were used in the experiment. Participants had never seen any of these verbs before, so the frequency of the non-homophonous variant could not have been higher than the frequency of the homophonous variant in their input. Yet, this explanation could work when we assume that the effect of homophony surpasses the level of the individual verb and instead works at a higher, more abstract level of ‘verbs ending in a dental stem.’ For this more abstract level, language users have received real-life input where the homophonous preterite could be (‘wrongly’) assigned to the category of the present, resulting in a lower input of preterite plural forms for verbs in a dental stem, which could then perhaps have affected the choice for the non-homophonous variant in these new, non-existing verbs ending in a dental stem. A second issue lies with the difference we found between verbs in a more ambiguous context and verbs in a less ambiguous context (with gisteren). If it was strictly a case of frequency of input, whether or not the context is ambiguous would only matter in perception, but not in production.

One step that could be taken to further investigate how homophony avoidance works is to take a closer look at perception, instead of production. So far, researchers have mainly looked at the production side of homophony avoidance. Yet, for the frequency of input argument to make sense, we need to establish that language users indeed frequently misunderstand the homophonous variant and ‘miscategorise’ it as belonging to the adjacent category. Though frequent misunderstanding of the homophonous variant would not rule out the possibility of a more intentional mechanism behind homophony avoidance, the opposite – that is, homophonous variants not causing ambiguity, which is not unlikely given that most utterances are embedded in disambiguating context – should rule out the frequency of input argument. A more teleological explanation could still stand as simply the assumption that an utterance is ambiguous to the addressee could perhaps be enough for the speaker to shift away from homophony.

Despite the converging evidence emanating from our studies, some limitations should be noted. A drawback of the experimental study is the limited ecological validity. Not only does it contain non-existing verbs, the context and task itself are quite far removed from natural language production. In response to these limitations, a path for future research could be to replicate this experiment in a discourse completion task prioritizing online language production in a more communicative setting. Of course, the drawbacks of our experimental set-up are mitigated by complementing that study with a corpus study, where spontaneous spoken data and existing verbs are used. The drawback of the use of existing verbs is that each verb has their own semantics, which may be associated with a certain strategy to express past tense. We tackled that limitation both by taking individual behavior of verbs into account by adding a random intercept for verb to our corpus model and by combining the corpus results with the more tightly controlled experimental study. A next step forward would also be a larger scale corpus study, taking into account a wider variety of verbs. A second limitation of the corpus study is that we did not control for the ambiguity of the context in which the preterites and perfecta appeared, as we did in the experimental study. Ideally, an adequate measure of how ambiguous a sentence is with regard to past tense should be added to the analysis. However, this is not as straightforwardly implemented as in the experimental study, where language users only received limited and tightly controlled context. Adverbial markers or other past tense forms often make the sentence unambiguous, but the context in previous sentences or even hand gestures made by the speaker can help as well.

The combination, however, of both corpus and experimental research, with each solving possible limitations of the other and at the same time supporting earlier historical observations, makes a strong case for homophony avoidance as a plausible mechanism of language change, even though the exact cognitive workings of this mechanism are still unclear.

Data availability statement

The datasets and R-code can be found at: https://osf.io/sr87h/.

Acknowledgements

We want to thank Freek Van de Velde for sharing his insights on the design of the experiment and Andy Wedel, Bodo Winter, and an anonymous reviewer for their helpful comments and feedback. We would also like to express our sincerest gratitude to all colleagues who filled out the pretest and who helped distribute the final experiment, as well as to all participants who took part in the study.

Competing interest

The authors declare none.

A. Appendix 1: sentences used in stimuli (both singular and plural variant)

A.1. Sentences without gisteren ‘yesterday’

De man vertelde dat hij/ze … ‘The man said that he/they …’.

Je hoorde dat ze/ze … ‘You heard that she/they …’.

De vader zei dat hij/ze … ‘The father said that he/they …’.

Je vernam dat hij/ze … ‘You found out that he/they …’.

Hij beweerde dat hij/ze … ‘He claimed that he/they …’.

Ze verklapte dat haar zoon/zonen … ‘She revealed that her son/sons …’.

Haar zus vermeldde dat ze/ze … ‘Her sister mentioned that she/they …’.

De buurvrouw vertelde dat ze/ze … ‘The neighbour said that she/they …’.

De advocaat hoorde dat hij/ze … ‘The lawyer heard that he/they …’.

De leraar zei dat het kind/de kinderen … ‘The teacher said that the child/the children …’.

De baas vernam dat ze/ze … ‘The boss found out that she/they …’.

De moeder beweerde dat ze/ze … ‘The mother claimed that she/they …’.

A.2. Sentences with gisteren ‘yesterday’

De bankier beweerde dat hij/ze gisteren … ‘The banker claimed that he/they … yesterday’.

Hij verklapte dat ze/ze gisteren … ‘He revealed that she/they … yesterday’.

De verkoper vermeldde dat hij/ze gisteren … ‘The seller mentioned that he/they … yesterday’.

De leerling vertelde dat hij/ze gisteren … ‘The pupil said that he/they … yesterday’.

Hij hoorde dat de directeur/directeurs gisteren … ‘He heard that the principal/principals … yesterday’.

Het meisje zei dat ze/ze gisteren … ‘The girl said that she/they … yesterday’.

Toon verklapte dat hij/ze gisteren … ‘Toon revealed that he/they … yesterday’.

Tom vermeldde dat hij/ze gisteren … ‘Tom mentioned that he/they … yesterday’.

Ze vertelde dat haar dochter/dochters gisteren … ‘She said that her daughter/daughters … yesterday’.

De agent hoorde dat de man/mannen gisteren … ‘The police officer heard that the man/men … yesterday’.

De familie zei dat ze/ze gisteren … ‘The family said that she/they … yesterday’.

Mijn collega vernam dat hij/ze gisteren … ‘My colleague found out that he/they … yesterday’.

B. Appendix 2: fillers

Op de markt kocht ik twee … ‘On the market I bought two …’ (meppels-meppelen).

Hij doet elke dag meerdere … ‘Every day he does multiple …’ (fose-fosen).

Zijn ouders gaan elke dag naar drie verschillende … ‘His parents go to three different … every day’ (lagerieën-lageries).

De … worden verkocht tegen een hoge prijs ‘The … are sold at a high price’ (banaren-banaars).

De kinderen leren alles over de … ‘The children learn everything about the …’ (demen-demes).

Vandaag worden de … onderzocht ‘Today the … get examined’ (ratoren-rators).

De … doen het goed voor de tijd van het jaar. ‘The … are doing well for the time of the year’ (oengelen-oengels).

Het kind houdt … vast ‘The child is holding …’ (het nakje-de nakje).

Ik ga graag naar … ‘I like to go to …’ (de pars-het pars).

… staat in de garage ‘… is standing in the garage’ (Het blet-De blet).

De jongen roept dat hij/ze morgen … ‘The boy shouts that he/they … tomorrow’ (gaat/gaan waven-zal/zullen waven).

Het meisje fluistert dat ze/ze morgen … ‘The girl whispers that she/they … tomorrow’ (gaat/gaan greffen-zal/zullen greffen).

Jo zegt dat hij/ze morgen … ‘Jo says that he/they … tomorrow’ (gaat/gaan goeven-zal/zullen goeven).

Mijn grootvader vertelt dat hij morgen … ‘My grandfather says that he … tomorrow (gaat/gaan truizen-zal/zullen truizen).

… is veel te groot ‘… is way too big’ (De naster-Het naster).

Elk jaar gaan we naar … ‘Every year we go to …’ (de ost-het ost).

Footnotes

¹ Homophony avoidance between present and past tense is by no means specific to Dutch. Examples can be found in English as well. De Clerck and Vanopstal (Reference De Clerck, Vanopstal and Collins2015, p. 364) suggest that the verb lean prefers the regular preterite leaned above leant to avoid homophony with lend/lent, the preterite of lend. Bybee & Moder (Reference Bybee and Moder1983, p. 259) show in an Wug-experiment, that language users avoid producing past tense forms that are identical to present tense forms, even though this is a grammatical possibility in English (e.g., hit-hit, but see Cuskley et al., Reference Cuskley, Colaiori, Castellano, Loreto, Pugliese and Tria2015 for evidence of L2 speakers gravitating towards these forms in a Wug-experiment and Fertig, Reference Fertig2013 for a discussion of verbs that have changed to the level inflection, e.g., wet -wet and fit -fit in American English as opposed to respectively wetted and fitted). Furthermore, homophony avoidance has also been suggested as a reason why TD-deletion (deletion of t/d after a consonant at the end of a word) takes place more often in monomorphemic words than in regular past tense forms (Guy, Reference Guy1991; Holtz, Reference Holtz, Holtz, Kovač, Puggaard-Rode and Wall2021).

² Of course this cannot entirely ensure that participants will not link the non-existing verbs to existing verbs, which is something we will have to keep in mind when interpreting the results. Yet, the observation that participants varied quite a lot in the verbs they associated with the non-existing verbs in the pretest strengthens our belief that results will not be skewed too much by this.

³ Other packages we used are: dplyr version 0.8.3 (Wickham et al., Reference Wickham, François, Henry and Müller2019a,Reference Wickham, Averick, Bryan, Chang, McGowan, François, Grolemund, Hayes, Henry, Hester, Kuhn, Pedersen, Miller, Bache, Müller, Ooms, Robinson, Seidel, Spinu and Yutanib), tidyverse version 1.2.1 (Wickham et al., Reference Wickham, François, Henry and Müller2019a,Reference Wickham, Averick, Bryan, Chang, McGowan, François, Grolemund, Hayes, Henry, Hester, Kuhn, Pedersen, Miller, Bache, Müller, Ooms, Robinson, Seidel, Spinu and Yutanib), reshape2 version 1.4.3 (Wickham, Reference Wickham2007), effects version 4.1 (Fox, Reference Fox2003), ggplot2 version 3.2.1 (Wickham, Reference Wickham2016), ModelMetrics version 1.2.2 (Hunt, Reference Hunt2018), MuMIn version 1.43.6 (Barton, Reference Barton2019), gridExtra version 2.3 (Auguie & Antonov, Reference Auguie and Antonov2017), emmeans version 1.4.6 (Lenth et al., Reference Lenth, Singmann, Love, Buerkner and Herve2020), afex version 1.3–0 (Singmann et al., Reference Singmann, Bolker, Westfall, Aust and Ben-Shachar2023).

⁴ The model formula is: fit <− glmer(tense ~ verb stem × number × context + display order + priming + trial number + (1 + Context|ResponseId) + (1 + number + context|Verb), family = binomial, data = d, control = glmerControl(optimizer = “bobyqa”)).

⁵ We used the emmeans package (Lenth et al., Reference Lenth, Singmann, Love, Buerkner and Herve2020).

⁶ The model formula is: fit <− glmer(tense ~ verb stem × number + clause type + register + genre + region + (1|verb), data = d, family = binomial).

References

Auguie, B., & Antonov, A. (2017). Miscellaneous Functions for “Grid” Graphics. R package version 2.3. https://CRAN.R-project.org/package=gridExtra Google Scholar

Baerman, M. (2011). Defectiveness and homophony avoidance. Journal of Linguistics, 47, 1–29.CrossRef Google Scholar

Barr, D.J., Levy, R., Scheepers, C. & Tily, H.J. (2013). Random effectsstructure for confirmatory hypothesis testing: keep it maximal. Journal of Memory and Language 68(3), 255–278.CrossRef Google Scholar PubMed

Barton, K. (2019). MuMIn: Multi-Model Inference. R package version 1.43.6. https://CRAN.R-project.org/package=MuMIn Google Scholar

Bates, D., Maechler, M., Bolker, B.. & Walker, S. (2015). Fitting linear mixedeffects models using lme4. Journal of Statistical Software 67(1), 1–48.CrossRef Google Scholar

Blevins, J., & Wedel, A. (2009). Inhibited sound change. Diachronica, 26(2), 143–183.CrossRef Google Scholar

Bybee, J., & Moder, C. L. (1983). Morphological classes as natural categories. Language, 59(2), 251–270.CrossRef Google Scholar

Campbell, L. (1975). Constraints on sound change. In Dahlstedt, K. (Ed.), The Nordic languages and modern linguistics (Vol. II, pp. 388–406). Almqvist & Wiksell.Google Scholar

Campbell, L. (1996). On sound change and challenges to regularity. In Durie, M. & Ross, M. (Eds.), The comparative method reviewed: Regularity and irregularity in language change (pp. 72–89). Oxford University Press.CrossRef Google Scholar

Campbell, L. (1998). Historical linguistics: An introduction. University of Edinburgh Press.Google Scholar

Cuskley, C., Colaiori, F., Castellano, C., Loreto, V., Pugliese, M., & Tria, F. (2015). The adoption of linguistic rules in native and non-native speakers: Evidence from a Wug task. Journal of Memory and Language, 84, 205–223.CrossRef Google Scholar

Dautriche, I., Swingley, D., & Christophe, A. (2015). Learning novel phonological neighbors: Syntactic category matters. Cognition, 143, 77–86.CrossRef Google Scholar PubMed

De Clerck, B., & Vanopstal, K. (2015). Patterns of regularisation in British, American and Indian English: A closer look at irregular verbs with t/ed variation. In Collins, P. (Ed.), Grammatical change in English world-wide (pp. 335–372). John Benjamins.Google Scholar

De Smet, I. (2021). De sterke werkwoorden in het Nederlands. Een diachroon, kwantitatief onderzoek. [Doctoral dissertation, KU Leuven].Google Scholar

De Smet, I., & Van de Velde, F. (2019). Reassessing the evolution of West-Germanic preterite inflection. Diachronica, 36(2), 139–179.CrossRef Google Scholar

De Vogelaer, G., & Coussé, E. (2011). The functional nature of pronominal change: Innovative plural pronouns in English and Dutch. Neophilologus, 95, 1–26.CrossRef Google Scholar

Fertig, D. (2013). Analogy and morphological change. Edinburgh: Edinburgh University Press.CrossRef Google Scholar

Flego, S. (2022). The emergence of vowel quality mutation in German and Dinka-Nuer. Modeling the role of information-theoretic factors using agent-based simulation [Doctoral dissertation, Indiana University].Google Scholar

Fox, J. (2003). Effect displays in R for generalised linear models. Journal of Statistical Software, 8(15), 1–27.CrossRef Google Scholar

Gilliéron, J., & Roques, M. (1912). Etudes de géographie linguistique d’après l’Atlas linguistique de la France. Champion.Google Scholar

Goossens, J., & Verheyden, J. (1970). De preteritum-vormen van de zwakke werkwoorden in het zuiden van het Nederlandse taalgebied. In Heeroma, K., Meertens, P. J., & Den Besten, A. (Eds.), Zijn akker is de taal (pp. 133–147). Bakker.Google Scholar

Grondelaers, S., De Troij, R., Speelman, D., & van den Bosch, A. (2020). Vissen naar variatie. Digitaal op zoek naar onbekende Noord/Zuid-verschillen in de grammatica van het Nederlands. Nederlandse Taalkunde, 25(1), 73–99.CrossRef Google Scholar

Guy, G. R. (1991). Explanation in variable phonology: An exponential model of morphological constraints. Language Variation and Change, 3(1), 1–22.CrossRef Google Scholar

Haeseryn, W., Romijn, K., Geerts, G., de Rooij, J., & van den Toorn, M. C. (1997). Algemene Nederlandse Spraakkunst (2nd edition). Martinus Nijhoff uitgevers/Wolters Plantyn.Google Scholar

Holtz, A. (2021). Ambiguity and variable phonological rules. The challenge of TD Deletion in US English. In Holtz, A., Kovač, I, Puggaard-Rode, R. & Wall, J. (Eds.), Proceedings of the 29th Conference of the Student Organization of Linguistics in Europe, 26–28 January 2021, Leiden University (pp. 98–115). Leiden University Centre for Linguistics.Google Scholar

Hunt, T. (2018). ModelMetrics: Rapid Calculation of Model Metrics. R package version 1.2.2. https://cran.r-project.org/web/package=ModelMetrics Google Scholar

Kaplan, A. (2011). How much homophony is normal? Journal of Linguistics, 47(3), 631–671.CrossRef Google Scholar

Kaplan, A., & Muratani, Y. (2015). Categorical and gradient homophony avoidance: Evidence from Japanese. Laboratory Phonology, 6(2), 167–195.CrossRef Google Scholar

King, R. D. (1967). Functional load and sound change. Language, 43(4), 831–852.CrossRef Google Scholar

Labov, W. (1994). Principles of linguistic change: Internal factors (Vol. 1). Blackwell.Google Scholar

Lass, R. (1987). The shape of English. Dent.Google Scholar

Lass, R. (1997a). Arse longa, vita brevis: last words on ‘harmful homophony’. Studia Anglica Posnaniensia, XXXII, 21–31.Google Scholar

Lass, R. (1997b). Historical linguistics and language change. Cambridge University Press.CrossRef Google Scholar

Lenth, R., Singmann, H., Love, J., Buerkner, P., & Herve, M. (2020). Emmeans: Estimated Marginal Means, aka Least-Squares Means. R Package version 1.4.7. https://cran.r-project.org/web/packages/emmeans/index.html Google Scholar

Levshina, N. (2020). Communicative efficiency and differential case marking: A reverse engineering approach. Linguistics Vanguard, 7(s3), 20190087.CrossRef Google Scholar

Lloyd, P. M. (1987). From Latin to Spanish: Historical phonology and morphology of the Spanish language. American Philosophical Society.Google Scholar

Martinet, A. (1955). Economie des changements phonétiques. Francke.Google Scholar

Oostdijk, N., Goedertier, W., Van Eynde, F., Boves, L., Martens, J-P, Moortgat, M., & Baayen, H. (2002). Experiences from the Spoken Dutch corpus project. In Proceedings of the third international conference on language resources and evaluation (LREC’02). European Language Resources Association.Google Scholar

Sampson, G. (2013). A counterexample to homophony avoidance. Diachronica, 30(4), 579–591.CrossRef Google Scholar

Samuels, M. L. (1987). A brief rejoinder to Professor Lass. In Koopman, W., van der Leek, F., Fischer, O., & Eaton, R. (Eds.), Explanation and linguistic change (pp. 257–258). John Benjamins.Google Scholar

Seyfarth, S., Buz, E., & Jaeger, F. (2016). Dynamic hyperarticulation of coda voicing contrasts. The Journal of the Acoustical Society of America, 139(2), EL31–EL37.CrossRef Google Scholar PubMed

Silverman, D. (2010). Neutralization and anti-homophony in Korean. Journal of Linguistics, 46(2), 453–482.CrossRef Google Scholar

Singmann, H., Bolker, B., Westfall, J., Aust, F., & Ben-Shachar, M. (2023). afex: Analysis of Factorial Experiments. R package version 1.3–0. https://CRAN.R-project.org/package=afex.Google Scholar

Taeldeman, J. (2011). De vorming van het ‘zwakke’ preteritum in de zuidelijke Nederlandse dialecten: een tentatieve benadering van twee aspecten. Verslagen en Mededelingen van de KANTL, 121(2), 183–204.Google Scholar

Tal, S., Smith, K., Culbertson, J., Grossman, E., & Arnon, I. (2022). The impact of information structure on the emergence of differential object marking: An experimental study. Cognitive Science, 46(3), e13119.CrossRef Google Scholar PubMed

Vandekerckhove, R. (2003). Microvariatie in de preteritumvorming van zwakke werkwoorden. Leuvense Bijdragen, 92, 31–41.Google Scholar

Van Bree, C. (1987). Historische grammatica van het Nederlands. Foris.Google Scholar

Van Loon, J. (2014). Historische fonologie van het Nederlands. Universitas.Google Scholar

Wedel, A. (2012). Lexical contrast maintenance and the organization of sublexical contrast systems. Language and Cognition, 4(4), 319–355.CrossRef Google Scholar

Wedel, A., Kaplan, A., & Jackson, S. (2013). High functional load inhibits phonological contrast loss: A corpus study. Cognition, 128(2), 179–186.CrossRef Google Scholar

Wedel, A., & Fatkullin, I. (2017). Category competition as a driver of category contrast. Journal of Language Evolution, 2(1), 77–93.CrossRef Google Scholar

Wickham, H. (2007). Reshaping data with the reshape package. Journal of Statistical Software, 21(12), 1–20. http://www.jstatsoft.org/v21/i12/CrossRef Google Scholar

Wickham, H. (2016). ggplot2: Elegant graphics for data analysis. Springer-Verlag.CrossRef Google Scholar

Wickham, H., François, R., Henry, L. & Müller, K. (2019a). dplyr: A Grammar of Data Manipulation. R package version 0.8.3. https://CRAN.Rproject.org/package=dplyr Google Scholar

Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L. D., François, R., Grolemund, G., Hayes, A., Henry, L., Hester, J., Kuhn, M., Pedersen, T. L., Miller, E., Bache, S. M., Müller, K., Ooms, J., Robinson, D., Seidel, D. P., Spinu, V., … Yutani, H. (2019b). Welcome to the tidyverse. Journal of Open Source Software, 4(43), 1686.CrossRef Google Scholar

Winter, B., & Wedel, A. (2016). The co-evolution of speech and the lexicon: The interaction of functional pressures, redundancy, and category variation. Topics in Cognitive Science, 8(2), 503–513.CrossRef Google Scholar PubMed

Yin, S. H., & White, J. (2018). Neutralization and homophony avoidance in phonological learning. Cognition, 179, 89–101.CrossRef Google Scholar PubMed

Zehentner, E. (2022). Ambiguity avoidance as a factor in the rise on the English dative alternation. Cognitive Linguistics, 33(1), 3–33.CrossRef Google Scholar