Abstract
Japanese has a rich verbal conjugation system. For example, the lemma miru may be realized as the non-past indicative form miru, the past indicative form mita, or the imperative form miro. A verb is conjugated depending on a variety of factors such as aspect and mood. That is, a speaker has to choose an appropriate conjugated form in speech production. The aim of this study is to explore whether the production of a verb is influenced by the probability of choosing a conjugated form given the verbal lemma. Our corpus-based research demonstrates that, if a verb has been produced as a non-past indicative form frequently, the verb is produced with shorter duration when produced as the non-past indicative form. This result can be neatly captured by positing that a speaker stores both episodic memories and abstract categories (i.e., lemma and conjugation form), and that the ease in production target creation is a function of the resting activation level of an abstract category.
Similar content being viewed by others
Notes
I deeply thank an anonymous reviewer for bringing this problem to my attention.
There is an alternative view called a listener-oriented approach (Pate & Goldwater, 2015; Rose, 2017; Hall et al., 2018), which can also put forward a similar prediction. This type of account hypothesizes that a speaker balances two competing biases: a pressure to minimize articulatory effort and a pressure to transmit a message accurately (Jaeger & Buz, 2017; Rose, 2017). When a speaker thinks that a listener can infer what kind of message the speaker conveys, the speaker is supposed to minimize articulatory effort, and thereby the speaker produces a reduced speech signal (e.g., a signal with shorter duration and that with centralized formant values). This is because, if the message is inferable in speech settings, the communication is potentially successful without clear acoustic signals. It would be easy for a listener to recognize a non-past indicative form with higher conjugation predictability. The reason is that they are exposed to the conjugation form frequently in speech, and they store a large number of exemplars encoding the conjugation form, which allow a listener to access the conjugation node with ease. Consequently, a speaker does not have to improve the intelligibility of the speech signal, with the result that the duration becomes shorter.
Bell et al. (2021) put forward a similar hypothesis. They posit that a linguistic unit with a greater activation level receives more enhanced articulation.
The so-called paradigmatic enhancement effects can be interpreted in another way. Cohen (2015) accounts for the signal enhancement by positing that all the exemplars in a lexical category are averaged together to create a production target. Consequently, the pronunciation of a conjugation form shifts towards the more frequent conjugation form within the paradigm. Tomaschek et al. (2021) posit that a higher paradigmatic probability increases the learnability of a word form, and the greater learnability is somehow associated with clear speech articulation.
So far, we have mainly discussed the probabilistic reduction by positing that the phonetic redundancy in a speech signal reflects the speed in which a speaker accesses a conjugation node when producing a verb (Gahl, 2008; Bell et al., 2009; Gahl et al., 2012). As discussed in Sect. 2, there is an alternative view called a listener-oriented approach (Pate & Goldwater, 2015; Rose, 2017; Hall et al., 2018), which can also account for the findings in the current study. According to the listener-oriented account, a speaker is supposed to minimize articulatory effort (i.e., produce a speech signal with shorter duration), when the speaker thinks that the intended message is easy for a listener to retrieve. In the case of the current study, a non-past indicative form with higher conjugation predictability is produced with a reduced speech signal because the speaker thinks that the non-past indicative form would be easy for a listener to recognize. This communicative ease account should be tested further using a perceptual experiment in future work (see Allen et al., 2005; Cohen & Kang, 2018). According to our corpus-based data, the verb kotonaru is frequently used as a non-past indicative form, while matomeru is not frequently used as a non-past indicative form. (Recall that these two lemmas have similar token frequency according to our corpus-based research.) According to the listener-oriented approach, it is expected that the non-past indicative form kotonaru may be recognized more accurately and faster than the non-past indicative form matomeru.
References
Allen, P. A., Smith, A. F., Lien, M., Grabbe, J., & Murphy, M. D. (2005). Evidence for an activation locus of the word-frequency effect in lexical decision. Journal of Experimental Psychology, 31, 713–721. https://psycnet.apa.org/doi/10.1037/0096-1523.31.4.713.
Baker, R., & Bradlow, A. (2009). Variability in word duration as a function of probability, speech style, and prosody. Language and Speech, 52, 391–413. https://doi.org/10.1177/0023830909336575.
Barton, K. (2020). MuMIn: multi-model inference. R package version 1.43.17. https://CRAN.R-project.org/package=MuMIn.
Bell, A., Brenier, J., Gregory, M., Girand, C., & Jurafsky, D. (2009). Predictability effects on durations of content and function words in conversational English. Journal of Memory and Language, 60, 92–111. https://doi.org/10.1016/j.jml.2008.06.003.
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67, 1–38. arXiv:1406.5823.
Bell, M. J., Ben Hedia, S., & Plag, I. (2021). How morphological structure affects phonetic realisation in English compound nouns. Morphology, 31(2), 87–120. https://doi.org/10.1007/s11525-020-09346-6.
Bloch, B. (1946). Studies in colloquial Japanese I inflection. Journal of the American Oriental Society, 66(2), 97–109. https://doi.org/10.2307/596327.
Boersma, P., & Weenink, D. (2019). Praat version 6.0.49. http://www.fon.hum.uva.nl/praat/.
Bürki, A., Ernestus, M., Gendrot, C., Fougeron, C., & Frauenfelder, U. H. (2011). What affects the presence versus absence of schwa and its duration: a corpus analysis of French connected speech. The Journal of the Acoustical Society of America, 130, 3980–3991. https://doi.org/10.1121/1.3658386.
Bybee, J. (2010). Language, usage and cognition. Cambridge: Cambridge University Press.
Cohen, C. (2014). Probabilistic reduction and probabilistic enhancement: contextual and paradigmatic effects on morpheme pronunciation. Morphology, 24, 291–323. https://doi.org/10.1007/s11525-014-9243-y.
Cohen, C. (2015). Context and paradigms: two patterns of probabilistic pronunciation variation in Russian agreement suffixes. The Mental Lexicon, 10(3), 313–338. https://doi.org/10.1075/ml.10.3.01coh.
Cohen, C., & Kang, S. (2018). Flexible perceptual sensitivity to acoustic and distributional cues. The Mental Lexicon, 13(1), 38–73. https://doi.org/10.1075/ml.16029.coh.
Dell, G. S. (1986). A spreading-activation theory of retrieval in sentence production. Psychological Review, 93(3), 283–321. https://psycnet.apa.org/doi/10.1037/0033-295X.93.3.283.
Forster, K. I., & Chambers, S. M. (1973). Lexical access and naming time. Journal of Verbal Learning and Verbal Behaviour, 12, 627–635. https://doi.org/10.1016/S0022-5371(73)80042-8.
Foulkes, P., & Docherty, G. (2006). The social life of phonetics and phonology. Journal of Phonetics, 34, 409–438. https://doi.org/10.1016/j.wocn.2005.08.002.
Fowler, C. A., & Housum, J. (1987). Talkers’ signaling of “new” and “old” words in speech and listeners’ perception and use of the distinction. Journal of Memory and Language, 26, 489–504. https://doi.org/10.1016/0749-596X(87)90136-7.
Friedman, L., & Wall, M. (2005). Graphical views of suppression and multicollinearity in multiple linear regression. American Statistician, 59, 127–136. https://doi.org/10.1198/000313005X41337.
Gahl, S. (2008). Time and thyme are not homophones: the effect of lemma frequency on word durations in spontaneous speech. Language, 84, 474–496. https://doi.org/10.1353/lan.0.0035.
Gahl, S., Yao, Y., & Johnson, K. (2012). Why reduce? Phonological neighborhood density and phonetic reduction in spontaneous speech. Journal of Memory and Language, 66, 789–806. https://doi.org/10.1016/j.jml.2011.11.006.
Hall, K., Hume, E., Jaeger, T. F., & Wedel, A. (2018). The role of predictability in shaping phonological patterns. Linguistics Vanguard, 4, 20170027. https://doi.org/10.1515/lingvan-2017-0027.
Hashimoto, D. (2020). Probabilistic reduction in relation to social message predictability. Linguistics Vanguard, 6(1), 20190052 https://doi.org/10.1515/lingvan-2019-0052.
Hashimoto, D. (2021). Probabilistic reduction and mental accumulation in Japanese: frequency, contextual predictability, and average predictability. Journal of Phonetics, 87(101061), 1–17. https://doi.org/10.1016/j.wocn.2021.101061.
Hay, J. (2003). Causes and consequences of word structure. New York: Routledge.
Iori, I. (2012). Atarashī Nihon-go-gaku Nyūmon. Tokyo: 3A Network.
Irwin, M., & Zirk, M. (2019). Japanese linguistics. Tokyo: Asakura.
Jaeger, T. F., & Buz, E. (2017). Signal reduction and linguistic encoding. In E. M. Fernández & H. S. Cairns (Eds.), The handbook of psycholinguistics (pp. 38–81). Hoboken: Wiley-Blackwell.
Jescheniak, J. D., & Levelt, W. (1994). Word frequency effects in speech production: retrieval of syntactic information and of phonological form. Journal of Experimental Psychology. Learning, Memory, and Cognition, 20, 824–843. https://psycnet.apa.org/doi/10.1037/0278-7393.20.4.824.
Johnson, K. (2006). Resonance in an exemplar-based lexicon: the emergence of social identity and phonology. Journal of Phonetics, 34, 485–499. https://doi.org/10.1016/j.wocn.2005.08.004.
Jurafsky, D., Bell, A., Gregory, M., & Raymond, W. (2001). Probabilistic relations between words: evidence from reduction in lexical production. In J. Bybee & P. Hopper (Eds.), Frequency and the emergence of linguistic structure (pp. 229–254). Amsterdam: Benjamin.
Kuperman, V., Pluymaekers, M., Ernestus, M., & Baayen, H. (2007). Morphological predictability and acoustic duration of interfixes in Dutch compounds. The Journal of the Acoustical Society of America, 121(4), 2261–2271. https://doi.org/10.1121/1.2537393.
Levelt, W. J. M., Roelofs, A., & Meyer, A. S. (1999). A theory of lexical access in speech production. Behavioral and Brain Sciences, 22, 1–38. https://doi.org/10.1017/S0140525X99001776.
Maekawa, K., Kikuchi, K., & Tsukahara, W. (2004). Corpus of spontaneous Japanese: design, annotation and XML representation. In Proceedings of the international symposium on large-scale knowledge resources 2004 (pp. 19–24).
Nash, J. C. (2014). On best practice optimization methods in R. Journal of Statistical Software, 60(2), 1–14. http://www.jstatsoft.org/v60/i02/.
Ogura, H. (2006). Morphological information. In National Institute of Japanese Language and Linguistics (Eds.), Construction of the corpus of spontaneous Japanese (pp. 133–180). Tokyo: NINJAL.
Pandey, S., & Elliott, W. (2010). Suppressor variable in social network research: ways to identify in multiple regression models. Journal of the Society for Social Work and Research, 1, 28–40. https://doi.org/10.5243/jsswr.2010.2.
Pate, J. K., & Goldwater, S. (2015). Talkers account for listener and channel characteristics to communicate efficiently. Journal of Memory and Language, 78, 1–17. https://doi.org/10.1016/j.jml.2014.10.003.
Pierrehumbert, J. B. (2001). Exemplar dynamics, word frequency, lenition, and contrast. In J. Bybee & P. Hopper (Eds.), Frequency effects and the emergence of linguistic structure (pp. 135–157). Amsterdam: Benjamins.
R Core Team (2020). R version 3.6.3. https://www.r-project.org/.
Rose, D. (2017). Predicting plurality: an examination of the effects of morphological predictability on the learning and realization of bound morphemes. Doctoral thesis, University of Canterbury. https://doi.org/10.26021/4881.
Schuppler, B., van Dommelen, W. A., Koreman, J., & Ernestus, M. (2012). How linguistic and probabilistic properties of a word affect the realization of its final /t/: Studies at the phonemic and sub-phonemic level. Journal of Phonetics, 40(4), 595–607. https://doi.org/10.1016/j.wocn.2012.05.004.
Seyfarth, S. (2014). Word informativity influences acoustic duration: effects of contextual predictability on lexical representation. Cognition, 133, 140–155. https://doi.org/10.1016/j.cognition.2014.06.013.
Sóskuthy, M., & Hay, J. (2017). Changing word usage predicts changing word durations in New Zealand English. Cognition, 166, 298–313. https://doi.org/10.1016/j.cognition.2017.05.032.
Takezawa, K. (2019). Inflection. In T. Kageyama & H. Kishimoto (Eds.), Handbook of Japanese Lexicon and word formation (pp. 459–488). Berlin: de Gruyter.
Tang, K., & Bennett, R. (2018). Contextual predictability influences word and morpheme duration in a morphologically complex language (Kaqchikel Mayan). The Journal of the Acoustical Society of America, 144(2), 997–1017. https://doi.org/10.1121/1.5046095.
Tang, K., & Shaw, J. (2021). Prosody leaks into the memories of words. Cognition, 210, 104601. https://doi.org/10.1016/j.cognition.2021.104601.
Tomaschek, F., Hendrix, P., & Baayen, H. (2018). Strategies for addressing collinearity in multivariate linguistic data. Journal of Phonetics, 71, 249–267. https://doi.org/10.1016/j.wocn.2018.09.004.
Tomaschek, F., Tucker, B. V., Ramscar, M., & Harald Baayen, R. (2021). Paradigmatic enhancement of stem vowels in regular English inflected verb forms. Morphology, 31(2), 171–199. https://doi.org/10.1007/s11525-021-09374-w.
Wang, S. F. (2022). The interaction between predictability and pre-boundary lengthening on syllable duration in Taiwan Southern Min. Phonetica, 79(4), 315–352. https://doi.org/10.1515/phon-2022-0009.
Wheeldon, L. R., & Konopka, A. E. (2018). Spoken word production. In S. Rueschemeyer & M. G. Gaskell (Eds.), The Oxford handbook of psycholinguistics, Oxford: Oxford University Press.
Wickham, H. (2016). ggplot2: elegant graphics for data analysis. New York: Springer.
Wurm, L. H., & Fisicaro, S. A. (2014). What residualizing predictors in regression analyses does (and what it does not do). Journal of Memory and Language, 72, 37–48. https://doi.org/10.1016/j.jml.2013.12.003.
Zee, T., Ten Bosch, L., Plag, I., & Ernestus, M. (2021). Paradigmatic relations interact during the production of complex words: evidence from variable plurals in Dutch. Frontiers in Psychology, 12, 720017. https://doi.org/10.3389/fpsyg.2021.720017.
Acknowledgements
The earlier version of this study was presented at the 29th Japanese/Korean Linguistics Conference (Nagoya University & the National Institute for Japanese Language and Linguistics), the Socio meeting (University of Canterbury), and the Tokyo Circle of Phonologists. I thank the audiences for their comments at these events. I am especially grateful to Jen Hay, Donald Derrick, Ayaki Kujirai, Clemens Poppe, Shin-ichi Tanaka, Manami Hirayama, Marco Fonseca, Yusuke Kubota, and Dani Watson. I also thank the journal editor Sabine Arndt-Lappe for providing me with many insightful comments, and Clara Cohen for sharing her knowledge about paradigm predictability effects. I greatly appreciate the two anonymous reviewers for providing valuable comments and suggestions to improve the quality of this study.
Funding
This study is supported by Japan Society for the Promotion of Science (JSPS) KAKENHI grant number 20K13000 (https://kaken.nii.ac.jp/grant/KAKENHI-PROJECT-20K13000/).
Author information
Authors and Affiliations
Contributions
Daiki Hashimoto confirms sole responsibilities for conceptualization, scripting, data retrieval, data processing, statistical analyses, writing, and funding acquisition. As explained in Sect. 3, this study employs the CSJ corpus, and the corpus was constructed by the CSJ development team. I am permitted to use the corpus for academic research, but I am not allowed to share the data with someone outside my research institution. See the following link for how to obtain the permission to use the corpus data (https://clrd.ninjal.ac.jp/csj/en/index.html).
Corresponding author
Ethics declarations
Competing Interests
The author has no conflict of interest.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Hashimoto, D. The effect of verbal conjugation predictability on speech signal. Morphology 33, 41–63 (2023). https://doi.org/10.1007/s11525-023-09404-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11525-023-09404-9