The effect of verbal conjugation predictability on speech signal

Hashimoto, Daiki

doi:10.1007/s11525-023-09404-9

The effect of verbal conjugation predictability on speech signal

Published: 10 March 2023

Volume 33, pages 41–63, (2023)
Cite this article

Morphology Aims and scope Submit manuscript

Daiki Hashimoto ORCID: orcid.org/0000-0001-5007-046X¹

182 Accesses
1 Altmetric
Explore all metrics

Abstract

Japanese has a rich verbal conjugation system. For example, the lemma miru may be realized as the non-past indicative form miru, the past indicative form mita, or the imperative form miro. A verb is conjugated depending on a variety of factors such as aspect and mood. That is, a speaker has to choose an appropriate conjugated form in speech production. The aim of this study is to explore whether the production of a verb is influenced by the probability of choosing a conjugated form given the verbal lemma. Our corpus-based research demonstrates that, if a verb has been produced as a non-past indicative form frequently, the verb is produced with shorter duration when produced as the non-past indicative form. This result can be neatly captured by positing that a speaker stores both episodic memories and abstract categories (i.e., lemma and conjugation form), and that the ease in production target creation is a function of the resting activation level of an abstract category.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Serial verb constructions and the syntax-prosody interface

Article Open access 10 February 2021

Morphological embedding and phonetic reduction: the case of triconstituent compounds

Article 07 March 2016

*Daroo ka↑: the interplay of deictic modality, sentence type, prosody and tier of meaning

Article 18 July 2023

Notes

I deeply thank an anonymous reviewer for bringing this problem to my attention.
There is an alternative view called a listener-oriented approach (Pate & Goldwater, 2015; Rose, 2017; Hall et al., 2018), which can also put forward a similar prediction. This type of account hypothesizes that a speaker balances two competing biases: a pressure to minimize articulatory effort and a pressure to transmit a message accurately (Jaeger & Buz, 2017; Rose, 2017). When a speaker thinks that a listener can infer what kind of message the speaker conveys, the speaker is supposed to minimize articulatory effort, and thereby the speaker produces a reduced speech signal (e.g., a signal with shorter duration and that with centralized formant values). This is because, if the message is inferable in speech settings, the communication is potentially successful without clear acoustic signals. It would be easy for a listener to recognize a non-past indicative form with higher conjugation predictability. The reason is that they are exposed to the conjugation form frequently in speech, and they store a large number of exemplars encoding the conjugation form, which allow a listener to access the conjugation node with ease. Consequently, a speaker does not have to improve the intelligibility of the speech signal, with the result that the duration becomes shorter.
Bell et al. (2021) put forward a similar hypothesis. They posit that a linguistic unit with a greater activation level receives more enhanced articulation.
The so-called paradigmatic enhancement effects can be interpreted in another way. Cohen (2015) accounts for the signal enhancement by positing that all the exemplars in a lexical category are averaged together to create a production target. Consequently, the pronunciation of a conjugation form shifts towards the more frequent conjugation form within the paradigm. Tomaschek et al. (2021) posit that a higher paradigmatic probability increases the learnability of a word form, and the greater learnability is somehow associated with clear speech articulation.
So far, we have mainly discussed the probabilistic reduction by positing that the phonetic redundancy in a speech signal reflects the speed in which a speaker accesses a conjugation node when producing a verb (Gahl, 2008; Bell et al., 2009; Gahl et al., 2012). As discussed in Sect. 2, there is an alternative view called a listener-oriented approach (Pate & Goldwater, 2015; Rose, 2017; Hall et al., 2018), which can also account for the findings in the current study. According to the listener-oriented account, a speaker is supposed to minimize articulatory effort (i.e., produce a speech signal with shorter duration), when the speaker thinks that the intended message is easy for a listener to retrieve. In the case of the current study, a non-past indicative form with higher conjugation predictability is produced with a reduced speech signal because the speaker thinks that the non-past indicative form would be easy for a listener to recognize. This communicative ease account should be tested further using a perceptual experiment in future work (see Allen et al., 2005; Cohen & Kang, 2018). According to our corpus-based data, the verb kotonaru is frequently used as a non-past indicative form, while matomeru is not frequently used as a non-past indicative form. (Recall that these two lemmas have similar token frequency according to our corpus-based research.) According to the listener-oriented approach, it is expected that the non-past indicative form kotonaru may be recognized more accurately and faster than the non-past indicative form matomeru.

References

Allen, P. A., Smith, A. F., Lien, M., Grabbe, J., & Murphy, M. D. (2005). Evidence for an activation locus of the word-frequency effect in lexical decision. Journal of Experimental Psychology, 31, 713–721. https://psycnet.apa.org/doi/10.1037/0096-1523.31.4.713.
Google Scholar
Baker, R., & Bradlow, A. (2009). Variability in word duration as a function of probability, speech style, and prosody. Language and Speech, 52, 391–413. https://doi.org/10.1177/0023830909336575.
Article Google Scholar
Barton, K. (2020). MuMIn: multi-model inference. R package version 1.43.17. https://CRAN.R-project.org/package=MuMIn.
Bell, A., Brenier, J., Gregory, M., Girand, C., & Jurafsky, D. (2009). Predictability effects on durations of content and function words in conversational English. Journal of Memory and Language, 60, 92–111. https://doi.org/10.1016/j.jml.2008.06.003.
Article Google Scholar
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67, 1–38. arXiv:1406.5823.
Article Google Scholar
Bell, M. J., Ben Hedia, S., & Plag, I. (2021). How morphological structure affects phonetic realisation in English compound nouns. Morphology, 31(2), 87–120. https://doi.org/10.1007/s11525-020-09346-6.
Article Google Scholar
Bloch, B. (1946). Studies in colloquial Japanese I inflection. Journal of the American Oriental Society, 66(2), 97–109. https://doi.org/10.2307/596327.
Article Google Scholar
Boersma, P., & Weenink, D. (2019). Praat version 6.0.49. http://www.fon.hum.uva.nl/praat/.
Bürki, A., Ernestus, M., Gendrot, C., Fougeron, C., & Frauenfelder, U. H. (2011). What affects the presence versus absence of schwa and its duration: a corpus analysis of French connected speech. The Journal of the Acoustical Society of America, 130, 3980–3991. https://doi.org/10.1121/1.3658386.
Article Google Scholar
Bybee, J. (2010). Language, usage and cognition. Cambridge: Cambridge University Press.
Book Google Scholar
Cohen, C. (2014). Probabilistic reduction and probabilistic enhancement: contextual and paradigmatic effects on morpheme pronunciation. Morphology, 24, 291–323. https://doi.org/10.1007/s11525-014-9243-y.
Article Google Scholar
Cohen, C. (2015). Context and paradigms: two patterns of probabilistic pronunciation variation in Russian agreement suffixes. The Mental Lexicon, 10(3), 313–338. https://doi.org/10.1075/ml.10.3.01coh.
Article Google Scholar
Cohen, C., & Kang, S. (2018). Flexible perceptual sensitivity to acoustic and distributional cues. The Mental Lexicon, 13(1), 38–73. https://doi.org/10.1075/ml.16029.coh.
Article Google Scholar
Dell, G. S. (1986). A spreading-activation theory of retrieval in sentence production. Psychological Review, 93(3), 283–321. https://psycnet.apa.org/doi/10.1037/0033-295X.93.3.283.
Article Google Scholar
Forster, K. I., & Chambers, S. M. (1973). Lexical access and naming time. Journal of Verbal Learning and Verbal Behaviour, 12, 627–635. https://doi.org/10.1016/S0022-5371(73)80042-8.
Article Google Scholar
Foulkes, P., & Docherty, G. (2006). The social life of phonetics and phonology. Journal of Phonetics, 34, 409–438. https://doi.org/10.1016/j.wocn.2005.08.002.
Article Google Scholar
Fowler, C. A., & Housum, J. (1987). Talkers’ signaling of “new” and “old” words in speech and listeners’ perception and use of the distinction. Journal of Memory and Language, 26, 489–504. https://doi.org/10.1016/0749-596X(87)90136-7.
Article Google Scholar
Friedman, L., & Wall, M. (2005). Graphical views of suppression and multicollinearity in multiple linear regression. American Statistician, 59, 127–136. https://doi.org/10.1198/000313005X41337.
Article Google Scholar
Gahl, S. (2008). Time and thyme are not homophones: the effect of lemma frequency on word durations in spontaneous speech. Language, 84, 474–496. https://doi.org/10.1353/lan.0.0035.
Article Google Scholar
Gahl, S., Yao, Y., & Johnson, K. (2012). Why reduce? Phonological neighborhood density and phonetic reduction in spontaneous speech. Journal of Memory and Language, 66, 789–806. https://doi.org/10.1016/j.jml.2011.11.006.
Article Google Scholar
Hall, K., Hume, E., Jaeger, T. F., & Wedel, A. (2018). The role of predictability in shaping phonological patterns. Linguistics Vanguard, 4, 20170027. https://doi.org/10.1515/lingvan-2017-0027.
Article Google Scholar
Hashimoto, D. (2020). Probabilistic reduction in relation to social message predictability. Linguistics Vanguard, 6(1), 20190052 https://doi.org/10.1515/lingvan-2019-0052.
Article Google Scholar
Hashimoto, D. (2021). Probabilistic reduction and mental accumulation in Japanese: frequency, contextual predictability, and average predictability. Journal of Phonetics, 87(101061), 1–17. https://doi.org/10.1016/j.wocn.2021.101061.
Article Google Scholar
Hay, J. (2003). Causes and consequences of word structure. New York: Routledge.
Google Scholar
Iori, I. (2012). Atarashī Nihon-go-gaku Nyūmon. Tokyo: 3A Network.
Google Scholar
Irwin, M., & Zirk, M. (2019). Japanese linguistics. Tokyo: Asakura.
Google Scholar
Jaeger, T. F., & Buz, E. (2017). Signal reduction and linguistic encoding. In E. M. Fernández & H. S. Cairns (Eds.), The handbook of psycholinguistics (pp. 38–81). Hoboken: Wiley-Blackwell.
Chapter Google Scholar
Jescheniak, J. D., & Levelt, W. (1994). Word frequency effects in speech production: retrieval of syntactic information and of phonological form. Journal of Experimental Psychology. Learning, Memory, and Cognition, 20, 824–843. https://psycnet.apa.org/doi/10.1037/0278-7393.20.4.824.
Article Google Scholar
Johnson, K. (2006). Resonance in an exemplar-based lexicon: the emergence of social identity and phonology. Journal of Phonetics, 34, 485–499. https://doi.org/10.1016/j.wocn.2005.08.004.
Article Google Scholar
Jurafsky, D., Bell, A., Gregory, M., & Raymond, W. (2001). Probabilistic relations between words: evidence from reduction in lexical production. In J. Bybee & P. Hopper (Eds.), Frequency and the emergence of linguistic structure (pp. 229–254). Amsterdam: Benjamin.
Chapter Google Scholar
Kuperman, V., Pluymaekers, M., Ernestus, M., & Baayen, H. (2007). Morphological predictability and acoustic duration of interfixes in Dutch compounds. The Journal of the Acoustical Society of America, 121(4), 2261–2271. https://doi.org/10.1121/1.2537393.
Article Google Scholar
Levelt, W. J. M., Roelofs, A., & Meyer, A. S. (1999). A theory of lexical access in speech production. Behavioral and Brain Sciences, 22, 1–38. https://doi.org/10.1017/S0140525X99001776.
Article Google Scholar
Maekawa, K., Kikuchi, K., & Tsukahara, W. (2004). Corpus of spontaneous Japanese: design, annotation and XML representation. In Proceedings of the international symposium on large-scale knowledge resources 2004 (pp. 19–24).
Google Scholar
Nash, J. C. (2014). On best practice optimization methods in R. Journal of Statistical Software, 60(2), 1–14. http://www.jstatsoft.org/v60/i02/.
Article Google Scholar
Ogura, H. (2006). Morphological information. In National Institute of Japanese Language and Linguistics (Eds.), Construction of the corpus of spontaneous Japanese (pp. 133–180). Tokyo: NINJAL.
Google Scholar
Pandey, S., & Elliott, W. (2010). Suppressor variable in social network research: ways to identify in multiple regression models. Journal of the Society for Social Work and Research, 1, 28–40. https://doi.org/10.5243/jsswr.2010.2.
Article Google Scholar
Pate, J. K., & Goldwater, S. (2015). Talkers account for listener and channel characteristics to communicate efficiently. Journal of Memory and Language, 78, 1–17. https://doi.org/10.1016/j.jml.2014.10.003.
Article Google Scholar
Pierrehumbert, J. B. (2001). Exemplar dynamics, word frequency, lenition, and contrast. In J. Bybee & P. Hopper (Eds.), Frequency effects and the emergence of linguistic structure (pp. 135–157). Amsterdam: Benjamins.
Google Scholar
R Core Team (2020). R version 3.6.3. https://www.r-project.org/.
Rose, D. (2017). Predicting plurality: an examination of the effects of morphological predictability on the learning and realization of bound morphemes. Doctoral thesis, University of Canterbury. https://doi.org/10.26021/4881.
Schuppler, B., van Dommelen, W. A., Koreman, J., & Ernestus, M. (2012). How linguistic and probabilistic properties of a word affect the realization of its final /t/: Studies at the phonemic and sub-phonemic level. Journal of Phonetics, 40(4), 595–607. https://doi.org/10.1016/j.wocn.2012.05.004.
Article Google Scholar
Seyfarth, S. (2014). Word informativity influences acoustic duration: effects of contextual predictability on lexical representation. Cognition, 133, 140–155. https://doi.org/10.1016/j.cognition.2014.06.013.
Article Google Scholar
Sóskuthy, M., & Hay, J. (2017). Changing word usage predicts changing word durations in New Zealand English. Cognition, 166, 298–313. https://doi.org/10.1016/j.cognition.2017.05.032.
Article Google Scholar
Takezawa, K. (2019). Inflection. In T. Kageyama & H. Kishimoto (Eds.), Handbook of Japanese Lexicon and word formation (pp. 459–488). Berlin: de Gruyter.
Google Scholar
Tang, K., & Bennett, R. (2018). Contextual predictability influences word and morpheme duration in a morphologically complex language (Kaqchikel Mayan). The Journal of the Acoustical Society of America, 144(2), 997–1017. https://doi.org/10.1121/1.5046095.
Article Google Scholar
Tang, K., & Shaw, J. (2021). Prosody leaks into the memories of words. Cognition, 210, 104601. https://doi.org/10.1016/j.cognition.2021.104601.
Article Google Scholar
Tomaschek, F., Hendrix, P., & Baayen, H. (2018). Strategies for addressing collinearity in multivariate linguistic data. Journal of Phonetics, 71, 249–267. https://doi.org/10.1016/j.wocn.2018.09.004.
Article Google Scholar
Tomaschek, F., Tucker, B. V., Ramscar, M., & Harald Baayen, R. (2021). Paradigmatic enhancement of stem vowels in regular English inflected verb forms. Morphology, 31(2), 171–199. https://doi.org/10.1007/s11525-021-09374-w.
Article Google Scholar
Wang, S. F. (2022). The interaction between predictability and pre-boundary lengthening on syllable duration in Taiwan Southern Min. Phonetica, 79(4), 315–352. https://doi.org/10.1515/phon-2022-0009.
Article Google Scholar
Wheeldon, L. R., & Konopka, A. E. (2018). Spoken word production. In S. Rueschemeyer & M. G. Gaskell (Eds.), The Oxford handbook of psycholinguistics, Oxford: Oxford University Press.
Google Scholar
Wickham, H. (2016). ggplot2: elegant graphics for data analysis. New York: Springer.
Book Google Scholar
Wurm, L. H., & Fisicaro, S. A. (2014). What residualizing predictors in regression analyses does (and what it does not do). Journal of Memory and Language, 72, 37–48. https://doi.org/10.1016/j.jml.2013.12.003.
Article Google Scholar
Zee, T., Ten Bosch, L., Plag, I., & Ernestus, M. (2021). Paradigmatic relations interact during the production of complex words: evidence from variable plurals in Dutch. Frontiers in Psychology, 12, 720017. https://doi.org/10.3389/fpsyg.2021.720017.
Article Google Scholar

Download references

Acknowledgements

The earlier version of this study was presented at the 29th Japanese/Korean Linguistics Conference (Nagoya University & the National Institute for Japanese Language and Linguistics), the Socio meeting (University of Canterbury), and the Tokyo Circle of Phonologists. I thank the audiences for their comments at these events. I am especially grateful to Jen Hay, Donald Derrick, Ayaki Kujirai, Clemens Poppe, Shin-ichi Tanaka, Manami Hirayama, Marco Fonseca, Yusuke Kubota, and Dani Watson. I also thank the journal editor Sabine Arndt-Lappe for providing me with many insightful comments, and Clara Cohen for sharing her knowledge about paradigm predictability effects. I greatly appreciate the two anonymous reviewers for providing valuable comments and suggestions to improve the quality of this study.

Funding

This study is supported by Japan Society for the Promotion of Science (JSPS) KAKENHI grant number 20K13000 (https://kaken.nii.ac.jp/grant/KAKENHI-PROJECT-20K13000/).

Author information

Authors and Affiliations

Joetsu University of Education, Niigata, Japan
Daiki Hashimoto

Authors

Daiki Hashimoto
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Daiki Hashimoto confirms sole responsibilities for conceptualization, scripting, data retrieval, data processing, statistical analyses, writing, and funding acquisition. As explained in Sect. 3, this study employs the CSJ corpus, and the corpus was constructed by the CSJ development team. I am permitted to use the corpus for academic research, but I am not allowed to share the data with someone outside my research institution. See the following link for how to obtain the permission to use the corpus data (https://clrd.ninjal.ac.jp/csj/en/index.html).

Corresponding author

Correspondence to Daiki Hashimoto.

Ethics declarations

Competing Interests

The author has no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Hashimoto, D. The effect of verbal conjugation predictability on speech signal. Morphology 33, 41–63 (2023). https://doi.org/10.1007/s11525-023-09404-9

Download citation

Received: 31 January 2022
Accepted: 27 February 2023
Published: 10 March 2023
Issue Date: March 2023
DOI: https://doi.org/10.1007/s11525-023-09404-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The effect of verbal conjugation predictability on speech signal

Abstract

Access this article

Similar content being viewed by others

Serial verb constructions and the syntax-prosody interface

Morphological embedding and phonetic reduction: the case of triconstituent compounds

*Daroo ka↑: the interplay of deictic modality, sentence type, prosody and tier of meaning

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

The effect of verbal conjugation predictability on speech signal

Abstract

Access this article

Similar content being viewed by others

Serial verb constructions and the syntax-prosody interface

Morphological embedding and phonetic reduction: the case of triconstituent compounds

*Daroo ka↑: the interplay of deictic modality, sentence type, prosody and tier of meaning

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation