Abstract
It is well known that contextual predictability facilitates word identification, but it is less clear whether the uncertainty associated with the current context (i.e., its lexical entropy) influences sentence processing. On the one hand, high entropy contexts may lead to interference due to greater number of lexical competitors. On the other hand, predicting multiple lexical competitors may facilitate processing through the preactivation of shared semantic features. In this study, we examined whether entropy measured at the trial level (i.e., for each participant, for each item) corresponds to facilitatory or inhibitory effects. Trial-level entropy captures each individual’s knowledge about specific contexts and is therefore a more valid and sensitive measure of entropy (relative to the commonly employed item-level entropy). Participants (N = 112) completed two experimental sessions (with counterbalanced orders) that were separated by a 3- to 14-day interval. In one session, they produced up to 10 completions for sentence fragments (N = 647). In another session, they read the same sentences including a target word (whose entropy value was calculated based on the produced completions) while reading times were measured. We observed a facilitatory (not inhibitory) effect of trial-level entropy on lexical processing over and above item-level measures of lexical predictability (including cloze probability, surprisal, and semantic constraint). Extra analyses revealed that greater semantic overlap between the target and the produced responses facilitated target processing. Thus, the results lend support to theories of lexical prediction maintaining that prediction involves broad activation of semantic features rather than activation of full lexical forms.
Similar content being viewed by others
Data Availability
All the data, materials and analysis codes for this article are available at https://osf.io/buwxs/. This study was not pre-registered.
Notes
Note that greater entropy reduction would result in a slower (not faster) reading times on the target word (bowl).
Note that in addition to semantic features, a cohort of activated lexical items may also lead to activation of form-related features such as phonological and/or orthographic information. Importantly, form-related features have been shown to exert both facilitatory and inhibitory effects on target processing depending on the frequency of the target word relative to the activated cohort (e.g., Carreiras et al., 1997; Karimi & Diaz, 2020; Vergara-Martinez & Swaab, 2012). Although the potential effects of such form-related factors would constitute a valid and interesting research question, this was not the aim of the current research. Given the exponential growth of interaction terms with the addition of these factors and their interactions, we believe a proper investigation of the potential effects of such variables requires a considerably larger dataset with proper controls implemented in the experimental design. Despite this, we still examined the potential effect of orthographic similarity on reading times. This analysis revealed an inhibitory (not facilitatory) main effect of orthographic overlap. Moreover, this variable did not reliably interact with trial entropy (see section Investigating the Mechanism Underlying the Effect of Trial Entropy).
Due to dropouts and experimenter errors, the distribution of participants was not exactly balanced across sessions. Fifty-four participants completed the cloze task before the reading task, and 58 participants did the reading task before the cloze task.
We thank an anonymous reviewer for suggesting these analyses.
However, we believe that more direct comparisons of trial-level, item-level, and participant-level entropy (subject-specific prediction fluency) will be an important goal for future research.
We thank Kiel Christianson for bringing this discussion point to our attention.
References
Altmann, G. T. M., & Kamide, Y. (1999). Incremental interpretation at verbs: Restricting the domain of subsequent reference. Cognition, 73(3), 247–264. https://doi.org/10.1016/S0010-0277(99)00059-1
Ankener, C. S., Sekicki, M., & Staudte, M. (2018). The influence of visual uncertainty on word surprisal and processing effort. Frontiers in Psychology, 9. https://doi.org/10.3389/fpsyg.2018.02387. Article 2387
Apfelbaum, K. S., Klein-Packard, J., & McMurray, B. (2021). The pictures who shall not be named: Empirical support for benefits of preview in the Visual World Paradigm. Journal of Memory and Language, 121, 104279. https://doi.org/10.1016/j.jml.2021.104279
Armeni, K., Willems, R. M., Van den Bosch, A., & Schoffelen, J. M. (2019). Frequency-specific brain dynamics related to prediction during language comprehension. NeuroImage, 198, 283–295.
Arnold, J. E., & Griffin, Z. M. (2007). The effect of additional characters on choice of referring expression: Everyone counts. Journal of Memory and Language, 56(4), 521–536. https://doi.org/10.1016/j.jml.2006.09.007
Aurnhammer, C., & Frank, S. L. (2019). Evaluating information-theoretic measures of word prediction in naturalistic sentence reading. Neuropsychologia, 134, Article 107198.
Baayen, R. H., Davidson, D. J., & Bates, D. M. (2008). Mixed-effects modeling with crossed random effects for subjects and items. Journal of Memory and Language, 59(4), 390–412. https://doi.org/10.1016/j.jml.2007.12.005
Barr, D. J., Levy, R., Scheepers, C., & Tily, H. J. (2013). Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language, 68(3), 255–278. https://doi.org/10.1016/j.jml.2012.11.001
Brothers, T., & Kuperberg, G. R. (2021). Word predictability effects are linear, not logarithmic: Implications for probabilistic models of sentence comprehension. Journal of Memory and Language, 116, 104174. https://doi.org/10.1016/j.jml.2020.104174
Brothers, T., Swaab, T. Y., & Traxler, M. J. (2015). Effects of prediction and contextual support on lexical processing: Prediction takes precedence. Cognition, 136, 135–149. https://doi.org/10.1016/J.COGNITION.2014.10.017
Brothers, T., Swaab, T. Y., & Traxler, M. J. (2017). Goals and strategies influence lexical prediction during sentence comprehension. Journal of Memory and Language, 93, 203–216. https://doi.org/10.1016/J.JML.2016.10.002
Brown-Schmidt, S., & Tanenhaus, M. K. (2008). Real-time investigation of referential domains in unscripted conversation: A targeted language game approach. Cognitive Science, 32(4), 643–684. https://doi.org/10.1080/03640210802066816
Carpenter, P. A., & Just, M. A. (1983). What your eyes do while your mind is reading. In K. Rayner (Ed.), Eye movements in reading: Perceptual and language processes (pp. 275–307). New York, NY: Academic Press.
Carreiras, M., Perea, M., & Grainger, J. (1997). Effects of the orthographic neighborhood in visual word recognition: Cross-task comparisons. Journal of Experimental Psychology: Learning, Memory, and Cognition, 23(4), 857.
Carter, B. T., Foster, B., Muncy, N. M., & Luke, S. G. (2019). Linguistic networks associated with lexical, semantic and syntactic predictability in reading: A fixation-related fMRI study. NeuroImage, 189, 224–240. https://doi.org/10.1016/j.neuroimage.2019.01.018
Cevoli, B., Watkins, C., & Rastle, K. (2022). Prediction as a basis for skilled reading: Insights from modern language models. Royal Society Open Science, 9(6), Article 211837.
Chen, Q., & Mirman, D. (2012). Competition and cooperation among similar representations: Toward a unified account of facilitative and inhibitory effects of lexical neighbors. Psychological Review, 119(2), 417–430. https://doi.org/10.1037/A0027175
Dahan, D., Magnuson, J. S., Tanenhaus, M. K., & Hogan, E. M. (2001). Subcategorical mismatches and the time course of lexical access: Evidence for lexical competition. Language and Cognitive Processes, 16(5/6), 507–534. https://doi.org/10.1080/01690960143000074
Duffy, S. A., Morris, R. K., & Rayner, K. (1988). Lexical ambiguity and fixation times in reading. Journal of Memory and Language, 27(4), 429–446. https://doi.org/10.1016/0749-596X(88)90066-6
Duffy, S. A., Kambe, G., & Rayner, K. (2001). The effect of prior disambiguating context on the comprehension of ambiguous words: Evidence from eye movements. In D. S. Gorfein (Ed.), On the consequences of meaning selection: Perspectives on resolving lexical ambiguity (pp. 27–43). American Psychological Association.
Federmeier, K. D., & Kutas, M. (1999). A rose by any other name: Long-term memory structure and sentence processing. Journal of Memory and Language, 41(4), 469–495. https://doi.org/10.1006/jmla.1999.2660
Fernandez, L. B., Engelhardt, P. E., Patarroyo, A. G., & Allen, S. E. M. (2020). Effects of speech rate on anticipatory eye movements in the visual world paradigm: Evidence from aging, native, and non-native language processing. Quarterly Journal of Experimental Psychology, 73(12), 2348–2361. https://doi.org/10.1177/1747021820948019
Fleur, D. S., Flecken, M., Rommers, J., & Nieuwland, M. S. (2020). Definitely saw it coming? The dual nature of the pre-nominal prediction effect. Cognition, 204, 104335. https://doi.org/10.1016/j.cognition.2020.104335
Frank, S. L. (2013). Uncertainty reduction as a measure of cognitive load in sentence comprehension. Topics in Cognitive Science, 5(3), 475–494. https://doi.org/10.1111/TOPS.12025
Frank, S. L., Otten, L. J., Galli, G., & Vigliocco, G. (2013). Word surprisal predicts N400 amplitude during reading. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (pp. 878–883). Sofia, Bulgaria: Association for Computational Linguistics.
Frisson, S., Rayner, K., & Pickering, M. J. (2005). Effects of contextual predictability and transitional probability on eye movements during reading. Journal of Experimental Psychology: Learning Memory and Cognition, 31(5), 862–877. https://doi.org/10.1037/0278-7393.31.5.862
Frisson, S., Harvey, D. R., & Staub, A. (2017). No prediction error cost in reading: Evidence from eye movements. Journal of Memory and Language, 95, 200–214. https://doi.org/10.1016/j.jml.2017.04.007
Hale, J. (2001). A probabilistic Earley parser as a psycholinguistic model. In Proceedings of NAACL (Vol. 2, pp. 159–166).
Hale, J. (2003). The information conveyed by words in sentences. Journal of Psycholinguistic Research, 32(2), 101–123. https://doi.org/10.1023/A:1022492123056
Hale, J. (2006). Uncertainty about the rest of the sentence. Cognitive Science, 30(4), 643–672. https://doi.org/10.1207/S15516709COG0000_64
Hale, J. T. (2011). What a rational parser would do. Cognitive Science, 35(3), 399–443. https://doi.org/10.1111/j.1551-6709.2010.01145.x
Hale, J. (2016). Information-theoretical complexity metrics. Language and Linguistics Compass, 10(9), 397–412. https://doi.org/10.1111/lnc3.12196
Hofmeister, P. (2011). Representational complexity and memory retrieval in language comprehension. Language and Cognitive Processes, 26(3), 376–405.
Huettig, F. (2015). Four central questions about prediction in language processing. Brain Research, 1626, 118–135. https://doi.org/10.1016/j.brainres.2015.02.014
Huettig, F., & Altmann, G. T. M. (2005). Word meaning and the control of eye fixation: Semantic competitor effects and the visual world paradigm. Cognition, 96(1), B23–B32. https://doi.org/10.1016/j.cognition.2004.10.003
Huettig, F., & Janse, E. (2016). Individual differences in working memory and processing speed predict anticipatory spoken language processing in the visual world. Language, Cognition and Neuroscience, 31(1), 80–93. https://doi.org/10.1080/23273798.2015.1047459
Ito, A., & Husband, E. M. (2017). How robust are effects of semantic and phonological prediction during language comprehension? A visual world eye-tracking study. IEICE Technical Report, 117(149).
Jegerski, J. (2014). Self-paced reading. Research methods in second language psycholinguistics (pp. 20–49).
Kahneman, D. (2011). Thinking fast and slow. New York, NY: Farrar, Straus and Giroux.
Kambe, G., Rayner, K., & Duffy, S. A. (2001). Global context effects on processing lexically ambiguous words: Evidence from eye fixations. Memory & Cognition, 29, 363–372.
Karimi, H. (2022). Greater entropy leads to more explicit referential forms during language production. Cognition, 225, 105093. https://doi.org/10.1016/j.cognition.2022.105093
Karimi, H., & Diaz, M. (2020). When phonological neighborhood density both facilitates and impedes: Age of acquisition and name agreement interact with phonological neighborhood during word production. Memory & Cognition, 48, 1061–1072.
Karimi, H., Brothers, T., & Ferreira, F. (2019). Phonological versus semantic prediction in focus and repair constructions: No evidence for differential predictions. Cognitive Psychology, 112, 25–47. https://doi.org/10.1016/j.cogpsych.2019.04.001
Kukona, A., Fang, S. Y., Aicher, K. A., Chen, H., & Magnuson, J. S. (2011). The time course of anticipatory constraint integration. Cognition, 119(1), 23–42. https://doi.org/10.1016/j.cognition.2010.12.002
Kuperberg, G. R. (2007). Neural mechanisms of language comprehension: Challenges to syntax. Brain Research, 1146(1), 23–49. https://doi.org/10.1016/j.brainres.2006.12.063
Kuperberg, G. R., & Jaeger, T. F. (2016). What do we mean by prediction in language comprehension? Language, Cognition and Neuroscience, 31(1), 32. https://doi.org/10.1080/23273798.2015.1102299
Kutas, M., & Hillyard, S. A. (1984). Brain potentials during reading reflect word expectancy and semantic association. Nature, 307(5947), 161–163. https://doi.org/10.1038/307161a0
Lau, E. F., & Namyst, A. (2019). fMRI evidence that left posterior temporal cortex contributes to N400 effects of predictability independent of congruity. Brain and Language, 199, 104697. https://doi.org/10.1016/j.bandl.2019.104697
Lau, E. F., Holcomb, P. J., & Kuperberg, G. R. (2013). Dissociating N400 effects of prediction from association in single-word contexts. Journal of Cognitive Neuroscience, 25(3), 484–502. https://doi.org/10.1162/jocn_a_00328
Levy, R. (2008). Expectation-based syntactic comprehension. Cognition, 106(3), 1126–1177. https://doi.org/10.1016/J.COGNITION.2007.05.006
Lowder, M. W., & Ferreira, F. (2016). Prediction in the processing of repair disfluencies: Evidence from the visual-world paradigm. Journal of Experimental Psychology: Learning Memory and Cognition, 42(9), 1400–1416. https://doi.org/10.1037/xlm0000256
Lowder, M. W., Choi, W., Ferreira, F., & Henderson, J. M. (2018). Lexical predictability during natural reading: Effects of surprisal and entropy reduction. Cognitive Science, 42, 1166–1183. https://doi.org/10.1111/COGS.12597
Lucas, M. M. (1987). Frequency effects on the processing of ambiguous words in sentence contexts. Language and Speech, 30(1), 25–46. https://doi.org/10.1177/002383098703000103
Luke, S. G., & Christianson, K. (2016). Limits on lexical prediction during reading. Cognitive Psychology, 88, 22–60. https://doi.org/10.1016/J.COGPSYCH.2016.06.002
Mani, N., & Huettig, F. (2012). Prediction during language processing is a piece of cake-But only for skilled producers. Journal of Experimental Psychology: Human Perception and Performance, 38(4), 843–847. https://doi.org/10.1037/a0029284
Marslen-Wilson, W. D. (1987). Functional parallelism in spoken word-recognition. Cognition, 25(1/2), 71–102. https://doi.org/10.1016/0010-0277(87)90005-9
Marslen-Wilson, W. D., & Welsh, A. (1978). Processing interactions and lexical access during word recognition in continuous speech. Cognitive Psychology, 10(1), 29–63. https://doi.org/10.1016/0010-0285(78)90018-X
Mirman, D., & Magnuson, J. S. (2008). Attractor dynamics and semantic neighborhood density: Processing is slowed by near neighbors and speeded by distant neighbors. Journal of Experimental Psychology: Learning Memory and Cognition, 34(1), 65–79. https://doi.org/10.1037/0278-7393.34.1.65
Mirman, D., & Magnuson, J. S. (2009). Dynamics of activation of semantically similar concepts during spoken word recognition. Memory & Cognition, 37(7), 1026–1039. https://doi.org/10.3758/MC.37.7.1026
Nelson, M. J., Dehaene, S., Pallier, C., & Hale, J. T. (2017). Entropy Reduction correlates with temporal lobe activity. CMCL 2017—Cognitive Modeling and Computational Linguistics at the 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 (pp. 1–10). https://doi.org/10.18653/V1/W17-0701
Nieuwland, M. S., Barr, D. J., Bartolozzi, F., Busch-Moreno, S., Darley, E., Donaldson, D. I., Ferguson, H. J., Fu, X., Heyselaar, E., Huettig, F., Husband, E. M., Ito, A., Kazanina, N., Kogan, V., Kohút, Z., Kulakova, E., Mézière, D., Politzer-Ahles, S., Rousselet, G., … von Grebmer Zu Wolfsthurn, S. (2020). Dissociable effects of prediction and integration during language comprehension: Evidence from a large-scale study using brain potentials. Philosophical Transactions of the Royal Society B, 375(1791), 20180522. https://doi.org/10.1098/RSTB.2018.0522
Pennington, J., Socher, R., Manning, C. (2014). GloVe: Global vectors for word representation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (pp. 1532–1543). Association for Computational Linguistics, Doha, Qatar. https://doi.org/10.3115/v1/D14-1162. https://www.aclweb.org/anthology/D14-1162
Perfetti, C. (2007). Reading ability: Lexical quality to comprehension. Scientific studies of reading, 11(4), 357–383.
Perfetti, C. A., & Hart, L. (2002). The lexical quality hypothesis. Precursors of Functional Literacy, 11, 67–86.
Pickering, M. J., & Gambi, C. (2018). Predicting while comprehending language: A theory and review. Psychological Bulletin, 144(10), 1002–1044. https://doi.org/10.1037/bul0000158
Rabovsky, M., Schad, D. J., & Abdel Rahman, R. (2016). Language production is facilitated by semantic richness but inhibited by semantic density: Evidence from picture naming. Cognition, 146, 240–244. https://doi.org/10.1016/j.cognition.2015.09.016
Rayner, K. (1998). Eye movements in reading and information processing: 20 years of research. Psychological Bulletin, 124(3), 372.
Rayner, K., & Duffy, S. A. (1988). On-line comprehension processes and eye movements in reading. In M. Daneman, G. E. MacKinnon, & T. G. Waller (Eds.), Reading research: Advances in theory and practice (pp. 13–66). Academic Press.
Rayner, K., Slattery, T. J., Drieghe, D., & Liversedge, S. P. (2011). Eye movements and word skipping during reading: Effects of word length and predictability. Journal of Experimental Psychology: Human Perception and Performance, 37(2), 514–528. https://doi.org/10.1037/a0020990
Reichle, E. D., Rayner, K., & Pollatsek, A. (2003). The EZ Reader model of eye-movement control in reading: Comparisons to other models. Behavioral and Brain Sciences, 26(4), 445–476.
Roark, B., Bachrach, A., Cardenas, C., & Pallier, C. (2009). Deriving lexical and syntactic expectation based measures for psycholinguistic modeling via incremental top-down parsing. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing (pp. 324–333). Association for Computational Linguistics.
Rodd, J., Gaskell, G., & Marslen-Wilson, W. (2002). Making sense of semantic ambiguity: Semantic competition in lexical access. Journal of Memory and Language, 46(2), 245–266. https://doi.org/10.1006/jmla.2001.2810
Roland, D., Yun, H., Koenig, J. P., & Mauner, G. (2012). Semantic similarity, predictability, and models of sentence processing. Cognition, 122(3), 267–279. https://doi.org/10.1016/J.COGNITION.2011.11.011
Rommers, J., & Federmeier, K. D. (2018). Predictability’s aftermath: Downstream consequences of word predictability as revealed by repetition effects. Cortex, 101, 16–30. https://doi.org/10.1016/J.CORTEX.2017.12.018
Saint-Aubin, J., & Klein, R. M. (2001). Influence of parafoveal processing on the missing-letter effect. Journal of Experimental Psychology: Human Perception and Performance, 27(2), 318.
Smith, N. J., & Levy, R. (2013). The effect of word predictability on reading time is logarithmic. Cognition, 128(3), 302–319. https://doi.org/10.1016/j.cognition.2013.02.013
Staub, A., Grant, M., Astheimer, L., & Cohen, A. (2015). The influence of cloze probability and item constraint on cloze task response time. Journal of Memory and Language, 82, 1–17. https://doi.org/10.1016/J.JML.2015.02.004
Stone, K., Vasishth, S., & von der Malsburg, T. (2022). Does entropy modulate the prediction of German long-distance verb particles? PLOS ONE, 17(8), Article e0267813.
Swaab, T., Brown, C., & Hagoort, P. (2003). Understanding words in sentence contexts: The time course of ambiguity resolution. Brain and Language, 86(2), 326–343. https://doi.org/10.1016/S0093-934X(02)00547-3
Taylor, W. L. (1953). “Cloze procedure”: A new tool for measuring readability. Journalism Quarterly, 30(4), 415–433. https://doi.org/10.1177/107769905303000401
Traxler, M. J., & Foss, D. J. (2000). Effects of sentence constraint on priming in natural language comprehension. Journal of Experimental Psychology: Learning Memory and Cognition, 26(5), 1266–1282. https://doi.org/10.1037/0278-7393.26.5.1266
van Petten, C., & Luka, B. J. (2012). Prediction during language comprehension: Benefits, costs, and ERP components. International Journal of Psychophysiology, 83(2), 176–190. https://doi.org/10.1016/J.IJPSYCHO.2011.09.015
van Schijndel, M., Linzen, T. (2018). Can entropy explain successor surprisal effects in reading? In Proceedings of the society for computation in linguistics (SCiL).
van Berkum, J. J. A., Brown, C. M., Zwitserlood, P., Kooijman, V., & Hagoort, P. (2005). Anticipating upcoming words in discourse: Evidence from ERPs and reading times. Journal of Experimental Psychology: Learning Memory and Cognition, 31(3), 443–467. https://doi.org/10.1037/0278-7393.31.3.443
Vergara-Martínez, M., & Swaab, T. Y. (2012). Orthographic neighborhood effects as a function of word frequency: An event-related potential study. Psychophysiology, 49(9), 1277–1289.
Weissbart, H., Kandylaki, K. D., & Reichenbach, T. (2020). Cortical tracking of surprisal during continuous speech comprehension. Journal of Cognitive Neuroscience, 32(1), 155–166. https://doi.org/10.1162/JOCN_A_01467
Wong, R., Veldre, A., & Andrews, S. (2022). Are there independent effects of constraint and predictability on eye movements during reading? Journal of Experimental Psychology: Learning, Memory, and Cognition. https://doi.org/10.1037/xlm0001206
Zhuang, J., Tyler, L. K., Randall, B., Stamatakis, E. A., & Marslen-Wilson, W. D. (2014). Optimally efficient neural systems for processing spoken language. Cerebral Cortex, 24(4), 908–918. https://doi.org/10.1093/cercor/bhs366
Acknowledgements
We thank Trevor Brothers and Martin Pickering for providing feedback on this manuscript. We also thank Soran Malaie and Kathryn Walters for their help with data collection.
H.K. was supported by National Institutes on Aging of the National Institutes of Health under award number 1R15AG073945-01A1. The data, materials, and analysis scripts for all experiments are available at: https://osf.io/buwxs/.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix. Extra confirmatory analyses
Appendix. Extra confirmatory analyses
As mentioned above, in addition to the main analyses, we also performed extra analyses on the target region where the effect of entropy was significant to corroborate the main findings. The results of these analyses are briefly outlined in the main text. Here, we present these analyses in more detail. The statistical models corresponding to these analyses can be found in the supplementary materials.
-
1. Memory (session order) did not affect reading times on any of the critical regions.
Because participants who took part in the SPR session first were necessarily exposed to the target word, and because our main measure was reading times on the target word (and the following regions), we checked the potential effect of session order (SPR-first vs. Cloze-first; a proxy for memory carryover effects) on reading times as well as on the probability of guessing the target word during the cloze session. Importantly, session order did not have a significant effect on RTs (the primary dependent variable) on any of the critical regions (“target-1”: t = −.32, “target”: t = −.38, “target+1”: t = −.28, “target+2”: t = −.38, “target+3”: t = .07). However, session order did have a significant effect on the predictability of the target word, such that the probability of the target word being included in the response set was higher in the SPR-first relative to the Cloze-first session (46.4% vs. 44.1%, respectively; t = 2.45). The full results for these analyses are reported in the supplementary materials.
-
2. The effect of entropy was still significant when the probability values were ignored and only the number of produced responses was the predictor.
Because human participants may not be good at judging probabilities, we replaced entropy values with number of responses produced per trial. The results showed that the more responses were produced in a trial, the faster the reading times were on the target word on that trial (t = −2.84), indicating that the observed effect of entropy does not depend on the given probability values. Note that this result also indicates that the way the original probabilities were normalized was not critical for obtaining this effect.
-
3. The sole effect of entropy was significant (and facilitatory), ruling out the possibility that the observed effect of entropy may arise from multicollinearity between the predictors.
Because entropy and cloze probability were moderately correlated (−.41), including them in the same regression model could have caused issues due to multicollinearity. Thus, we ran a separate model predicting reading times on the target word with only entropy as the predictor. This model still showed a significant facilitatory effect of entropy (t = −2.54).
-
4. The entropy effect was independent of surprisal.
In another analysis, we replaced cloze probability with surprisal in our main original analyses to ensure that the effect of entropy is independent of surprisal. This model also still showed a significant facilitatory effect of entropy (t = −2.77).
-
5. The entropy effect was independent of semantic constraint.
We also replaced cloze probability with sentence constraint in our main models. This model also still showed a significant facilitatory effect of entropy (t = −3.04) on the target word, indicating that this effect is independent of sentential constraint too.
-
6. Entropy had similar effects when the target word was produced in the response set vs. when it was not.
Another concern was that participants who provided their cloze responses second, after the self-paced reading task, were slightly more likely to provide the target word in their response set (see above). To determine if a memory effect was driving our entropy results, we re-ran our original analyses on subsets of trials in which the critical word was (or was not) included in the response set. When the target was included within the responses, entropy had a non-significant effect on reading times on the target word (t = −1.68). However, when the target word was not present in the response set, the effect of trial entropy was significant (t = −2.46). Note that these results are obviously underpowered because they are based on less data points, and therefore the results should not be overinterpreted. In fact, the number of trials in which the target word was predicted was smaller than those in which the target was not predicted (10,600 vs. 12,816, respectively), which may explain the non-significant but trending effect of entropy within target-present trials. In any case, it appears that the facilitatory effect of trial-level entropy was not a result of our within-subject design. If anything, the observed effect of entropy seems to be slightly greater when participants excluded the target word in their response set during the cloze norming task, suggesting that memory retrieval did not drive the observed facilitatory effects.
-
7. The effect of entropy was independent of the length and frequency of the target word, as well as the words preceding the target, ruling out the possibility that the observed effect depends on target frequency or that it may reflect spillover effects from earlier regions.
Although the target words were identical across the three levels of semantic constraint, it is possible that the target word’s frequency might have still influenced reading times. We therefore performed two separate analyses in which the length and frequency of target words were added to trial entropy as predictors of reading times. These models showed that the effect of trial entropy remained significant (t = −2.53, t = −2.48, respectively). Moreover, even though the word immediately before the target (i.e., “target −1”) was always constant across all conditions, the words further preceding the target varied across constraint conditions (although an average of two words were constant across all items), it was still possible that the observed effect of entropy on the target could have originated in earlier regions and spilled over to the target region. As expected, the results revealed that both the length and frequency of these two words were significant predictors of reading times on the target. However, entropy remained a significant predictor both in the model including the length, as well as in the model including the frequency of the preceding words (t = −2.81, t = −2.49, respectively), demonstrating its independent influence on reading times on the target word. For all analyses reported in this section, we gathered data on the length and log-transformed subtitle frequency (“SUBTLWF”) of the relevant words from the English Lexicon Project (https://elexicon.wustl.edu/), and included them as predictors in the regression models. Because length and frequency are highly correlated, we conducted two separate analyses, one including the length of the relevant words and the other including their frequency.
-
8. The effect of trial entropy was in the same direction across the Cloze-first and SPR-first sessions.
Although the interaction between trial entropy and session order was not significant (see Table 4), we still further examined the effect of trial entropy across the two sessions. The results revealed that the effect of trial entropy was similar in both sessions. Specifically, the t values associated with the effect of trial entropy across the Cloze-first and SPR-first sessions were −1.76 and −2.44, respectively. Note that the nonsignificant effect of trial entropy in the Cloze-first sessions could simply be due to lack of power (recall that we had 54 participants in the Cloze-first order vs. 58 in the SPR-first order). Interestingly, the effect of cloze probability was not significant in SPR-first sessions. This could be because the range of cloze values obtained in the cloze sessions were more limited due to exposure to the target word in the SPR sessions.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Karimi, H., Weber, P. & Zinn, J. Information entropy facilitates (not impedes) lexical processing during language comprehension. Psychon Bull Rev (2024). https://doi.org/10.3758/s13423-024-02463-x
Accepted:
Published:
DOI: https://doi.org/10.3758/s13423-024-02463-x