当前位置: X-MOL 学术Speech Commun. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Comparing Levenshtein distance and dynamic time warping in predicting listeners’ judgments of accent distance
Speech Communication ( IF 3.2 ) Pub Date : 2023-09-21 , DOI: 10.1016/j.specom.2023.102987
Holly C. Lind-Combs , Tessa Bent , Rachael F. Holt , Cynthia G. Clopper , Emma Brown

Listeners attend to variation in segmental and prosodic cues when judging accent strength. The relative contributions of these cues to perceptions of accentedness in English remains open for investigation, although objective accent distance measures (such as Levenshtein distance) appear to be reliable tools for predicting perceptual distance. Levenshtein distance, however, only accounts for phonemic information in the signal. The purpose of the current study was to examine the relative contributions of phonemic (Levenshtein) and holistic acoustic (dynamic time warping) distances from the local accent to listeners’ accent rankings for nine non-local native and nonnative accents. Listeners (n = 52) ranked talkers on perceived distance from the local accent (Midland American English) using a ladder task for three sentence-length stimuli. Phonemic and holistic acoustic distances between Midland American English and the other accents were quantified using both weighted and unweighted Levenshtein distance measures, and dynamic time warping (DTW). Results reveal that all three metrics contribute to perceived accent distance, with the weighted Levenshtein slightly outperforming the other measures. Moreover, the relative contribution of phonemic and holistic acoustic cues was driven by the speaker's accent. Both nonnative and non-local native accents were included in this study, and the benefits of considering both of these accent groups in studying phonemic and acoustic cues used by listeners is discussed.



中文翻译:

比较编辑距离和动态时间扭曲在预测听众对口音距离的判断中的作用

听众在判断口音强度时会注意片段和韵律线索的变化。尽管客观的口音距离测量(例如编辑距离)似乎是预测感知距离的可靠工具,但这些线索对英语口音感知的相对贡献仍有待研究。然而,编辑距离仅考虑信号中的音素信息。当前研究的目的是检查音素(Levenshtein)和整体声学(动态时间扭曲)距离从当地口音到九种非本地本地口音和非本地口音的听众口音排名的相对贡献。听众 (n = 52) 使用梯子任务对三个句子长度的刺激,根据与当地口音(米德兰美式英语)的感知距离对说话者进行排名。使用加权和未加权的 Levenshtein 距离测量以及动态时间扭曲 (DTW) 来量化米德兰美式英语和其他口音之间的音位和整体声学距离。结果显示,所有三个指标都有助于感知口音距离,其中加权 Levenshtein 的表现略优于其他指标。此外,音素和整体声学线索的相对贡献是由说话者的口音驱动的。本研究包括非本地口音和非本地本地口音,并讨论了在研究听众使用的音素和声学线索时考虑这两个口音群体的好处。和动态时间扭曲(DTW)。结果显示,所有三个指标都有助于感知口音距离,其中加权 Levenshtein 的表现略优于其他指标。此外,音素和整体声学线索的相对贡献是由说话者的口音驱动的。本研究包括非本地口音和非本地本地口音,并讨论了在研究听众使用的音素和声学线索时考虑这两个口音群体的好处。和动态时间扭曲(DTW)。结果显示,所有三个指标都有助于感知口音距离,其中加权 Levenshtein 的表现略优于其他指标。此外,音素和整体声学线索的相对贡献是由说话者的口音驱动的。本研究包括非本地口音和非本地本地口音,并讨论了在研究听众使用的音素和声学线索时考虑这两个口音群体的好处。

更新日期:2023-09-22
down
wechat
bug