当前位置: X-MOL 学术arXiv.cs.SD › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Auralization based on multi-perspective ambisonic room impulse responses
arXiv - CS - Sound Pub Date : 2023-12-05 , DOI: arxiv-2312.02581
Kaspar Müller, Franz Zotter

Most often, virtual acoustic rendering employs real-time updated room acoustic simulations to accomplish auralization for a variable listener perspective. As an alternative, we propose and test a technique to interpolate room impulse responses, specifically Ambisonic room impulse responses (ARIRs) available at a grid of spatially distributed receiver perspectives, measured or simulated in a desired acoustic environment. In particular, we extrapolate a triplet of neighboring ARIRs to the variable listener perspective, preceding their linear interpolation. The extrapolation is achieved by decomposing each ARIR into localized sound events and re-assigning their direction, time, and level to what could be observed at the listener perspective, with as much temporal, directional, and perspective context as possible. We propose to undertake this decomposition in two levels: Peaks in the early ARIRs are decomposed into jointly localized sound events, based on time differences of arrival observed in either an ARIR triplet, or all ARIRs observing the direct sound. Sound events that could not be jointly localized are treated as residuals whose less precise localization utilizes direction-of-arrival detection and the estimated time of arrival. For the interpolated rendering, suitable parameter settings are found by evaluating the proposed method in a listening experiment, using both measured and simulated ARIR data sets, under static and time-varying conditions.

中文翻译:

基于多视角立体混响房间脉冲响应的可听化

最常见的是,虚拟声学渲染采用实时更新的房间声学模拟来实现可变听众视角的可听化。作为替代方案,我们提出并测试了一种内插房间脉冲响应的技术,特别是在空间分布的接收器视角网格上可用的高保真度立体声响复制房间脉冲响应(ARIR),在所需的声学环境中测量或模拟。特别是,我们在线性插值之前将相邻 ARIR 的三元组外推到可变收听者视角。外推是通过将每个 ARIR 分解为局部声音事件并将其方向、时间和电平重新分配到听众视角下可以观察到的内容来实现的,并具有尽可能多的时间、方向和视角上下文。我们建议在两个层面上进行这种分解:根据在 ARIR 三元组或观察直达声音的所有 ARIR 中观察到的到达时间差,将早期 ARIR 中的峰值分解为联合定位的声音事件。无法联合定位的声音事件被视为残差,其不太精确的定位利用到达方向检测和估计到达时间。对于插值渲染,通过在静态和时变条件下使用测量的和模拟的 ARIR 数据集在听音实验中评估所提出的方法来找到合适的参数设置。
更新日期:2023-12-06
down
wechat
bug