论文标题
基于双重RTF-vector的基于相干的频率子集选择基于多扬声器的到达估算方向
Coherence-Based Frequency Subset Selection For Binaural RTF-Vector-Based Direction of Arrival Estimation for Multiple Speakers
论文作者
论文摘要
最近,已经提出了一种方法来估计单个扬声器的到达方向(DOA),方法是通过最大程度地减少估计的相对传递函数(RTF)向量(RTF)矢量和原型室内rtf矢量数据库之间的频率平均均值。在本文中,我们通过引入频率平均的Hermitian角度光谱并选择该空间频谱的峰来扩展到多演讲者的定位。为了构建hermitian角度频谱,我们仅考虑一部分频率,其中一个说话者可能是主导的。我们将广义幅度平方相干性和两个相干与扩散比(CDR)估计器作为频率选择标准的有效性。使用双耳听力设备在混响环境中估算带有分散的Babble噪声的两个扬声器DOA的仿真结果表明,使用基于双耳有效固定的CDR估计作为频率选择标准,可产生最佳性能。
Recently, a method has been proposed to estimate the direction of arrival (DOA) of a single speaker by minimizing the frequency-averaged Hermitian angle between an estimated relative transfer function (RTF) vector and a database of prototype anechoic RTF vectors. In this paper, we extend this method to multi-speaker localization by introducing the frequency-averaged Hermitian angle spectrum and selecting peaks of this spatial spectrum. To construct the Hermitian angle spectrum, we consider only a subset of frequencies, where it is likely that one speaker is dominant. We compare the effectiveness of the generalized magnitude squared coherence and two coherent-to-diffuse ratio (CDR) estimators as frequency selection criteria. Simulation results for estimating the DOAs of two speakers in a reverberant environment with diffuse-like babble noise using binaural hearing devices show that using the binaural effective-coherence-based CDR estimate as a frequency selection criterion yields the best performance.
