J4 ›› 2015, Vol. 42 ›› Issue (1): 168-173.doi: 10.3969/j.issn.1001-2400.2015.01.027

• 研究论文 • 上一篇    下一篇

利用多点互相关值均值的实时声源定位算法

张一闻1;刘建平2;张世全2   

  1. (1. 武警工程大学 信息工程系,陕西 西安  710086;
    2. 武警工程大学 理学院,陕西 西安  710086)
  • 收稿日期:2013-10-07 出版日期:2015-02-20 发布日期:2015-04-14
  • 作者简介:张一闻(1978-),男,副教授,博士,E-mail:zevens@163.com .
  • 基金资助:

    国家自然科学基金资助项目(60940007,61072034)

Real-time acoustic source localization algorithm based on the average of multiple cross correlation values

ZHANG Yiwen1;LIU Jianping2;ZHANG Shiquan2   

  1. (1. Dept. of Information Engineering, Engineering Univ. of CAPF, Xi'an  710086, China;
    2. School of Science, Engineering Univ. of CAPF, Xi'an  710086, China)
  • Received:2013-10-07 Online:2015-02-20 Published:2015-04-14

摘要:

基于可控响应功率的麦克风阵列声源定位算法在噪声和混响环境中具有较好的鲁棒性,但是利用网格搜索方法寻找可控响应功率的最大值计算复杂度很高.故提出一种新的实时声源定位算法,使用网格中多互相关值均值代替单点的可控响应功率值作为代价函数,通过多级区域收缩方法实现了代价函数最大值的空间搜索,并给出一种不显著增加计算量的互相关值求和范围的计算方式.由于利用了网格中更多的空间采样信息,因此可以使用更大尺度的搜索网格,从而降低了整个搜索过程的计算复杂度.

关键词: 可控响应功率, 麦克风阵列, 声源定位, 互相关, 多级区域收缩

Abstract:

The Steered Response Power(SRP)-based acoustic source localization algorithm with microphone arrays has been shown to be robust in noisy and reverberant environments. However, grid-search methods used to find a global maximum of the SRP image are computationally-intensive. In this paper, we propose a different strategy where, instead of the SRP at discrete spatial positions, the average of multiple Cross-Correlation(CC) values is evaluated as a cost function. The multi-stage region contraction is then used to find the maximum of the cost function to reduce the computational burden. The determination of CC accumulation limits does not increase the amount of calculation significantly. For incorporating additional spatial knowledge at each search grid, the proposed algorithm allows for a coarser spatial grid and reduces the computational cost with almost no loss of accuracy. Experiments carried out under different acoustic conditions confirm the validity of the proposed approach.

Key words: steered response power, microphone arrays, acoustic source localization, cross correlation, multi-stage region contraction