Document Type

Article

Publication Date

11-10-2014

Department

Engineering

Keywords

teleconferencing, acoustics, MATLAB, acoustic arrays, estimation, vectors, transforms

Abstract

Efficient sound source detection and location with microphone arrays is important for many applications, including teleconferencing, surveillance, and smart rooms. While the steered response power algorithms exhibit robust performance relative to other approaches, their applications are limited by the high computational load required. For dynamic auditory scenes, the entire space must be scanned at regular intervals due to moving sound sources switching between active and inactive states. This paper introduces a time segmentation and parallelization strategy to speed up the steered response power algorithm for dynamic auditory scenes with multiple speech sources. The primary application targeted by this work is for immersive arrays and off-line auditory scene analysis with beamforming for speaker separation in cocktail party environments. Results from a Monte Carlo simulation with 6 speech sources in a mildly reverberant environment demonstrate a speed-up factor of 45, with a modest loss in the number of detections and a significant reduction in anomalous detections. Experimental results with real recordings demonstrate a performance consistent with those of the simulation.

Comments

© 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Source Publication Title

IEEE SOUTHEASTCON 2014

Publisher

IEEE

First Page

1

DOI

10.1109/SECON.2014.6950750

Share

COinS