TY - JOUR
T1 - Performance of time- and frequency-domain binaural beamformers based on recorded signals from real rooms
AU - Lockwood, Michael E.
AU - Jones, Douglas L.
AU - Bilger, Robert C.
AU - Lansing, Charissa R.
AU - O'Brien, William D.
AU - Wheeler, Bruce C.
AU - Feng, Albert S.
PY - 2004/1
Y1 - 2004/1
N2 - Extraction of a target sound source amidst multiple interfering sound sources is difficult when there are fewer sensors than sources, as is the case for human listeners in the classic cocktail-party situation. This study compares the signal extraction performance of five algorithms using recordings of speech sources made with three different two-microphone arrays in three rooms of varying reverberation time. Test signals, consisting of two to five speech sources, were constructed for each room and array. The signals were processed with each algorithm, and the signal extraction performance was quantified by calculating the signal-to-noise ratio of the output. A frequency-domain minimum-variance distortionless-response beamformer outperformed the time-domain based Frost beamformer and generalized sidelobe canceler for all tests with two or more interfering sound sources, and performed comparably or better than the time-domain algorithms for tests with one interfering sound source. The frequency-domain minimum-variance algorithm offered performance comparable to that of the Peissig-Kollmeier binaural frequency-domain algorithm, but with much less distortion of the target signal. Comparisons were also made to a simple beamformer. In addition, computer simulations illustrate that, when processing speech signals, the chosen implementation of the frequency-domain minimum-variance technique adapts more quickly and accurately than time-domain techniques.
AB - Extraction of a target sound source amidst multiple interfering sound sources is difficult when there are fewer sensors than sources, as is the case for human listeners in the classic cocktail-party situation. This study compares the signal extraction performance of five algorithms using recordings of speech sources made with three different two-microphone arrays in three rooms of varying reverberation time. Test signals, consisting of two to five speech sources, were constructed for each room and array. The signals were processed with each algorithm, and the signal extraction performance was quantified by calculating the signal-to-noise ratio of the output. A frequency-domain minimum-variance distortionless-response beamformer outperformed the time-domain based Frost beamformer and generalized sidelobe canceler for all tests with two or more interfering sound sources, and performed comparably or better than the time-domain algorithms for tests with one interfering sound source. The frequency-domain minimum-variance algorithm offered performance comparable to that of the Peissig-Kollmeier binaural frequency-domain algorithm, but with much less distortion of the target signal. Comparisons were also made to a simple beamformer. In addition, computer simulations illustrate that, when processing speech signals, the chosen implementation of the frequency-domain minimum-variance technique adapts more quickly and accurately than time-domain techniques.
UR - http://www.scopus.com/inward/record.url?scp=1642580515&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=1642580515&partnerID=8YFLogxK
U2 - 10.1121/1.1624064
DO - 10.1121/1.1624064
M3 - Article
C2 - 14759029
AN - SCOPUS:1642580515
SN - 0001-4966
VL - 115
SP - 379
EP - 391
JO - Journal of the Acoustical Society of America
JF - Journal of the Acoustical Society of America
IS - 1
ER -