TY - JOUR
T1 - Factors influencing the minimum audible change in talker head orientation cues using diotic stimuli
AU - Moriarty, Brendan T.
AU - Ananthanarayana, Rohit M.
AU - Monson, Brian B.
N1 - Publisher Copyright:
© 2024 Acoustical Society of America.
PY - 2024/8/1
Y1 - 2024/8/1
N2 - The perception of a talker's head orientation is an ecologically relevant task. Humans are able to discriminate changes in talker head orientation using acoustic cues. Factors that may influence measures of this ability have not been well characterized. Here, we examined the minimum audible change in head orientation cues (MACHO) using diotic stimuli. The effects of several factors were tested: talker and gender, stimulus bandwidth (full-band vs low-pass filtered at 8 or 10 kHz), transducer (loudspeaker vs headphone), stimulus uncertainty (interleaved vs blocked presentation of four talkers), and vocal production mode (speech vs singing). The best performance of ∼41° was achieved for full-band, blocked presentation of speech over a loudspeaker. Greater stimulus uncertainty (interleaved presentation) worsened the MACHO by 26%. Bandlimiting at 8 and 10 kHz worsened performance by an additional 22% and 14%, respectively. At equivalent overall sound levels, performance was better for speech than for singing. There was some limited evidence for the transducer influencing the MACHO. These findings suggest the MACHO relies on multiple factors manipulated here. One of the largest, consistent effects was that of talker, suggesting head orientation cues are highly dependent on individual talker characteristics. This may be due to individual variability in speech directivity patterns.
AB - The perception of a talker's head orientation is an ecologically relevant task. Humans are able to discriminate changes in talker head orientation using acoustic cues. Factors that may influence measures of this ability have not been well characterized. Here, we examined the minimum audible change in head orientation cues (MACHO) using diotic stimuli. The effects of several factors were tested: talker and gender, stimulus bandwidth (full-band vs low-pass filtered at 8 or 10 kHz), transducer (loudspeaker vs headphone), stimulus uncertainty (interleaved vs blocked presentation of four talkers), and vocal production mode (speech vs singing). The best performance of ∼41° was achieved for full-band, blocked presentation of speech over a loudspeaker. Greater stimulus uncertainty (interleaved presentation) worsened the MACHO by 26%. Bandlimiting at 8 and 10 kHz worsened performance by an additional 22% and 14%, respectively. At equivalent overall sound levels, performance was better for speech than for singing. There was some limited evidence for the transducer influencing the MACHO. These findings suggest the MACHO relies on multiple factors manipulated here. One of the largest, consistent effects was that of talker, suggesting head orientation cues are highly dependent on individual talker characteristics. This may be due to individual variability in speech directivity patterns.
UR - http://www.scopus.com/inward/record.url?scp=85200708857&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85200708857&partnerID=8YFLogxK
U2 - 10.1121/10.0028119
DO - 10.1121/10.0028119
M3 - Article
C2 - 39105574
AN - SCOPUS:85200708857
SN - 0001-4966
VL - 156
SP - 763
EP - 773
JO - Journal of the Acoustical Society of America
JF - Journal of the Acoustical Society of America
IS - 2
ER -