TY - GEN
T1 - A spectro-temporal glimpsing index (STGI) for speech intelligibility prediction
AU - Edraki, Amin
AU - Chan, Wai Yip
AU - Jensen, Jesper
AU - Fogerty, Daniel
N1 - Publisher Copyright:
Copyright © 2021 ISCA.
PY - 2021
Y1 - 2021
N2 - We propose a monaural intrusive speech intelligibility prediction (SIP) algorithm called STGI based on detecting glimpses in short-time segments in a spectro-temporal modulation decomposition of the input speech signals. Unlike existing glimpse-based SIP methods, the application of STGI is not limited to additive uncorrelated noise; STGI can be employed in a broad range of degradation conditions. Our results show that STGI performs consistently well across 15 datasets covering degradation conditions including modulated noise, noise reduction processing, reverberation, near-end listening enhancement, checkerboard noise, and gated noise.
AB - We propose a monaural intrusive speech intelligibility prediction (SIP) algorithm called STGI based on detecting glimpses in short-time segments in a spectro-temporal modulation decomposition of the input speech signals. Unlike existing glimpse-based SIP methods, the application of STGI is not limited to additive uncorrelated noise; STGI can be employed in a broad range of degradation conditions. Our results show that STGI performs consistently well across 15 datasets covering degradation conditions including modulated noise, noise reduction processing, reverberation, near-end listening enhancement, checkerboard noise, and gated noise.
KW - Glimpsing
KW - Spectro-temporal modulation
KW - Speech intelligibility
KW - Speech quality model
UR - http://www.scopus.com/inward/record.url?scp=85119186065&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85119186065&partnerID=8YFLogxK
U2 - 10.21437/Interspeech.2021-605
DO - 10.21437/Interspeech.2021-605
M3 - Conference contribution
AN - SCOPUS:85119186065
T3 - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
SP - 2738
EP - 2742
BT - 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021
PB - International Speech Communication Association
T2 - 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021
Y2 - 30 August 2021 through 3 September 2021
ER -