TY - GEN
T1 - Sudo RM -RF
T2 - 30th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2020
AU - Tzinis, Efthymios
AU - Wang, Zhepei
AU - Smaragdis, Paris
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/9
Y1 - 2020/9
N2 - In this paper, we present an efficient neural network for end-to-end general purpose audio source separation. Specifically, the backbone structure of this convolutional network is the SUccessive DOwnsampling and Resampling of Multi-Resolution Features (SuDoRM-RF) as well as their aggregation which is performed through simple one-dimensional convolutions. In this way, we are able to obtain high quality audio source separation with limited number of floating point operations, memory requirements, number of parameters and latency. Our experiments on both speech and environmental sound separation datasets show that SuDoRM - RF performs comparably and even surpasses various state-of-the-art approaches with significantly higher computational resource requirements.
AB - In this paper, we present an efficient neural network for end-to-end general purpose audio source separation. Specifically, the backbone structure of this convolutional network is the SUccessive DOwnsampling and Resampling of Multi-Resolution Features (SuDoRM-RF) as well as their aggregation which is performed through simple one-dimensional convolutions. In this way, we are able to obtain high quality audio source separation with limited number of floating point operations, memory requirements, number of parameters and latency. Our experiments on both speech and environmental sound separation datasets show that SuDoRM - RF performs comparably and even surpasses various state-of-the-art approaches with significantly higher computational resource requirements.
KW - Audio source separation
KW - Deep learning
KW - Low-cost neural networks
UR - http://www.scopus.com/inward/record.url?scp=85096506405&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85096506405&partnerID=8YFLogxK
U2 - 10.1109/MLSP49062.2020.9231900
DO - 10.1109/MLSP49062.2020.9231900
M3 - Conference contribution
AN - SCOPUS:85096506405
T3 - IEEE International Workshop on Machine Learning for Signal Processing, MLSP
BT - Proceedings of the 2020 IEEE 30th International Workshop on Machine Learning for Signal Processing, MLSP 2020
PB - IEEE Computer Society
Y2 - 21 September 2020 through 24 September 2020
ER -