LIMMITS'24: Multi-Speaker, Multi-Lingual Indic TTS with Voice Cloning

Abhayjeet Singh, Amala Nagireddi, G. Deekshitha, Jesuraja Bandekar, R. Roopa, Sandhya Badiger, Sathvik Udupa, Prasanta Kumar Ghosh, Hema A. Murthy, Pranaw Kumar, Keiichi Tokuda, Mark Hasegawa-Johnson, Philipp Olbrich

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The Multi-speaker, Multi-lingual Indic TTS with voice cloning (LIMMITS'24) challenge is organized as part of the ICASSP 2024 signal processing grand challenge. LIMMITS'24 aims at the development of voice cloning for multi-speaker, multi-lingual Text-to-Speech (TTS) model. Towards this, 80 hours of TTS data has been released in each of Bengali, Chhattisgarhi, English (Indian), and Kannada languages. This is in addition to Telugu, Hindi, and Marathi data released in the LIMMITS'23. The challenge encourages the advancement of TTS in Indian Languages as well as the development of multi-speaker voice cloning techniques for TTS. The three tracks of LIMMITS'24 have provided an opportunity for various researchers and practitioners around the world to explore the state of the art in TTS research.

Original languageEnglish (US)
Title of host publication2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages61-62
Number of pages2
ISBN (Electronic)9798350374513
DOIs
StatePublished - 2024
Event49th IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Seoul, Korea, Republic of
Duration: Apr 14 2024Apr 19 2024

Publication series

Name2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings

Conference

Conference49th IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024
Country/TerritoryKorea, Republic of
CitySeoul
Period4/14/244/19/24

Keywords

  • cross-lingual synthesis
  • multi-lingual TTS
  • multi-speaker
  • speech synthesis
  • voice cloning

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Signal Processing
  • Media Technology
  • Acoustics and Ultrasonics

Fingerprint

Dive into the research topics of 'LIMMITS'24: Multi-Speaker, Multi-Lingual Indic TTS with Voice Cloning'. Together they form a unique fingerprint.

Cite this