LIMMITS'24: Multi-Speaker, Multi-Lingual INDIC TTS With Voice Cloning

Sathvik Udupa, Jesuraja Bandekar, Abhayjeet Singh, G. Deekshitha, Saurabh Kumar, Sandhya Badiger, Amala Nagireddi, R. Roopa, Prasanta Kumar Ghosh, Hema A. Murthy, Pranaw Kumar, Keiichi Tokuda, Mark Hasegawa-Johnson, Philipp Olbrich

Research output: Contribution to journalArticlepeer-review

Abstract

The Multi-speaker, Multi-lingual Indic Text to Speech (TTS) with voice cloning (LIMMITS'24) challenge is organized as part of the ICASSP 2024 signal processing grand challenge. LIMMITS'24 aims at the development of voice cloning for the multi-speaker, multi-lingual Text-to-Speech (TTS) model. Towards this, 80 hours of TTS data has been released in each of Bengali, Chhattisgarhi, English (Indian), and Kannada languages. This is in addition to Telugu, Hindi, and Marathi data released during the LIMMITS'23 challenge. The challenge encourages the advancement of TTS in Indian Languages as well as the development of multi-speaker voice cloning techniques for TTS. The three tracks of LIMMITS'24 have provided an opportunity for various researchers and practitioners around the world to explore the state of the art in research for voice cloning with TTS.

Original languageEnglish (US)
Pages (from-to)293-302
Number of pages10
JournalIEEE Open Journal of Signal Processing
Volume6
DOIs
StatePublished - 2025

Keywords

  • cross-lingual synthesis
  • multi-lingual TTS
  • multi-speaker
  • speech synthesis
  • voice cloning
  • Speech synthesis

ASJC Scopus subject areas

  • Signal Processing

Fingerprint

Dive into the research topics of 'LIMMITS'24: Multi-Speaker, Multi-Lingual INDIC TTS With Voice Cloning'. Together they form a unique fingerprint.

Cite this