TY - GEN
T1 - Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages
T2 - 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018
AU - Scharenborg, Odette
AU - Besacier, Laurent
AU - Black, Alan
AU - Hasegawa-Johnson, Mark
AU - Metze, Florian
AU - Neubig, Graham
AU - Stuker, Sebastian
AU - Godard, Pierre
AU - Muller, Markus
AU - Ondel, Lucas
AU - Palaskar, Shruti
AU - Arthur, Philip
AU - Ciannella, Francesco
AU - Du, Mingxing
AU - Larsen, Elin
AU - Merkx, Danny
AU - Riad, Rachid
AU - Wang, Liming
AU - Dupoux, Emmanuel
N1 - \u2217Corresponding author: [email protected] \u2020The work reported here was started at JSALT 2017 in CMU, Pittsburgh, and was supported by JHU and CMU via grants from Google, Microsoft, Amazon, Facebook, Apple. This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by NSF grant number OCI-1053575. Specifically, it used the Bridges system, which is supported by NSF award number ACI-1445606, at the Pittsburgh Supercomputing Center (PSC). OS was partially supported by a Vidi-grant from NWO (276-89-003). PG was funded by the French ANR and the German DFG under grant ANR-14-CE35-0002 (BULB project). MD, EL, RR and ED were funded by the European Research Council (ERC-2011-AdG-295810 BOOT-PHON), and ANR-10-LABX-0087 IEC and ANR-10-IDEX-0001-02 PSL*.
PY - 2018/9/10
Y1 - 2018/9/10
N2 - We summarize the accomplishments of a multi-disciplinary workshop exploring the computational and scientific issues surrounding the discovery of linguistic units (subwords and words) in a language without orthography. We study the replacement of orthographic transcriptions by images and/or translated text in a well-resourced language to help unsupervised discovery from raw speech.
AB - We summarize the accomplishments of a multi-disciplinary workshop exploring the computational and scientific issues surrounding the discovery of linguistic units (subwords and words) in a language without orthography. We study the replacement of orthographic transcriptions by images and/or translated text in a well-resourced language to help unsupervised discovery from raw speech.
KW - Image retrieval
KW - Machine translation
KW - Multi-modal data
KW - Unsupervised unit discovery
KW - Unwritten languages
UR - http://www.scopus.com/inward/record.url?scp=85054264895&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85054264895&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2018.8461761
DO - 10.1109/ICASSP.2018.8461761
M3 - Conference contribution
AN - SCOPUS:85054264895
SN - 9781538646588
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 4979
EP - 4983
BT - 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2018 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 15 April 2018 through 20 April 2018
ER -