Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Jiasen Lu, Christopher Clark, Sangho Lee, Zichen Zhang, Savya Khosla, Ryan Marten, Derek Hoiem, Aniruddha Kembhavi

Research output: Contribution to journalConference articlepeer-review

Fingerprint

Dive into the research topics of 'Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action'. Together they form a unique fingerprint.

Keyphrases

Computer Science