Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

https://unified-io-2.allenai.org/

Feb 15, 2024

Unified Frameworks, Vision and Language,

arXiv (2023)

概要

新規性・差分

アイデア

結果

一覧へ戻る