A survey of multimodal deep generative models

Suzuki Masahiro; Matsuo Yutaka

首页> 外文期刊>Advanced Robotics: The International Journal of the Robotics Society of Japan >A survey of multimodal deep generative models

【24h】

A survey of multimodal deep generative models

机译：A survey of multimodal deep generative models

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Multimodal learning is a framework for building models that make predictions based on different types of modalities. Important challenges in multimodal learning are the inference of shared representations from arbitrary modalities and cross-modal generation via these representations; however, achieving this requires taking the heterogeneous nature of multimodal data into account. In recent years, deep generative models, i.e. generative models in which distributions are parameterized by deep neural networks, have attracted much attention, especially variational autoencoders, which are suitable for accomplishing the above challenges because they can consider heterogeneity and infer good representations of data. Therefore, various multimodal generative models based on variational autoencoders, called multimodal deep generative models, have been proposed in recent years. In this paper, we provide a categorized survey of studies on multimodal deep generative models.

著录项

来源
《Advanced Robotics: The International Journal of the Robotics Society of Japan》 |2022年第6期|261-278|共18页
作者
Suzuki Masahiro; Matsuo Yutaka;
展开▼
作者单位

Univ Tokyo;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类机器人技术;
关键词
Deep generative models; multimodal learning; FUSION; REPRESENTATION;

A survey of multimodal deep generative models

摘要

著录项

相关主题

期刊订阅