Towards Unsupervised Learning of Generative Models for 3D Controllable Image Synthesis

机译：面向3D可控图像合成生成模型的无监督学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, Generative Adversarial Networks have achieved impressive results in photorealistic image synthesis. This progress nurtures hopes that one day the classical rendering pipeline can be replaced by efficient models that are learned directly from images. However, current image synthesis models operate in the 2D domain where disentangling 3D properties such as camera viewpoint or object pose is challenging. Furthermore, they lack an interpretable and controllable representation. Our key hypothesis is that the image generation process should be modeled in 3D space as the physical world surrounding us is intrinsically three-dimensional. We define the new task of 3D controllable image synthesis and propose an approach for solving it by reasoning both in 3D space and in the 2D image domain. We demonstrate that our model is able to disentangle latent 3D factors of simple multi-object scenes in an unsupervised fashion from raw images. Compared to pure 2D baselines, it allows for synthesizing scenes that are consistent wrt. changes in viewpoint or object pose. We further evaluate various 3D representations in terms of their usefulness for this challenging task.

机译：近年来，生成对抗网络在逼真的图像合成中取得了令人印象深刻的结果。这一进步孕育着希望有一天，经典的渲染管道可以被直接从图像中学习的高效模型所取代。但是，当前的图像合成模型在2D域中运行，在其中难以解开3D属性（例如相机视点或对象姿势）的挑战。此外，它们缺乏可解释和可控制的表示形式。我们的主要假设是，由于我们周围的物理世界本质上是三维的，因此应该在3D空间中对图像生成过程进行建模。我们定义了3D可控图像合成的新任务，并提出了一种通过在3D空间和2D图像域中进行推理来解决该问题的方法。我们证明了我们的模型能够以无人监督的方式从原始图像中解开简单多对象场景的潜在3D因子。与纯2D基准相比，它可以合成前后一致的场景。视点或物体姿势的变化。我们进一步评估各种3D表示形式对这项艰巨任务的有用性。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2020年|5870-5879|共10页
会议地点
作者
Yiyi Liao; Katja Schwarz; Lars Mescheder; Andreas Geiger;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Three-dimensional displays; Two dimensional displays; Solid modeling; Image generation; Rendering (computer graphics); Generators; Task analysis;

机译：三维显示;二维显示;实体建模;图像生成;渲染（计算机图形）;生成器;任务分析;

相似文献

外文文献
中文文献
专利

1. Unsupervised Learning of Generative and Discriminative Weights Encoding Elementary Image Components in a Predictive Coding Model of Cortical Function [J] . M. W. Spratling Neural computation . 2012,第1期

机译：皮层功能预测编码模型中编码基本图像分量的生成和区分权重的无监督学习
2. An improved unsupervised representation learning generative adversarial network for remote sensing image scene classification [J] . Wei Yufan, Luo Xiaobo, Hu Lixin, Remote sensing letters . 2020,第4a6期

机译：一种改进的遥感图像场景分类学习生成的逆势网络
3. Deep unsupervised learning for image super-resolution with generative adversarial network [J] . Lin Guimin, Wu Qingxiang, Chen Liang, Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing . 2018,第期

机译：具有生成对抗网络的图像超分辨率的深度无监督学习
4. Unsupervised Feature Learning for Output Control of Generative Models [C] . Kazuki Toda, Kyohei Atarashi, Satoshi Oyama, International Symposium on Advanced Intelligent Systems;International Conference on Soft Computing and Intelligent Systems . 2020

机译：未经监督的功能学习生成模型的输出控制
5. Coupling and Learning Hierarchical Generative and Descriptive Models for Image Synthesis and Analysis. [D] . Lu, Yang. 2017

机译：耦合和学习用于图像合成和分析的分层生成和描述模型。
6. Generative Adversarial Phonology: Modeling Unsupervised Phonetic and Phonological Learning With Neural Networks [O] . Gašper Beguš 2020

机译：生成对抗语音学：用神经网络建模无监督的语音和语音学习
7. Unsupervised Feature Learning for Output Control of Generative Models [O] . Kazuki Toda, Kyohei Atarashi, Satoshi Oyama, 2020

机译：未经监督的功能学习生成模型的输出控制

Towards Unsupervised Learning of Generative Models for 3D Controllable Image Synthesis

摘要

著录项

相似文献

相关主题

期刊订阅