Faces à la Carte: Text-to-Face Generation via Attribute Disentanglement

机译：点菜面孔：通过属性解剖学的文本到面一代

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text-to-Face (TTF) synthesis is a challenging task with great potential for diverse computer vision applications. Compared to Text-to-Image (TTI) synthesis tasks, the textual description of faces can be much more complicated and detailed due to the variety of facial attributes and the parsing of high dimensional abstract natural language. In this paper, we propose a Text-to-Face model that not only produces images in high resolution (1024×1024) with text-to-image consistency, but also outputs multiple diverse faces to cover a wide range of unspecified facial features in a natural way. By fine-tuning the multi-label classifier and im age encoder, our model obtains the adjustment vectors and image embeddings which are used to transform the input noise vector sampled from the normal distribution. Afterwards, the transformed noise vector is fed into a pre-trained high-resolution image generator to produce a set of faces with the desired facial attributes. We refer to our model as TTF-HD. Experimental results show that TTF-HD generates high-quality synthesised faces from free-form text descriptions with state-of-the-art performance.

机译：文本面对面（TTF）合成是一个具有挑战性的任务，具有巨大的计算机视觉应用。与文本到图像（TTI）合成任务相比，由于各种面部属性和高维抽象自然语言的解析，面孔的文本描述可能更复杂和详细。在本文中，我们提出了一种文本到面模型，不仅在具有文本到图像一致性中产生高分辨率（1024×1024）的图像，而且还输出多个不同的面，以覆盖各种未指明的面部特征一种自然的方式。通过微调多标签分类器和IM年龄编码器，我们的模型获取调整向量和图像嵌入，用于改变从正态分布采样的输入噪声向量。然后，将变换的噪声向量馈入预先训练的高分辨率图像发生器以产生具有所需面部属性的一组面。我们将我们的模型称为TTF-HD。实验结果表明，TTF-HD从最先进的性能产生了从自由形式文本描述的高质量合成面。

著录项

来源
《IEEE Winter Conference on Applications of Computer Vision》|2021年|3379-3387|共9页
会议地点
作者
Tianren Wang; Teng Zhang; Brian Lovell;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Computer vision; Image resolution; Text categorization; Natural languages; Training data; Transforms; Generators;

机译：计算机愿景;图像分辨率;文本分类;自然语言;培训数据;转换;发电机;

相似文献

外文文献
中文文献
专利

1. NOUVELLES GENERATIONS DE SUPPORTS: Cartes electroniques et cles USB a memoire Flash: toujours plus de capacite [J] . Jean-Jacques Maleval MOS: Le Magazine Du Stockage Et De La Gestion D'Informations . 2009,第257a258期

机译：媒体的新发展：电子卡和USB闪存盘：始终具有更大的容量
2. China's roadmap to low-carbon electricity and water: Disentangling greenhouse gas (GHG) emissions from electricity-water nexus via renewable wind and solar power generation, and carbon capture and storage [J] . Sharifzadeh Mandi, Hien Raymond Khoo Teck, Shah Nilay Applied Energy . 2019,第FEBa1期

机译：中国通往低碳电力和水的路线图：通过可再生风能和太阳能发电，消除水电结合产生的温室气体（GHG）排放
3. Next-Generation Museomics Disentangles One of the Largest Primate Radiations [J] . Katerina Guschanski, Johannes Krause, Susanna Sawyer, Systematic Biology . 2013,第4期

机译：下一代Museomics消除了最大的灵长类动物辐射之一
4. GENERATION DE CARTES PLANIMETRIQUES AUTOMATISEES DANS LA REGION BRESILIENNE DE L'AMAZONIA AVEC IMAGES DU SENSEUR LANDSAT-TM. [C] . Andrade Luis Antonio de, Abib Osvaldo Ari International Society for Photogrammetry and Remote Sensing Congress . 2009

机译：在Amazonia的巴西地区自动化的模板卡，具有Landsat-TM传感器的图像。
5. Estimation du Champ Dense de Mouvement pour la Génération Semi-automatique de Cartes de Profondeur =DENSE MOTION ESTIMATION FOR SEMI-AUTOMATIC DEPTH MAP GENERATION [D] . Rocheleau, étienne. 2017

机译：估计半自动发电的致密运动场=半自动深度映射生成的密集运动估计
6. Next-Generation Museomics Disentangles One of the Largest Primate Radiations [O] . Katerina Guschanski, Johannes Krause, Susanna Sawyer, -1

机译：下一代Museomics可以解开最大的灵长类动物辐射之一
7. Estudo qualitativo dos principais atributos que determinam a percepção de qualidade e de preço dos consumidores de restaurantes a la carte Qualitative study of main attributes that determine the consumer quality and price perceptions of a la carte restaurants [O] . Maria Auxiliadora Cannarozzo Tinoco, José Luis Duarte Ribeiro 2008

机译：Estudo qualitativo dos principais atributos que determinam apercepçãodequalidade edepreçodosconsumidores de restaurantes a la carte定性研究主要属性决定了点菜餐厅的消费者质量和价格感知

Faces à la Carte: Text-to-Face Generation via Attribute Disentanglement

摘要

著录项

相似文献

相关主题

期刊订阅