Video modeling and learning on Riemannian manifold for emotion recognition in the wild

Liu Mengyi; Wang Ruiping; Li Shaoxin; Huang Zhiwu; Shan Shiguang; Chen Xilin

首页> 外文期刊>Journal on multimodal user interfaces >Video modeling and learning on Riemannian manifold for emotion recognition in the wild

【24h】

Video modeling and learning on Riemannian manifold for emotion recognition in the wild

机译：在黎曼流形上进行视频建模和学习以进行野外情感识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present the method for our submission to the emotion recognition in the wild challenge (EmotiW). The challenge is to automatically classify the emotions acted by human subjects in video clips under real-world environment. In our method, each video clip can be represented by three types of image set models (i.e. linear subspace, covariance matrix, and Gaussian distribution) respectively, which can all be viewed as points residing on some Riemannian manifolds. Then different Riemannian kernels are employed on these set models correspondingly for similarity/ distance measurement. For classification, three types of classifiers, i.e. kernel SVM, logistic regression, and partial least squares, are investigated for comparisons. Finally, an optimal fusion of classifiers learned from different kernels and different modalities (video and audio) is conducted at the decision level for further boosting the performance. We perform extensive evaluations on the EmotiW 2014 challenge data (including validation set and blind test set), and evaluate the effects of different components in our pipeline. It is observed that our method has achieved the best performance reported so far. To further evaluate the generalization ability, we also perform experiments on the EmotiW 2013 data and two well-known lab-controlled databases: CK+ and MMI. The results show that the proposed framework significantly outperforms the state-of-the-art methods.

机译：在本文中，我们提出了一种用于在野外挑战（EmotiW）中进行情感识别的方法。面临的挑战是在现实环境下，自动将人类对象在视频片段中表现出的情绪进行分类。在我们的方法中，每个视频片段都可以分别由三种类型的图像集模型（即线性子空间，协方差矩阵和高斯分布）表示，它们都可以被视为位于某些黎曼流形上的点。然后，在这些集合模型上分别采用不同的黎曼核，以进行相似度/距离测量。对于分类，研究了三种类型的分类器，即内核支持向量机，逻辑回归和偏最小二乘以进行比较。最后，在决策层进行从不同内核和不同模式（视频和音频）中学习的分类器的最佳融合，以进一步提高性能。我们对EmotiW 2014挑战数据（包括验证集和盲测集）进行了广泛的评估，并评估了产品线中不同组件的影响。可以观察到，我们的方法迄今已达到最佳性能。为了进一步评估泛化能力，我们还对EmotiW 2013数据和两个著名的实验室控制数据库CK +和MMI进行了实验。结果表明，所提出的框架明显优于最新方法。

著录项

来源
《Journal on multimodal user interfaces》 |2016年第2期|113-124|共12页
作者
Liu Mengyi; Wang Ruiping; Li Shaoxin; Huang Zhiwu; Shan Shiguang; Chen Xilin;
展开▼
作者单位

Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Proc, 6 Kexueyuan South Rd, Beijing 100190, Peoples R China;

Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Proc, 6 Kexueyuan South Rd, Beijing 100190, Peoples R China;

Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Proc, 6 Kexueyuan South Rd, Beijing 100190, Peoples R China;

Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Proc, 6 Kexueyuan South Rd, Beijing 100190, Peoples R China;

Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Proc, 6 Kexueyuan South Rd, Beijing 100190, Peoples R China;

Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Proc, 6 Kexueyuan South Rd, Beijing 100190, Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Emotion recognition; Video modeling; Riemannian manifold; EmotiW challenge;

机译：情绪识别;视频建模;黎曼流形;情绪挑战;

相似文献

外文文献
中文文献
专利

1. Face recognition on large-scale video in the wild with hybrid Euclidean-and-Riemannian metric learning [J] . Huang Zhiwu, Wang Ruiping, Shan Shiguang, Pattern Recognition: The Journal of the Pattern Recognition Society . 2015,第10期

机译：欧氏和黎曼混合度量学习在野外对大型视频进行人脸识别
2. Video-based emotion recognition in the wild using deep transfer learning and score fusion [J] . Kaya Heysem, Gurpinar Furkan, Salah Albert Ali Image and Vision Computing . 2017,第sepa期

机译：使用深度转移学习和分数融合的基于视频的野外情感识别
3. Kernel-Based Subspace Learning on Riemannian Manifolds for Visual Recognition [J] . Xi Liu, Zhengming Ma Neural processing letters . 2020,第1期

机译：黎曼流形上基于核的子空间学习用于视觉识别
4. Landmark manifold: Revisiting the Riemannian manifold approach for facial emotion recognition [C] . Kun Zhao, Siqi Yang, Arnold Wiliem, International Conference on Pattern Recognition . 2016

机译：具有里程碑意义的流形：再次探讨面部表情识别的黎曼流形方法
5. Multimodal Sensing and Data Processing for Speaker and Emotion Recognition Using Deep Learning Models with Audio, Video and Biomedical Sensors [D] . Abtahi, Farnaz. 2018

机译：使用具有音频，视频和生物医学传感器的深度学习模型，对说话人和情感识别进行多模式传感和数据处理
6. Chimpanzee face recognition from videos in the wild using deep learning [O] . Daniel Schofield, Arsha Nagrani, Andrew Zisserman, 2019

机译：黑猩猩使用深度学习从野外视频中识别人脸
7. Combining Multiple Kernel Methods on Riemannian Manifold for Emotion Recognition in the Wild [O] . Mengyi Liu, Ruiping Wang, Shaoxin Li, 2015

机译：结合黎曼流形上的多核方法进行野外情感识别

Video modeling and learning on Riemannian manifold for emotion recognition in the wild

摘要

著录项

相似文献

相关主题

期刊订阅