Unsupervised speaker identification for TV news

机译：电视新闻的无监督说话人识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Cable, satellite, and broadcast television (TV) networks produce a tremendous amount of information every day. Identifying the speaker throughout a video at specific times would be useful. Previous research has identified speakers on pre-trained faces for TV shows and movies. News videos are challenging because new faces often appear. By using an unsupervised clustering algorithm, this paper proposes to label speakers using just the available information in the news video without external information. Our proposed framework segments the audio by speaker, parses closed captions to identify possible names of speakers, identifies talking persons, performs optical character recognition on text that appears while a person speaks, and checks if a name appears on screen during a speaker's audio segments. Our framework utilizes face detection, face recognition, face clustering, face landmarking, natural language processing tools, parsing rules, and speaker diarization. Our results indicate 63.6% accuracy for identifying speakers for CNN news.

机译：有线，卫星和广播电视（TV）网络每天都会产生大量信息。在特定时间确定整个视频中的讲话者会很有用。先前的研究已经确定了在预训练过的电视节目和电影中面部的说话者。新闻视频具有挑战性，因为经常会出现新面孔。通过使用一种无监督的聚类算法，本文建议仅使用新闻视频中的可用信息来标记发言人，而无需外部信息。我们提议的框架按讲话者对音频进行细分，解析隐藏式字幕以识别讲话者的可能姓名，识别讲话者，对讲话者说话时出现的文本进行视觉字符识别以及检查在讲话者的音频片段中屏幕上是否出现名字。我们的框架利用人脸检测，人脸识别，人脸聚类，人脸地标，自然语言处理工具，解析规则和说话人区分。我们的结果表明，识别CNN新闻发言人的准确性为63.6％。

著录项

作者
Woo, Daniel N.;
展开▼
作者单位

The University of Alabama in Huntsville.;

展开▼
授予单位 The University of Alabama in Huntsville.;
学科 Computer science.;Computer engineering.;Mass communication.
学位 M.S.
年度 2014
页码 69 p.
总页数 69
原文格式 PDF
正文语种 eng
中图分类 TS97-4;
关键词

相似文献

外文文献
中文文献
专利

1. Unsupervised Speaker Identification for TV News [J] . Daniel N. Woo, Ramazan S. Aygün IEEE multimedia . 2016,第4期

机译：电视新闻的无监督说话人识别
2. Unsupervised Speaker Identification in TV Broadcast Based on Written Names [J] . Poignant J., Besacier L., Quenot G. Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2015,第1期

机译：基于书面姓名的电视广播中无监督说话人识别
3. Lexical speaker identification in TV shows [J] . Anindya Roy, Herve Bredin, William Hartmann, Multimedia Tools and Applications . 2015,第4期

机译：电视节目中的词汇说话者识别
4. Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast [C] . Johann Poignant, Hervi Bredin, Viet Bac Le, Annual conference of the International Speech Communication Association . 2012

机译：在电视广播中使用叠加文本进行无监督的说话人识别
5. The role of television news leads in learning from television news: The effect of anxiety-inducing leads and the lead as advance organizer on attention and memory for the news. [D] . Kelley, Mark Alan. 2004

机译：电视新闻线索在电视新闻学习中的作用：引起焦虑的线索以及作为高级组织者的线索对新闻的关注和记忆的作用。
6. Comparing Local TV News with National TV News in Cancer Coverage: An Exploratory Content Analysis [O] . Chul-joo Lee, Marilee Long, Michael D. Slater, -1

机译：地方电视新闻与国家电视新闻在癌症报道方面的比较：探索性内容分析
7. Unsupervised Speaker Identification in TV Broadcast Based on Written Names [O] . Johann Poignant, Laurent Besacier, Georges Quenot 2014

机译：基于书面名称的电视广播中无监督的扬声器识别
8. Supervised and Unsupervised Speaker Adaptation in the NIST 2005 Speaker Recognition Evaluation [R] . Hansen, E. G. , Slyh, R. E. , Anderson, T. R. 2006

机译：NIsT 2005演讲者识别评估中的监督和无监督演讲者适应

Unsupervised speaker identification for TV news

摘要

著录项

相似文献

相关主题

期刊订阅