【24h】

Topic Indexing of TV Broadcast News Programs

机译:电视广播新闻节目的主题索引

获取原文
获取原文并翻译 | 示例

摘要

This paper describes a topic segmentation and indexation system for TV broadcast news programs spoken in European Portuguese. The system is integrated in an alert system for selective dissemination of multimedia information developed in the scope of an European Project. The goal of this work is to enhance the retrieval of specific spoken documents that have been automatically transcribed, using speech recognition. Our segmentation algorithm is based on simple heuristics related with anchor detection. The indexation is based on hierarchical concept trees (thesaurus), containing 22 main thematic domains, for which Hidden Markov models and topic language models were created. On-going experiments related to multiple topic indexing are also described, where a confidence measure based on the likelihood ratio test is used as the hypothesis test.
机译:本文介绍了以欧洲葡萄牙语说的电视广播新闻节目的主题细分和索引系统。该系统集成在一个警报系统中,用于选择性地传播在欧洲项目范围内开发的多媒体信息。这项工作的目的是使用语音识别来增强对已自动转录的特定语音文档的检索。我们的分割算法基于与锚点检测相关的简单启发式算法。该索引基于分层概念树(同义词库),包含22个主要主题领域,为此创建了隐马尔可夫模型和主题语言模型。还描述了与多个主题索引相关的正在进行的实验,其中基于似然比检验的置信度度量用作假设检验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号