首页> 外文会议>International Speech Communication Association >Rank-Predicted Pseudo-Greedy Approach to Efficient Text Selection From Large-Scale Corpus For Maximum Coverage of Target Units

【24h】

Rank-Predicted Pseudo-Greedy Approach to Efficient Text Selection From Large-Scale Corpus For Maximum Coverage of Target Units

机译：排名预测的伪贪婪方法，以获得大规模语料库的高效文本选择，以获得目标单位的最大覆盖

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Selecting efficiently a minimum amount of text from a large-scale text corpus to achieve a maximum coverage of certain units is an important problem in spoken language processing area. In this paper, the above text selection problem is first formulated as a maximum coverage problem with a Knapsack constraint (MCK). An efficient rank-predicted pseudo-greedy approach is then proposed to solve this problem. Experiments on a Chinese text selection task are conducted to verify the ef-ficiency of the proposed approach. Experimental results show that our approach can improve significantly the text selection speed yet without sacrificing the coverage score compared with traditional greedy approach.

机译：从大规模文本语料库中有效地选择最小的文本，以实现某些单位的最大覆盖范围是语言处理区域中的重要问题。在本文中，首先将上述文本选择问题称为具有背包约束（MCK）的最大覆盖问题。然后提出了一种有效的等级预测的伪贪婪方法来解决这个问题。进行了中国文本选择任务的实验，以验证建议方法的EF效力。实验结果表明，与传统的贪婪方法相比，我们的方法可以显着提高文本选择速度，尚未牺牲覆盖率。

著录项

来源
《International Speech Communication Association》|2008年||共4页
会议地点
作者
Wei LI; Qiang HUO;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912.3-532;
关键词
text selection; greedy approach; pseudo-greedy approach; maximum coverage problem;

机译：文本选择;贪婪的方法;伪贪婪的方法;最大覆盖问题;

相似文献

外文文献
中文文献
专利

1. Extracting salient sublexical units from written texts: a??Emophon,a?? a corpus-based approach to phonological iconicity [J] . Arash Aryani, Arthur M. Jacobs, Markus Conrad Frontiers in Psychology . 2013,第4期

机译：从书面文本中提取显着的次词汇单位：a ?? Emophon，a ??基于语料库的语音象似性方法
2. Building Text Corpus for Unit Selection Synthesis [J] . Pijus KASPARAITIS, Tomas ANBINDERIS Informatica . 2014,第4期

机译：建立文本语料库以进行单元选择综合
3. GRADIENT-DESCENT BASED UNIT-SELECTION OPTIMIZATION ALGORITHM USED FOR CORPUS-BASED TEXT-TO-SPEECH SYNTHESIS [J] . Matej Rojc, Zdravko Kacic Applied Artificial Intelligence . 2011,第5a7期

机译：基于语料库的语篇合成中基于梯度下降的单元选择优化算法
4. Rank-Predicted Pseudo-Greedy Approach to Efficient Text Selection From Large-Scale Corpus For Maximum Coverage of Target Units [C] . Wei LI, Qiang HUO International Speech Communication Association . 2008

机译：排名预测的伪贪婪方法，以获得大规模语料库的高效文本选择，以获得目标单位的最大覆盖
5. Efficient camera selection for maximized target coverage in underwater acoustic sensor networks. [D] . Albuali, Abdullah. 2014

机译：高效的摄像机选择，可在水下声传感器网络中最大化目标覆盖范围。
6. Extracting salient sublexical units from written texts: Emophon a corpus-based approach to phonological iconicity [O] . Arash Aryani, Arthur M. Jacobs, Markus Conrad 2013

机译：从书面文本中提取显着的亚词性单位： Emophon这是一种基于语料库的语音象似性方法
7. Extracting salient sublexical units from written texts:‘Emophon’, a corpus-based approach to phonological iconicity [O] . Arash eAryani, Markus eConrad, Arthur M Jacobs 2013

机译：从书面文本中提取显着的单性单位：'Emophon'，一种基于语料库的语音象似性方法
8. Optimal Constellation Design for Maximum Continuous Coverage of Targets Against a Space Background. [R] . Marchand, B., Takano, A. 2012

机译：空间背景下目标最大连续覆盖的最优星座设计。

Rank-Predicted Pseudo-Greedy Approach to Efficient Text Selection From Large-Scale Corpus For Maximum Coverage of Target Units

摘要

著录项

相似文献

相关主题

期刊订阅