首页> 外文会议>Pacific Asia Conference on Language, Information and Computation >Relating Keywords to the 'Top Ten News of the Year' in Korean Newspapers
【24h】

Relating Keywords to the 'Top Ten News of the Year' in Korean Newspapers

机译:将关键词与韩国报纸的“一年十大新闻”联系起来

获取原文

摘要

This paper takes an in-depth look at the relationship between mechanically extracted keywords and 'Top Ten News of the Year' compiled by the news editors. A previous study that briefly touched on the topic concludes there does not seem to exist any meaningful connection between the two. In this paper, we set up a more elaborate way of comparing and connecting the two, and argue that there is a certain reasonably good converging point. The corpus we make use of for our experiment is a subset of the Trend 21 corpus which is a collection of Korean major newspapers (2000-2013). For keyword extraction, log-likelihood ratio was made use of. Extraction of collocation for each keyword was needed, for which a version of Mutual Information was utilized. Finally a detailed comparison of the top ten news with the top 100 keywords was conducted from several points of view.
机译:本文深入了解机械提取的关键词与新闻编辑编制的“年十大新闻”之间的关系。先前的研究,在主题上短暂地触及了这两个主题的结论似乎并没有任何有意义的联系。在本文中,我们建立了更精细的比较和连接两者的方式,并认为存在一定的合理良好的会聚点。我们利用我们的实验的语料库是趋势21个语料库的子集,这是韩国主要报纸的集合(2000-2013)。对于关键字提取,利用对数似然比。需要提取每个关键字的搭配,用于哪个相互信息的版本。最后从几个观点进行了前十个新闻的详细比较了前十个关键词。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号