首页> 外文期刊>Malaysian Journal of Computer Science >Linguistic Feature Classifying and Tracing
【24h】

Linguistic Feature Classifying and Tracing

机译:语言特征分类与追踪

获取原文
           

摘要

We investigate the identification and analysis of linguistic (lexico-grammatical) features that are characteristically used by articles of a specific year of publication. Linguistic features differ from shallow features because they represent authors?lexico-grammatical writing styles and do not consider well-known bag-of-words model. Current literature focusses on shallow features rather than on linguistic features and existing methods for identifying linguistic features use well-known knowledge-structure based approaches. In contrast to this, we advance these existing methods by applying semantic clustering instead of using knowledge-structure based approaches. For evaluation purpose, a linguistic feature-based prediction model is built to enable an automated assignment of articles to their years of publication. In a case study, the proposed methodology is applied to articles of the Springer book series 'Communications in Computer and Information Science' published from 2009 to 2013. The Case study results show the feasibility of the proposed approach as compared to frequently used baseline.
机译:我们调查特定出版年份的文章所特有的语言(词汇语法)特征的识别和分析。语言特征不同于浅层特征,因为它们代表着作者的词汇语法写作风格,并且没有考虑众所周知的词袋模型。当前的文献关注于浅层特征而不是语言特征,并且用于识别语言特征的现有方法使用众所周知的基于知识结构的方法。与此相反,我们通过应用语义聚类而不是使用基于知识结构的方法来推进这些现有方法。为了进行评估,建立了基于语言特征的预测模型,以使文章能够自动分配到其发表年份。在一个案例研究中,该提议的方法被应用于2009年至2013年出版的Springer丛书“计算机与信息科学中的通信”的文章中。案例研究结果表明,与常用基线相比,该提议方法的可行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号