首页> 外文会议>Consumer Electronics, Communications and Networks (CECNet), 2012 2nd International Conference on >Study of the word segmentation algorithm based on Hash dictionary mechanism
【24h】

Study of the word segmentation algorithm based on Hash dictionary mechanism

机译:基于哈希字典机制的分词算法研究

获取原文
获取原文并翻译 | 示例

摘要

Machine learning of data analysis and processing in internet allows users to access information quickly and conveniently. As most of the information is text, so that automatic segmentation technology has great significance. The word segmentation dictionary is an important component of the Chinese automatic word segmentation system. The speed of the dictionary loading and query can affect the speed of segmentation system directly. This paper proposes an improved word segmentation mechanism based on double word Hash. The test result shows that the improved word segmentation algorithm enhances the query speed and efficiency of the term matches.
机译:互联网上数据分析和处理的机器学习使用户可以快速方便地访问信息。由于大多数信息是文本,因此自动分割技术具有重要意义。分词词典是中文自动分词系统的重要组成部分。字典加载和查询的速度会直接影响分割系统的速度。提出了一种基于双字哈希的改进分词机制。测试结果表明,改进的分词算法提高了词匹配的查询速度和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号