首页> 外国专利> FULL-TEXT FUZZY RETRIEVAL METHOD FOR SIMILAR CHINESE CHARACTERS IN CIPHERTEXT DOMAIN

FULL-TEXT FUZZY RETRIEVAL METHOD FOR SIMILAR CHINESE CHARACTERS IN CIPHERTEXT DOMAIN

机译:密文域相似汉字全文模糊检索方法

摘要

A full-text fuzzy retrieval method for similar Chinese characters in a ciphertext domain. The method realizes a fuzzy search on a Chinese ciphertext domain on the basis of a symmetric searchable encryption scheme and an inverted index structure, supports a fuzzy search on Chinese characters having similar glyphs in ciphertext status, ensures that the searching result is ordered, and supports a multi-keyword logical connection fuzzy search. According to the method, a distributed searching engine Lucene and a Chinese word segmenter IKAnalyzer are used for full-text word segmentation on a document, and a plaintext inverted index comprising similar Chinese characters is constructed by means of the established similar character library of 3,755 commonly used Chinese characters. Considering the security of the inverted index structure, all the keywords in the plaintext inverted index and document numbers corresponding thereto are constructed in an encrypted chain form, and a B+ tree structure is used for accelerating searching. The method realizes a full-text fuzzy search on a Chinese ciphertext domain in a semi-trusted cloud server without false detection and missed detection.
机译:密文域相似汉字的全文模糊检索方法。该方法基于对称可搜索加密方案和倒排索引结构,实现了对中文密文域的模糊搜索,支持对密文状态下具有相似字形的汉字进行模糊搜索,确保搜索结果有序,并支持多关键字逻辑连接模糊搜索。根据该方法,使用分布式搜索引擎Lucene和中文分词器IKAnalyzer对文档进行全文分词,借助已建立的3755个相似字符库,构造出包含相似汉字的明文倒排索引。使用汉字。考虑到倒排索引结构的安全性,以加密的链形式构造明文倒排索引中的所有关键字和与其相对应的文档编号,并且使用B +树结构来加速搜索。该方法在半信任云服务器中的中文密文域上实现了全文模糊搜索,没有误检和漏检。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号