首页>
外国专利>
Written text editing method for correcting spelling error, involves calculating difference between apparition frequency of one n-gram in text and in language using n-gram by n-gram technique
Written text editing method for correcting spelling error, involves calculating difference between apparition frequency of one n-gram in text and in language using n-gram by n-gram technique
The method involves establishing text and language distributions of frequencies to which N-grams in a text and a language are appeared, respectively, where n-grams can be groups of characters. The distributions are compared between them. A language (18), whose language distribution has a large similarity with the text distribution, is determined as a language of a written text (15). The text is processed according to the language determination. The difference between the apparition frequency of one n-gram in the text and in the language is calculated using an n-gram by n-gram technique.
展开▼
机译:该方法包括建立文本和语言中的N-gram分别出现的频率的文本和语言分布,其中n-gram可以是字符组。比较它们之间的分布。将其语言分布与文本分布具有很大相似性的语言(18)确定为书面文本(15)的语言。根据语言确定来处理文本。使用n-gram by n-gram技术计算文本和语言中一个n-gram的出现频率之间的差异。
展开▼