UPDATING FIELD ASSOCIATION WORD DICTIONARY USING WORD ATTRIBUTES, MORPHOLOGICAL ANALYSIS, AND COMPOUND WORDS

El-Sayed Atlam

首页> 外文期刊>International Journal of Innovative Computing Information and Control >UPDATING FIELD ASSOCIATION WORD DICTIONARY USING WORD ATTRIBUTES, MORPHOLOGICAL ANALYSIS, AND COMPOUND WORDS

【24h】

UPDATING FIELD ASSOCIATION WORD DICTIONARY USING WORD ATTRIBUTES, MORPHOLOGICAL ANALYSIS, AND COMPOUND WORDS

机译：使用词属性，词法分析和复合词来更新字段关联词词典

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Document classification and summarization are certainly important for document text retrieval. There are some pioneer researches using Field Association (FA) words to identify the subject of a text (document field) when extracting specific words in that text. However, these works have disadvantages by extracting irrelevant FA words selection and therefore, results giving huge amount of unwanted texts. To treat these disadvantages in text retrieval, two techniques are used: the first technique is using attributes for extracting FA words and classifying the texts in the document proposed. The key point of this technique is to use attributes to recognize specific field information as well as extracting relevant FA words. The second technique proposes a method for filtering automatically the FA words dictionary by deleting irrelevant word using morphological analysis and words that has no more information than single FA. From experimental results Precision and Recall are improved by 11-18% and 15-28% respectively using word attribute (first technique) than traditional method. Moreover, the second technique could delete around 15% of irrelevant FA word from word candidates using morphological analysis and words that has no more information than single FA. Furthermore, Precision and Recall increases by 18-25% after using the second technique as the dictionary words become clear and specific. Finally, the New_m (new method) gains higher classification accuracy over all models by 10-15%. This model achieves high classification accuracy because it gains the advantage of the FA words classification using extraction attributes.

机译：文档分类和摘要对于文档文本检索当然很重要。有一些先驱研究在提取文本中的特定单词时使用字段关联（FA）单词来识别文本（文档字段）的主题。但是，这些工作由于提取了不相关的FA词选择而具有缺点，因此，结果会产生大量不需要的文本。为了解决文本检索中的这些缺点，使用了两种技术：第一种技术是使用属性来提取FA单词并将分类的文本在建议的文档中。该技术的关键是使用属性来识别特定的字段信息以及提取相关的FA字。第二种技术提出了一种方法，该方法通过使用形态学分析删除不相关的单词和信息量不超过单个FA的单词来自动过滤FA单词词典。从实验结果来看，与传统方法相比，使用单词属性（第一种技术）可以将精度和召回率分别提高11-18％和15-28％。此外，第二种技术可以使用词法分析和单词信息不超过单个FA的单词从候选单词中删除大约15％的无关FA单词。此外，使用第二种技术后，由于字典单词变得清晰而具体，精度和查全率提高了18-25％。最后，New_m（新方法）在所有模型上的分类精度都提高了10-15％。该模型获得了高分类精度，因为它获得了使用提取属性进行FA词分类的优势。

著录项

来源
《International Journal of Innovative Computing Information and Control》 |2014年第6期|2097-2111|共15页
作者
El-Sayed Atlam;
展开▼
作者单位

Department of Information Science and Intelligent System University of Tokushima 2-24, Shinkura-cho, Tokushima 770-8501, Japan,Computer Science Division Department of Mathematics Faculty of Science Tanta University General Administration of Tanta University, El-Geish St., Tanta, El-Gharbia Governorate, Egypt;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Field Association words; FA word dictionary; Morphological analysis; Irrelevant FA word;

机译：田野协会的话;FA单词词典;形态分析;不相关的FA字;

相似文献

外文文献
中文文献
专利

1. Visual words dictionaries and fusion techniques for searching people through textual and visual attributes [J] . Junior Fabian, Ramon Pires, Anderson Rocha Pattern recognition letters . 2014,第APRa1期

机译：视觉单词词典和融合技术，用于通过文本和视觉属性搜索人
2. An automatic filtering method for field association words by deleting unnecessary words [J] . E. GHADA, E.-S. ATLAM, M. FUKETA, International journal of computer mathematics . 2006,第3期

机译：通过删除不必要的单词来自动过滤字段关联单词的方法
3. COMPILING A DICTIONARY OF LOAN WORDS IN BALINESE: THE EVALUATION RESULT OF EFFECTIVENESS TESTING IN THE FIELD AIDED BY MOBILE TECHNOLOGY [J] . I NENGAH SUANDI, IDA BAGUS PUTRAYASA, DEWA GEDE HENDRA DIVAYANA Journal of Theoretical and Applied Information Technology . 2017,第14期

机译：编制巴厘岛的贷款词词典：通过移动技术辅助实地有效测试的评估结果
4. DETERMINATION OF FIELD ASSOCIATION WORDS USING WORD ATTRIBUTES [C] . Uddin M Sharif, E-S Atlam, W Hiraishi, IASTED(International Association of Science and Technology for Development) International Conference on Knowledge Sharing and Collaborative Engineering; 20061129-1201; St.Thomas(US) . 2006

机译：使用词属性确定字段关联词
5. The analysis of title words as document contents indicators: Development of an informetrica method and application to the field of Drawing and Art Education. [D] . Maaswinkel, Antonius Peter. 1999

机译：标题词作为文档内容指标的分析：一种信息计量方法的开发及其在绘画艺术教育领域的应用。
6. Character Decomposition and Transposition of Chinese Compound Words in the Right and Left Visual Fields [O] . Hong-Wen Cao, Kai-Fu Yang, Hong-Mei Yan 2016

机译：左右视野中汉语复合词的字符分解与移位
7. The analysis of the exocentric compounding from the new entry words of Oxford Dictionary from May 2014 to May 2015 [O] . Leihitu Stefanie Naomi 2016

机译：2014年5月至2015年5月《牛津词典》新增词对中心外复合词的分析

UPDATING FIELD ASSOCIATION WORD DICTIONARY USING WORD ATTRIBUTES, MORPHOLOGICAL ANALYSIS, AND COMPOUND WORDS

摘要

著录项

相似文献

相关主题

期刊订阅