Web taxonomy integration with hierarchical shrinkage algorithm and fine-grained relations

Chia-Wei Wu; Richard Tzong-Han Tsai; Cheng-Wei Lee; Wen-Lian Hsu

首页> 外文期刊>Expert systems with applications >Web taxonomy integration with hierarchical shrinkage algorithm and fine-grained relations

【24h】

Web taxonomy integration with hierarchical shrinkage algorithm and fine-grained relations

机译：具有分类收缩算法和细粒度关系的Web分类法集成

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address the problem of integrating web taxonomies from different real Internet applications. Integrating web taxonomies is to transfer instances from a source to target taxonomy. Unlike the conventional text categorization problem, in taxonomy integration, the source taxonomy contains extra information that can be used to improve the categorization. The major existing methods can be divided in two types: those that use neighboring categories to smooth the document term vector and those that consider the semantic relationship between corresponding categories of the target and source taxonomies to facilitate categorization. In contrast to the first type of approach, which only uses a flattened hierarchy for smoothing, we apply a hierarchy shrinkage algorithm to smooth child documents by their parents. We also discuss the effect of using different hierarchical levels for smoothing. To extend the second type of approach, we extract fine-grain semantic relationships, which consider the relationships between lower-level categories. In addition, we use the cosine similarity to measure the semantic relationships, which achieves better performance than existing methods. Finally, we integrate the existing approaches and the proposed methods into one machine learning model to find the best feature configuration. The results of experiments on real Internet data demonstrate that our system outperforms standard text classifiers by about 10%.

机译：我们解决了集成来自不同实际Internet应用程序的Web分类法的问题。集成Web分类法是将实例从源转移到目标分类法。与常规的文本分类问题不同，在分类法集成中，源分类法包含可用于改进分类的额外信息。现有的主要方法可以分为两种：使用相邻类别平滑文档术语向量的方法以及考虑目标分类法和源分类法的相应类别之间的语义关系以促进分类的方法。与仅使用扁平化的层次结构进行平滑处理的第一种方法相反，我们采用层次结构收缩算法来通过其父级对子文档进行平滑处理。我们还将讨论使用不同的层次级别进行平滑的效果。为了扩展第二种方法，我们提取了细粒度的语义关系，该关系考虑了较低级别类别之间的关系。另外，我们使用余弦相似度来度量语义关系，这比现有方法具有更好的性能。最后，我们将现有方法和提出的方法集成到一个机器学习模型中，以找到最佳的特征配置。对真实Internet数据进行的实验结果表明，我们的系统比标准文本分类器的性能高出约10％。

著录项

来源
《Expert systems with applications》 |2008年第4期|2123-2131|共9页
作者
Chia-Wei Wu; Richard Tzong-Han Tsai; Cheng-Wei Lee; Wen-Lian Hsu;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
web taxonomy integration; shrinkage algorithm; text categorization;

机译：网络分类法集成;收缩算法;文字分类;

相似文献

外文文献
中文文献
专利

1. Fine-grained I/O Complexity via Reductions: New Lower Bounds, Faster Algorithms, and a Time Hierarchy [J] . Erik D. Demaine, Andrea Lincoln, Quanquan C. Liu, LIPIcs : Leibniz International Proceedings in Informatics . 2018,第30期

机译：通过减少细粒度I / O复杂性：新的下限，更快的算法和时间层次结构
2. Relation Enhanced Neural Model for Type Classification of Entity Mentions with a Fine-Grained Taxonomy [J] . Kai-Yuan Cui, Peng-Jie Ren, Zhu-Min Chen, 计算机科学技术学报（英文版） . 2017,第004期

机译：精细分类法中实体提及类型分类的关系增强神经模型
3. Taxonomy of Manufacturing Flexibility at Manufacturing Companies Using Imperialist Competitive Algorithms, Support Vector Machines and Hierarchical Cluster Analysis [J] . M. Khoobiyan, A. Pooya, A. Tavakkoli, Engineering Technology and Applied Science Research . 2017,第2期

机译：使用帝国主义竞争算法，支持向量机和层次聚类分析的制造公司制造灵活性分类
4. Learning to Integrate Web Taxonomies with Fine-Grained Relations: A Case Study Using Maximum Entropy Model [C] . Chia-Wei Wu, Tzong-Han Tsai, Wen-Lian Hsu Asia Information Retrieval Symposium(AIRS 2005); 20051013-15; Jeju Island(KR) . 2005

机译：学习将网络分类法与细粒度关系集成在一起：使用最大熵模型的案例研究
5. Leveraging Human Perception and Computer Vision Algorithms for Interactive Fine-Grained Visual Categorization [D] . Wah, Catherine Lih-Lian 2014

机译：利用人的感知和计算机视觉算法进行交互式细粒度视觉分类
6. HPEPDOCK: a web server for blind peptide–protein docking based on a hierarchical algorithm [O] . Pei Zhou, Bowen Jin, Hao Li, 2018

机译：HPEPDOCK：基于分层算法的盲肽-蛋白对接的Web服务器
7. Learning to Integrate Web Taxonomies with Fine-Grained Relations: A Case Study Using Maximum Entropy Model [O] . Chia-wei Wua, Tzong-han Tsaiab, Wen-lian Hsuac 2015

机译：学习将网络分类与细粒度关系整合：使用最大熵模型的案例研究

Web taxonomy integration with hierarchical shrinkage algorithm and fine-grained relations

摘要

著录项

相似文献

相关主题

期刊订阅