Extracting Local Web Communities Using Lexical Similarity

机译：使用词法相似性提取本地Web社区

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The World Wide Web contains rich textual contents that are interconnected via complex hyperlinks. Most studies on web community extraction only focus on graph structures. Consequently, web communities are discovered purely in terms of explicit link information without considering textual properties of web pages. This paper proposes an improved algorithm based on Flake's method using the maximum flow algorithm. The improved algorithm considers the differences between edges in terms of importance, and assigns a well-designed capacity to each edge via the lexical similarity of web pages. Given a specific query, it also lends itself to a new and efficient ranking scheme for members in the extracted community. The experimental results indicate that our approach efficiently handles a variety of data sets across a novel optimization strategy of similarity computation.

机译：万维网包含丰富的文本内容，这些内容通过复杂的超链接相互连接。关于网络社区提取的大多数研究都只关注图结构。因此，纯粹根据显式链接信息发现网络社区，而无需考虑网页的文本属性。本文提出了一种基于Flake方法的最大流量算法。改进的算法在重要性方面考虑了边缘之间的差异，并通过网页的词汇相似性为每个边缘分配了精心设计的容量。给定一个特定的查询，它还可以为提取的社区中的成员提供一种新的高效的排名方案。实验结果表明，我们的方法通过一种新型的相似度计算优化策略有效地处理了各种数据集。

著录项

来源
《DASFAA 2010;International conference on database systems for advances applications;International workshop on graph data management: Techniques and application;GDM 2010;International workshop on benchmarking of database management systems and data-oriented web technologies;BenchmarX 2010;International workshop on managing data quality in collaborative information systems;MCIS 2010;Workshop on social networks and social media mining on the web;SNSMW 2010;Data-intensive eScience workshop;DIEW 2010;International workshop on ubiquitous data management;UDM 2010》|2010年|p.327-337|共11页
会议地点 Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP);Tsukuba(JP)
作者
Xianchao Zhang; Wen Xu; Wenxin Liang;
展开▼
作者单位

School of Software Dalian University of Technology China;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
community extraction; maximum flow algorithm; lexical similarity;

机译：社区提取；最大流量算法；词汇相似度;

相似文献

外文文献
中文文献
专利

1. Sentiment classification: a lexical similarity based approach for extracting subjectivity in documents [J] . Kiran Sarvabhotla, Prasad Pingali, Vasudeva Varma Information retrieval . 2011,第3期

机译：情感分类：一种基于词汇相似度的方法，用于提取文档的主观性
2. Two Phase Non-Rigid Multi-Modal Image Registration Using Weber Local Descriptor-Based Similarity Metrics and Normalized Mutual Information [J] . Feng Yang, Jiani Hu, Mingyue Ding, Sensors . 2013,第6期

机译：基于Weber局部描述符的相似度量和归一化互信息的两相非刚性多模态图像配准
3. A two-stage BFS local community detection algorithm based on node transfer similarity and Local Clustering Coefficient [J] . Liu Saisai, Xia Zhengyou Physica, A. Statistical mechanics and its applications . 2020,第期

机译：基于节点传输相似性和局部聚类系数的两阶段BFS本地社区检测算法
4. Extracting Local Web Communities Using Lexical Similarity [C] . Xianchao Zhang, Wen Xu, Wenxin Liang International Conference on Database Systems for Advanced Applications . 2010

机译：使用词汇相似性提取本地Web社区
5. Website interface design: Similarity and differences between Saudi Arabian and United States university websites. [D] . Alyahya, Dalia Mohammed. 2011

机译：网站界面设计：沙特阿拉伯和美国大学网站之间的异同。
6. Two Phase Non-Rigid Multi-Modal Image Registration Using Weber Local Descriptor-Based Similarity Metrics and Normalized Mutual Information [O] . Feng Yang, Mingyue Ding, Xuming Zhang, 2013

机译：使用基于Weber局部描述符的相似度量和归一化互信息进行两相非刚性多模态图像配准
7. A Relational Model of Semantic Similarity between Words using Automatically Extracted Lexical Pattern Clusters from the Web [O] . Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuka 2009

机译：基于Web的自动提取词类模式的词语间语义相似关系模型

Extracting Local Web Communities Using Lexical Similarity

摘要

著录项

相似文献

相关主题

期刊订阅