首页> 外文会议>Discovery science >A Methodology for Mining Document-Enriched Heterogeneous Information Networks

【24h】

A Methodology for Mining Document-Enriched Heterogeneous Information Networks

机译：丰富文档异构信息网络的方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The paper presents a new methodology for mining heterogeneous information networks, motivated by the fact that, in many real-life scenarios, documents are available in heterogeneous information networks, such as interlinked multimedia objects containing titles, descriptions, and subtitles. The methodology consists of transforming documents into bag-of-words vectors, decomposing the corresponding heterogeneous network into separate graphs and computing structural-context feature vectors with PageRank, and finally constructing a common feature vector space in which knowledge discovery is performed. We exploit this feature vector construction process to devise an efficient classification algorithm. We demonstrate the approach by applying it to the task of categorizing video lectures. We show that our approach exhibits low time and space complexity without compromising classification accuracy.

机译：本文提出了一种用于挖掘异构信息网络的新方法，这一事实的动机是，在许多实际场景中，文档可以在异构信息网络中使用，例如包含标题，描述和字幕的互连多媒体对象。该方法包括将文档转换成单词袋向量，将相应的异构网络分解为单独的图，并使用PageRank计算结构上下文特征向量，最后构造一个用于执行知识发现的公共特征向量空间。我们利用此特征向量构建过程来设计一种有效的分类算法。我们通过将其应用于视频讲座的分类任务来演示该方法。我们证明了我们的方法在不影响分类准确性的前提下，显示出较低的时间和空间复杂度。

著录项

来源
《Discovery science》|2011年|p.107-121|共15页
会议地点 Espoo(FI);Espoo(FI)
作者
Miha Grcar; Nada Lavrac;
展开▼
作者单位

Jozef Stefan Institute, Dept. of Knowledge Technologies, Jamova cesta 39, 1000 Ljubljana, Slovenia;

Jozef Stefan Institute, Dept. of Knowledge Technologies, Jamova cesta 39, 1000 Ljubljana, Slovenia;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词
text mining; heterogeneous information networks; data fusion; classification; centroid-based classifier; diffusion kernels;

机译：文本挖掘；异构信息网络；数据融合；分类;基于质心的分类器；扩散核;

相似文献

外文文献
中文文献
专利

1. A Methodology for Mining Document-Enriched Heterogeneous Information Networks [J] . Miha Grcar, Nejc Trdin, Nada Lavrac The Computer journal . 2013,第3期

机译：丰富文档异构信息网络的方法
2. A negotiation-based networking methodology to enable cooperation across heterogeneous co-located networks [J] . Eli De Poorter, Pieter Becue, Milos Rovcanin, Ad hoc networks . 2012,第6期

机译：基于协商的联网方法，可实现异构主机网络之间的合作
3. Predicting lncRNA-disease associations using network topological similarity based on deep mining heterogeneous networks [J] . Zhang Hui, Liang Yanchun, Peng Cheng, Mathematical Biosciences: An International Journal . 2019,第期

机译：基于深度挖掘异构网络预测利用网络拓扑相似性的LNCRNA疾病关联
4. A Methodology for Mining Document-Enriched Heterogeneous Information Networks [C] . Miha Grcar, Nada Lavrac International Conference on Discovery Science . 2011

机译：挖掘文件丰富的异构信息网络的方法
5. Models of EEG data mining and classification in temporal lobe epilepsy: Wavelet-chaos-neural network methodology and spiking neural networks. [D] . Ghosh Dastidar, Samanwoy. 2007

机译：颞叶癫痫的EEG数据挖掘和分类模型：小波-混沌神经网络方法和尖峰神经网络。
6. GraphWeb: mining heterogeneous biological networks for gene modules with functional significance [O] . Jüri Reimand, Laur Tooming, Hedi Peterson, 2008

机译：GraphWeb：挖掘具有功能意义的基因模块的异构生物网络
7. A Methodology for Mining Document-Enriched Heterogeneous Information Networks [O] . Miha Grčar, Nada Lavrač 2014

机译：一种挖掘文档丰富的异构信息网络的方法论

A Methodology for Mining Document-Enriched Heterogeneous Information Networks

摘要

著录项

相似文献

相关主题

期刊订阅