Regularized bi-directional co-clustering

Affeldt Severine; Labiod Lazhar; Nadif Mohamed

首页> 外文期刊>Statistics and computing >Regularized bi-directional co-clustering

【24h】

Regularized bi-directional co-clustering

机译：正则化双向共聚

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The simultaneous clustering of documents and words, known as co-clustering, has proved to be more effective than one-sided clustering in dealing with sparse high-dimensional datasets. By their nature, text data are also generally unbalanced and directional. Recently, the von Mises-Fisher (vMF) mixture model was proposed to handle unbalanced data while harnessing the directional nature of text. In this paper, we propose a general co-clustering framework based on a matrix formulation of vMF model-based co-clustering. This formulation leads to a flexible framework for text co-clustering that can easily incorporate both word-word semantic relationships and document-document similarities. By contrast with existing methods, which generally use an additive incorporation of similarities, we propose a bi-directional multiplicative regularization that better encapsulates the underlying text data structure. Extensive evaluations on various real-world text datasets demonstrate the superior performance of our proposed approach over baseline and competitive methods, both in terms of clustering results and co-cluster topic coherence.

机译：在处理稀疏高维数据集时，已经证明，已被证明在处理稀疏高维数据集中的单面聚类，同时群集文档和单词。通过他们的性质，文本数据通常也是不平衡和方向的。最近，提出了Von Mises-Fisher（VMF）混合模型来处理不平衡数据，同时利用文本的定向性质。在本文中，我们提出了一种基于VMF模型的共聚类矩阵制定的一般共聚类框架。该配方导致文本共簇的灵活框架，可以轻松地包含单词语义关系和文档文件的相似性。相反，与现有方法相比，通常使用相似性的添加剂掺入，我们提出了一种双向乘法正则化，从而更好地封装了底层文本数据结构。关于各种现实世界文本数据集的广泛评估展示了我们提出的方法对基线和竞争方法的卓越性能，无论是在聚类结果和共簇主题连贯性方面。

著录项

来源
《Statistics and computing》 |2021年第3期|32.1-32.17|共17页
作者
Affeldt Severine; Labiod Lazhar; Nadif Mohamed;
展开▼
作者单位

Univ Paris Ctr Borelli CNRS F-75006 Paris France;

Univ Paris Ctr Borelli CNRS F-75006 Paris France;

Univ Paris Ctr Borelli CNRS F-75006 Paris France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Co-clustering; Regularization; Information retrieval; Text mining;

机译：共聚类;正规化;信息检索;文本挖掘;

相似文献

外文文献
中文文献
专利

1. Tri-regularized nonnegative matrix tri-factorization for co-clustering [J] . Deng Ping, Li Tianrui, Wang Hongjun, Knowledge-Based Systems . 2021,第Auga17期

机译：共簇的三正常化非负矩阵三分化
2. Orthogonal Dual Graph-Regularized Nonnegative Matrix Factorization for Co-Clustering [J] . Tang Jiayi, Wan Zhong Journal of Scientific Computing . 2021,第3期

机译：协同聚类的正交双图 - 正则矩阵分解
3. Generalized Co-clustering Analysis via Regularized Alternating Least Squares [J] . Li Gen Computational statistics & data analysis . 2020,第期

机译：广义共聚类通过正则化交替最小二乘法分析
4. Tucker-Regularized Tensor Bregman Co-clustering [C] . Pedro A. Forero, Paul A. Baxley European Signal Processing Conference . 2020

机译：Tucker-Rengearzed Tensor Bregman Co-Clustering
5. Discovering spatial co-clustering patterns in collision data. [D] . Li, Dapeng. 2013

机译：在碰撞数据中发现空间共聚模式。
6. Spatial Co-Clustering of Cardiovascular Diseases and Select Risk Factors among Adults in South Africa [O] . Timotheus B. Darikwa, Samuel O. Manda 2020

机译：南非成年人心血管疾病的空间共聚和选择危险因素
7. Constrained Dual Graph Regularized Orthogonal Nonnegative Matrix Tri-Factorization for Co-Clustering [O] . Shaodi Ge, Hongjun Li, Liuhong Luo 2019

机译：约束双图正常化正交非环境矩阵三分化用于共聚类

Regularized bi-directional co-clustering

摘要

著录项

相似文献

相关主题

期刊订阅