首页> 外文会议>Workshop on noisy user-generated text >No, you're not alone: A better way to find people with similar experiences on Reddit

【24h】

No, you're not alone: A better way to find people with similar experiences on Reddit

机译：不，您并不孤单：在Reddit上找到具有类似经验的人的更好方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a probabilistic clustering algorithm that can help Reddit users to find posts that discuss experiences similar to their own. This model is built upon the BF.RT Next Sentence Prediction model and reduces the time complexity for clustering all posts in a corpus from O(n~2) to O(n) with respect to the number of posts. We demonstrate that such probabilistic clustering can yield a performance better than baseline clustering methods based on Latent Dirichlet Allocation (Blei et al.. 2003) and Word2Vec (Mikolov et al., 2013). Furthermore, there is a high degree of coherence between our probabilistic clustering and the exhaustive comparison O(n~2) algorithm in which the similarity between every pair of posts is found. This makes the use of the BERT Next Sentence Prediction model more practical for unsupervised clustering tasks due to the high runtime overhead of each BERT computation.

机译：我们提出了一种概率聚类算法，可以帮助Reddit用户找到讨论与自己的经历类似的帖子。该模型建立在BF.RT下一句预测模型的基础上，并降低了将语料库中所有帖子相对于帖子数量从O（n〜2）聚集到O（n）的时间复杂度。我们证明，与基于潜在狄利克雷分配（Blei et al。2003）和Word2Vec（Mikolov et al。，2013）的基线聚类方法相比，这种概率聚类可以产生更好的性能。此外，在我们的概率聚类和穷举比较O（n〜2）算法之间存在高度的一致性，其中发现了每对帖子之间的相似性。由于每个BERT计算的运行时开销都很高，因此对于无人监督的群集任务，使用BERT下一句预测模型更加实用。

著录项

来源
《Workshop on noisy user-generated text》|2019年|307-315|共9页
会议地点
作者
Zhilin Wang; Elena Rastorgueva; Weizhe Lin; Xiaodong Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. I leaked, then I Reddit: experiences and insight shared on urinary incontinence by Reddit users [J] . Du Chris, Lee Wai, Moskowitz Dena, International urogynecology journal and pelvic floor dysfunction . 2020,第2期

机译：我泄露了，然后我reddit：reddit用户对尿失禁共享的经历和洞察力
2. I leaked, then I reddit: experiences and insight shared on urinary incontinence by reddit users [J] . Du Chris, Lee Wai, Moskowitz Dena, Neurourology and urodynamics. . 2019,第Suppla1期

机译：我泄露了，然后我reddit：reddit用户对尿失禁共享的经历和洞察力
3. Comparison of Facebook, Google Ads, and Reddit for the Recruitment of People Who Considered but Did Not Obtain Abortion Care in the United States: Cross-sectional Survey [J] . Heidi Moseson, Alexandra Wollum, Jane W Seymour, JMIR formative research. . 2021,第2期

机译：Facebook，Google广告和Reddit对招聘招募的人，但在美国未获得堕胎护理：横断面调查
4. No, you're not alone: A better way to find people with similar experiences on Reddit [C] . Zhilin Wang, Elena Rastorgueva, Weizhe Lin, Workshop on noisy user-generated text . 2019

机译：不，你并不孤单：找到在Reddit上找到类似经验的人
5. The invisible people of the invisible coast: The resilience of people experiencing homelessness to disasters on the Alabama, Louisiana, and Mississippi Gulf Coasts. [D] . Callais, Nicole Elizabeth. 2016

机译：无形海岸的无形人民：在阿拉巴马州，路易斯安那州和密西西比州墨西哥湾沿岸遭受灾难的无家可归者的抵御能力。
6. Comparison of Facebook Google Ads and Reddit for the Recruitment of People Who Considered but Did Not Obtain Abortion Care in the United States: Cross-sectional Survey [O] . Heidi Moseson, Alexandra Wollum, Jane W Seymour, 2021

机译：FacebookGoogle广告和Reddit的比较为招募被考虑但未获得美国堕胎护理的人：横断面调查
7. No, you’re not alone: A better way to find people with similar experiences on Reddit [O] . Zhilin Wang, Elena Rastorgueva, Weizhe Lin, 2019

机译：不，你并不孤单：找到在Reddit上找到类似经验的人

No, you're not alone: A better way to find people with similar experiences on Reddit

摘要

著录项

相似文献

相关主题

期刊订阅