Detecting and modeling local text reuse

机译：检测和建模本地文本重用

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Texts propagate through many social networks and provide evidence for their structure. We describe and evaluate efficient algorithms for detecting clusters of reused passages embedded within longer documents in large collections. We apply these techniques to two case studies: analyzing the culture of free reprinting in the nineteenth-century United States and the development of bills into legislation in the U.S. Congress. Using these divergent case studies, we evaluate both the efficiency of the approximate local text reuse detection methods and the accuracy of the results. These techniques allow us to explore how ideas spread, which ideas spread, and which subgroups shared ideas.

机译：文本通过许多社交网络传播，并为其结构提供证据。我们描述和评估有效的算法，以检测嵌入在较大集合中的较长文档中的重复使用段落的簇。我们将这些技术应用于两个案例研究：分析19世纪美国的免费转载文化以及美国国会将法案发展为立法。使用这些不同的案例研究，我们评估了近似本地文本重用检测方法的效率和结果的准确性。这些技术使我们能够探索思想如何传播，哪些思想传播以及哪些小组共享思想。

著录项

来源
《2014 IEEE/ACM Joint Conference on Digital Libraries》|2014年|183-192|共10页
会议地点 London(GB)
作者
Smith D.A.; Cordel R.; Dillon E.M.; Stramp N.; Wilkerson J.;
展开▼
作者单位

Coll. of Comput. Inf. Sci., Northeastern Univ., Boston, MA, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
text analysis; documents; local text reuse detection methods; modeling; social networks; text propagation; Abstracts; Irrigation; Logic gates;

机译：文本分析;文档;本地文本重用检测方法;建模;社交网络;文本传播;摘要;灌溉;逻辑门;;

相似文献

外文文献
中文文献
专利

1. A TEXT-BASED IMPLEMENTATION MODEL FOR REUSABLE ASPECT MODELS [J] . ABID MEHMOOD, DAYANG N.A. JAWAWI Journal of Theoretical and Applied Information Technology . 2013,第2期

机译：用于可重用方面模型的基于文本的实现模型
2. Evaluation of Fingerprint Selection Algorithms for Local Text Reuse Detection [J] . Zeitschrift fur Arznei- und Gewurzpflanzen . 2020,第1期

机译：用于本地文本重用检测的指纹选择算法的评估
3. Efficient Reuse of Natural Language Processing Models for Phenotype-Mention Identification in Free-text Electronic Medical Records: A Phenotype Embedding Approach [J] . Honghan Wu, Karen Hodgson, Sue Dyson, JMIR Medical Informatics . 2019,第4期

机译：在自由文本电子医疗记录中有效地重用自然语言处理模型的表型提及识别：嵌入方法的表型
4. Detecting and modeling local text reuse [C] . Smith D.A., Cordel R., Dillon E.M., IEEE/ACM Joint Conference on Digital Libraries . 2014

机译：检测和建模本地文本重用
5. Detecting and Analyzing Cybercrime in Text-based Communication of Cybercriminal Networks Through Computational Linguistic and Psycholinguistic Feature Modeling. [D] . Mbaziira, Alex Vincent. 2017

机译：通过计算语言和心理语言特征建模，在基于文本的网络犯罪网络通信中检测和分析网络犯罪。
6. Detecting concept mentions in biomedical text using hidden Markov model: multiple concept types at once or one at a time? [O] . Manabu Torii, Kavishwar Wagholikar, Hongfang Liu 2014

机译：使用隐藏的马尔可夫模型检测生物医学文本中的概念提及：一次还是一次选择多个概念类型？
7. Detecting and Modeling Local Text Reuse [O] . David A. Smith, Ryan Cordell, Elizabeth Maddock Dillon, 2015

机译：检测和建模本地文本重用

Detecting and modeling local text reuse

摘要

著录项

相似文献

相关主题

期刊订阅