From tree to network: reordering an archival catalogue

Mark Bell

首页> 外文期刊>Records management journal >From tree to network: reordering an archival catalogue

【24h】

From tree to network: reordering an archival catalogue

机译：从树到网络：重新排序档案目录

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Purpose - This paper presents the results of a number of experiments performed at the National Archives, all related to the theme of linking collections of records. This paper aims to present a methodology for translating a hierarchy into a network structure using a number of methods for deriving statistical distributions from records metadata or content and then aggregating them. Simple similarity metrics are then used to compare and link, collections of records with similar characteristics. Design/methodology/approach - The approach taken is to consider a record at any level of the catalogue hierarchy as a summary of its children. A distribution for each child record is created (e.g. word counts and date distribution) and averaged/summed with the other children. This process is repeated up the hierarchy to find a representative distribution of the whole series. By doing this the authors can compare record series together and create a similarity network. Findings - The summarising method was found to be applicable not only to a hierarchical catalogue but also to web archive data, which is by nature stored in a hierarchical folder structure. The case studies raised many questions worthy of further exploration such as how to present distributions and uncertainty to users and how to compare methods, which produce similarity scores on different scales. Originality/value - Although the techniques used to create distributions such as topic modelling and word frequency counts, are not new and have been used to compare documents, to the best of the knowledge applying the averaging approach to the archival catalogue is new. This provides an interesting method for zooming in and out of a collection, creating networks at different levels of granularity according to user needs.

机译：目的 - 本文介绍了在国家档案中进行了许多实验的结果，均与联系记录收藏的主题有关。本文旨在介绍一种使用许多方法将层次结构转换为网络结构的方法，用于从记录元数据或内容中派生统计分布，然后聚合它们。然后使用简单的相似度指标来比较和链接，具有相似特征的记录集合。设计/方法/方法 - 采取的方法是考虑任何级别的目录层次结构的记录作为其子女的摘要。创建每个子程度记录的分发（例如，单词计数和日期分发），并与其他子项平均/汇总。该过程重复了层次结构以找到整个系列的代表性分布。通过这样做，作者可以将记录系列组合在一起并创建相似网络。调查结果 - 发现总结方法不仅适用于分层目录，还可以应用于Web归档数据，这是由存储在分层文件夹结构中的自然。案例研究提出了许多值得进一步的探索的问题，例如如何向用户呈现分布和不确定性以及如何比较的方法，这些方法在不同的尺度上产生相似性分数。原创性/值 - 尽管用于创建产品如主题建模和字频计数等的技术并不是新的，并且已被用于比较文档，以最佳应用程序应用于档案目录的平均方法是新的。这提供了一种有趣的方法，用于根据用户需求在不同粒度级别创建网络的有趣方法。

著录项

来源
《Records management journal》 |2020年第3期|379-394|共16页
作者
Mark Bell;
展开▼
作者单位

The National Archives Kew UK;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Archives; Network analysis; Record linkage; Topic modelling;

机译：档案;网络分析;记录联动;主题建模;

相似文献

外文文献
中文文献
专利

1. Archives for Administrators or Archives for Antiquarians? A History of Archive Cataloguing in Four Oxford Colleges [J] . Journal of the Society of Archivists . 2009,第1期

机译：管理员档案还是古人档案？牛津四所学院的档案编目史
2. Archives for Administrators or Archives for Antiquarians? A History of Archive Cataloguing in Four Oxford Colleges [J] . Robin Darwall-Smith, Michael Riordan Journal of the society of archivists . 2009,第1期

机译：管理员档案还是古人档案？牛津四所学院的档案编目史
3. SECURING THE BRITISH RECORDS ASSOCIATION'S LEGACY: CATALOGUING THE ASSOCIATION'S ARCHIVES AT THE LONDON METROPOLITAN ARCHIVES [J] . PENELOPE BAKER Archives . 2020,第1期

机译：确保英国记录协会的遗产：在伦敦大都会档案馆编制协会的档案
4. Comparison of Minimum Spanning Tree Reordering with Bias-Adjusted Reordering for Lossless Compression of 3D Ultraspectral Sounder Data [C] . Alok Ahuja, Bormin Huang, Mitchell D. Goldberg Algorithms and Technologies for Multispectral, Hyperspectral, and Ultraspectral Imagery XII pt.2 . 2006

机译：最小生成树重排序与偏置调整后的重排序对3D超光谱测深仪数据的无损压缩的比较
5. Assessment of the Suitability of Tree Rings as Archives of Atmospheric Mercury Pollution Using Tree Cores and Results of a Controlled Field Experiment to Assess the Use of Tree Tissue Concentrations as Bioindicators of Air Hg [D] . Peckham, Matthew A. 2018

机译：评估树圈作为大气汞污染的档案的适用性使用树核和受控场实验的结果评估树组织浓度作为空气HG的生物inderators的使用
6. Reordering Hierarchical Tree Based on Bilateral Symmetric Distance [O] . Minho Chae, James J. Chen 2008

机译：基于双边对称距离的层次树重排序
7. Public Archives of Canada. Catalogue of the National Map Collection. Boston: G. K. Hall & Co., 1976. 16 vols. ISBN 0-1861-1215-0 Archives publiques du Canada. Catalogue de la Collection nationale de cartes et plans. Boston: G. K. Hall & Co., 1976. 16 vols. ISBN 0-1861-1215-0 [O] . Dahl, Edward H. 1977

机译：加拿大公共档案馆。国家地图集目录。波士顿：G。K. Hall＆Co.，1976。16卷。国际标准书号0-1861-1215-0加拿大档案馆。 Catalogue de la Collection nationale de cartes et plans。波士顿：G。K. Hall＆Co.，1976。16卷。国际标准书号0-1861-1215-0
8. Bayes Tree: Enabling Incremental Reordering and Fluid Relinearization for Online Mapping. [R] . F. Dellaert M. Kaess R. Roberts V. Ila 2010

机译：贝叶斯树：为在线映射启用增量重新排序和流体重新定位。

From tree to network: reordering an archival catalogue

摘要

著录项

相似文献

相关主题

期刊订阅