StructuralLM: Structural Pre-training for Form Understanding

机译：Structurallm：表单理解的结构性预培训

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Large pre-trained language models achieve state-of-the-art results when tine-tuned on downstream NLP tasks. However, they almost exclusively focus on text-only representation, while neglecting cell-level layout information that is important for form image understanding. In this paper, we propose a new pre-training approach, StructuralLM, to jointly leverage cell and layout information from scanned documents. Specifically, we prc-train StructuralLM with two new designs to make the most of the interactions of cell and layout information: 1) each cell as a semantic unit: 2) classification of cell positions. The pre-trained StructuralLM achieves new state-of-the-art results in different types of downstream tasks, including form understanding (from 78.95 to 85.14), document visual question answering (from 72.59 to 83.94) and document image classification (from 94.43 to 96.08).

机译：当在下游NLP任务上调整时，大型预训练的语言模型实现最先进的结果。但是，它们几乎完全专注于唯一的文本表示，而忽略对形式图像理解很重要的单元级布局信息。在本文中，我们提出了一种新的训练方法，STRATURURALLM，共同利用来自扫描的文档的细胞和布局信息。具体而言，WE PRC-TRANS STRUCTURALLM具有两种新设计，以充分利用细胞和布局信息的互动：1）每个单元作为语义单位：2）单元位置的分类。训练有素的STRACTURALLM在不同类型的下游任务中实现了新的最先进的结果，包括表单理解（从78.95到85.14），记录视觉问题的回答（从72.59到83.94）和文件图像分类（从94.43到 96.08）。

著录项

来源
《International Joint Conference on Natural Language Processing;Annual Meeting of the Association for Computational Linguistics》|2021年|6309-6318|共10页
会议地点
作者
Chenliang Li; Bin Bi; Ming Yan; Wei Wang; Songfang Huang; Fei Huang; Luo Si;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. New Understanding of the Influence of the Pre-Training Phase Transformation Behaviour on the TWSME in NiTi SMA Wires [J] . Urbina C., de la Flor S., Gispert-Guirado F., Experimental Mechanics . 2013,第8期

机译：NiTi SMA导线中预训练相变行为对TWSME影响的新认识
2. Insights in understanding aggregate formation and dissociation in cation exchange chromatography for a structurally unstable Fc-fusion protein [J] . Chen Zhiqiang, Huang Chao, Chennamsetty Naresh, Journal of chromatography, A: Including electrophoresis and other separation methods . 2016,第Null期

机译：理解阳离子交换色谱中结构不稳定的Fc融合蛋白的聚集体形成和解离的见解
3. Unsupervised bin-wise pre-training: A fusion of information theory and hypergraph [J] . Glory Anila H., Vigneswaran C., Sriram Shankar V. S. Knowledge-Based Systems . 2020,第May11期

机译：无监督的Bin-Wise预训练：信息理论和超图融合
4. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding [C] . Jacob Devlin, Ming-Wei Chang, Kenton Lee, Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2019

机译：BERT：用于语言理解的深度双向变压器的预训练
5. Using Computer Simulations as a Pre-training Activity in a Hands-on Lab to Help Community College Students Improve Their Understanding of Physics. [D] . Pineda, Blanca. 2015

机译：在动手实验室中使用计算机模拟作为培训前的活动，以帮助社区大学生提高对物理学的理解。
6. Effects of pre-training using serious game technology on CPR performance – an exploratory quasi-experimental transfer study [O] . Johan Creutzfeldt, Leif Hedman, Li Felländer-Tsai 2012

机译：使用严肃的游戏技术进行的预训练对CPR表现的影响–一项探索性的准实验转移研究
7. S3c2-2 Structural Interactomics : Omics approach in protein structural bioinformatics(S3-c2: "Structural Bioinformatics: Molecular structures as the basis of understanding protein network systems",Symposia,Abstract,Meeting Program of EABS BSJ 2006) [O] . Jong Bhak 2006

机译：S3C2-2结构副学：蛋白质结构生物信息学中的OMIC方法（S3-C2：“结构生物信息学：作为理解蛋白质网络系统的基础的分子结构”，EABS＆BSJ 2006的Symposia，摘要，会议计划）

StructuralLM: Structural Pre-training for Form Understanding

摘要

著录项

相似文献

相关主题

期刊订阅