首页> 外文学位 >Features and Algorithms for Visual Parsing of Handwritten Mathematical Expressions.
【24h】

Features and Algorithms for Visual Parsing of Handwritten Mathematical Expressions.

机译:可视化手写数学表达式的功能和算法。

获取原文
获取原文并翻译 | 示例

摘要

Math expressions are an essential part of scientific documents. Handwritten math expressions recognition can benefit human-computer interaction especially in the education domain and is a critical part of document recognition and analysis.;Parsing the spatial arrangement of symbols is an essential part of math expression recognition. A variety of parsing techniques have been developed during the past three decades, and fall into two groups. The first group is graph-based parsing. It selects a path or sub-graph which obeys some rule to form a possible interpretation for the given expression. The second group is grammar driven parsing. Grammars and related parameters are defined manually for different tasks. The time complexity of these two groups parsing is high, and they often impose some strict constraints to reduce the computation.;The aim of this thesis is working towards building a straightforward and effective parser with as few constraints as possible. First, we propose using a line of sight graph for representing the layout of strokes and symbols in math expressions. It achieves higher F-score than other graph representations and reduces search space for parsing. Second, we modify the shape context feature with Parzen window density estimation. This feature set works well for symbol segmentation, symbol classification and symbol layout analysis. We get a higher symbol segmentation F-score than other systems on CROHME 2014 dataset. Finally, we develop a Maximum Spanning Tree (MST) based parser using Edmonds' algorithm, which extracts an MST from the directed line of sight graph in two passes: first symbols are segmented, and then symbols and spatial relationship are labeled. The time complexity of our MST-based parsing is lower than the time complexity of CYK parsing with context-free grammars. Also, our MST-based parsing obtains higher structure rate and expression rate than CYK parsing when symbol segmentation is accurate. Correct structure means we get the structure of the symbol layout tree correct, even though the label of the edge in the symbol layout tree might be wrong. The performance of our math expression recognition system with MST-based parsing is competitive on CROHME 2012 and 2014 datasets.;For future work, how to incorporate symbol classifier result and correct segmentation error in MST-based parsing needs more research.
机译:数学表达式是科学文献的重要组成部分。手写数学表达式识别可以使人机交互受益,特别是在教育领域,它是文档识别和分析的关键部分。解析符号的空间排列是数学表达式识别的重要部分。在过去的三十年中,已经开发出各种解析技术,并且将其分为两组。第一组是基于图的解析。它选择一条路径或子图,该路径或子图应遵循某些规则以形成给定表达式的可能解释。第二组是语法驱动的解析。语法和相关参数是针对不同任务手动定义的。这两类分析的时间复杂度很高,并且经常施加一些严格的约束条件以减少计算量。本论文的目的是努力建立一种尽可能少的约束条件的简单有效的解析器。首先,我们建议使用视线图来表示数学表达式中笔画和符号的布局。与其他图形表示相比,它可获得更高的F分数,并减少了用于分析的搜索空间。其次,我们使用Parzen窗口密度估计来修改形状上下文特征。该功能集非常适合符号分割,符号分类和符号布局分析。与CROHME 2014数据集上的其他系统相比,我们获得了更高的符号分割F分数。最后,我们使用埃德蒙兹(Edmonds)算法开发了基于最大生成树(MST)的解析器,该解析器分两次通过从有向视线图中提取MST:首先对符号进行分割,然后对符号和空间关系进行标记。基于MST的解析的时间复杂度低于使用上下文无关语法的CYK解析的时间复杂度。同样,当符号分割准确时,基于MST的解析比CYK解析获得更高的结构速率和表达速率。正确的结构意味着即使符号布局树中的边缘标签可能是错误的,我们也可以使符号布局树的结构正确。我们的基于MST的解析的数学表达式识别系统的性能在CROHME 2012和2014数据集上具有竞争力。;为将来的工作,如何在基于MST的解析中合并符号分类器结果和纠正分段错误需要更多的研究。

著录项

  • 作者

    Hu, Lei.;

  • 作者单位

    Rochester Institute of Technology.;

  • 授予单位 Rochester Institute of Technology.;
  • 学科 Computer science.
  • 学位 Ph.D.
  • 年度 2016
  • 页码 191 p.
  • 总页数 191
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 公共建筑;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号