首页> 外文学位 >A methodology to improve the performance of extracting information from financial documents.

【24h】

A methodology to improve the performance of extracting information from financial documents.

机译：一种改进从财务文件中提取信息的性能的方法。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Information Extraction (IE) technology retrieves the most relevant, context sensitive, and specific pieces of information from unstructured documents and presents it in a structured format. The IE problem is very difficult for several reasons. First of all, there is no clear boundary of the items to be retrieved. Secondly, information retrieval techniques, by using a bag of words and word statistics, may not suffice to retrieve most of the relevant information because of missing contexts. Thirdly, the direct use of some statistical techniques such as the use of Naive Bayes classifier or the use of Average Mutual Information performs well on document retrieval tasks, but these techniques are not directly applicable to the IE tasks.;This study proposes an IE methodology that aims at extracting financial information of various NASDAQ listed companies with high precision and recall. The performance is improved partly by using a rule-based symbolic-learning model. A set of rules is learned by the simplest form of Tabu search algorithm. The results show that the application of the Tabu search algorithm with parts of speech tags improves precision and recall over the application of other methods and resources. The output of the learned model is further analyzed by a statistical method called "Max-Strength" to improve the precision of the items extracted by the symbolic learning model. The strength of the methodology has been evidenced by its performance on the "Seminar Announcement" corpus that has been used by several well known systems.

机译：信息提取（IE）技术从非结构化文档中检索最相关，与上下文有关的特定信息，并将其以结构化格式显示。 IE问题非常困难，原因有几个。首先，要检索的项目没有明确的界限。其次，由于缺少上下文，使用一袋单词和单词统计信息的信息检索技术可能不足以检索大多数相关信息。第三，直接使用某些统计技术（例如使用朴素贝叶斯分类器或使用平均互信息）在文档检索任务上表现良好，但这些技术并不直接适用于IE任务。旨在以高精度和高召回率提取各种纳斯达克上市公司的财务信息。通过使用基于规则的符号学习模型，可以部分提高性能。通过禁忌搜索算法的最简单形式可以学习一组规则。结果表明，将禁忌搜索算法与部分语音标签配合使用可以提高精度和召回率，优于其他方法和资源。通过称为“最大强度”的统计方法进一步分析学习模型的输出，以提高由符号学习模型提取的项目的精度。该方法在“研讨会通知”语料库上的表现已证明了该方法的优势，该系统已被多个知名系统使用。

著录项

作者
Sheikh, Mahmudul Islam.;
展开▼
作者单位

The University of Mississippi.;

展开▼
授予单位 The University of Mississippi.;
学科 Business Administration Management.;Computer Science.
学位 Ph.D.
年度 2009
页码 149 p.
总页数 149
原文格式 PDF
正文语种 eng
中图分类贸易经济;自动化技术、计算机技术;
关键词
入库时间 2022-08-17 11:38:31

相似文献

外文文献
中文文献
专利

1. Do network capabilities improve corporate financial performance? Evidence from financial supply chains [J] . Wang Liukai, Yan Ji, Chen Xiaohong, International journal of operations & production management . 2021,第4期

机译：网络能力是否提高了公司财务表现？来自金融供应链的证据
2. Improving Small and Medium Enterprises Financing for Stronger Financial and Non-Financial Performance [J] . Advanced Science Letters . 2017,第4期

机译：改善中小企业融资，以实现更强的金融和非财务表现
3. Do Financial Reforms Improve the Performance of Financial Holding Companies? The Case of Taiwan [J] . Meng-Chun KAO, Chien-Ting LIN, Lei Xu International review of finance . 2012,第4期

机译：金融改革是否会改善金融控股公司的绩效？台湾案
4. Polarity Assignment to Causal Information Extracted from Financial Articles Concerning Business Performance of Companies [C] . Hiroyuki Sakai, Shigeru Masuyama SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence . 2009

机译：从有关公司业务绩效的财务文章提取的因果信息的极性分配
5. Sensitivity analysis in performance modeling of multicomputer networks: A methodology to improve the simulation efficiency while maintaining the modeling accuracy in performance modeling of complex multicomputer systems [D] . Han, Gang 2000

机译：多计算机网络性能建模中的敏感性分析：一种在保持复杂多计算机系统性能建模的准确性的同时提高仿真效率的方法
6. 'We pledge to improve the health of our entire community': Improving health worker motivation and performance in Bihar, India through teamwork, recognition, and non-financial incentives [O] . Carolyn Grant, Dipty Nawal, Sai Mala Guntur, 2012

机译：“我们承诺改善整个社区的健康”：通过团队合作，认可和非财务激励措施，提高印度比哈尔邦的卫生工作者激励和绩效
7. Performance Financeira Corporativa e Performance Social Corporativa: desenvolvimento metodológico e contribuição teórica dos estudos empíricos Corporate Financial Performance and Corporate Social Performance: methodological development and the theoretical contribution of empirical studies [O] . João Maurício Gama Boaventura, Ralph Santos da Silva, Rodrigo Bandeira-de-Mello 2012

机译：公司财务绩效和企业社会绩效：方法论的发展和实证研究的理论贡献公司财务绩效和企业社会绩效：方法论的发展和实证研究的理论贡献

A methodology to improve the performance of extracting information from financial documents.

摘要

著录项

相似文献

相关主题

期刊订阅