The Lixto Project: Exploring New Frontiers of Web Data Extraction

机译：Lixto项目：探索Web数据提取的新领域

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Lixto project is an ongoing research effort in the area of Web data extraction. Whereas the project originally started out with the idea to develop a logic-based extraction language and a tool to visually define extraction programs from sample Web pages, the scope of the project has been extended over time. Today, new issues such as employing learning algorithms for the definition of extraction programs, automatically extracting data from Web pages featuring a table-centric visual appearance, and extracting from alternative document formats such as PDF are being investigated.

机译：Lixto项目是Web数据提取领域中一项正在进行的研究工作。尽管该项目最初的想法是开发一种基于逻辑的提取语言和一种工具，以可视方式从示例Web页面中定义提取程序，但随着时间的推移，该项目的范围得到了扩展。如今，正在研究新问题，例如采用学习算法定义提取程序，从具有以表格为中心的视觉外观的网页自动提取数据以及从其他文档格式（例如PDF）提取数据。

著录项

来源
《British National Conference on Databases(BNCOD 23); 20060718-23; Belfast(GB)》|2006年|P.1-15|共15页
会议地点 Belfast(GB)
作者
Julien Carme; Michal Ceresna; Oliver Froelich; Georg Gottlob; Tamir Hassan; Marcus Herzog; Wolfgang Holzinger; Bernhard Kruepl;
展开▼
作者单位

Vienna University of Technology, Database and Artificial Intelligence Group, Favoritenstrasse 9-11, A-1040 Wien, Austria;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.13;
关键词

相似文献

外文文献
中文文献
专利

1. Are there any frontiers of research performance? Efficiency measurement of funded research projects with the Bayesian stochastic frontier analysis for count data [J] . Mutz Rudiger, Bornmann Lutz, Daniel Hans-Dieter Journal of informetrics . 2017,第3期

机译：研究绩效有哪些前沿领域？贝叶斯随机前沿分析对计数数据的资助研究项目效率评估
2. LINKING OPEN DATA WIKI OF THE WORLD WIDE WEB CONSORTIUM (W3C) SEMANTIC WEB EDUCATION AND OUTREACH (SWEO) COMMUNITY PROJECT http://esw.w3.org/SweoIG/TaskForces/CommunityProjects/LinkingOpenData [J] . Mary Mallery Technical services quarterly . 2011,第1期

机译：链接全球Web站点（W3C）的语义Web教育和推广（SWEO）社区项目的开放数据维基百科。
3. Exploring frontier areas using 2D seismic and 3D CSEM data, as exemplified by multi-client data over the skrugard and havis discoveries in the barents sea [J] . Gabrielsen P.T., Abrahamson P., Panzner M., First Break . 2013,第1期

机译：使用2D地震和3D CSEM数据探索前沿地区，例如在巴伦支海溜冰和哈维斯发现中的多客户数据举例说明
4. The Lixto Project: Exploring New Frontiers of Web Data Extraction [C] . Julien Carme, Michal Ceresna, Oliver Frolich, British National Conference on Databases(BNCOD 23) . 2006

机译：LIXTO项目：探索网络数据提取的新边界
5. Design and Development of Intelligent Web Mining System for Extraction of Information from Web Databases [D] . Sharma, Sanjeev Kumar. 2010

机译：Web数据库提取信息的智能网络挖掘系统的设计与开发
6. Cancer diagnosis marker extraction for soft tissue sarcomas based on gene expression profiling data by using projective adaptive resonance theory (PART) filtering method [O] . Hiro Takahashi, Takeshi Nemoto, Teruhiko Yoshida, 2006

机译：基于投射自适应共振理论（PART）过滤的基因表达谱数据提取软组织肉瘤癌症诊断标志
7. The Lixto Data Extraction Project − Back and Forth between Theory and Practice [O] . Georg Gottlob, Christoph Koch, Robert Baumgartner, 2004

机译：Lixto数据提取项目-理论与实践之间的往返

The Lixto Project: Exploring New Frontiers of Web Data Extraction

摘要

著录项

相似文献

相关主题

期刊订阅