...
首页> 外文期刊>Transactions in GIS: TG >Reference data enhancement for geographic information retrieval using linked data
【24h】

Reference data enhancement for geographic information retrieval using linked data

机译:使用链接数据的地理信息检索参考数据增强

获取原文
获取原文并翻译 | 示例
           

摘要

Gazetteers are instrumental in recognizing place names in documents such as Web pages, news, and social media messages. However, creating and maintaining gazetteers is still a complex task. Even though some online gazetteers provide rich sets of geographic names in planetary scale (e.g. GeoNames), other sources must be used to recognize references to urban locations, such as street names, neighborhood names or landmarks. We propose integrating Linked Data sources to create a gazetteer that combines a broad coverage of places with urban detail, including content on geographic and semantic relationships involving places, their multiple names and related non-geographic entities. Our final goal is to expand the possibilities for recognizing, disambiguating and filtering references to places in texts for geographic information retrieval (GIR) and related applications. The resulting ontological gazetteer, named LoG (Linked OntoGazetteer), is accessible through Web services by applications and research initiatives on GIR, text processing, named entity recognition and others. The gazetteer currently contains over 13 million places, 140 million attributes and relationships, and 4.5 million non-geographic entities. Data sources include GeoNames, Freebase, DBPedia and LinkedGeoData, which is based on OpenStreetMap data. An analysis on how these datasets overlap and complement one another is also presented.
机译:公鸡是识别在网页,新闻和社交媒体消息等文件中的地方名称的工具。但是,创建和维护公鸡仍然是一个复杂的任务。尽管一些在线公布者在行星规模(例如Geonames)中提供丰富的地理名称,但必须使用其他来源来识别对城市地点的引用,例如街道名称,邻居名称或地标。我们建议集成链接的数据来源,以创建一个宪录,这些宪录将具有城市细节的广泛覆盖范围,包括涉及地理和语义关系的内容,涉及地点,他们的多个名称和相关非地理实体。我们的最终目标是扩大识别,消除和过滤对地理信息检索(GIR)和相关申请的文本中的参考的可能性。由GIR,文本处理,命名实体识别等的应用程序和研究举措,通过Web服务访问所产生的Ontological Gazeteer,命名日志(链接的Ontogazeere)可通过Web服务访问。瞪羚目前含有超过1300万场,1.4亿个属性和关系,450万非地理实体。数据源包括地理游览,自由贝级,DBPedia和LinkedgeData,它基于OpenStreetMap数据。还呈现了这些数据集如何重叠和补充的分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号