首页> 外文学位 >Learning embeddings for indexing, retrieval, and classification, with applications to object and shape recognition in image databases.
【24h】

Learning embeddings for indexing, retrieval, and classification, with applications to object and shape recognition in image databases.

机译:学习用于索引,检索和分类的嵌入,并将其应用于图像数据库中的对象和形状识别。

获取原文
获取原文并翻译 | 示例

摘要

Nearest neighbor retrieval is the task of identifying, given a database of objects and a query object, the objects in the database that are the most similar to the query. Retrieving nearest neighbors is a necessary component of many practical applications, in fields as diverse as computer vision, pattern recognition, multimedia databases, bioinformatics, and computer networks. This thesis proposes new methods for improving the efficiency and accuracy of nearest neighbor retrieval and classification in spaces with computationally expensive distance measures.; The first contribution of this thesis is the BoostMap algorithm for embedding arbitrary spaces into a vector space with a computationally efficient distance measure. Using this approach, an approximate set of nearest neighbors can be retrieved efficiently---often orders of magnitude faster than retrieval using the exact distance measure in the original space. In BoostMap, embedding construction is treated as a machine learning problem. The learning-based formulation leads to an algorithm that directly maximizes the amount of nearest neighbor structure preserved by the embedding, without making any assumptions about the underlying geometry of the original space. The second contribution consists of extending BoostMap to produce, together with the embedding, a query-sensitive distance measure for the target space of the embedding. In high-dimensional spaces, query-sensitive distance measures allow for automatic selection of the dimensions that are the most informative for each specific query object. The third contribution is a method for speeding up nearest neighbor classification by combining multiple embedding-based nearest neighbor classifiers in a cascade structure. In cascade-based classification, computationally efficient classifiers are used to quickly classify easy cases, and classifiers that are more computationally expensive and also more accurate are only applied to objects that are harder to classify.; The proposed methods are evaluated experimentally in several different applications: hand shape recognition, off-line character recognition, online character recognition, and efficient retrieval of time series. In all datasets, the proposed methods lead to significant improvements in accuracy and efficiency compared to existing state-of-the-art methods.
机译:在给定对象数据库和查询对象的情况下,最近邻居检索是确定数据库中与查询最相似的对象的任务。在计算机视觉,模式识别,多媒体数据库,生物信息学和计算机网络等众多领域中,检索最近的邻居是许多实际应用的必要组成部分。本文提出了一种新的方法,利用计算量大的距离度量来提高空间中最近邻检索和分类的效率和准确性。本文的第一个贡献是BoostMap算法,该算法利用计算效率高的距离度量将任意空间嵌入向量空间。使用这种方法,可以有效地检索到一组近似的最近邻居-通常比在原始空间中使用精确距离度量进行检索要快几个数量级。在BoostMap中,将嵌入构造视为机器学习问题。基于学习的公式化导致一种算法,该算法可直接最大化通过嵌入保留的最近邻居结构的数量,而无需对原始空间的基础几何结构进行任何假设。第二个贡献包括扩展BoostMap以与嵌入一起生成针对嵌入目标空间的查询敏感距离度量。在高维空间中,对查询敏感的距离度量允许自动选择对每个特定查询对象最有用的维。第三贡献是一种通过在级联结构中组合多个基于嵌入的最近邻居分类器来加速最近邻居分类的方法。在基于级联的分类中,计算效率高的分类器用于快速分类容易的情况,而计算量更大,准确性也更高的分类器仅应用于较难分类的对象。在几种不同的应用中对提出的方法进行了实验评估:手形识别,离线字符识别,在线字符识别以及有效的时间序列检索。与现有的最新方法相比,在所有数据集中,所提出的方法均可以显着提高准确性和效率。

著录项

  • 作者

    Athitsos, Vassilis.;

  • 作者单位

    Boston University.;

  • 授予单位 Boston University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2006
  • 页码 160 p.
  • 总页数 160
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号