A POI Categorization by Composition of Onomastic and Contextual Information

机译：通过本体信息和上下文信息的组合进行POI分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Point of interest (POI) categorization is the task of finding of categories of POIs within a document. Because the documents that possess POIs have clue words for identifying POI categories, the task can be solved as document classification. However, this approach misses two crucial factors for identifying the category of a POI. First, the approach pays no attention to onomastic information, even though POI names reveal much categorical information in many cases. Second, the approach ignores the fact that most clue words for identifying a POI category are located near the POI name. This paper proposes a novel method that incorporates both onomastic and local contextual information in POI categorization. The proposed method uses support vector machines (SVMs) to categorize POIs. In order to utilize the onomastic information of POIs, The proposed method adopts the string kernel that manages variations of the POI names efficiently at the character level. The method also proposes a Gaussian weighting to content words in a document. By setting the mean of a Gaussian weighting at the position of a POI name, the method imposes higher weights to the words near the POI name and lower weights to the words far from the name. Then, these two types of information are combined by a composite kernel of the string kernel and a linear kernel with the Gaussian weighting. A series of experiments prove that SVMs with the combined information outperforms those with single information.

机译：兴趣点（POI）归类是在文档中查找POI类别的任务。因为拥有POI的文档具有用于标识POI类别的线索，所以可以将任务作为文档分类来解决。但是，此方法缺少识别POI类别的两个关键因素。首先，即使POI名称在许多情况下都揭示了很多分类信息，该方法也不关注正则信息。其次，该方法忽略了以下事实：大多数用于标识POI类别的线索词都位于POI名称附近。本文提出了一种新颖的方法，该方法在POI分类中结合了异常信息和局部上下文信息。所提出的方法使用支持向量机（SVM）对POI进行分类。为了利用POI的本体信息，该方法采用了字符串核，该字符串核在字符级别上有效地管理POI名称的变化。该方法还提出了对文档中的内容词的高斯加权。通过在POI名称的位置处设置高斯加权的平均值，该方法对POI名称附近的单词施加较高的权重，而对远离名称的单词施加较高的权重。然后，这两种类型的信息由具有高斯加权的字符串核和线性核的复合核组合而成。一系列实验证明，具有组合信息的SVM优于具有单一信息的SVM。

著录项

来源
《IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technologies》|2014年|38-45|共8页
会议地点
作者
Su Jeong Choi; Hyun Je Song; Seong Bae Park; Sang Jo Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Gaussian processes; document handling; pattern classification; support vector machines; Gaussian weighting; POI categorization; SVM; composite kernel; contextual information composition; document classification; linear kernel; onomastic information composition; point of interest categorization; string kernel; support vector machines; Context; Equations; Kernel; Marine animals; Support vector machines; Vectors; Contextual information; Kernel composition; Onomastic information; POI categorization;

机译：高斯过程;文档处理;模式分类;支持向量机;高斯加权; POI分类; SVM;复合核;上下文信息组合;文档分类;线性核;正反信息构成;兴趣点分类;字符串核;支持向量机;上下文;方程;内核;海洋动物;支持向量机;向量;上下文信息;内核组成;本体信息; POI分类;

相似文献

外文文献
中文文献
专利

1. Contextual Text Categorization: An Improved Stemming Algorithm to Increase the Quality of Categorization in Arabic Text [J] . Gadri Said, Moussaoui Abdelouahab The international arab journal of information technology . 2017,第6期

机译：上下文文本分类：一种改进的词干算法，可提高阿拉伯文本分类的质量
2. Visual Object Categorization Based on Hierarchical Shape Motifs Learned From Noisy Point Cloud Decompositions [J] . Mueller Christian A., Birk Andreas Journal of Intelligent & Robotic Systems: Theory & Application . 2020,第2期

机译：基于分层形状图案的视觉对象分类从嘈杂的点云分解中学到的
3. Contextual Influences on Phonetic Categorization in School-Aged Children [J] . Campbell Jean A., McSherry Heather L., Theodore Rachel M. Frontiers in Communication . 2018,第3期

机译：语境对学龄儿童语音分类的影响
4. A POI Categorization by Composition of Onomastic and Contextual Information [C] . Su Jeong Choi, Hyun Je Song, Seong Bae Park, IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technologies . 2014

机译：由泛色和上下文信息的组成的POI分类
5. Perceiving speech in context: Compensation for contextual variability during acoustic cue encoding and categorization. [D] . Toscano, Joseph Christopher. 2011

机译：在上下文中感知语音：在声学提示编码和分类过程中补偿上下文变化。
6. Contextual control of stimulus generalization and stimulus equivalence in hierarchical categorization. [O] . Karen Griffee, Michael J Dougher 2002

机译：层次化分类中刺激泛化和等同刺激的上下文控制。
7. Contextual Homogeneity-Based Patch Decomposition Method for Higher Point Cloud Compression [O] . Sungryeul Rhyu, Junsik Kim, Jiheon Im, 2020

机译：基于语境的同质性的贴片分解方法，用于较高点云压缩
8. Methods and Compositions for Categorizing Patients. [R] . Markowitz, S. D. 2003

机译：用于分类患者的方法和组合物。

A POI Categorization by Composition of Onomastic and Contextual Information

摘要

著录项

相似文献

相关主题

期刊订阅