首页> 外文期刊>Information Processing & Management >A discrete mixture-based kernel for SVMs: Application to spam and image categorization
【24h】

A discrete mixture-based kernel for SVMs: Application to spam and image categorization

机译:用于SVM的基于混合的离散内核:在垃圾邮件和图像分类中的应用

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper, we investigate the problem of training support vector machines (SVMs) on count data. Multinomial Dirichlet mixture models allow us to model efficiently count data. On the other hand, SVMs permit good discrimination. We propose, then, a hybrid model that appropriately combines their advantages. Finite mixture models are introduced, as an SVM kernel, to incorporate prior knowledge about the nature of data involved in the problem at hand. For the learning of our mixture model, we propose a deterministic annealing component-wise EM algorithm mixed with a minimum description length type criterion. In the context of this model, we compare different kernels. Through some applications involving spam and image database categorization, we find that our data-driven kernel performs better.
机译:在本文中,我们研究了在计数数据上训练支持向量机(SVM)的问题。多项式Dirichlet混合模型使我们能够对有效的计数数据进行建模。另一方面,SVM允许很好的区分。然后,我们提出了一种混合模型,可以适当地结合它们的优势。引入了有限的混合模型作为SVM内核,以结合有关手头问题所涉及数据性质的先验知识。为了学习我们的混合模型,我们提出了一种混合确定性退火成分的EM算法,并结合了最小描述长度类型准则。在此模型的上下文中,我们比较了不同的内核。通过一些涉及垃圾邮件和图像数据库分类的应用程序,我们发现数据驱动的内核性能更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号