Unsupervised Compound Splitting With Distributional Semantics Rivals Supervised Methods

机译：具有分布语义竞争的无监督复合拆分监督方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present a word decompounding method that is based on distributional semantics. Our method does not require any linguistic knowledge and is initialized using a large monolingual corpus. The core idea of our approach is that parts of compounds (like "candle" and "stick") are seman-tically similar to the entire compound, which helps to exclude spurious splits (like "candles" and "tick"). We report results for German and Dutch: For German, our unsupervised method comes on par with the performance of a rule-based and a supervised method and significantly outperforms two unsupervised baselines. For Dutch, our method performs only slightly below a rule-based optimized compound splitter.

机译：在本文中，我们提出了一种基于分布语义的词分解方法。我们的方法不需要任何语言知识，并使用大型单语语料库进行初始化。我们方法的核心思想是化合物的某些部分（例如“蜡烛”和“棒”）在语义上与整个化合物相似，这有助于排除虚假拆分（例如“蜡烛”和“滴答”）。我们报告了德语和荷兰语的结果：对于德语，我们的无监督方法与基于规则和有监督方法的性能相当，并且显着优于两个无监督基准。对于荷兰语，我们的方法仅比基于规则的优化复合拆分器执行效果稍差。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2016年|617-622|共6页
会议地点
作者
Martin Riedl; Chris Biemann;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Use Supervise - unsupervised methods for tweet level Contextual semantics [J] . Shilpi Goyal, Nirupama Tiwari International Journal of Engineering Trends and Technology . 2017,第4期

机译：使用监督 - 无调节方法，即推特级上下文语义
2. Unsupervised and Semi-Supervised Image Classification With Weak Semantic Consistency [J] . Zhang Chunjie, Cheng Jian, Tian Qi IEEE transactions on multimedia . 2019,第10期

机译：具有弱语义一致性的无监督和半监督图像分类
3. Unsupervised and supervised exploitation of semantic domains in lexical disambiguation [J] . Alfio Gliozzo, Carlo Strapparava, Ido Dagan Computer speech and language . 2004,第3期

机译：词汇歧义化中语义域的无监督和监督利用
4. Unsupervised Compound Splitting With Distributional Semantics Rivals Supervised Methods [C] . Martin Riedl, Chris Biemann Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2016

机译：具有分布语义竞争对手的无监督复合分裂监督方法
5. Supervised and Unsupervised Learning for Semantics Distillation in Multimedia Processing [D] . Liu, Yu. 2018

机译：多媒体处理中语义蒸馏的有监督和无监督学习
6. Comparison of Supervised and Unsupervised Deep Learning Methods for Medical Image Synthesis between Computed Tomography and Magnetic Resonance Images [O] . Yafen Li, Wen Li, Jing Xiong, 2020

机译：计算机断层扫描与磁共振图像中医学图像综合的监督和无监督深度学习方法的比较
7. Unsupervised Compound Splitting With Distributional Semantics Rivals Supervised Methods [O] . Martin Riedl, Chris Biemann 2016

机译：具有分布语义竞争对手的无监督复合分裂监督方法

Unsupervised Compound Splitting With Distributional Semantics Rivals Supervised Methods

摘要

著录项

相似文献

相关主题

期刊订阅