Annotated Corpora for Word Alignment Between Japanese and English and its Evaluation with MAP-based Word Aligner

机译：日语和英语之间单词对齐的带注释语料库及其基于MAP的单词对齐器的评估

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents two annotated corpora for word alignment between Japanese and English. We annotated on top of the IWSLT-2006 and the NTCIR-8 corpora. The IWSLT-2006 corpus is in the domain of travel conversation while the NTCIR-8 corpus is in the domain of patent. We annotated the first 500 sentence pairs from the IWSLT-2006 corpus and the first 100 sentence pairs from the NTCIR-8 corpus. After mentioned the annotation guideline, we present two evaluation algorithms how to use such hand-annotated corpora: although one is a well-known algorithm for word alignment researchers, one is novel which intends to evaluate a MAP-based word aligner of Okita et al. (2010b).

机译：本文提出了两个带注释的语料库，用于日语和英语之间的单词对齐。我们在IWSLT-2006和NTCIR-8语料库的顶部进行了注释。 IWSLT-2006语料库在旅行对话中，而NTCIR-8语料库在专利中。我们注释了IWSLT-2006语料库的前500个句子对和NTCIR-8语料库的前100个句子对。在提到注释准则之后，我们提出了两种如何使用这种手工注释的语料库的评估算法：尽管一种是词对齐研究人员的著名算法，一种是新颖的，旨在评估Okita等人的基于MAP的词对齐器。（2010b）。

著录项

来源
《International conference on language resources and evaluation》|2012年|3241-3248|共8页
会议地点
作者
Tsuyoshi Okita;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Annotated Corpus for Word Alignment; Statistical Machine Translation; Evaluation;

机译：带注释的语料库，用于单词对齐;统计机器翻译;评价;

相似文献

外文文献
中文文献
专利

1. A method of pronunciation evaluation for English words using Japanese's and English phonemic models [J] . Naoko Maeda, Yoichi Yamashita 電子情報通信学会技術研究報告. 音声. Speech . 2001,第604期

机译：利用日语和英语音位模型评估英语单词发音的方法
2. A method of pronunciation evaluation for English words using Japanese's and English phonemic models [J] . Naoko Maeda, Yoichi Yamashita 電子情報通信学会技術研究報告. 音声. Speech . 2001,第604期

机译：使用日语和英语音素模型对英语单词的发音评价方法
3. Self-organizing semantic maps and its application to word alignment in Japanese-Chinese parallel corpora. [J] . Ma Q, Kanzaki K, Zhang Y, Neural Networks: The Official Journal of the International Neural Network Society . 2004,第8a9期

机译：自组织语义图及其在日汉平行语料库中的词对齐中的应用。
4. Annotated Corpora for Word Alignment Between Japanese and English and its Evaluation with MAP-based Word Aligner [C] . Tsuyoshi Okita International conference on language resources and evaluation . 2012

机译：注释语料库，用于日语和英语与基于地图的词对齐器的评估
5. Hypernym Discovery over WordNet and English Corpora - Using Hearst Patterns and Word Embeddings [D] . Vallabhajosyula, Manikya Swathi 2018

机译：通过WordNet和英语语料库发现Hypernym-使用赫斯特模式和单词嵌入
6. Improving the Alignment Quality of Consistency Based Aligners with an Evaluation Function Using Synonymous Protein Words [O] . Hsin-Nan Lin, Cédric Notredame, Jia-Ming Chang, 2011

机译：基于改进的一致性矫正器对齐质量的评价函数使用同义字蛋白
7. Fine-Grained Word Sense Disambiguation Based on Parallel Corpora, Word Alignment, Word Clustering and Aligned Wordnets [O] . Tufis, Dan, Ion, Radu, Ide, Nancy 2005

机译：基于平行语料库，Word的细粒度词义消歧对齐，Word聚类和对齐的Wordnets

Annotated Corpora for Word Alignment Between Japanese and English and its Evaluation with MAP-based Word Aligner

摘要

著录项

相似文献

相关主题

期刊订阅