Latin word stemming using Wiktionary

Khoury Richard; Sapsford Francesca

首页> 外文期刊>Literary & linguistic computing >Latin word stemming using Wiktionary

【24h】

Latin word stemming using Wiktionary

机译：使用维基词典词干的拉丁词

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This article demonstrates how to automatically build a Latin word stemmer to transform words into their grammatical roots. By using the Wiktionary database as source data, it becomes possible to build such a tool with several hundreds of thousands of words. Our experiments demonstrate that it can then be used to correctly find the root of 78% of the words of Martial's Epigrams, and can be combined with other linguistic tools such as the Latin WordNet to greatly enhance their language coverage. While our research focuses on the Latin language, the same methodology could be used to build stemmers and other linguistic tools for many other ancient languages represented in Wiktionary, such as Ancient Greek or Old Armenian.

机译：本文演示了如何自动构建拉丁语词干分析器，以将单词转换为其语法根源。通过将Wiktionary数据库用作源数据，可以构建具有数十万个单词的工具。我们的实验表明，它可以用来正确地找到武术的Epigrams的78％单词的词根，并且可以与其他语言工具（例如拉丁语WordNet）结合使用，以大大增强其语言覆盖范围。虽然我们的研究集中在拉丁语言上，但可以使用相同的方法为Wiktionary中代表的许多其他古代语言（例如古希腊语或亚美尼亚语）构建词干分析器和其他语言工具。

著录项

来源
《Literary & linguistic computing》 |2016年第2期|368-373|共6页
作者
Khoury Richard; Sapsford Francesca;
展开▼
作者单位

Lakehead Univ, Thunder Bay, ON, Canada;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Taxonomy-based information content and wordnet-wiktionary-wikipedia glosses for semantic relatedness [J] . Ben Aouicha Mohamed, Taieb Mohamed Ali Hadj, Ben Hamadou Abdelmajid Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2016,第2期

机译：基于分类法的信息内容和词汇网-维基百科-维基百科词汇表的语义相关性
2. The contribution of Christopher Freeman to the study of National Systems of Innovation and beyond: some words from Latin America [J] . Judith Sutz* Innovation and Development . 2011,第1期

机译：克里斯托弗·弗里曼（Christopher Freeman）对国家创新体系研究的贡献：拉丁美洲的一些话
3. Semi-automatic enrichment of crowdsourced synonymy networks: the WISIGOTH system applied to Wiktionary [J] . Franck Sajous, Emmanuel Navarro, Bruno Gaume, Language Resources and Evaluation . 2013,第1期

机译：众包同义词网络的半自动富集：应用于Wiktionary的WISIGOTH系统
4. Segmentation of Words Written in the Latin Alphabet: A Systematic Review [C] . Marcelo A. Inuzuka, Acquila S. Rocha, Hugo A. D. Nascimento International Conference on Computational Processing of the Portuguese Language . 2020

机译：拉丁字母中的单词分割：系统评价
5. The Wives and Sisters of Sahagun: Word Order of Latin and Romance Synonyms in Possessive Noun Phrases in the Formulae of Medieval Iberian Notarial Documents- Uxor vs. Mulier and Soror vs. Germana, a Preliminary Study. [D] . Lee, Jesse. 2013

机译：萨哈贡的妻子和姐妹：中世纪伊比利亚公证文件中的名词短语中拉丁和浪漫同义词的词序-初步研究：Uxor vs. Mulier和Soror vs. Germana。
6. First Latin American clinical practice guidelines for the treatment of systemic lupus erythematosus: Latin American Group for the Study of Lupus (GLADEL Grupo Latino Americano de Estudio del Lupus)–Pan-American League of Associations of Rheumatology (PANLAR) [O] . Bernardo A Pons-Estel, Eloisa Bonfa, Enrique R Soriano, -1

机译：拉丁美洲首例治疗系统性红斑狼疮的临床实践指南：拉丁美洲红斑狼疮研究小组（GLADEL拉丁美洲红斑狼疮研究室）–泛美风湿病协会联盟（PANLAR）
7. Filtering Wiktionary Triangles by Linear Mbetween Distributed Word Models [O] . Makrai Márton 2016

机译：分布式词模型之间的线性中间滤波对三角函数的过滤

Latin word stemming using Wiktionary

摘要

著录项

相似文献

相关主题

期刊订阅