Complex Word Identification Based on Frequency in a Learner Corpus

机译：学习者语料库中基于频率的复杂词识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce the TMU systems for the complex word identification (CWI) shared task 2018. TMU systems use random forest classifiers and regressors whose features are the number of characters and words and the frequency of target words in various corpora. Our simple systems performed best on 5 of the 12 tracks. Ablation analysis confirmed the usefulness of a learner corpus for a CWI task.

机译：我们介绍了用于复杂单词识别（CWI）共享任务2018的TMU系统。TMU系统使用随机森林分类器和回归器，其特征是各种语料库中字符和单词的数量以及目标单词的频率。我们的简单系统在12条轨道中的5条上表现最佳。消融分析证实了学习者语料库对CWI任务的有用性。

著录项

来源
《Thirteenth workshop on innovative use of NLP for building educational applications 2018》|2018年|195-199|共5页
会议地点 New Orleans(US)
作者
Tomoyuki Kajiwara; Mamoru Komachi;
展开▼
作者单位

Institute for Datability Science Osaka University Osaka, Japan,Graduate School of Systems Design Tokyo Metropolitan University Tokyo, Japan;

Graduate School of Systems Design Tokyo Metropolitan University Tokyo, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Multi-Word Expressions in Second Language Writing: A Large-Scale Longitudinal Learner Corpus Study [J] . Czechoslovak Mathematical Journal . 2020,第2期

机译：以第二语言写作的多字表达：大规模的纵向学习者语料库研究
2. L2 English Learners' Performance in Persuasion Role-Plays: A Learner-Corpus-Based Study [J] . Shinichiro Ishikawa International Journal of Computer-Assisted Language Learning and Teaching . 2021,第2期

机译：L2英语学习者在说服角色扮演中的表现：基于学习者的学习者的研究
3. Noun phrase complexity in young Spanish EFL learners' writing Complementing syntactic complexity indices with corpus-driven analyses [J] . Zeitschrift fur Arznei- und Gewurzpflanzen . 2020,第1期

机译：NUN短语在年轻西班牙语EFL学习者中与语料库分析的句法复杂性指数的文字复杂性
4. Complex Word Identification Based on Frequency in a Learner Corpus [C] . Tomoyuki Kajiwara, Mamoru Komachi Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2018

机译：基于学习者语料库中频率的复杂词识别
5. Exploring recurrent word combinations in a business English learner corpus: A parallel corpus analysis and its curricular implications. [D] . Lopez Rodriguez, Jesus. 2006

机译：探索商务英语学习者语料库中的重复单词组合：并行语料库分析及其课程含义。
6. Manipulations of word frequency reveal differences in the processing of morphologically complex and simple words in German [O] . Maria Bronk, Pienie Zwitserlood, Jens Bölte 2013

机译：单词频率的处理揭示了德语中形态复杂和简单单词的处理差异
7. Complex Word Identification Based on Frequency in a Learner Corpus [O] . Tomoyuki Kajiwara, Mamoru Komachi 2018

机译：基于学习者语料库中频率的复杂词识别

Complex Word Identification Based on Frequency in a Learner Corpus

摘要

著录项

相似文献

相关主题

期刊订阅