Hardtoparse: POS Tagging and Parsing the Twitter-verse

机译：Hardtoparse：POS标记和解析推特诗

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We evaluate the statistical dependency parser, Malt, on a new dataset of sentences taken from tweets. We use a version of Malt which is trained on gold standard phrase structure Wall Street Journal (WSJ) trees converted to Stanford labeled dependencies. We observe a drastic drop in performance moving from our in-domain WSJ test set to the new Twitter dataset, much of which has to do with the propagation of part-of-speech tagging errors. Retraining Malt on dependency trees produced by a state-of-the-art phrase structure parser, which has itself been self-trained on Twitter material, results in a significant improvement. We analyse this improvement by examining in detail the effect of the retraining on individual dependency types.

机译：我们评估统计依赖解析器，麦芽，从推文中的新数据集上。我们使用一个版本的麦芽版，这是在黄金标准短语结构华尔街日记（WSJ）树上被转换为Stanford标记依赖的树木。我们观察到从我们的域WSJ测试设置到新的Twitter数据集中的性能急剧下降，其中大部分都与语音部分标记错误的传播有关。通过最先进的短语结构解析器制作的依赖树木的竞争麦芽，这本身在Twitter材料上自培训，导致显着改善。我们通过详细检查Retringing对各个依赖类型的效果来分析这种改进。

著录项

来源
《AAAI Workshop on Analyzing Microtext》|2011年||共6页
会议地点
作者
Jennifer Foster; Oezlem Cetinoglu; JoachimWagner; Joseph Le Roux; Stephen Hogan; Joakim Nivre; Deirdre Hogan; Josef van Genabith;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. From Genesis to Creole Language: Transfer Learning for Singlish Universal Dependencies Parsing and POS Tagging [J] . Wang Hongmin, Yang Jie, Zhang Yue ACM transactions on Asian language information processing . 2020,第1期

机译：从创世纪到克里奥尔语：用于单一通用依赖项解析和POS标记的转移学习
2. From POS tagging to dependency parsing for biomedical event extraction [J] . Dat Quoc Nguyen, Karin Verspoor BMC Bioinformatics . 2019,第1期

机译：从POS标记到依赖解析生物医学事件提取的解析
3. Character-Level Dependency Model for Joint Word Segmentation, POS Tagging, and Dependency Parsing in Chinese [J] . Zhen GUO, Yujie ZHANG, Chen SU, IEICE transactions on information and systems . 2016,第1期

机译：汉字联合分词，POS标记和依赖解析的字符级依赖模型
4. Hardtoparse: POS Tagging and Parsing the Twitter-verse [C] . Jennifer Foster, Oezlem Cetinoglu, JoachimWagner, AAAI Workshop on Analyzing Microtext . 2011

机译：Hardtoparse：POS标记和解析推特诗
5. Using a named entity tagger and a syntactic parser to improve Web-based answer extraction [D] . Kamel, Yasser. 2004

机译：使用命名实体标记器和语法解析器来改进基于Web的答案提取
6. From POS tagging to dependency parsing for biomedical event extraction [O] . Dat Quoc Nguyen, Karin Verspoor 2019

机译：从POS标记到相关性分析以进行生物医学事件提取
7. A Chinese Efficient Analyser Integrating Word Segmentation, Part-Of-Speech Tagging, Partial Parsing and Full Parsing [O] . Guodong Zhou 2008

机译：中文高效分析器集成了分词，词性标注，部分分析和全分析

Hardtoparse: POS Tagging and Parsing the Twitter-verse

摘要

著录项

相似文献

相关主题

期刊订阅