首页> 外文期刊>Journal of Discrete Mathematical Sciences and Cryptography >Identification of sarcasm using word embeddings and hyperparameters tuning
【24h】

Identification of sarcasm using word embeddings and hyperparameters tuning

机译:使用Word Embeddings和HyperParameter调整识别讽刺

获取原文
获取原文并翻译 | 示例
           

摘要

Around the world, most of the proposed techniques for the identification of sarcasm either take the utterance in isolation or these methods only perform the categorization of the textual data. Very limited work has been done on how to train or manipulate the various parameters related to textual data so that to improve on the accuracy of the classification method. In this article, we are trying to identify the sarcasm in the textual data using neural networks. We have tried to classify the data using convolutional neural networks (CNN), recurrent neural networks (RNN) and a blend of these techniques to improve accuracy. Our work is not limited to the classification of the sarcastic text, we have also tried to measure the impact of the training data, number of epochs and amount of dropout in the network. The paper also discusses the impact of various embedding on the dataset when converting the same dataset into vectors via different word embeddings. We measured the influence of various parameters on the very large-scale Reddit1 corpus.
机译:在世界各地,大多数提出的技术用于识别讽刺的孤立或这些方法只能遵守文本数据的分类。如何在如何培训或操纵与文本数据相关的各种参数时完成了非常有限的工作,以便提高分类方法的准确性。在本文中,我们正在尝试使用神经网络识别文本数据中的讽刺​​。我们尝试使用卷积神经网络(CNN),经常性神经网络(RNN)和这些技术的混合来对数据进行分类,以提高精度。我们的工作不仅限于讽刺案文的分类,我们还试图衡量培训数据,时期数量和辍学量的影响。本文还讨论了通过不同的单词嵌入式将相同数据集转换为向量时各种嵌入对数据集的影响。我们测量了各种参数对非常大的Reddit1语料库的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号