Automatic bug labeling using semantic information from LSI

机译：使用LSI的语义信息自动进行错误标记

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most open source projects provide a defect tracking system, where users, developers, testers can directly report the problems. The fields provided in the bug report help triager and debugger to understand the problem better. They also help in other tasks like accurate assessment of priority and severity of bugs, identification of appropriate developer to resolve bugs etc. Label field in the bug report is one such field. It has been observed that in many bug repositories, the label field is either not present or is incorrectly assigned. There is a need for automatic bug labeling so that bug reports could be made more informative. This paper presents an automated technique for bug labeling using TF-IDF and LSI. Experimental study shows that there is improvement in results with the addition of semantically similar words obtained from LSI in conjunction with the terms extracted using TF-IDF. Using LSI along with TF-IDF, we achieved 61.5% accuracy for the polish bug reports and 62.8% accuracy for security bug reports as compared to 53.8% accuracy for polish and 61% for security bug reports from using TF-IDF alone.

机译：大多数开源项目都提供一个缺陷跟踪系统，用户，开发人员，测试人员可以在其中直接报告问题。错误报告中提供的字段可帮助Triager和调试器更好地了解问题。它们还帮助完成其他任务，例如准确评估错误的优先级和严重性，确定合适的开发人员以解决错误等。错误报告中的“标签”字段就是这样的字段。已经观察到在许多错误库中，标签字段不存在或分配不正确。需要自动的错误标记，以便可以使错误报告更具信息性。本文提出了一种使用TF-IDF和LSI进行错误标记的自动化技术。实验研究表明，通过添加从LSI获得的语义相似的单词以及使用TF-IDF提取的术语，可以改善结果。通过将LSI与TF-IDF一起使用，我们仅通过使用TF-IDF就能获得61.5％的精确度错误报告和62.8％的准确性，而安全缺陷报告的准确度则为53.8％和61％。

著录项

来源
《International Conference on Contemporary Computing》|2014年|376-381|共6页
会议地点
作者
Chawla Indu; Singh Sandeep K;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Accuracy; Browsers; Computer bugs; Labeling; Large scale integration; Security; Training; Bug categorization; Bug labeling; LSI; Latent Semantic Indexing;

机译：准确性;浏览器;计算机错误;标签;大规模整合;安全;训练;错误分类;错误标签; LSI;潜在语义索引;

相似文献

外文文献
中文文献
专利

1. MUVI: Automatically Inferring Multi-Variable Access Correlations and Detecting Related Semantic and Concurrency Bugs [J] . Shan Lu, Soyeon Park, Chongfeng Hu, Operating systems review . 2007,第6期

机译：MUVI：自动推断多变量访问关联并检测相关的语义和并发错误
2. Unsupervised Learning Method for Sorting Positive and Negative Reviews Using LSI (Latent Semantic Indexing) with Automatic Generated Queries [J] . Sheikh Muhammad Saqib, Fazal Masud Kundi, Shakeel Ahmad International journal of computer science and network security . 2018,第1期

机译：使用带自动生成的查询的LSI（潜在语义索引）对正面评论和负面评论进行排序的无监督学习方法
3. Automatic Question Generation Using Semantic Role Labeling for Morphologically Rich Languages [J] . Vasi? Daniel, ?itko Branko, Ljubi? Hrvoje Technical Gazette . 2021,第3期

机译：自动问题使用语义角色标记为形态学丰富的语言
4. Automatic bug labeling using semantic information from LSI [C] . Chawla Indu, Singh Sandeep K International Conference on Contemporary Computing . 2014

机译：使用LSI的语义信息自动错误标记
5. Detecting Semantic Bugs in Autopilot Software by Classifying Anomalous Variables [D] . Huang, Hu. 2019

机译：通过对异常变量进行分类检测自动驾驶仪软件中的语义错误
6. BIOSMILE: A semantic role labeling system for biomedical verbs using a maximum-entropy model with automatically generated template features [O] . Richard Tzong-Han Tsai, Wen-Chi Chou, Ying-Shan Su, 2007

机译：BIOSMILE：生物医学动词的语义角色标记系统使用具有最大熵模型和自动生成的模板特征的生物医学动词
7. Generalization of Semantic Roles in Automatic Semantic Role Labeling [O] . Yuichiroh Matsubayashi, Naoaki Okazaki, Jun’ichi Tsujii 2010

机译：自动语义角色标记中语义角色的概括

Automatic bug labeling using semantic information from LSI

摘要

著录项

相似文献

相关主题

期刊订阅