Are Bug Reports Enough for Text Retrieval-Based Bug Localization?

机译：错误报告足以用于基于文本检索的错误本地化？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text Retrieval (TR) has been widely used to support many software engineering tasks, including bug localization (i.e., the activity of localizing buggy code starting from a bug report). Many studies show TR's effectiveness in lowering the manual effort required to perform this maintenance task; however, the actual usefulness of TR-based bug localization has been questioned in recent studies. These studies discuss (i) potential biases in the experimental design usually adopted to evaluate TRbased bug localization techniques and (ii) their poor performance in the scenario when they are needed most: when the bug report, which serves as the de facto query in most studies, does not contain localization hints (e.g., code snippets, method names, etc.) Fundamentally, these studies raise the question: do bug reports provide sufficient information to perform TR-based localization? In this work, we approach that question from two perspectives. First, we investigate potential biases in the evaluation of TR-based approaches which artificially boost the performance of these techniques, making them appear more successful than they are. Second, we analyze bug report text with and without localization hints using a genetic algorithm to derive a near-optimal query that provides insight into the potential of that bug report for use in TR-based localization. Through this analysis we show that in most cases the bug report vocabulary (i.e., the terms contained in the bug title and description) is all we need to formulate effective queries, making TR-based bug localization successful without supplementary query expansion. Most notably, this also holds when localization hints are completely absent from the bug report. In fact, our results suggest that the next major step in improving TR-based bug localization is the ability to formulate these near-optimal queries.

机译：文本检索（TR）已被广泛用于支持许多软件工程任务，包括错误本地化（即，从错误报告开始定位错误代码的活动）。许多研究表明TR在降低执行此维护任务所需的手动努力方面的有效性;然而，在最近的研究中，基于TR基本错误定位的实际有用性受到质疑。这些研究讨论（i）实验设计中的潜在偏见通常采用来评估TRBASed Bug定位技术和（ii）当您最需要的情况下它们在方案中的性能很差：当错误报告时，它是最重要的查询研究，不包含本地化提示（例如，代码片段，方法名称等），这些研究提出了问题：DO错误报告提供足够的信息来执行基于TR的本地化吗？在这项工作中，我们从两个角度来看这个问题。首先，我们调查评估TR基方法中的潜在偏见，这是人为地提高了这些技术的性能，使它们看起来比它们更成功。其次，我们使用遗传算法分析错误报告文本，并使用遗传算法导出近最优查询，为该错误报告提供了洞察，以便在基于TR的本地化中使用。通过此分析，我们显示在大多数情况下，错误报告词汇（即，错误标题和描述中包含的术语）是我们需要制定有效查询，使基于TR的BUG本地化成功，没有补充查询扩展。最值得注意的是，当来自错误报告完全没有本地化提示时，这也持有。事实上，我们的结果表明，改善基于TR的错误本地化的下一个重大步骤是能够制定这些近最佳查询。

著录项

来源
《IEEE International Conference on Software Maintenance and Evolution》|2018年|xxii 764 p. :|共12页
会议地点
作者
Chris Mills; Jevgenija Pantiuchina; Esteban Parra; Gabriele Bavota; Sonia Haiduc;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-532;
关键词
Computer bugs; Vocabulary; Task analysis; Genetic algorithms; Software engineering; Manuals; Software;

机译：计算机虫;词汇;任务分析;遗传算法;软件工程;手册;软件;

相似文献

外文文献
中文文献
专利

1. On the relationship between bug reports and queries for text retrieval-based bug localization [J] . Chris Mills, Esteban Parra, Jevgenija Pantiuchina, Empirical Software Engineering . 2020,第5期

机译：关于基于文本检索的错误本地化的错误报告与查询的关系
2. Are datasets for information retrieval-based bug localization techniques trustworthy? Impact analysis of bug types on IRBL [J] . Kim Misoo, Lee Eunseok Empirical Software Engineering . 2021,第3期

机译：数据集是基于信息检索的错误本地化技术可信赖吗？ IRBL上的错误类型的影响分析
3. Using bug descriptions to reformulate queries during text-retrieval-based bug localization [J] . Chaparro Oscar, Florez Juan Manuel, Marcus Andrian Empirical Software Engineering . 2019,第5期

机译：在基于文本检索的错误本地化过程中使用错误描述重新构造查询
4. Are Bug Reports Enough for Text Retrieval-Based Bug Localization? [C] . Chris Mills, Jevgenija Pantiuchina, Esteban Parra, IEEE International Conference on Software Maintenance and Evolution . 2018

机译：错误报告是否足以进行基于文本检索的错误本地化？
5. Automated Analysis of Bug Descriptions to Support Bug Reporting and Resolution [D] . Chaparro Arenas, Oscar Javier. 2019

机译：对支持Bug报告和解决方案的错误描述自动分析
6. Case Report: A Case of Cardboard Boxes Likely Facilitating the Biting of a Patient by Trypanosoma cruzi-Infected Triatomine Bugs [O] . Eduardo P. Dolhun, Andrew W. Antes 2016

机译：病例报告：一例纸板箱可能有助于克鲁斯锥虫感染的三角藻虫咬伤患者
7. Where should the bugs be fixed? More accurate information retrieval-based bug localization based on bug reports [O] . Jian Zhou, Hongyu Zhang, David Lo 2012

机译：错误应该在哪里修复？基于错误报告的基于Revical的基于错误定位更准确的信息

Are Bug Reports Enough for Text Retrieval-Based Bug Localization?

摘要

著录项

相似文献

相关主题

期刊订阅