Combining Probabilistic Ranking and Latent Semantic Indexing for Feature Identification
Citations Over TimeTop 1% of 2006 papers
Abstract
The paper recasts the problem of feature location in source code as a decision-making problem in the presence of uncertainty. The main contribution consists in the combination of two existing techniques for feature location in source code. Both techniques provide a set of ranked facts from the software, as result to the feature identification problem. One of the techniques is based on a scenario based probabilistic ranking of events observed while executing a program under given scenarios. The other technique is defined as an information retrieval task, based on the latent semantic indexing of the source code. We show the viability and effectiveness of the combined technique with two case studies. A first case study is a replication of feature identification in Mozilla, which allows us to directly compare the results with previously published data. The other case study is a bug location problem in Mozilla. The results show that the combined technique improves feature identification significantly with respect to each technique used independently
Related Papers
- → Analysis in indexing: document and domain centered approaches(2004)100 cited
- → A Review on Indexing Techniques and its application in Multilingual Information Retrieval System(2021)3 cited
- → Discussion on the Accurate Modeling of Cylindrical Indexing Cam(2013)
- The problems of Chinese Library Classification(CLC) in indexing of scientific and technical journals and the indexing principles(2008)
- A closer look on indexing and indexing parameters(2020)