원문정보
초록
영어
Open-source projects continue to evolve resulted in so many versions. Managing, visualizing and understanding an evolving software system are challenging tasks. In this work, we apply Latent Dirichlet Allocation (LDA) to discover source code topics and study their evolution over multiple software versions. We apply LDA to all versions of the system together and then determine how the assignment metric evolves over time. We study the software evolution of two large open-source projects, JEdit and JHotDraw, over twelve versions. The results show that changes in topics across versions are due to actual software changes such as adding, updating, and removing features. Our work suggests that using LDA can open many paths in software evolution research.
목차
1. Introduction
2. Background
3. Related Work
4. Approach and Case Study
4.1. Terminology
4.2. Systems Under Study
4.3. Preprocessing
4.4. Selecting the number of topics
4.5. The Generated Topics
4.6. JEdit Heatmap
4.7. JHotDraw Heatmap
5. Threats to Validity
6. Conclusion
References