skip to main content
10.1145/1370750.1370777acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
research-article

Correctness of data mined from CVS

Published: 10 May 2008 Publication History

Abstract

Source code repositories managed by the popular CVS are frequently mined by researchers to validate various hypotheses about how the development of a software product progressed. This paper presents a study where the development process of 17 student teams was followed. It will show that in an environment where the teams were permitted to manage their own CVS repositories that several errors emerged that could lead to the misinterpretation of a hypothesis. These were classified into three types: type one errors which relate to the non-use of the system; type two errors that emerged from the direct manipulation of the repository; and type three errors from the limitation of CVS not to record file name changes.

References

[1]
cvsplot.sourceforge.net.
[2]
www.tortoisecvs.org.
[3]
Berliner, B., CVS {II}: Parallelizing Software Development. in Proceedings of the USENIX Winter 1990 Technical Conference, (1990), USENIX Association, 341--352.
[4]
Dig, D. and Johnson, R., The Role of Refactorings in API Evolution. in ICSM '05: Proceedings of the 21st IEEE International Conference on Software Maintenance (ICSM'05), (2005), IEEE Computer Society, 389--398.
[5]
Gorg, C. and Weisgerber, P., Detecting and Visualizing Refactorings from Software Archives. in IWPC '05: Proceedings of the 13th International Workshop on Program Comprehension, (2005), IEEE Computer Society, 205--214.
[6]
Liu, Y., Stoulia, E., Wong, K. and German, D., Using CVS Historical Information to Understand How Students Develop Software. in 1st International Workshop on Mining Software Repositories, (2004), 32--36.
[7]
Mierle, K., Laven, K., Roweis, S. and Wilson, G., Mining student CVS repositories for performance indicators. in MSR '05: Proceedings of the 2005 international workshop on Mining software repositories, (2005), ACM, 1--5.
[8]
Van Rysselberghe, F., Rieger, M. and Demeyer, S., Detecting move operations in versioning information. in Software Maintenance and Reengineering, 2006. CSMR 2006. Proceedings of the 10th European Conference on, (2006), 8 pp.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MSR '08: Proceedings of the 2008 international working conference on Mining software repositories
May 2008
162 pages
ISBN:9781605580241
DOI:10.1145/1370750
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 May 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. CVS
  2. correctness
  3. data mining

Qualifiers

  • Research-article

Conference

ICSE '08
Sponsor:

Upcoming Conference

ICSE 2025

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media