dc.contributor.author | Rainer, A. | |
dc.contributor.author | Lane, P.C.R. | |
dc.contributor.author | Malcolm, J. | |
dc.contributor.author | Scholz, S. | |
dc.date.accessioned | 2013-01-15T14:29:05Z | |
dc.date.available | 2013-01-15T14:29:05Z | |
dc.date.issued | 2008 | |
dc.identifier.citation | Rainer , A , Lane , P C R , Malcolm , J & Scholz , S 2008 , Using n-grams to rapidly characterise the evolution of software code . in Procs 23rd IEEE/ACM Int Conf on Automated Software Engineering : ASE Workshops 2008 . Institute of Electrical and Electronics Engineers (IEEE) , pp. 43-52 . https://doi.org/10.1109/ASEW.2008.4686320 | |
dc.identifier.isbn | 978-1-4244-2776-5 | |
dc.identifier.other | dspace: 2299/3034 | |
dc.identifier.uri | http://hdl.handle.net/2299/9658 | |
dc.description | “This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder." “Copyright IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.” DOI: 10.1109/ASEW.2008.4686320 | |
dc.description.abstract | Text-based approaches to the analysis of software evolution are attractive because of the fine-grained, token-level comparisons they can generate. The use of such approaches has, however, been constrained by the lack of an efficient implementation. In this paper we demonstrate the ability of Ferret, which uses ngrams of 3 tokens, to characterise the evolution of software code. Ferret’s implementation operates in almost linear time and is at least an order of magnitude faster than the diff tool. Ferret’s output can be analysed to reveal several characteristics of software evolution, such as: the lifecycle of a single file, the degree of change between two files, and possible regression. In addition, the similarity scores produced by Ferret can be aggregated to measure larger parts of the system being analysed. | en |
dc.format.extent | 10 | |
dc.format.extent | 454010 | |
dc.language.iso | eng | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | |
dc.relation.ispartof | Procs 23rd IEEE/ACM Int Conf on Automated Software Engineering | |
dc.title | Using n-grams to rapidly characterise the evolution of software code | en |
dc.contributor.institution | School of Computer Science | |
dc.contributor.institution | Science & Technology Research Institute | |
dc.contributor.institution | Centre for Computer Science and Informatics Research | |
dc.contributor.institution | Department of Computer Science | |
dc.contributor.institution | School of Physics, Engineering & Computer Science | |
dc.identifier.url | http://www.scopus.com/inward/record.url?scp=58049141129&partnerID=8YFLogxK | |
rioxxterms.versionofrecord | 10.1109/ASEW.2008.4686320 | |
rioxxterms.type | Other | |
herts.preservation.rarelyaccessed | true | |