Show simple item record

dc.contributor.authorRainer, A.
dc.contributor.authorLane, P.C.R.
dc.contributor.authorMalcolm, J.
dc.contributor.authorScholz, S.
dc.date.accessioned2013-01-15T14:29:05Z
dc.date.available2013-01-15T14:29:05Z
dc.date.issued2008
dc.identifier.citationRainer , A , Lane , P C R , Malcolm , J & Scholz , S 2008 , Using n-grams to rapidly characterise the evolution of software code . in Procs 23rd IEEE/ACM Int Conf on Automated Software Engineering : ASE Workshops 2008 . Institute of Electrical and Electronics Engineers (IEEE) , pp. 43-52 . https://doi.org/10.1109/ASEW.2008.4686320
dc.identifier.isbn978-1-4244-2776-5
dc.identifier.otherdspace: 2299/3034
dc.identifier.urihttp://hdl.handle.net/2299/9658
dc.description“This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder." “Copyright IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.” DOI: 10.1109/ASEW.2008.4686320
dc.description.abstractText-based approaches to the analysis of software evolution are attractive because of the fine-grained, token-level comparisons they can generate. The use of such approaches has, however, been constrained by the lack of an efficient implementation. In this paper we demonstrate the ability of Ferret, which uses ngrams of 3 tokens, to characterise the evolution of software code. Ferret’s implementation operates in almost linear time and is at least an order of magnitude faster than the diff tool. Ferret’s output can be analysed to reveal several characteristics of software evolution, such as: the lifecycle of a single file, the degree of change between two files, and possible regression. In addition, the similarity scores produced by Ferret can be aggregated to measure larger parts of the system being analysed.en
dc.format.extent10
dc.format.extent454010
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.relation.ispartofProcs 23rd IEEE/ACM Int Conf on Automated Software Engineering
dc.titleUsing n-grams to rapidly characterise the evolution of software codeen
dc.contributor.institutionSchool of Computer Science
dc.contributor.institutionScience & Technology Research Institute
dc.identifier.urlhttp://www.scopus.com/inward/record.url?scp=58049141129&partnerID=8YFLogxK
rioxxterms.versionofrecord10.1109/ASEW.2008.4686320
rioxxterms.typeOther
herts.preservation.rarelyaccessedtrue


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record