Show simple item record

dc.contributor.authorRainer, A.
dc.contributor.authorLane, P.C.R.
dc.contributor.authorMalcolm, J.
dc.contributor.authorScholz, S.
dc.date.accessioned2013-01-15T14:29:05Z
dc.date.available2013-01-15T14:29:05Z
dc.date.issued2008
dc.identifier.citationRainer , A , Lane , P C R , Malcolm , J & Scholz , S 2008 , Using n-grams to rapidly characterise the evolution of software code . in Procs 23rd IEEE/ACM Int Conf on Automated Software Engineering : ASE Workshops 2008 . IEEE , pp. 43-52 . https://doi.org/10.1109/ASEW.2008.4686320
dc.identifier.isbn978-1-4244-2776-5
dc.identifier.otherPURE: 1421885
dc.identifier.otherPURE UUID: 11dfa9d2-fc9b-4c9c-b496-7965d24f8cfb
dc.identifier.otherdspace: 2299/3034
dc.identifier.otherScopus: 58049141129
dc.identifier.urihttp://hdl.handle.net/2299/9658
dc.description“This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder." “Copyright IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.” DOI: 10.1109/ASEW.2008.4686320
dc.description.abstractText-based approaches to the analysis of software evolution are attractive because of the fine-grained, token-level comparisons they can generate. The use of such approaches has, however, been constrained by the lack of an efficient implementation. In this paper we demonstrate the ability of Ferret, which uses ngrams of 3 tokens, to characterise the evolution of software code. Ferret’s implementation operates in almost linear time and is at least an order of magnitude faster than the diff tool. Ferret’s output can be analysed to reveal several characteristics of software evolution, such as: the lifecycle of a single file, the degree of change between two files, and possible regression. In addition, the similarity scores produced by Ferret can be aggregated to measure larger parts of the system being analysed.en
dc.format.extent10
dc.language.isoeng
dc.publisherIEEE
dc.relation.ispartofProcs 23rd IEEE/ACM Int Conf on Automated Software Engineering
dc.titleUsing n-grams to rapidly characterise the evolution of software codeen
dc.contributor.institutionSchool of Computer Science
dc.contributor.institutionCentre for Computer Science and Informatics Research
dc.contributor.institutionScience & Technology Research Institute
dc.identifier.urlhttp://www.scopus.com/inward/record.url?scp=58049141129&partnerID=8YFLogxK
dc.relation.schoolSchool of Computer Science
rioxxterms.versionVoR
rioxxterms.versionofrecordhttps://doi.org/10.1109/ASEW.2008.4686320
rioxxterms.typeOther
herts.preservation.rarelyaccessedtrue
herts.rights.accesstypeclosedAccess


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record