Show simple item record

dc.contributor.authorLane, Peter
dc.contributor.authorClarke, Daoud
dc.contributor.authorHender, Paul
dc.date.accessioned2012-09-18T12:01:00Z
dc.date.available2012-09-18T12:01:00Z
dc.date.issued2012
dc.identifier.citationLane , P , Clarke , D & Hender , P 2012 , ' On developing robust models for favourability analysis : Model choice, feature sets and imbalanced data ' , Decision Support Systems , vol. 53 , no. 4 , pp. 712-718 . https://doi.org/10.1016/j.dss.2012.05.028
dc.identifier.urihttp://hdl.handle.net/2299/9017
dc.description.abstractLocating documents carrying positive or negative favourability is an important application within media analysis. This article presents some empirical results on the challenges facing a machine-learning approach to this kind of opinion mining. Some of the challenges include the often considerable imbalance in the distribution of positive and negative samples, changes in the documents over time, and effective training and evaluation procedures for the models. This article presents results on three data sets generated by a media-analysis company, classifying documents in two ways: detecting the presence of favourability, and assessing negative vs. positive favourability. We describe our experiments in developing a machine-learning approach to automate the classification process. We explore the effect of using five different types of features, the robustness of the models when tested on data taken from a later time period, and the effect of balancing the input data by undersampling. We find varying choices for the optimum classifier, feature set and training strategy depending on the task and data set.en
dc.format.extent175113
dc.language.isoeng
dc.relation.ispartofDecision Support Systems
dc.titleOn developing robust models for favourability analysis : Model choice, feature sets and imbalanced dataen
dc.contributor.institutionSchool of Computer Science
dc.contributor.institutionScience & Technology Research Institute
dc.contributor.institutionCentre for Computer Science and Informatics Research
dc.description.statusPeer reviewed
rioxxterms.versionofrecord10.1016/j.dss.2012.05.028
rioxxterms.typeJournal Article/Review
herts.preservation.rarelyaccessedtrue


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record