What accuracy statistics really measure
aut.researcher | MacDonell, Stephen Gerard | |
dc.contributor.author | Kitchenham, B | |
dc.contributor.author | Pickard, L | |
dc.contributor.author | MacDonell, SG | |
dc.contributor.author | Shepperd, MJ | |
dc.date.accessioned | 2011-10-01T06:45:42Z | |
dc.date.available | 2011-10-01T06:45:42Z | |
dc.date.copyright | 2001-06-01 | |
dc.date.issued | 2001-06-01 | |
dc.description.abstract | Provides the software estimation research community with a better understanding of the meaning of, and relationship between, two statistics that are often used to assess the accuracy of predictive models: the mean magnitude relative error (MMRE) and the number of predictions within 25% of the actual, pred(25). It is demonstrated that MMRE and pred(25) are, respectively, measures of the spread and the kurtosis of the variable z, where z=estimate/actual. Thus, z is considered to be a measure of accuracy, and statistics such as MMRE and pred(25) to be measures of properties of the distribution of z. It is suggested that measures of the central location and skewness of z, as well as measures of spread and kurtosis, are necessary. Furthermore, since the distribution of z is non-normal, non-parametric measures of these properties may be needed. For this reason, box-plots of z are useful alternatives to simple summary metrics. It is also noted that the simple residuals are better behaved than the z variable, and could also be used as the basis for comparing prediction systems | |
dc.identifier.citation | IEE Proceedings: Software, vol.148(3), pp.81-85 | |
dc.identifier.doi | 10.1049/ip-sen:20010506 | |
dc.identifier.issn | 1462-5970 | |
dc.identifier.uri | https://hdl.handle.net/10292/2179 | |
dc.publisher | IEEE | |
dc.relation.uri | http://dx.doi.org/10.1049/ip-sen:20010506 | |
dc.rights | Copyright © 2001 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. | |
dc.rights.accessrights | OpenAccess | |
dc.subject | Accuracy measures | |
dc.subject | Accuracy statistics | |
dc.subject | Box-plots | |
dc.subject | Central location measure | |
dc.subject | Kurtosis measure | |
dc.subject | Mean magnitude relative error | |
dc.subject | Nonnormal distribution | |
dc.subject | Nonparametric measures | |
dc.subject | Prediction number | |
dc.subject | Prediction systems comparison | |
dc.subject | Predictive models | |
dc.subject | Residuals | |
dc.subject | Skewness measure | |
dc.subject | Software estimation | |
dc.subject | Spread measure | |
dc.subject | Statistical distribution properties | |
dc.subject | Summary metrics | |
dc.title | What accuracy statistics really measure | |
dc.type | Journal Article | |
pubs.organisational-data | /AUT | |
pubs.organisational-data | /AUT/Design & Creative Technologies | |
pubs.organisational-data | /AUT/PBRF Researchers | |
pubs.organisational-data | /AUT/PBRF Researchers/Design & Creative Technologies PBRF Researchers | |
pubs.organisational-data | /AUT/PBRF Researchers/Design & Creative Technologies PBRF Researchers/DCT C & M Computing |