We propose a novel method for scoring the accuracy of\ud protein binding site predictions – the Binding-site Distance Test\ud (BDT) score. Recently, the Matthews Correlation Coefficient (MCC)\ud has been used to evaluate binding site predictions, both by developers\ud of new methods and by the assessors for the community wide\ud prediction experiment – CASP8. Whilst being a rigorous scoring\ud method, the MCC does not take into account the actual 3D location\ud of the predicted residues from the observed binding site. Thus, an\ud incorrectly predicted site that is nevertheless close to the observed\ud binding site will obtain an identical score to the same number of nonbinding\ud residues predicted at random. The MCC is somewhat affected\ud by the subjectivity of determining observed binding residues\ud and the ambiguity of choosing distance cutoffs. By contrast the BDT\ud method produces continuous scores ranging between 0 and 1, relating\ud to the distance between the predicted and observed residues.\ud Residues predicted close to the binding site will score higher than\ud those more distant, providing a better reflection of the true accuracy\ud of predictions. The CASP8 function predictions were evaluated using\ud both the MCC and BDT methods and the scores were compared.\ud The BDT was found to strongly correlate with the MCC\ud scores whilst also being less susceptible to the subjectivity of defining\ud binding residues. We therefore suggest that this new simple\ud score is a potentially more robust method for future evaluations of\ud protein-ligand binding site predictions
To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.