6 research outputs found

    Interobserver reliability of classification and characterization of proximal humeral fractures: a comparison of two and three-dimensional CT

    Full text link
    Interobserver reliability for the classification of proximal humeral fractures is limited. The aim of this study was to test the null hypothesis that interobserver reliability of the AO classification of proximal humeral fractures, the preferred treatment, and fracture characteristics is the same for two-dimensional (2-D) and three-dimensional (3-D) computed tomography (CT). Members of the Science of Variation Group--fully trained practicing orthopaedic and trauma surgeons from around the world--were randomized to evaluate radiographs and either 2-D CT or 3-D CT images of fifteen proximal humeral fractures via a web-based survey and respond to the following four questions: (1) Is the greater tuberosity displaced? (2) Is the humeral head split? (3) Is the arterial supply compromised? (4) Is the glenohumeral joint dislocated? They also classified the fracture according to the AO system and indicated their preferred treatment of the fracture (operative or nonoperative). Agreement among observers was assessed with use of the multirater kappa (κ) measure. Interobserver reliability of the AO classification, fracture characteristics, and preferred treatment generally ranged from "slight" to "fair." A few small but statistically significant differences were found. Observers randomized to the 2-D CT group had slightly but significantly better agreement on displacement of the greater tuberosity (κ = 0.35 compared with 0.30, p < 0.001) and on the AO classification (κ = 0.18 compared with 0.17, p = 0.018). A subgroup analysis of the AO classification results revealed that shoulder and elbow surgeons, orthopaedic trauma surgeons, and surgeons in the United States had slightly greater reliability on 2-D CT, whereas surgeons in practice for ten years or less and surgeons from other subspecialties had slightly greater reliability on 3-D CT. Proximal humeral fracture classifications may be helpful conceptually, but they have poor interobserver reliability even when 3-D rather than 2-D CT is utilized. This may contribute to the similarly poor interobserver reliability that was observed for selection of the treatment for proximal humeral fractures. The lack of a reliable classification confounds efforts to compare the outcomes of treatment methods among different clinical trials and reports

    Scapula fractures: interobserver reliability of classification and treatment

    Full text link
    OBJECTIVES:There is substantial variation in the classification and the management of scapula fractures. The first purpose of this study was to analyze the interobserver reliability of the OTA/AO and the New International Classification of scapula fractures. The second purpose was to assess the proportion of agreement among orthopaedic surgeons on operative or nonoperative treatment. DESIGN:: Web-based reliability study SETTING:: Independent orthopaedic surgeons from several countries were invited to classify scapular fractures in an online survey. PARTICIPANTS:One-hundred and three orthopaedic surgeons evaluated 35 movies of 3DCT-reconstruction of selected scapular fractures, representing a full spectrum of fracture patterns. MAIN OUTCOME MEASUREMENTS:Fleiss' kappa (κ) was used to assess the reliability of agreement between the surgeons. RESULTS:: The overall agreement on the OTA/AO Classification was moderate for the types (A, B, and C, κ = 0.54) with a 71% proportion of rater agreement (PA) as well as for the nine groups (A1 to C3, κ = 0.47) with a 57% PA. For the New International Classification, the agreement about the intra-articular extension of the fracture (Fossa (F), κ = 0.79) was substantial, the agreement about a fractured body (Body (B), κ = 0.57) or process was moderate (Process (P), κ = 0.53), however PAs were more than 81%. The agreement on the treatment recommendation was moderate (κ = 0.57) with a 73% PA. CONCLUSIONS:The New International Classification was more reliable. Body and process fractures generated more disagreement than intra-articular fractures and need further clear definitions
    corecore