1,960 research outputs found

    Structural Features for Predicting the Linguistic Quality of Text: Applications to Machine Translation, Automatic Summarization and Human-Authored Text

    Get PDF
    Sentence structure is considered to be an important component of the overall linguistic quality of text. Yet few empirical studies have sought to characterize how and to what extent structural features determine fluency and linguistic quality. We report the results of experiments on the predictive power of syntactic phrasing statistics and other structural features for these aspects of text. Manual assessments of sentence fluency for machine translation evaluation and text quality for summarization evaluation are used as gold-standard. We find that many structural features related to phrase length are weakly but significantly correlated with fluency and classifiers based on the entire suite of structural features can achieve high accuracy in pairwise comparison of sentence fluency and in distinguishing machine translations from human translations. We also test the hypothesis that the learned models capture general fluency properties applicable to human-authored text. The results from our experiments do not support the hypothesis. At the same time structural features and models based on them prove to be robust for automatic evaluation of the linguistic quality of multi-document summaries

    Predicting the Fluency of Text with Shallow Structural Features: Case Studies of Machine Tanslation and Human-Written Text

    Get PDF
    Sentence fluency is an important component of overall text readability but few studies in natural language processing have sought to understand the factors that define it. We report the results of an initial study into the predictive power of surface syntactic statistics for the task; we use fluency assessments done for the purpose of evaluating machine translation. We find that these features are weakly but significantly correlated with fluency. Machine and human translations can be distinguished with accuracy over 80%. The performance of pairwise comparison of fluency is also very highโ€”over 90% for a multi-layer perceptron classifier. We also test the hypothesis that the learned models capture general fluency properties applicable to human-written text. The results do not support this hypothesis: prediction accuracy on the new data is only 57%. This finding suggests that developing a dedicated, task-independent corpus of fluency judgments will be beneficial for further investigations of the problem

    ๊ฒฝ์ถ” ์ธก๋ฉด X์„  ์˜์ƒ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ ๊ฐ‘์ƒ์„  ์ˆ˜์ˆ  ํ™˜์ž์—์„œ ์–ด๋ ค์šด ํ›„๋‘๊ฒฝ ์˜ˆ์ธก ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(์„์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต๋Œ€ํ•™์› : ์˜๊ณผ๋Œ€ํ•™ ์˜ํ•™๊ณผ, 2022. 8. ์ •์ฒ ์šฐ.์˜ˆ์ƒํ•˜์ง€ ๋ชปํ•œ ์–ด๋ ค์šด ํ›„๋‘๊ฒฝ์€ ์‹ฌ๊ฐํ•œ ๊ธฐ๋„๊ด€๋ จ ํ•ฉ๋ณ‘์ฆ๊ณผ ์—ฐ๊ด€๋˜์–ด ์žˆ๋‹ค. ๋ณธ ์—ฐ๊ตฌ๋Š” ํ›„ํ–ฅ์ ์œผ๋กœ ์ˆ˜์ง‘๋œ ๊ฐ‘์ƒ์„  ์ˆ˜์ˆ ์„ ๋ฐ›์€ ์ด 14,135๋ช… ํ™˜์ž์˜ ๊ฒฝ์ถ” ์ธก๋ฉด X์„ ์„ ํ†ตํ•ด ์–ด๋ ค์šด ํ›„๋‘๊ฒฝ (Cormack-Lehane ๋“ฑ๊ธ‰ 3-4)๋ฅผ ์˜ˆ์ธกํ•˜๋Š” ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ์„ ๊ฐœ๋ฐœ ๋ฐ ๊ฒ€์ฆํ•˜์˜€๋‹ค. ๊ฐœ๋ฐœ ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์€ ๊ธฐ์กด์˜ 6๊ฐœ์˜ ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ๊ณผ ๋น„๊ตํ•˜์˜€๋‹ค. ๊ฐœ๋ฐœ ๋ชจ๋ธ์—์„œ ์–ด๋ ค์šด ํ›„๋‘๊ฒฝ ์˜ˆ์ธก์˜ ๋ฏผ๊ฐ๋„๋Š” 95.6%, ํŠน์ด๋„ 91.2%๋ฅผ ๋‚˜ํƒ€๋ƒˆ๋‹ค. Area Under ROC curve์˜ ๊ฒฝ์šฐ ๊ฐœ๋ฐœ ๋ชจ๋ธ์—์„œ 0.972(0.955~0.988), ๊ธฐ์กด ๋ชจ๋ธ์˜ ๊ฒฝ์šฐ ๊ฐ๊ฐ VGG-Net: 0.842, ResNet: 0.841, Xception: 0.863, ResNext: 0.825, DenseNet: 0.889, SENet: 0.875๋ฅผ ๋‚˜ํƒ€๋ƒˆ๋‹ค. ์–ด๋ ค์šด ํ›„๋‘๊ฒฝ๊ณผ ๊ด€๋ จ๋œ ํ•ด๋ถ€ํ•™์  ํŠน์ง•์„ ์„ค๋ช…ํ•˜๊ธฐ ์œ„ํ•ด ํด๋ž˜์Šค ํ™œ์„ฑํ™” ๋งต(Class Activation Map)์„ ์‚ฌ์šฉํ•˜์˜€๋‹ค. ํด๋ž˜์Šค ํ™œ์„ฑํ™” ๋งต์—์„œ ์„ค๊ณจ, ์ธ๋‘ ๋ฐ ๊ฒฝ์ถ” ์ฃผ๋ณ€์ด ๊ฐ•์กฐ๋˜์—ˆ๋‹ค. ๋ณธ ์—ฐ๊ตฌ๋ฅผ ํ†ตํ•ด ๊ฐœ๋ฐœ๋œ ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ์€ ๊ฒฝ์ถ” ์ธก๋ฉด X์„  ์˜์ƒ์„ ์ด์šฉํ•œ ์–ด๋ ค์šด ํ›„๋‘๊ฒฝ ์˜ˆ์ธก์— ๋†’์€ ์„ฑ๋Šฅ์„ ๋ณด์˜€๋‹ค.An unanticipated difficult laryngoscopy is associated with serious airway-related complications. We here developed and validated a deep learning-based model that predicts a difficult laryngoscopy (Cormackโ€“Lehane grade 3โ€“4) from a cervical spine lateral X-ray using data from 14,135 patients undergoing thyroid surgery. The performance of our model was compared with six representative deep learning architectures. A class activation map was created to elucidate the anatomical features associated with difficult laryngoscopy. Our model showed 95.6% sensitivity and 91.2% specificity for predicting difficult laryngoscopy. The area under the receiver operating characteristic curve of our model was 0.972 (0.955โ€’0.988), which was higher than that of other models (VGG-Net: 0.842, ResNet: 0.841, Xception: 0.863, ResNext: 0.825, DenseNet: 0.889, and SENet: 0.875, all P < 0.001). The class activation map demonstrated clear differences around the hyoid bone, pharynx, and cervical spine. The model showed excellent performance for predicting difficult laryngoscopy using a cervical spine lateral X-ray image.1. Introduction 1 2. Materials and Methods 2 2.1 Inclusion and Exclusion Criteria 2 2.2 Anesthesia Management 2 2.3 Data Collection and Preprocessing 2 2.4 Model Building 3 2.5 Model Validation 4 2.6 Sensitivity Analysis 4 2.7 Statistical Analysis 4 3. Results 6 3.1 Dataset Construction 6 3.2 Performance of the Models 6 3.3 Sensitivity Analysis 6 4. Discussion 8 5. Conclusions 11 References 23 Abstract 26 Tables 12 [Table 1] 12 [Table 2] 13 [Table 3] 14 Figures 15 [Figure 1] 15 [Figure 2] 16 [Figure 3] 17 [Figure 4] 18 [Figure 5] 19 Supplementary Materials 20 [Supplementary Materials] 20์„

    Improving the in silico assessment of pathogenicity for compensated variants

    Get PDF
    Understanding the functional sequelae of amino-acid replacements is of fundamental importance in medical genetics. Perhaps, the most intuitive way to assess the potential pathogenicity of a given human missense variant is by measuring the degree of evolutionary conservation of the substituted amino-acid residue, a feature that generally serves as a good proxy metric for the functional/structural importance of that residue. However, the presence of putatively compensated variants as the wild-type alleles in orthologous proteins of other mammalian species not only challenges this classical view of amino-acid essentiality but also precludes the accurate evaluation of the functional impact of this type of missense variant using currently available bioinformatic prediction tools. Compensated variants constitute at least 4% of all known missense variants causing human-inherited disease and hence represent an important potential source of error in that they are likely to be disproportionately misclassified as benign variants. The consequent under-reporting of compensated variants is exacerbated in the context of next-generation sequencing where their inappropriate exclusion constitutes an unfortunate natural consequence of the filtering and prioritization of the very large number of variants generated. Here we demonstrate the reduced performance of currently available pathogenicity prediction tools when applied to compensated variants and propose an alternative machine-learning approach to assess likely pathogenicity for this particular type of variant

    Concurrent Validity of the Child Behavior Checklist DSM-Oriented Scales: Correspondence with DSM Diagnoses and Comparison to Syndrome Scales

    Get PDF
    This study used receiver operating characteristic (ROC) methodology and discriminative analyses to examine the correspondence of the Child Behavior Checklist (CBCL) rationally-derived DSM-oriented scales and empirically-derived syndrome scales with clinical diagnoses in a clinic-referred sample of children and adolescents (Nโ€‰=โ€‰476). Although results demonstrated that the CBCL Anxiety, Affective, Attention Deficit/Hyperactivity, Oppositional and Conduct Problems DSM-oriented scales corresponded significantly with related clinical diagnoses derived from parent-based structured interviews, these DSM-oriented scales did not evidence significantly greater correspondence with clinical diagnoses than the syndrome scales in all cases but one. The DSM-oriented Anxiety Problems scale was the only scale that evidenced significantly greater correspondence with diagnoses above its syndrome scale counterpart โ€”the Anxious/Depressed scale. The recently developed and rationally-derived DSM-oriented scales thus generally do not add incremental clinical utility above that already afforded by the syndrome scales with respect to corresponding with diagnoses. Implications of these findings are discussed
    • โ€ฆ
    corecore