714 research outputs found

    AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

    Full text link
    In this paper, we propose an Attentional Generative Adversarial Network (AttnGAN) that allows attention-driven, multi-stage refinement for fine-grained text-to-image generation. With a novel attentional generative network, the AttnGAN can synthesize fine-grained details at different subregions of the image by paying attentions to the relevant words in the natural language description. In addition, a deep attentional multimodal similarity model is proposed to compute a fine-grained image-text matching loss for training the generator. The proposed AttnGAN significantly outperforms the previous state of the art, boosting the best reported inception score by 14.14% on the CUB dataset and 170.25% on the more challenging COCO dataset. A detailed analysis is also performed by visualizing the attention layers of the AttnGAN. It for the first time shows that the layered attentional GAN is able to automatically select the condition at the word level for generating different parts of the image

    Token Imbalance Adaptation for Radiology Report Generation

    Full text link
    Imbalanced token distributions naturally exist in text documents, leading neural language models to overfit on frequent tokens. The token imbalance may dampen the robustness of radiology report generators, as complex medical terms appear less frequently but reflect more medical information. In this study, we demonstrate how current state-of-the-art models fail to generate infrequent tokens on two standard benchmark datasets (IU X-RAY and MIMIC-CXR) of radiology report generation. % However, no prior study has proposed methods to adapt infrequent tokens for text generators feeding with medical images. To solve the challenge, we propose the \textbf{T}oken \textbf{Im}balance Adapt\textbf{er} (\textit{TIMER}), aiming to improve generation robustness on infrequent tokens. The model automatically leverages token imbalance by an unlikelihood loss and dynamically optimizes generation processes to augment infrequent tokens. We compare our approach with multiple state-of-the-art methods on the two benchmarks. Experiments demonstrate the effectiveness of our approach in enhancing model robustness overall and infrequent tokens. Our ablation analysis shows that our reinforcement learning method has a major effect in adapting token imbalance for radiology report generation.Comment: Accepted by CHIL202

    Enriching Unsupervised User Embedding via Medical Concepts

    Full text link
    Clinical notes in Electronic Health Records (EHR) present rich documented information of patients to inference phenotype for disease diagnosis and study patient characteristics for cohort selection. Unsupervised user embedding aims to encode patients into fixed-length vectors without human supervisions. Medical concepts extracted from the clinical notes contain rich connections between patients and their clinical categories. However, existing unsupervised approaches of user embeddings from clinical notes do not explicitly incorporate medical concepts. In this study, we propose a concept-aware unsupervised user embedding that jointly leverages text documents and medical concepts from two clinical corpora, MIMIC-III and Diabetes. We evaluate user embeddings on both extrinsic and intrinsic tasks, including phenotype classification, in-hospital mortality prediction, patient retrieval, and patient relatedness. Experiments on the two clinical corpora show our approach exceeds unsupervised baselines, and incorporating medical concepts can significantly improve the baseline performance.Comment: accepted at ACM CHIL 2022. a revision for section reforma

    Research on Safety Investment Decision Evaluation and Optimization of Network Booking Taxi Platform Enterprise based on Subjective-Objective Assessment Method

    Get PDF
    This study addresses the current problem of disproportion between the investment and return of safety operation of Network Booking Taxi Platform Enterprises (NBTPE). This study selects the more representative NBTPE in the domestic travel field, and further forms a graph of safety input law based on the impact analysis of internal and external safety inputs by applying the System Dynamics method. Based on the comprehensive use of subjective empowerment method represented by analytical hierarchy process and objective empowerment method represented by entropy weight method, the study proposes the method of determining the reasonable proportion of each safety input cost through the comprehensive Subjective-Objective Assessment Method, and evaluates the feasibility and reasonableness of the method by using the method of linear regularization. Further the study concluded that enterprises need to increase the investment in equipment and facilities in the field of safety investment, while the proportion of investment in different links was measured and suggestions were made to optimize the current proportion of safety investment in NBTPE. This study provides support for optimizing the safety investment ratio of platform companies and improving the efficiency of safety management
    • ā€¦
    corecore