137 research outputs found

    Hierarchically Structured Reinforcement Learning for Topically Coherent Visual Story Generation

    Full text link
    We propose a hierarchically structured reinforcement learning approach to address the challenges of planning for generating coherent multi-sentence stories for the visual storytelling task. Within our framework, the task of generating a story given a sequence of images is divided across a two-level hierarchical decoder. The high-level decoder constructs a plan by generating a semantic concept (i.e., topic) for each image in sequence. The low-level decoder generates a sentence for each image using a semantic compositional network, which effectively grounds the sentence generation conditioned on the topic. The two decoders are jointly trained end-to-end using reinforcement learning. We evaluate our model on the visual storytelling (VIST) dataset. Empirical results from both automatic and human evaluations demonstrate that the proposed hierarchically structured reinforced training achieves significantly better performance compared to a strong flat deep reinforcement learning baseline.Comment: Accepted to AAAI 201

    Investigation of increased casualties resulting from mindless rescue in a confined space with anoxic asphyxia

    Get PDF

    Improving Pneumonia Classification and Lesion Detection Using Spatial Attention Superposition and Multilayer Feature Fusion

    Get PDF
    Pneumonia is a severe inflammation of the lung that could cause serious complications. Chest X-rays (CXRs) are commonly used to make a diagnosis of pneumonia. In this paper, we propose a deep-learning-based method with spatial attention superposition (SAS) and multilayer feature fusion (MFF) to facilitate pneumonia diagnosis based on CXRs. Specifically, an SAS module, which takes advantage of the channel and spatial attention mechanisms, was designed to identify intrinsic imaging features of pneumonia-related lesions and their locations, and an MFF module was designed to harmonize disparate features from different channels and emphasize important information. These two modules were concatenated to extract critical image features serving as the basis for pneumonia diagnosis. We further embedded the proposed modules into a baseline neural network and developed a model called SAS-MFF-YOLO to diagnose pneumonia. To validate the effectiveness of our model, extensive experiments were conducted on two CXR datasets provided by the Radiological Society of North America (RSNA) and the AI Research Institute. SAS-MFF-YOLO achieved a precision of 88.1%, a recall of 98.2% for pneumonia classification and an AP50 of 99% for lesion detection on the AI Research Institute dataset. The visualization of intermediate feature maps showed that our method could facilitate uncovering pneumonia-related lesions in CXRs. Our results demonstrated that our approach could be used to enhance the performance of the overall pneumonia detection on CXR imaging

    Case report: Reoperative parathyroidectomy for large ectopic hyperplastic parathyroid in the mediastinum of a patient with recurrent secondary hyperparathyroidism

    Get PDF
    IntroductionSecondary hyperparathyroidism (SHPT) is a common complication in hemodialysis patients with chronic renal failure uremia. For severe SHPT, parathyroidectomy is effective. Owing to the variability in parathyroid anatomy, surgical parathyroidectomy can be complex and many patients experience recurrent SHPT, which may require repeated surgery. These cases pose significant challenges to surgeons.Case descriptionAn elderly woman with recurrent severe SHPT was admitted to our hospital. Preoperative methoxyisobutylisonitrile (MIBI) examination found a large ectopic parathyroid gland in the superior mediastinum, and she underwent reoperative parathyroidectomy. A large parathyroid gland in the right anterior mediastinum and another parathyroid gland in the left lingual lobe of the thymus were removed. The patient had postoperative hypocalcemia that was successfully corrected with calcium supplementation via femoral vein catheterization. During the 1-year postoperative follow-up, the patient's iPTH was well controlled and her blood calcium was within the normal range.ConclusionWe report a case of parathyroidectomy to remove multifocal ectopic hyperplastic parathyroid tissue in the mediastinum. Preoperative MIBI accurately detected the lesions. Calcium supplementation via femoral vein catheterization successfully corrected postoperative hypocalcemia. Postoperative follow-up for 1 year indicated that the surgery was successful

    KAT: A Knowledge Augmented Transformer for Vision-and-Language

    Full text link
    The primary focus of recent work with largescale transformers has been on optimizing the amount of information packed into the model's parameters. In this work, we ask a different question: Can multimodal transformers leverage explicit knowledge in their reasoning? Existing, primarily unimodal, methods have explored approaches under the paradigm of knowledge retrieval followed by answer prediction, but leave open questions about the quality and relevance of the retrieved knowledge used, and how the reasoning processes over implicit and explicit knowledge should be integrated. To address these challenges, we propose a novel model - Knowledge Augmented Transformer (KAT) - which achieves a strong state-of-the-art result (+6 points absolute) on the open-domain multimodal task of OK-VQA. Our approach integrates implicit and explicit knowledge in an end to end encoder-decoder architecture, while still jointly reasoning over both knowledge sources during answer generation. An additional benefit of explicit knowledge integration is seen in improved interpretability of model predictions in our analysis.Comment: Accepted by NAACL 202

    Triplet-radical spin entanglement: potential of molecular materials for high-temperature quantum information processing

    Get PDF
    Recently, spin-bearing molecules have been experimentally demonstrated to have great potential as building blocks for quantum information processing due to their substantial advantages including tunability, portability, and scalability. Here, we propose a theoretical model based on the theory of open quantum systems for spin dynamics in a molecule containing one radical, which can interact with the triplet state arising from another part of the molecule owing to optical excitation and intersystem crossing. With the initial state being a classical mixture of a radical 1/2-spin, the exchange interaction between the radical and the triplet produces a spin coherent state, which could potentially be used for a qubit-qutrit quantum entangling gate. Our calculations for the time-resolved electron paramagnetic resonance spectra showed good qualitative agreement with the related experimental results for radical-bearing molecules at high temperature (~77 K, the boiling point of liquid nitrogen). This work therefore lays a solid theoretical cornerstone for optically driven quantum gate operations in radical-bearing molecular materials, aiming toward high-temperature quantum information processing
    • …
    corecore