107 research outputs found

    Text-Only Image Captioning with Multi-Context Data Generation

    Full text link
    Text-only Image Captioning (TIC) is an approach that aims to construct a model solely based on text that can accurately describe images. Recently, diffusion models have demonstrated remarkable capabilities in generating high-quality images that are semantically coherent with given texts. This presents an opportunity to generate synthetic training images for TIC. However, we have identified a challenge that the images generated from simple descriptions typically exhibit a single perspective with one or limited contexts, which is not aligned with the complexity of real-world scenes in the image domain. In this paper, we propose a novel framework that addresses this issue by introducing multi-context data generation. Starting with an initial text corpus, our framework employs a large language model to select multiple sentences that describe the same scene from various perspectives. These sentences are then summarized into a single sentence with multiple contexts. We generate simple images using the straightforward sentences and complex images using the summarized sentences through diffusion models. Finally, we train the model exclusively using the synthetic image-text pairs obtained from this process. Experimental results demonstrate that our proposed framework effectively tackles the central challenge we have identified, achieving the state-of-the-art performance on popular datasets such as MSCOCO, Flickr30k, and SS1M

    ARTâ‹…\boldsymbol{\cdot}V: Auto-Regressive Text-to-Video Generation with Diffusion Models

    Full text link
    We present ARTâ‹…\boldsymbol{\cdot}V, an efficient framework for auto-regressive video generation with diffusion models. Unlike existing methods that generate entire videos in one-shot, ARTâ‹…\boldsymbol{\cdot}V generates a single frame at a time, conditioned on the previous ones. The framework offers three distinct advantages. First, it only learns simple continual motions between adjacent frames, therefore avoiding modeling complex long-range motions that require huge training data. Second, it preserves the high-fidelity generation ability of the pre-trained image diffusion models by making only minimal network modifications. Third, it can generate arbitrarily long videos conditioned on a variety of prompts such as text, image or their combinations, making it highly versatile and flexible. To combat the common drifting issue in AR models, we propose masked diffusion model which implicitly learns which information can be drawn from reference images rather than network predictions, in order to reduce the risk of generating inconsistent appearances that cause drifting. Moreover, we further enhance generation coherence by conditioning it on the initial frame, which typically contains minimal noise. This is particularly useful for long video generation. When trained for only two weeks on four GPUs, ARTâ‹…\boldsymbol{\cdot}V already can generate videos with natural motions, rich details and a high level of aesthetic quality. Besides, it enables various appealing applications, e.g., composing a long video from multiple text prompts.Comment: 24 pages, 21 figures. Project page at https://warranweng.github.io/art.

    Needs and views on healthy lifestyles for the prevention of dementia and the potential role for mobile health (mHealth) interventions in China: A qualitative study

    Get PDF
    Objectives Over the coming decades, China is expected to face the largest worldwide increase in dementia incidence. Mobile health (mHealth) may improve the accessibility of dementia prevention strategies, targeting lifestyle-related risk factors. Our aim is to explore the needs and views of Chinese older adults regarding healthy lifestyles to prevent cardiovascular disease (CVD) and dementia through mHealth, supporting the Prevention of Dementia using Mobile Phone Applications (PRODEMOS) study. Design Qualitative semi-structured interview study, using thematic analysis. Setting Primary and secondary care in Beijing and Tai’an, China. Participants Older adults aged 55 and over without dementia with an increased dementia risk, possessing a smartphone. Participants were recruited through seven hospitals participating in the PRODEMOS study, purposively sampled on age, sex, living area and history of CVD and diabetes. Results We performed 26 interviews with participants aged 55–86 years. Three main themes were identified: valuing a healthy lifestyle, sociocultural expectations and need for guidance. First, following a healthy lifestyle was generally deemed important. In addition to generic healthy behaviours, participants regarded certain specific Chinese lifestyle practices as important to prevent disease. Second, the sociocultural context played a crucial role, as an important motive to avoid disease was to limit the care burden put on family members. However, time-consuming family obligations and other social values could also impede healthy behaviours such as regular physical activity. Finally, there seemed to be a need for reliable and personalised lifestyle advice and for guidance from a health professional. Conclusions The Chinese older adults included in this study highly value a healthy lifestyle. They express a need for personalised lifestyle support in order to adopt healthy behaviours. Potentially, the PRODEMOS mHealth intervention can meet these needs through blended lifestyle support to improve risk factors for dementia and CVD

    Dementia with Lewy bodies research consortia: A global perspective from the ISTAART Lewy Body Dementias Professional Interest Area working group

    Get PDF
    Dementia with Lewy bodies (DLB) research has seen a significant growth in international collaboration over the last three decades. However, researchers face a challenge in identifying large and diverse samples capable of powering longitudinal studies and clinical trials. The DLB research community has begun to focus efforts on supporting the development and harmonization of consortia, while also continuing to forge networks within which data and findings can be shared. This article describes the current state of DLB research collaborations on each continent. We discuss several established DLB cohorts, many of whom have adopted a common framework, and identify emerging collaborative initiatives that hold the potential to expand DLB networks and diversify research cohorts. Our findings identify geographical areas into which the global DLB networks should seek to expand, and we propose strategies, such as the creation of data-sharing platforms and the harmonization of protocols, which may further potentiate international collaboration.publishedVersio

    Prevention of dementia using mobile phone applications (PRODEMOS): protocol for an international randomised controlled trial.

    Get PDF
    IntroductionProfiles of high risk for future dementia are well understood and are likely to concern mostly those in low-income and middle-income countries and people at greater disadvantage in high-income countries. Approximately 30%-40% of dementia cases have been estimated to be attributed to modifiable risk factors, including hypertension, smoking and sedentary lifestyle. Tailored interventions targeting these risk factors can potentially prevent or delay the onset of dementia. Mobile health (mHealth) improves accessibility of such prevention strategies in hard-to-reach populations while at the same time tailoring such approaches. In the current study, we will investigate the effectiveness and implementation of a coach-supported mHealth intervention, targeting dementia risk factors, to reduce dementia risk.Methods and analysisThe prevention of dementia using mobile phone applications (PRODEMOS) randomised controlled trial will follow an effectiveness-implementation hybrid design, taking place in the UK and China. People are eligible if they are 55-75 years old, of low socioeconomic status (UK) or from the general population (China); have ≥2 dementia risk factors; and own a smartphone. 2400 participants will be randomised to either a coach-supported, interactive mHealth platform, facilitating self-management of dementia risk factors, or a static control platform. The intervention and follow-up period will be 18 months. The primary effectiveness outcome is change in the previously validated Cardiovascular Risk Factors, Ageing and Incidence of Dementia dementia risk score. The main secondary outcomes include improvement of individual risk factors and cost-effectiveness. Implementation outcomes include acceptability, adoption, feasibility and sustainability of the intervention.Ethics and disseminationThe PRODEMOS trial is sponsored in the UK by the University of Cambridge and is granted ethical approval by the London-Brighton and Sussex Research Ethics Committee (reference: 20/LO/01440). In China, the trial is approved by the medical ethics committees of Capital Medical University, Beijing Tiantan Hospital, Beijing Geriatric Hospital, Chinese People's Liberation Army General Hospital, Taishan Medical University and Xuanwu Hospital. Results will be published in a peer-reviewed journal.Trial registration numberISRCTN15986016

    Robust estimation of bacterial cell count from optical density

    Get PDF
    Optical density (OD) is widely used to estimate the density of cells in liquid culture, but cannot be compared between instruments without a standardized calibration protocol and is challenging to relate to actual cell count. We address this with an interlaboratory study comparing three simple, low-cost, and highly accessible OD calibration protocols across 244 laboratories, applied to eight strains of constitutive GFP-expressing E. coli. Based on our results, we recommend calibrating OD to estimated cell count using serial dilution of silica microspheres, which produces highly precise calibration (95.5% of residuals <1.2-fold), is easily assessed for quality control, also assesses instrument effective linear range, and can be combined with fluorescence calibration to obtain units of Molecules of Equivalent Fluorescein (MEFL) per cell, allowing direct comparison and data fusion with flow cytometry measurements: in our study, fluorescence per cell measurements showed only a 1.07-fold mean difference between plate reader and flow cytometry data

    A Multi-Classification Method Based on Optimized Binary Tree Mahalanobis-Taguchi System for Imbalanced Data

    No full text
    Data imbalance is a common problem in classification tasks. The Mahalanobis-Taguchi system (MTS) has proven to be promising due to its lack of requirements for data distribution. The MTS is a binary classifier. However, multi-classification problems are more common in real life and the diversity of categories may further aggravate the difficulty of classifying imbalanced data. Imbalanced multi-classification has become an important research topic. To improve the performance of MTS in imbalanced multi-classification, we propose an algorithm called optimized binary tree MTS (Optimized BT-MTS). Mahalanobis space (MS) construction, feature selection, and threshold determination are incorporated in a unified classification framework, and joint optimization is carried out according to the principles of maximizing separability, signal-to-noise ratio, dimensionality reduction, and minimizing misclassification cost. Experimental results on several datasets show that the method can significantly reduce the overall misclassification cost and improve the performance of imbalanced data multi-classification
    • …
    corecore