394 research outputs found

    Channel and spatial attention mechanism for fashion image captioning

    Get PDF
    Image captioning aims to automatically generate one or more description sentences for a given input image. Most of the existing captioning methods use encoder-decoder model which mainly focus on recognizing and capturing the relationship between objects appearing in the input image. However, when generating captions for fashion images, it is important to not only describe the items and their relationships, but also mention attribute features of clothes (shape, texture, style, fabric, and more). In this study, one novel model is proposed for fashion image captioning task which can capture not only the items and their relationship, but also their attribute features. Two different attention mechanisms (spatial-attention and channel-wise attention) is incorporated to the traditional encoder-decoder model, which dynamically interprets the caption sentence in multi-layer feature map in addition to the depth dimension of the feature map. We evaluate our proposed architecture on Fashion-Gen using three different metrics (CIDEr, ROUGE-L, andĀ BLEU-1), and achieve the scores of 89.7, 50.6 and 45.6, respectively. Based on experiments, our proposed method shows significant performance improvement for the task of fashion-image captioning, and outperforms other state-of-the-art image captioning methods

    Federated Deep Reinforcement Learning-based Bitrate Adaptation for Dynamic Adaptive Streaming over HTTP

    Full text link
    In video streaming over HTTP, the bitrate adaptation selects the quality of video chunks depending on the current network condition. Some previous works have applied deep reinforcement learning (DRL) algorithms to determine the chunk's bitrate from the observed states to maximize the quality-of-experience (QoE). However, to build an intelligent model that can predict in various environments, such as 3G, 4G, Wifi, \textit{etc.}, the states observed from these environments must be sent to a server for training centrally. In this work, we integrate federated learning (FL) to DRL-based rate adaptation to train a model appropriate for different environments. The clients in the proposed framework train their model locally and only update the weights to the server. The simulations show that our federated DRL-based rate adaptations, called FDRLABR with different DRL algorithms, such as deep Q-learning, advantage actor-critic, and proximal policy optimization, yield better performance than the traditional bitrate adaptation methods in various environments.Comment: 13 pages, 1 colum

    UIT-Saviors at MEDVQA-GI 2023: Improving Multimodal Learning with Image Enhancement for Gastrointestinal Visual Question Answering

    Full text link
    In recent years, artificial intelligence has played an important role in medicine and disease diagnosis, with many applications to be mentioned, one of which is Medical Visual Question Answering (MedVQA). By combining computer vision and natural language processing, MedVQA systems can assist experts in extracting relevant information from medical image based on a given question and providing precise diagnostic answers. The ImageCLEFmed-MEDVQA-GI-2023 challenge carried out visual question answering task in the gastrointestinal domain, which includes gastroscopy and colonoscopy images. Our team approached Task 1 of the challenge by proposing a multimodal learning method with image enhancement to improve the VQA performance on gastrointestinal images. The multimodal architecture is set up with BERT encoder and different pre-trained vision models based on convolutional neural network (CNN) and Transformer architecture for features extraction from question and endoscopy image. The result of this study highlights the dominance of Transformer-based vision models over the CNNs and demonstrates the effectiveness of the image enhancement process, with six out of the eight vision models achieving better F1-Score. Our best method, which takes advantages of BERT+BEiT fusion and image enhancement, achieves up to 87.25% accuracy and 91.85% F1-Score on the development test set, while also producing good result on the private test set with accuracy of 82.01%.Comment: ImageCLEF2023 published version: https://ceur-ws.org/Vol-3497/paper-129.pd

    Indoor PMā‚€.ā‚ and PMā‚‚.ā‚… in Hanoi: Chemical characterization, source identification, and health risk assessment

    Get PDF
    This study attempted to provide comprehensive insights into the chemical composition, source identification, and health risk assessment of indoor particulate matter (PM) in urban areas of Vietnam. Three hundred and twenty daily samples of PMā‚€.ā‚ and PMā‚‚.ā‚… were collected at three different types of dwellings in Hanoi in two seasons, namely summer and winter. The samples were analyzed for 10 trace elements (TEs), namely Cr, Mn, Co, Cu, Ni, Zn, As, Cd, Sn, and Pb. The daily average concentrations of indoor PMā‚€.ā‚ and PMā‚‚.ā‚… in the city were in the ranges of 7.0ā€“8.9 Ī¼g/mĀ³ and 43.3ā€“106 Ī¼g/mĀ³, respectively. The average concentrations of TEs bound to indoor PM ranged from 66.2 ng/mĀ³ to 216 ng/mĀ³ for PMā‚€.ā‚ and 391 ng/mĀ³ to 2360 ng/mĀ³ for PMā‚‚.ā‚…. Principle component analysis and enrichment factor were applied to identify the possible sources of indoor PM. Results showed that indoor PMā‚‚.ā‚… was mainly derived from outdoor sources, whereas indoor PMā‚€.ā‚ was derived from indoor and outdoor sources. Domestic coal burning, industrial and traffic emissions were observed as outdoor sources, whereas household dust and indoor combustion were found as indoor sources. 80% of PMā‚‚.ā‚… was deposited in the head airways, whereas 75% of PMā‚€.ā‚ was deposited in alveolar region. Monte Carlo simulation indicated that the intake of TEs in PMā‚‚.ā‚… can lead to high carcinogenic risk for people over 60 years old and unacceptable non-carcinogenic risks for all ages at the roadside house in winter

    Financial Inclusion and Macroeconomic Stability in Emerging and Frontier Markets

    Get PDF
    Financial inclusion, being considered as a key enabler to reducing poverty and boosting prosperity in emerging and frontier markets such as Vietnam, is the process in which individuals and small businesses are provided with an access to useful and affordable financial products and services. The extant literature on the empirical evidence regarding the contribution of financial inclusion to macroeconomic stability is mixed. This paper investigates the linkages between financial inclusion and macroeconomic stability, which has not yet been thoroughly examined in the literature, for 22 emerging and frontier economies from 2008 to 2015, with particular focus on a potential optimal level. Using the panel threshold estimation technique, the empirical findings show that financial inclusion, as approximated by the growth rate in the number of bank branches over 100,000 account holders, is found to enhance financial stability under a certain threshold. Financial inclusion is also found to be of benefit to maintaining stable inflation and output growth. Policy implications are also discussed on the basis of the important empirical findings

    Ī‘Ī½Ī¬Ļ€Ļ„Ļ…Ī¾Ī· Web ĪµĻ†Ī±ĻĪ¼ĪæĪ³Ī®Ļ‚ Ī³Ī¹Ī± ĻƒĻ…ĻƒĻ‡Ī­Ļ„Ī¹ĻƒĪ· Ī³ĪæĪ½Ī¹Ī“ĪÆĻ‰Ī½

    Get PDF
    Frost, during reproductive developmental stages, especially post head emergence frost (PHEF), can result in catastrophic yield loss for wheat producers. Breeding for improved PHEF tolerance may allow greater yield to be achieved, by (i) reducing direct frost damage and (ii) facilitating earlier crop sowing to reduce the risk of late season drought and/or heat stress. This paper provides an economic feasibility analysis of breeding options for PHEF tolerant wheat varieties. It compares the economic benefit to growers with the cost of a wheat breeding program aimed at developing PHEF tolerant varieties. The APSIM wheat model, with a frost-impact and a phenology gene-based module, was employed to simulate direct and indirect yield benefits for various levels of improved frost tolerance. The economic model considers optimal profit, based on sowing date and nitrogen use, rather than achieving maximum yield. The total estimated fixed cost of breeding program was AUD 1293 million, including large scale seed production to meet seed demand, with AUD 1.2 million year(-1) to run breeding program after advanced development and large scale field experiments. The results reveal that PHEF tolerant varieties would lead to a significant increase in economic benefits through reduction in direct damage and an increase in yield through early sowing. The economic benefits to growers of up to AUD 4841 million could be realised from growing PHEF tolerant lines if useful genetic variation can be found. Sensitivity analyses indicated that the benefits are particularly sensitive to increases in fixed costs, seed replacement, discount rate, and to delays in variety release. However, the investment still remains viable for most tested scenarios. Based on comparative economic benefits, if breeders were able to develop PHEF tolerant varieties that could withstand cold temperatures -4 degrees C below the current damage threshold, there is very little further economic value of breeding total frost tolerant varieties

    VLSP 2023 -- LTER: A Summary of the Challenge on Legal Textual Entailment Recognition

    Full text link
    In this new era of rapid AI development, especially in language processing, the demand for AI in the legal domain is increasingly critical. In the context where research in other languages such as English, Japanese, and Chinese has been well-established, we introduce the first fundamental research for the Vietnamese language in the legal domain: legal textual entailment recognition through the Vietnamese Language and Speech Processing workshop. In analyzing participants' results, we discuss certain linguistic aspects critical in the legal domain that pose challenges that need to be addressed

    Measure representation and multifractal analysis of complete genomes

    Get PDF
    This paper introduces the notion of measure representation of DNA sequences. Spectral analysis and multifractal analysis are then performed on the measure representations of a large number of complete genomes. The main aim of this paper is to discuss the multifractal property of the measure representation and the classification of bacteria. From the measure representations and the values of the DqD_{q} spectra and related CqC_{q} curves, it is concluded that these complete genomes are not random sequences. In fact, spectral analyses performed indicate that these measure representations considered as time series, exhibit strong long-range correlation. For substrings with length K=8, the DqD_{q} spectra of all organisms studied are multifractal-like and sufficiently smooth for the CqC_{q} curves to be meaningful. The CqC_{q} curves of all bacteria resemble a classical phase transition at a critical point. But the 'analogous' phase transitions of chromosomes of non-bacteria organisms are different. Apart from Chromosome 1 of {\it C. elegans}, they exhibit the shape of double-peaked specific heat function.Comment: 12 pages with 9 figures and 1 tabl

    The three-way relationship of polymorphisms of porcine genes encoding terminal complement components, their differential expression, and health-related phenotypes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The complement system is an evolutionary ancient mechanism that plays an essential role in innate immunity and contributes to the acquired immune response. Three modes of activation, known as classical, alternative and lectin pathway, lead to the initiation of a common terminal lytic pathway. The terminal complement components (TCCs: C6, C7, C8A, C8B, and C9) are encoded by the genes <it>C6</it>, <it>C7</it>, <it>C8A</it>, <it>C8B</it>, <it>C8G</it>, and <it>C9</it>. We aimed at experimentally testing the porcine genes encoding TCCs as candidate genes for immune competence and disease resistance by addressing the three-way relationship of genotype, health related phenotype, and mRNA expression.</p> <p>Results</p> <p>Comparative sequencing of cDNAs of animals of the breeds German Landrace, PiƩtrain, Hampshire, Duroc, Vietnamese Potbelly Pig, and Berlin Miniature Pig (BMP) revealed 30 SNPs (21 in protein domains, 12 with AA exchange). The promoter regions (each ~1.5 kb upstream the transcription start sites) of <it>C6</it>, <it>C7</it>, <it>C8A</it>, <it>C8G</it>, and <it>C9</it> exhibited 29 SNPs. Significant effects of the TCC encoding genes on hemolytic complement activity were shown in a cross of Duroc and BMP after vaccination against Mycoplasma hyopneumoniae, Aujeszky disease virus and PRRSV by analysis of variance using repeated measures mixed models. Family based association tests (FBAT) confirmed the associations. The promoter SNPs were associated with the relative abundance of TCC transcripts obtained by real time RT-PCR of 311 liver samples of commercial slaughter pigs. Complement gene expression showed significant relationship with the prevalence of acute and chronic lung lesions.</p> <p>Conclusions</p> <p>The analyses point to considerable variation of the porcine TCC genes and promote the genes as candidate genes for disease resistance.</p
    • ā€¦
    corecore