394 research outputs found
Channel and spatial attention mechanism for fashion image captioning
Image captioning aims to automatically generate one or more description sentences for a given input image. Most of the existing captioning methods use encoder-decoder model which mainly focus on recognizing and capturing the relationship between objects appearing in the input image. However, when generating captions for fashion images, it is important to not only describe the items and their relationships, but also mention attribute features of clothes (shape, texture, style, fabric, and more). In this study, one novel model is proposed for fashion image captioning task which can capture not only the items and their relationship, but also their attribute features. Two different attention mechanisms (spatial-attention and channel-wise attention) is incorporated to the traditional encoder-decoder model, which dynamically interprets the caption sentence in multi-layer feature map in addition to the depth dimension of the feature map. We evaluate our proposed architecture on Fashion-Gen using three different metrics (CIDEr, ROUGE-L, andĀ BLEU-1), and achieve the scores of 89.7, 50.6 and 45.6, respectively. Based on experiments, our proposed method shows significant performance improvement for the task of fashion-image captioning, and outperforms other state-of-the-art image captioning methods
Federated Deep Reinforcement Learning-based Bitrate Adaptation for Dynamic Adaptive Streaming over HTTP
In video streaming over HTTP, the bitrate adaptation selects the quality of
video chunks depending on the current network condition. Some previous works
have applied deep reinforcement learning (DRL) algorithms to determine the
chunk's bitrate from the observed states to maximize the quality-of-experience
(QoE). However, to build an intelligent model that can predict in various
environments, such as 3G, 4G, Wifi, \textit{etc.}, the states observed from
these environments must be sent to a server for training centrally. In this
work, we integrate federated learning (FL) to DRL-based rate adaptation to
train a model appropriate for different environments. The clients in the
proposed framework train their model locally and only update the weights to the
server. The simulations show that our federated DRL-based rate adaptations,
called FDRLABR with different DRL algorithms, such as deep Q-learning,
advantage actor-critic, and proximal policy optimization, yield better
performance than the traditional bitrate adaptation methods in various
environments.Comment: 13 pages, 1 colum
UIT-Saviors at MEDVQA-GI 2023: Improving Multimodal Learning with Image Enhancement for Gastrointestinal Visual Question Answering
In recent years, artificial intelligence has played an important role in
medicine and disease diagnosis, with many applications to be mentioned, one of
which is Medical Visual Question Answering (MedVQA). By combining computer
vision and natural language processing, MedVQA systems can assist experts in
extracting relevant information from medical image based on a given question
and providing precise diagnostic answers. The ImageCLEFmed-MEDVQA-GI-2023
challenge carried out visual question answering task in the gastrointestinal
domain, which includes gastroscopy and colonoscopy images. Our team approached
Task 1 of the challenge by proposing a multimodal learning method with image
enhancement to improve the VQA performance on gastrointestinal images. The
multimodal architecture is set up with BERT encoder and different pre-trained
vision models based on convolutional neural network (CNN) and Transformer
architecture for features extraction from question and endoscopy image. The
result of this study highlights the dominance of Transformer-based vision
models over the CNNs and demonstrates the effectiveness of the image
enhancement process, with six out of the eight vision models achieving better
F1-Score. Our best method, which takes advantages of BERT+BEiT fusion and image
enhancement, achieves up to 87.25% accuracy and 91.85% F1-Score on the
development test set, while also producing good result on the private test set
with accuracy of 82.01%.Comment: ImageCLEF2023 published version:
https://ceur-ws.org/Vol-3497/paper-129.pd
Indoor PMā.ā and PMā.ā in Hanoi: Chemical characterization, source identification, and health risk assessment
This study attempted to provide comprehensive insights into the chemical composition, source identification, and health risk assessment of indoor particulate matter (PM) in urban areas of Vietnam. Three hundred and twenty daily samples of PMā.ā and PMā.ā
were collected at three different types of dwellings in Hanoi in two seasons, namely summer and winter. The samples were analyzed for 10 trace elements (TEs), namely Cr, Mn, Co, Cu, Ni, Zn, As, Cd, Sn, and Pb. The daily average concentrations of indoor PMā.ā and PMā.ā
in the city were in the ranges of 7.0ā8.9 Ī¼g/mĀ³ and 43.3ā106 Ī¼g/mĀ³, respectively. The average concentrations of TEs bound to indoor PM ranged from 66.2 ng/mĀ³ to 216 ng/mĀ³ for PMā.ā and 391 ng/mĀ³ to 2360 ng/mĀ³ for PMā.ā
. Principle component analysis and enrichment factor were applied to identify the possible sources of indoor PM. Results showed that indoor PMā.ā
was mainly derived from outdoor sources, whereas indoor PMā.ā was derived from indoor and outdoor sources. Domestic coal burning, industrial and traffic emissions were observed as outdoor sources, whereas household dust and indoor combustion were found as indoor sources. 80% of PMā.ā
was deposited in the head airways, whereas 75% of PMā.ā was deposited in alveolar region. Monte Carlo simulation indicated that the intake of TEs in PMā.ā
can lead to high carcinogenic risk for people over 60 years old and unacceptable non-carcinogenic risks for all ages at the roadside house in winter
Financial Inclusion and Macroeconomic Stability in Emerging and Frontier Markets
Financial inclusion, being considered as a key enabler to reducing poverty and boosting prosperity
in emerging and frontier markets such as Vietnam, is the process in which individuals and small
businesses are provided with an access to useful and affordable financial products and services.
The extant literature on the empirical evidence regarding the contribution of financial inclusion to
macroeconomic stability is mixed. This paper investigates the linkages between financial inclusion
and macroeconomic stability, which has not yet been thoroughly examined in the literature, for 22
emerging and frontier economies from 2008 to 2015, with particular focus on a potential optimal
level. Using the panel threshold estimation technique, the empirical findings show that financial
inclusion, as approximated by the growth rate in the number of bank branches over 100,000
account holders, is found to enhance financial stability under a certain threshold. Financial
inclusion is also found to be of benefit to maintaining stable inflation and output growth. Policy
implications are also discussed on the basis of the important empirical findings
ĪĪ½Ī¬ĻĻĻ Ī¾Ī· Web ĪµĻĪ±ĻĪ¼ĪæĪ³Ī®Ļ Ī³Ī¹Ī± ĻĻ ĻĻĪĻĪ¹ĻĪ· Ī³ĪæĪ½Ī¹Ī“ĪÆĻĪ½
Frost, during reproductive developmental stages, especially post head emergence frost (PHEF), can result in catastrophic yield loss for wheat producers. Breeding for improved PHEF tolerance may allow greater yield to be achieved, by (i) reducing direct frost damage and (ii) facilitating earlier crop sowing to reduce the risk of late season drought and/or heat stress. This paper provides an economic feasibility analysis of breeding options for PHEF tolerant wheat varieties. It compares the economic benefit to growers with the cost of a wheat breeding program aimed at developing PHEF tolerant varieties. The APSIM wheat model, with a frost-impact and a phenology gene-based module, was employed to simulate direct and indirect yield benefits for various levels of improved frost tolerance. The economic model considers optimal profit, based on sowing date and nitrogen use, rather than achieving maximum yield. The total estimated fixed cost of breeding program was AUD 1293 million, including large scale seed production to meet seed demand, with AUD 1.2 million year(-1) to run breeding program after advanced development and large scale field experiments. The results reveal that PHEF tolerant varieties would lead to a significant increase in economic benefits through reduction in direct damage and an increase in yield through early sowing. The economic benefits to growers of up to AUD 4841 million could be realised from growing PHEF tolerant lines if useful genetic variation can be found. Sensitivity analyses indicated that the benefits are particularly sensitive to increases in fixed costs, seed replacement, discount rate, and to delays in variety release. However, the investment still remains viable for most tested scenarios. Based on comparative economic benefits, if breeders were able to develop PHEF tolerant varieties that could withstand cold temperatures -4 degrees C below the current damage threshold, there is very little further economic value of breeding total frost tolerant varieties
VLSP 2023 -- LTER: A Summary of the Challenge on Legal Textual Entailment Recognition
In this new era of rapid AI development, especially in language processing,
the demand for AI in the legal domain is increasingly critical. In the context
where research in other languages such as English, Japanese, and Chinese has
been well-established, we introduce the first fundamental research for the
Vietnamese language in the legal domain: legal textual entailment recognition
through the Vietnamese Language and Speech Processing workshop. In analyzing
participants' results, we discuss certain linguistic aspects critical in the
legal domain that pose challenges that need to be addressed
Measure representation and multifractal analysis of complete genomes
This paper introduces the notion of measure representation of DNA sequences.
Spectral analysis and multifractal analysis are then performed on the measure
representations of a large number of complete genomes. The main aim of this
paper is to discuss the multifractal property of the measure representation and
the classification of bacteria. From the measure representations and the values
of the spectra and related curves, it is concluded that these
complete genomes are not random sequences. In fact, spectral analyses performed
indicate that these measure representations considered as time series, exhibit
strong long-range correlation. For substrings with length K=8, the
spectra of all organisms studied are multifractal-like and sufficiently smooth
for the curves to be meaningful. The curves of all bacteria
resemble a classical phase transition at a critical point. But the 'analogous'
phase transitions of chromosomes of non-bacteria organisms are different. Apart
from Chromosome 1 of {\it C. elegans}, they exhibit the shape of double-peaked
specific heat function.Comment: 12 pages with 9 figures and 1 tabl
The three-way relationship of polymorphisms of porcine genes encoding terminal complement components, their differential expression, and health-related phenotypes
<p>Abstract</p> <p>Background</p> <p>The complement system is an evolutionary ancient mechanism that plays an essential role in innate immunity and contributes to the acquired immune response. Three modes of activation, known as classical, alternative and lectin pathway, lead to the initiation of a common terminal lytic pathway. The terminal complement components (TCCs: C6, C7, C8A, C8B, and C9) are encoded by the genes <it>C6</it>, <it>C7</it>, <it>C8A</it>, <it>C8B</it>, <it>C8G</it>, and <it>C9</it>. We aimed at experimentally testing the porcine genes encoding TCCs as candidate genes for immune competence and disease resistance by addressing the three-way relationship of genotype, health related phenotype, and mRNA expression.</p> <p>Results</p> <p>Comparative sequencing of cDNAs of animals of the breeds German Landrace, PiƩtrain, Hampshire, Duroc, Vietnamese Potbelly Pig, and Berlin Miniature Pig (BMP) revealed 30 SNPs (21 in protein domains, 12 with AA exchange). The promoter regions (each ~1.5 kb upstream the transcription start sites) of <it>C6</it>, <it>C7</it>, <it>C8A</it>, <it>C8G</it>, and <it>C9</it> exhibited 29 SNPs. Significant effects of the TCC encoding genes on hemolytic complement activity were shown in a cross of Duroc and BMP after vaccination against Mycoplasma hyopneumoniae, Aujeszky disease virus and PRRSV by analysis of variance using repeated measures mixed models. Family based association tests (FBAT) confirmed the associations. The promoter SNPs were associated with the relative abundance of TCC transcripts obtained by real time RT-PCR of 311 liver samples of commercial slaughter pigs. Complement gene expression showed significant relationship with the prevalence of acute and chronic lung lesions.</p> <p>Conclusions</p> <p>The analyses point to considerable variation of the porcine TCC genes and promote the genes as candidate genes for disease resistance.</p
- ā¦