50 research outputs found
A Structured SVM Semantic Parser Augmented by Semantic Tagging with Conditional Random Field
PACLIC 19 / Taipei, taiwan / December 1-3, 200
Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech
Converting written texts into their spoken forms is an essential problem in
any text-to-speech (TTS) systems. However, building an effective text
normalization solution for a real-world TTS system face two main challenges:
(1) the semantic ambiguity of non-standard words (NSWs), e.g., numbers, dates,
ranges, scores, abbreviations, and (2) transforming NSWs into pronounceable
syllables, such as URL, email address, hashtag, and contact name. In this
paper, we propose a new two-phase normalization approach to deal with these
challenges. First, a model-based tagger is designed to detect NSWs. Then,
depending on NSW types, a rule-based normalizer expands those NSWs into their
final verbal forms. We conducted three empirical experiments for NSW detection
using Conditional Random Fields (CRFs), BiLSTM-CNN-CRF, and BERT-BiGRU-CRF
models on a manually annotated dataset including 5819 sentences extracted from
Vietnamese news articles. In the second phase, we propose a forward
lexicon-based maximum matching algorithm to split down the hashtag, email, URL,
and contact name. The experimental results of the tagging phase show that the
average F1 scores of the BiLSTM-CNN-CRF and CRF models are above 90.00%,
reaching the highest F1 of 95.00% with the BERT-BiGRU-CRF model. Overall, our
approach has low sentence error rates, at 8.15% with CRF and 7.11% with
BiLSTM-CNN-CRF taggers, and only 6.67% with BERT-BiGRU-CRF tagger.Comment: The 14th International Conference on Knowledge and Systems
Engineering (KSE 2022
Performance Analysis of Hybrid ALOHA/CDMA RFID Systems with Quasi-decorrelating Detector in Noisy Channels
In this paper we investigate the performance of a hybrid Aloha/CDMA radio frequency identification (RFID) system with quasi-decorrelating detector (QDD). Motivated by the fact that the QDD outperforms the conventional decorrelating detector (DD) in noisy network scenarios, we study and propose using QDD as one of the most promising candidates for the structure of RFID readers. Performance analysis in terms of bit error rate and the RFID system efficiency is considered considering CDMA code collision and detection error. Computer simulations are also performed, and the obtained results of QDD-based structure are compared with those of DD-based one to confirm the correctness of the design suggestion in different practical applications of tag identification and missing-tag detection
Review and Evaluation of Agricultural Policies in Years 2015-2017
This journal article describes main results of the OECD Annual Report published in 2018, titled “Review and evaluation of agricultural policy in 2017” for 51 selected countries in the world, including Vietnam. The report is closely prepared by MARD and OECD experts. The journal article emphasizes more on Vietnam by updating and adjusting data, information and policies in 2017 and 2018. The description presents the changing trend of agricultural policies applied in the world, considering whether this trend is in the direction of achieving sustainable productivity growth, environmental protection, and adaptation to climate change. On average in the last 20 years, trend of world policies has been better but far to catch above purposes. The development of international trade has made the commodity movement more freely and price gaps narrowed between countries and regions. This trend made agricultural markets developed more toward reflecting the scarcity of good and services. Average level of total agricultural supports has been reducing. Consequently, the world price indices and the total support have been converted between countries and commodities. However, the total agricultural support reduction is mainly in developed countries like OECD countries. Emerging and developing countries have increased their agricultural supports. Relative to GDP, the level of the total agriculture support in Vietnam has been reducing.Inside the total agricultural support, producer supports accounted 78% while general service support accounted only for 14%. Inside the producer support, market price support accounted for more than 50% in many countries. Payments based on outputs and inputs also accounted more than 50% in many countries. In Vietnam, the producer support is very small, negative level in 2015 and 2016 and became positive in 2017 and 2018. The agricultural producer support in 2017 is about 900 million USD. In the overall service support, many countries mainly invest in infrastructure construction, for example in Japan and Vietnam over 70%, while investments in other items are too small, for example that in Vietnam is only about 16%.In conclusion, OECD suggests that market price support should be reduced and finally eliminated. Similarly, output and input payments should be reduced and eliminated. Future policies should focus on general support service that helps producers to achieve sustainable productivity growth in the context of a changing and uncertain climate. OECD especially emphasizes on appropriate investments in research, together with efforts to ensure that the outputs of this research reach farmers. OECD also emphasizes on research that help producers to better manage risks including business risk, weather risk, and climate changes. Agricultural production and climate changes are strongly interacted. Future research should be the better co-operation between public and private sectors with the leading role of public sector. The future research should be co-operated more strongly between countries and regions because of the differences in histories, cultures, geology and climate
A PubMed-Wide Associational Study of Infectious Diseases
Background: Computational discovery is playing an ever-greater role in supporting the processes of knowledge synthesis. A significant proportion of the more than 18 million manuscripts indexed in the PubMed database describe infectious disease syndromes and various infectious agents. This study is the first attempt to integrate online repositories of text-based publications and microbial genome databases in order to explore the dynamics of relationships between pathogens and infectious diseases. Methodology/Principal Findings: Herein we demonstrate how the knowledge space of infectious diseases can be computationally represented and quantified, and tracked over time. The knowledge space is explored by mapping of the infectious disease literature, looking at dynamics of literature deposition, zooming in from pathogen to genome level and searching for new associations. Syndromic signatures for different pathogens can be created to enable a new and clinically focussed reclassification of the microbial world. Examples of syndrome and pathogen networks illustrate how multilevel network representations of the relationships between infectious syndromes, pathogens and pathogen genomes can illuminate unexpected biological similarities in disease pathogenesis and epidemiology. Conclusions/Significance: This new approach based on text and data mining can support the discovery of previously hidden associations between diseases and microbial pathogens, clinically relevant reclassification of pathogeni
Vietnamese Word Segmentation with CRFs and SVMs: An Investigation
PACLIC 20 / Wuhan, China / 1-3 November, 200
Current advances in seagrass research: A review from Viet Nam
Seagrass meadows provide valuable ecosystem services but are fragile and threatened ecosystems all over the world. This review highlights the current advances in seagrass research from Viet Nam. One goal is to support decision makers in developing science-based conservation strategies. In recent years, several techniques were applied to estimate the size of seagrass meadows. Independent from the method used, there is an alarming decline in the seagrass area in almost all parts of Viet Nam. Since 1990, a decline of 46.5% or 13,549 ha was found. Only in a few protected and difficult-to-reach areas was an increase observed. Conditions at those sites could be investigated in more detail to make suggestions for conservation and recovery of seagrass meadows. Due to their lifestyle and morphology, seagrasses take up compounds from their environment easily. Phytoremediation processes of Thalassia hemprichii and Enhalus acoroides are described exemplarily. High accumulation of heavy metals dependent on their concentration in the environment in different organs can be observed. On the one hand, seagrasses play a role in phytoremediation processes in polluted areas; on the other hand, they might suffer at high concentrations, and pollution will contribute to their overall decline. Compared with the neighboring countries, the total Corg stock from seagrass beds in Viet Nam was much lower than in the Philippines and Indonesia but higher than that of Malaysia and Myanmar. Due to an exceptionally long latitudinal coastline of 3,260 km covering cool to warm water environments, the seagrass species composition in Viet Nam shows a high diversity and a high plasticity within species boundaries. This leads to challenges in taxonomic issues, especially with the Halophila genus, which can be better deduced from genetic diversity/population structures of members of Hydrocharitaceae. Finally, the current seagrass conservation and management efforts in Viet Nam are presented and discussed. Only decisions based on the interdisciplinary cooperation of scientists from all disciplines mentioned will finally lead to conserve this valuable ecosystem for mankind and biodiversity