20 research outputs found
Crowdsourcing Argumentation Structures in Chinese Hotel Reviews
Argumentation mining aims at automatically extracting the premises-claim
discourse structures in natural language texts. There is a great demand for
argumentation corpora for customer reviews. However, due to the controversial
nature of the argumentation annotation task, there exist very few large-scale
argumentation corpora for customer reviews. In this work, we novelly use the
crowdsourcing technique to collect argumentation annotations in Chinese hotel
reviews. As the first Chinese argumentation dataset, our corpus includes 4814
argument component annotations and 411 argument relation annotations, and its
annotations qualities are comparable to some widely used argumentation corpora
in other languages.Comment: 6 pages,3 figures,This article has been submitted to "The 2017 IEEE
International Conference on Systems, Man, and Cybernetics (SMC2017)
Using Argument-based Features to Predict and Analyse Review Helpfulness
We study the helpful product reviews identification problem in this paper. We
observe that the evidence-conclusion discourse relations, also known as
arguments, often appear in product reviews, and we hypothesise that some
argument-based features, e.g. the percentage of argumentative sentences, the
evidences-conclusions ratios, are good indicators of helpful reviews. To
validate this hypothesis, we manually annotate arguments in 110 hotel reviews,
and investigate the effectiveness of several combinations of argument-based
features. Experiments suggest that, when being used together with the
argument-based features, the state-of-the-art baseline features can enjoy a
performance boost (in terms of F1) of 11.01\% in average.Comment: 6 pages, EMNLP201
Using Argument-based Features to Predict and Analyse Review Helpfulness
We study the helpful product reviews identification problem in this paper. We
observe that the evidence-conclusion discourse relations, also known as
arguments, often appear in product reviews, and we hypothesise that some
argument-based features, e.g. the percentage of argumentative sentences, the
evidences-conclusions ratios, are good indicators of helpful reviews. To
validate this hypothesis, we manually annotate arguments in 110 hotel reviews,
and investigate the effectiveness of several combinations of argument-based
features. Experiments suggest that, when being used together with the
argument-based features, the state-of-the-art baseline features can enjoy a
performance boost (in terms of F1) of 11.01\% in average.Comment: 6 pages, EMNLP201
Using Self-Organizing Map and Multivariate Statistical Methods for Groundwater Quality Assessment in the Urban Area of Linyi City, China
Groundwater holds an important role in the water supply in Linyi city, China. Investigating the hydrochemical characteristics of groundwater, and revealing the factors governing groundwater geochemistry, is a primary step for ensuring the safe and rational exploitation of groundwater resources. This study used a self-organizing map (SOM) and multivariate statistical methods to assess groundwater quality in the urban area of Linyi city. Based on the hydrochemical dataset consisting of nine parameters (i.e., pH, Ca2+, Mg2+, Na+, K+, HCO3−, Cl−, SO42−, and NO3−) from 89 groundwater samples, the SOM was first applied to obtain the weight vectors of the output nodes. Hierarchical cluster analysis (HCA) was used for organizing the nodes into four clusters. The node cluster indices were then remapped to the groundwater samples according to the winner node for each sample. The hydrochemical characteristics and factors controlling the groundwater geochemistry of the four clusters were analyzed using principal component analysis (PCA) and graphical methods including Piper and Gibbs diagrams, as well as binary plots of the major ions in groundwater. Results indicated that groundwater geochemistry in this area is primarily governed by water–rock interactions, such as the dissolution of halite, calcite, and gypsum, along with the influence of municipal sewage and the degradation of organic matter. This study demonstrates that the integration of an SOM and multivariate statistical methods improves the understanding of groundwater geochemistry and hydrochemical evolution in complex groundwater flow systems impacted by utilization
Epidemiological and transcriptome data identify potential key genes involved in iron overload for type 2 diabetes
Abstract Background Many previous studies have reported the association between iron overload (IO) and type 2 diabetes mellitus (T2DM). However, the underlying molecular mechanism is not clear. Methods Epidemiological data from the National Health and Nutrition Examination Survey 2017–2018 (NHANES) was used to systematically explore the association between IO and diabetes. Furthermore, transcriptome data from Gene Expression Omnibus (GEO) were analyzed using bioinformatics methods to explore the underlying functional mechanisms at the molecular level. Results Data from NHANES showed a “W” shape relationship between serum iron (frozen) and the risk of diabetes (P < 0.001) as well as a “∧” shape correlation between serum unsaturated iron binding capacity (UIBC) and the risk of diabetes (P = 0.007). Furthermore, the serum iron (frozen) was positively associated with fasting plasma glucose and HOMAB (P < 0.05), and UIBC was positively associated with fasting insulin (P < 0.05). Transcriptome data showed that two IO-related genes [Transferrin receptor (TFRC) and Solute carrier family-11 member-2 (SLC11A2)] were down-regulated in T2DM. The correlation analysis showed that expression levels of TFRC and SLC11A2 were significantly and positively correlated with genes involved in insulin secretion (P < 0.05). Protein–protein interaction network analysis showed that TFRC and SLC11A2 interacted with four key genes, including VAMP2, HIF1A, SLC2A1, and RAB11FIP2. Conclusion We found that IO status was associated with increased FPG and aggravated HOMAB, and two IO-related genes (TFRC and SLC11A2) might induce the occurrence of T2DM by influencing insulin secretion, which provides potential therapeutic targets for T2DM patients