168 research outputs found

    DA-Net: A Disentangled and Adaptive Network for Multi-Source Cross-Lingual Transfer Learning

    Full text link
    Multi-Source cross-lingual transfer learning deals with the transfer of task knowledge from multiple labelled source languages to an unlabeled target language under the language shift. Existing methods typically focus on weighting the predictions produced by language-specific classifiers of different sources that follow a shared encoder. However, all source languages share the same encoder, which is updated by all these languages. The extracted representations inevitably contain different source languages' information, which may disturb the learning of the language-specific classifiers. Additionally, due to the language gap, language-specific classifiers trained with source labels are unable to make accurate predictions for the target language. Both facts impair the model's performance. To address these challenges, we propose a Disentangled and Adaptive Network (DA-Net). Firstly, we devise a feedback-guided collaborative disentanglement method that seeks to purify input representations of classifiers, thereby mitigating mutual interference from multiple sources. Secondly, we propose a class-aware parallel adaptation method that aligns class-level distributions for each source-target language pair, thereby alleviating the language pairs' language gap. Experimental results on three different tasks involving 38 languages validate the effectiveness of our approach.Comment: AAAI 202

    Extending Conditional Dependencies with Built-in Predicates

    Get PDF
    This paper proposes a natural extension of conditional functional dependencies (CFDs [1]) and conditional inclusion dependencies (CINDs [2]), denoted by CFDps and CIND(p)s, respectively, by specifying patterns of data values with not equal, <, <=, >, and >= predicates. As data quality rules, CFDps and CIND(p)s are able to capture errors that commonly arise in practice but cannot be detected by CFDs and CINDs. We establish two sets of results for central technical problems associated with CFD(p)s and CIND(p)s. (a) One concerns the satisfiability and implication problems for CFD(p)s and CIND(p)s, taken separately or together. These are important for, e.g. deciding whether data quality rules are dirty themselves, and for removing redundant rules. We show that despite the increased expressive power, the static analyses of CFD(p)s and CIND(p)s retain the same complexity as their CFDs and CINDs counterparts. (b) The other concerns validation of CFD(p)s and CIND(p)s. We show that given a set Sigma of CFD(p)s and CIND(p)s on a database D, a set of SQL queries can be automatically generated that, when evaluated against D, return all tuples in D that violate some dependencies in Sigma. We also experimentally verified the efficiency and effectiveness of our SQL based error detection techniques, using real-life data. This provides commercial DBMS with an immediate capability to detect errors based on CFD(p)s and CIND(p)s.973 program [2014CB340300, 2012CB316200, 2014CB340302]; NSFC [61322207, 61133002]; Guangdong Innovative Research Team Program [2011D005]; Shenzhen Peacock Program [1105100030834361]; EPSRC [EP/J015377/1, EP/M025268/1]; NSF III [1302212]; Google Faculty Research Award; [ERC-2014-AdG 652976]SCI(E)[email protected]; [email protected]; [email protected]; [email protected]; [email protected]

    Different responses of soil fungal and bacterial communities to nitrogen addition in a forest grassland ecotone

    Get PDF
    IntroductionContinuous nitrogen deposition increases the nitrogen content of terrestrial ecosystem and affects the geochemical cycle of soil nitrogen. Forest-grassland ecotone is the interface area of forest and grassland and is sensitive to global climate change. However, the structure composition and diversity of soil microbial communities and their relationship with soil environmental factors at increasing nitrogen deposition have not been sufficiently studied in forest-grassland ecotone.MethodsIn this study, experiments were carried out with four nitrogen addition treatments (0 kgN·hm−2·a−1, 10 kgN·hm−2·a−1, 20 kgN·hm−2·a−1 and 40 kgN·hm−2·a−1) to simulate nitrogen deposition in a forest-grassland ecotone in northwest Liaoning Province, China. High-throughput sequencing and qPCR technologies were used to analyze the composition, structure, and diversity characteristics of the soil microbial communities under different levels of nitrogen addition.Results and discussionThe results showed that soil pH decreased significantly at increasing nitrogen concentrations, and the total nitrogen and ammonium nitrogen contents first increased and then decreased, which were significantly higher in the N10 treatment than in other treatments (N:0.32 ~ 0.48 g/kg; NH4+-N: 11.54 ~ 13 mg/kg). With the increase in nitrogen concentration, the net nitrogen mineralization, nitrification, and ammoniation rates decreased. The addition of nitrogen had no significant effect on the diversity and structure of the fungal community, while the diversity of the bacterial community decreased significantly at increasing nitrogen concentrations. Ascomycetes and Actinomycetes were the dominant fungal and bacterial phyla, respectively. The relative abundance of Ascomycetes was negatively correlated with total nitrogen content, while that of Actinomycetes was positively correlated with soil pH. The fungal community diversity was significantly negatively correlated with nitrate nitrogen, while the diversity of the bacterial community was significantly positively correlated with soil pH. No significant differences in the abundance of functional genes related to soil nitrogen transformations under the different treatments were observed. Overall, the distribution pattern and driving factors were different in soil microbial communities in a forest-grassland ecotone in northwest Liaoning. Our study enriches research content related to factors that affect the forest-grassland ecotone

    The effect of Shengmai injection in patients with coronary heart disease in real world and its personalized medicine research using machine learning techniques

    Get PDF
    Objective: Shengmai injection is a common treatment for coronary heart disease. The accurate dose regimen is important to maximize effectiveness and minimize adverse reactions. We aim to explore the effect of Shengmai injection in patients with coronary heart disease based on real-world data and establish a personalized medicine model using machine learning and deep learning techniques.Methods: 211 patients were enrolled. The length of hospital stay was used to explore the effect of Shengmai injection in a case-control study. We applied propensity score matching to reduce bias and Wilcoxon rank sum test to compare results between the experimental group and the control group. Important variables influencing the dose regimen of Shengmai injection were screened by XGBoost. A personalized medicine model of Shengmai injection was established by XGBoost selected from nine algorithm models. SHapley Additive exPlanations and confusion matrix were used to interpret the results clinically.Results: Patients using Shengmai injection had shorter length of hospital stay than those not using Shengmai injection (median 10.00 days vs. 11.00 days, p = 0.006). The personalized medicine model established via XGBoost shows accuracy = 0.81 and AUC = 0.87 in test cohort and accuracy = 0.84 and AUC = 0.84 in external verification. The important variables influencing the dose regimen of Shengmai injection include lipid-lowering drugs, platelet-lowering drugs, levels of GGT, hemoglobin, prealbumin, and cholesterol at admission. Finally, the personalized model shows precision = 75%, recall rate = 83% and F1-score = 79% for predicting 40 mg of Shengmai injection; and precision = 86%, recall rate = 79% and F1-score = 83% for predicting 60 mg of Shengmai injection.Conclusion: This study provides evidence supporting the clinical effectiveness of Shengmai injection, and established its personalized medicine model, which may help clinicians make better decisions
    • …
    corecore