108 research outputs found

    Exploring the Compositional Generalization in Context Dependent Text-to-SQL Parsing

    Full text link
    In the context-dependent Text-to-SQL task, the generated SQL statements are refined iteratively based on the user input utterance from each interaction. The input text from each interaction can be viewed as component modifications to the previous SQL statements, which could be further extracted as the modification patterns. Since these modification patterns could also be combined with other SQL statements, the models are supposed to have the compositional generalization to these novel combinations. This work is the first exploration of compositional generalization in context-dependent Text-to-SQL scenarios. To facilitate related studies, we constructed two challenging benchmarks named \textsc{CoSQL-CG} and \textsc{SParC-CG} by recombining the modification patterns and existing SQL statements. The following experiments show that all current models struggle on our proposed benchmarks. Furthermore, we found that better aligning the previous SQL statements with the input utterance could give models better compositional generalization ability. Based on these observations, we propose a method named \texttt{p-align} to improve the compositional generalization of Text-to-SQL models. Further experiments validate the effectiveness of our method. Source code and data are available.Comment: Accepted to ACL 2023 (Findings), Long Paper, 11 page

    RAPL: A Relation-Aware Prototype Learning Approach for Few-Shot Document-Level Relation Extraction

    Full text link
    How to identify semantic relations among entities in a document when only a few labeled documents are available? Few-shot document-level relation extraction (FSDLRE) is crucial for addressing the pervasive data scarcity problem in real-world scenarios. Metric-based meta-learning is an effective framework widely adopted for FSDLRE, which constructs class prototypes for classification. However, existing works often struggle to obtain class prototypes with accurate relational semantics: 1) To build prototype for a target relation type, they aggregate the representations of all entity pairs holding that relation, while these entity pairs may also hold other relations, thus disturbing the prototype. 2) They use a set of generic NOTA (none-of-the-above) prototypes across all tasks, neglecting that the NOTA semantics differs in tasks with different target relation types. In this paper, we propose a relation-aware prototype learning method for FSDLRE to strengthen the relational semantics of prototype representations. By judiciously leveraging the relation descriptions and realistic NOTA instances as guidance, our method effectively refines the relation prototypes and generates task-specific NOTA prototypes. Extensive experiments demonstrate that our method outperforms state-of-the-art approaches by average 2.61% F1F_1 across various settings of two FSDLRE benchmarks.Comment: Accepted to EMNLP 202

    Integrating Overlapping Structures and Background Information of Words Significantly Improves Biological Sequence Comparison

    Get PDF
    Word-based models have achieved promising results in sequence comparison. However, as the important statistical properties of words in biological sequence, how to use the overlapping structures and background information of the words to improve sequence comparison is still a problem. This paper proposed a new statistical method that integrates the overlapping structures and the background information of the words in biological sequences. To assess the effectiveness of this integration for sequence comparison, two sets of evaluation experiments were taken to test the proposed model. The first one, performed via receiver operating curve analysis, is the application of proposed method in discrimination between functionally related regulatory sequences and unrelated sequences, intron and exon. The second experiment is to evaluate the performance of the proposed method with f-measure for clustering Hepatitis E virus genotypes. It was demonstrated that the proposed method integrating the overlapping structures and the background information of words significantly improves biological sequence comparison and outperforms the existing models

    Mutations in an AP2 Transcription Factor-Like Gene Affect Internode Length and Leaf Shape in Maize

    Get PDF
    Background Plant height is an important agronomic trait that affects yield and tolerance to certain abiotic stresses. Understanding the genetic control of plant height is important for elucidating the regulation of maize development and has practical implications for trait improvement in plant breeding. Methodology/Principal Findings In this study, two independent, semi-dwarf maize EMS mutants, referred to as dwarf & irregular leaf (dil1), were isolated and confirmed to be allelic. In comparison to wild type plants, the mutant plants have shorter internodes, shorter, wider and wrinkled leaves, as well as smaller leaf angles. Cytological analysis indicated that the leaf epidermal cells and internode parenchyma cells are irregular in shape and are arranged in a more random fashion, and the mutants have disrupted leaf epidermal patterning. In addition, parenchyma cells in the dil1 mutants are significantly smaller than those in wild-type plants. The dil1 mutation was mapped on the long arm of chromosome 6 and a candidate gene, annotated as an AP2 transcription factor-like, was identified through positional cloning. Point mutations near exon-intron junctions were identified in both dil1 alleles, resulting in mis-spliced variants. Conclusion An AP2 transcription factor-like gene involved in stalk and leaf development in maize has been identified. Mutations near exon-intron junctions of the AP2 gene give mis-spliced transcript variants, which result in shorter internodes and wrinkled leaves

    The selenoproteome exhibits widely varying, tissue-specific dependence on selenoprotein P for selenium supply

    Get PDF
    Selenoprotein P (Sel P) is a selenium-rich glycoprotein believed to play a key role in selenium (Se) transport throughout the body. Development of a Sel P knockout mouse model has supported this notion and initial studies have indicated that selenium supply to various tissues is differentially affected by genetic deletion of Sel P. Se in the form of the amino acid, selenocysteine, is incorporated into selenoproteins at UGA codons. Thus, Se availability affects not only selenoprotein levels, but also the turnover of selenoprotein mRNAs via the nonsense-mediated decay pathway. We investigated how genetic deletion of Sel P in mice affected levels of the mRNAs encoding all known members of the murine selenoprotein family, as well as three non-selenoprotein factors involved in their synthesis, selenophosphate synthetase 1 (SPS1), SECIS-binding protein 2 (SBP2) and SECp43. Our findings present a comprehensive description of selenoprotein mRNA expression in the following murine tissues: brain, heart, intestine, kidney, liver, lung, spleen and testes. We also describe how abundance of selenoproteins and selenoprotein-synthesis factors are affected by genetic deletion of Sel P in some of these tissues, providing insight into how the presence of this selenoprotein influences selenoprotein mRNA levels, and thus, the selenoproteome

    Descope of the ALIA mission

    Get PDF
    The present work reports on a feasibility study commissioned by the Chinese Academy of Sciences of China to explore various possible mission options to detect gravitational waves in space alternative to that of the eLISA/LISA mission concept. Based on the relative merits assigned to science and technological viability, a few representative mission options descoped from the ALIA mission are considered. A semi-analytic Monte Carlo simulation is carried out to understand the cosmic black hole merger histories starting from intermediate mass black holes at high redshift as well as the possible scientific merits of the mission options considered in probing the light seed black holes and their coevolution with galaxies in early Universe. The study indicates that, by choosing the armlength of the interferometer to be three million kilometers and shifting the sensitivity floor to around one-hundredth Hz, together with a very moderate improvement on the position noise budget, there are certain mission options capable of exploring light seed, intermediate mass black hole binaries at high redshift that are not readily accessible to eLISA/LISA, and yet the technological requirements seem to within reach in the next few decades for China

    A Search for Light Super Symmetric Baryons

    Get PDF
    We have searched for the production and decay of light super-symmetric baryons produced in 800 GeV/c proton copper interactions in a charged hyperon beam experiment. We observe no evidence for the decays R+(uud \g^~) -> S(uds \g^~) pi+ and X-(ssd \g^~) -> S(uds \g^~) pi- in the predicted parent mass and lifetime ranges of 1700-2500 Mev/c2 and 50-500 ps. Production upper limits for R+ at xF=0.47, Pt=1.4 GeV/c2 and X- at xF=0.48, Pt=0.65 GeV/c2 of less than 10^-3 of all charged secondary particles produced are obtained for all but the highest masses and shortest lifetimes predicted.Comment: 9 pages, uuencoded postscript 4 figures uuencoded, tar-compressed file (submitted to PRL
    • …
    corecore