108 research outputs found
Exploring the Compositional Generalization in Context Dependent Text-to-SQL Parsing
In the context-dependent Text-to-SQL task, the generated SQL statements are
refined iteratively based on the user input utterance from each interaction.
The input text from each interaction can be viewed as component modifications
to the previous SQL statements, which could be further extracted as the
modification patterns. Since these modification patterns could also be combined
with other SQL statements, the models are supposed to have the compositional
generalization to these novel combinations. This work is the first exploration
of compositional generalization in context-dependent Text-to-SQL scenarios. To
facilitate related studies, we constructed two challenging benchmarks named
\textsc{CoSQL-CG} and \textsc{SParC-CG} by recombining the modification
patterns and existing SQL statements. The following experiments show that all
current models struggle on our proposed benchmarks. Furthermore, we found that
better aligning the previous SQL statements with the input utterance could give
models better compositional generalization ability. Based on these
observations, we propose a method named \texttt{p-align} to improve the
compositional generalization of Text-to-SQL models. Further experiments
validate the effectiveness of our method. Source code and data are available.Comment: Accepted to ACL 2023 (Findings), Long Paper, 11 page
RAPL: A Relation-Aware Prototype Learning Approach for Few-Shot Document-Level Relation Extraction
How to identify semantic relations among entities in a document when only a
few labeled documents are available? Few-shot document-level relation
extraction (FSDLRE) is crucial for addressing the pervasive data scarcity
problem in real-world scenarios. Metric-based meta-learning is an effective
framework widely adopted for FSDLRE, which constructs class prototypes for
classification. However, existing works often struggle to obtain class
prototypes with accurate relational semantics: 1) To build prototype for a
target relation type, they aggregate the representations of all entity pairs
holding that relation, while these entity pairs may also hold other relations,
thus disturbing the prototype. 2) They use a set of generic NOTA
(none-of-the-above) prototypes across all tasks, neglecting that the NOTA
semantics differs in tasks with different target relation types. In this paper,
we propose a relation-aware prototype learning method for FSDLRE to strengthen
the relational semantics of prototype representations. By judiciously
leveraging the relation descriptions and realistic NOTA instances as guidance,
our method effectively refines the relation prototypes and generates
task-specific NOTA prototypes. Extensive experiments demonstrate that our
method outperforms state-of-the-art approaches by average 2.61% across
various settings of two FSDLRE benchmarks.Comment: Accepted to EMNLP 202
Integrating Overlapping Structures and Background Information of Words Significantly Improves Biological Sequence Comparison
Word-based models have achieved promising results in sequence comparison. However, as the important statistical properties of words in biological sequence, how to use the overlapping structures and background information of the words to improve sequence comparison is still a problem. This paper proposed a new statistical method that integrates the overlapping structures and the background information of the words in biological sequences. To assess the effectiveness of this integration for sequence comparison, two sets of evaluation experiments were taken to test the proposed model. The first one, performed via receiver operating curve analysis, is the application of proposed method in discrimination between functionally related regulatory sequences and unrelated sequences, intron and exon. The second experiment is to evaluate the performance of the proposed method with f-measure for clustering Hepatitis E virus genotypes. It was demonstrated that the proposed method integrating the overlapping structures and the background information of words significantly improves biological sequence comparison and outperforms the existing models
Mutations in an AP2 Transcription Factor-Like Gene Affect Internode Length and Leaf Shape in Maize
Background
Plant height is an important agronomic trait that affects yield and tolerance to certain abiotic stresses. Understanding the genetic control of plant height is important for elucidating the regulation of maize development and has practical implications for trait improvement in plant breeding.
Methodology/Principal Findings
In this study, two independent, semi-dwarf maize EMS mutants, referred to as dwarf & irregular leaf (dil1), were isolated and confirmed to be allelic. In comparison to wild type plants, the mutant plants have shorter internodes, shorter, wider and wrinkled leaves, as well as smaller leaf angles. Cytological analysis indicated that the leaf epidermal cells and internode parenchyma cells are irregular in shape and are arranged in a more random fashion, and the mutants have disrupted leaf epidermal patterning. In addition, parenchyma cells in the dil1 mutants are significantly smaller than those in wild-type plants. The dil1 mutation was mapped on the long arm of chromosome 6 and a candidate gene, annotated as an AP2 transcription factor-like, was identified through positional cloning. Point mutations near exon-intron junctions were identified in both dil1 alleles, resulting in mis-spliced variants.
Conclusion
An AP2 transcription factor-like gene involved in stalk and leaf development in maize has been identified. Mutations near exon-intron junctions of the AP2 gene give mis-spliced transcript variants, which result in shorter internodes and wrinkled leaves
The selenoproteome exhibits widely varying, tissue-specific dependence on selenoprotein P for selenium supply
Selenoprotein P (Sel P) is a selenium-rich glycoprotein believed to play a key role in selenium (Se) transport throughout the body. Development of a Sel P knockout mouse model has supported this notion and initial studies have indicated that selenium supply to various tissues is differentially affected by genetic deletion of Sel P. Se in the form of the amino acid, selenocysteine, is incorporated into selenoproteins at UGA codons. Thus, Se availability affects not only selenoprotein levels, but also the turnover of selenoprotein mRNAs via the nonsense-mediated decay pathway. We investigated how genetic deletion of Sel P in mice affected levels of the mRNAs encoding all known members of the murine selenoprotein family, as well as three non-selenoprotein factors involved in their synthesis, selenophosphate synthetase 1 (SPS1), SECIS-binding protein 2 (SBP2) and SECp43. Our findings present a comprehensive description of selenoprotein mRNA expression in the following murine tissues: brain, heart, intestine, kidney, liver, lung, spleen and testes. We also describe how abundance of selenoproteins and selenoprotein-synthesis factors are affected by genetic deletion of Sel P in some of these tissues, providing insight into how the presence of this selenoprotein influences selenoprotein mRNA levels, and thus, the selenoproteome
Descope of the ALIA mission
The present work reports on a feasibility study commissioned by the Chinese
Academy of Sciences of China to explore various possible mission options to
detect gravitational waves in space alternative to that of the eLISA/LISA
mission concept. Based on the relative merits assigned to science and
technological viability, a few representative mission options descoped from the
ALIA mission are considered. A semi-analytic Monte Carlo simulation is carried
out to understand the cosmic black hole merger histories starting from
intermediate mass black holes at high redshift as well as the possible
scientific merits of the mission options considered in probing the light seed
black holes and their coevolution with galaxies in early Universe. The study
indicates that, by choosing the armlength of the interferometer to be three
million kilometers and shifting the sensitivity floor to around one-hundredth
Hz, together with a very moderate improvement on the position noise budget,
there are certain mission options capable of exploring light seed, intermediate
mass black hole binaries at high redshift that are not readily accessible to
eLISA/LISA, and yet the technological requirements seem to within reach in the
next few decades for China
A Search for Light Super Symmetric Baryons
We have searched for the production and decay of light super-symmetric
baryons produced in 800 GeV/c proton copper interactions in a charged hyperon
beam experiment. We observe no evidence for the decays R+(uud \g^~) -> S(uds
\g^~) pi+ and X-(ssd \g^~) -> S(uds \g^~) pi- in the predicted parent mass and
lifetime ranges of 1700-2500 Mev/c2 and 50-500 ps. Production upper limits for
R+ at xF=0.47, Pt=1.4 GeV/c2 and X- at xF=0.48, Pt=0.65 GeV/c2 of less than
10^-3 of all charged secondary particles produced are obtained for all but the
highest masses and shortest lifetimes predicted.Comment: 9 pages, uuencoded postscript 4 figures uuencoded, tar-compressed
file (submitted to PRL
- …