1,848 research outputs found

    Structural Embedding of Syntactic Trees for Machine Comprehension

    Full text link
    Deep neural networks for machine comprehension typically utilizes only word or character embeddings without explicitly taking advantage of structured linguistic information such as constituency trees and dependency trees. In this paper, we propose structural embedding of syntactic trees (SEST), an algorithm framework to utilize structured information and encode them into vector representations that can boost the performance of algorithms for the machine comprehension. We evaluate our approach using a state-of-the-art neural attention model on the SQuAD dataset. Experimental results demonstrate that our model can accurately identify the syntactic boundaries of the sentences and extract answers that are syntactically coherent over the baseline methods

    Age Problem in Lemaitre-Tolman-Bondi Void Models

    Full text link
    As is well known, one can explain the current cosmic acceleration by considering an inhomogeneous and/or anisotropic universe (which violates the cosmological principle), without invoking dark energy or modified gravity. The well-known one of this kind of models is the so-called Lema\^{\i}tre-Tolman-Bondi (LTB) void model, in which the universe is spherically symmetric and radially inhomogeneous, and we are living in a locally underdense void centered nearby our location. In the present work, we test various LTB void models with some old high redshift objects (OHROs). Obviously, the universe cannot be younger than its constituents. We find that an unusually large r0r_0 (characterizing the size of the void) is required to accommodate these OHROs in LTB void models. There is a serious tension between this unusually large r0r_0 and the much smaller r0r_0 inferred from other observations (e.g. SNIa, CMB and so on). However, if we instead consider the lowest limit 1.7\,Gyr for the quasar APM 08279+5255 at redshift z=3.91z=3.91, this tension could be greatly alleviated.Comment: 17 pages, 9 figures, revtex4; v2: discussions added, Phys. Lett. B in press; v3: published versio

    New Generalizations of Cosmography Inspired by the Pade Approximant

    Get PDF
    The current accelerated expansion of the universe has been one of the most important fields in physics and astronomy since 1998. Many cosmological models have been proposed in the literature to explain this mysterious phenomenon. Since the nature and cause of the cosmic acceleration are still unknown, model-independent approaches to study the evolution of the universe are welcome. One of the powerful model-independent approaches is the so-called cosmography. It only relies on the cosmological principle, without postulating any underlying theoretical model. However, there are several shortcomings in the usual cosmography. For instance, it is plagued with the problem of divergence (or an unacceptably large error), and it fails to predict the future evolution of the universe. In the present work, we try to overcome or at least alleviate these problems, and we propose two new generalizations of cosmography inspired by the Pad\'e approximant. One is to directly parameterize the luminosity distance based on the Pad\'e approximant, while the other is to generalize cosmography with respect to a so-called yβy_\beta-shift yβ=z/(1+βz)y_\beta=z/(1+\beta z), which is also inspired by the Pad\'e approximant. Then, we confront them with the observational data with the help of the Markov chain Monte Carlo (MCMC) code emcee, and find that they work fairly well.Comment: 16 pages, 3 tables, 5 figures, revtex4; v2: discussions added, Eur. Phys. J. C in press; v3: published versio

    Alignment is not sufficient to prevent large language models from generating harmful information: A psychoanalytic perspective

    Full text link
    Large Language Models (LLMs) are central to a multitude of applications but struggle with significant risks, notably in generating harmful content and biases. Drawing an analogy to the human psyche's conflict between evolutionary survival instincts and societal norm adherence elucidated in Freud's psychoanalysis theory, we argue that LLMs suffer a similar fundamental conflict, arising between their inherent desire for syntactic and semantic continuity, established during the pre-training phase, and the post-training alignment with human values. This conflict renders LLMs vulnerable to adversarial attacks, wherein intensifying the models' desire for continuity can circumvent alignment efforts, resulting in the generation of harmful information. Through a series of experiments, we first validated the existence of the desire for continuity in LLMs, and further devised a straightforward yet powerful technique, such as incomplete sentences, negative priming, and cognitive dissonance scenarios, to demonstrate that even advanced LLMs struggle to prevent the generation of harmful information. In summary, our study uncovers the root of LLMs' vulnerabilities to adversarial attacks, hereby questioning the efficacy of solely relying on sophisticated alignment methods, and further advocates for a new training idea that integrates modal concepts alongside traditional amodal concepts, aiming to endow LLMs with a more nuanced understanding of real-world contexts and ethical considerations

    Promoting cold-start items in recommender systems

    Full text link
    As one of major challenges, cold-start problem plagues nearly all recommender systems. In particular, new items will be overlooked, impeding the development of new products online. Given limited resources, how to utilize the knowledge of recommender systems and design efficient marketing strategy for new items is extremely important. In this paper, we convert this ticklish issue into a clear mathematical problem based on a bipartite network representation. Under the most widely used algorithm in real e-commerce recommender systems, so-called the item-based collaborative filtering, we show that to simply push new items to active users is not a good strategy. To our surprise, experiments on real recommender systems indicate that to connect new items with some less active users will statistically yield better performance, namely these new items will have more chance to appear in other users' recommendation lists. Further analysis suggests that the disassortative nature of recommender systems contributes to such observation. In a word, getting in-depth understanding on recommender systems could pave the way for the owners to popularize their cold-start products with low costs.Comment: 6 pages, 6 figure

    Swept blade influence on aerodynamic performance of steam turbine nozzle cascades

    Get PDF
    To improve the aerodynamic performance of steam turbine nozzle cascades, it is significant to study the effect of swept blades to control the flow field within the cascade. Numerical simulations of three different sweep angle blades (−20°, +20° and 0°) were carried out, using CFD modelling. Simulation results showed that the aft-swept blade can effectively improve the corresponding flow characteristics and reduce the total pressure loss. Meanwhile, it has better aerodynamic performance than the straight blade and the fore-swept blade

    Radial Angular Momentum Transfer and Magnetic Barrier for Short-Type Gamma-Ray Burst Central Engine Activity

    Get PDF
    Soft extended emission (EE) following initial hard spikes up to 100 seconds was observed with {\em Swift}/BAT for about half of short-type gamma-ray bursts (SGRBs). This challenges the conversional central engine models of SGRBs, i.e., compact star merger models. In the framework of the black hole-neutron star merger models, we study the roles of the radial angular momentum transfer in the disk and the magnetic barrier around the black hole for the activity of SGRB central engines. We show that the radial angular momentum transfer may significantly prolong the lifetime of the accretion process and multiple episodes may be switched by the magnetic barrier. Our numerical calculations based on the models of the neutrino-dominated accretion flows suggest that the disk mass is critical for producing the observed EE. In case of the mass being ∼0.8M⊙\sim 0.8M_{\odot}, our model can reproduce the observed timescale and luminosity of both the main and EE episodes in a reasonable parameter set. The predicted luminosity of the EE component is lower than the observed EE with about one order of magnitude and the timescale is shorter than 20 seconds if the disk mass being ∼0.2M⊙\sim 0.2M_{\odot}. {\em Swift}/BAT-like instruments may be not sensitive enough to detect the EE component in this case. We argue that the EE component would be a probe for merger process and disk formation for compact star mergers.Comment: 9 pages, 3 figures, accepted for publication in Ap
    • …
    corecore