Search CORE

1,848 research outputs found

Structural Embedding of Syntactic Trees for Machine Comprehension

Author: Hu Junjie
Liu Rui
Nyberg Eric
Wei Wei
Yang Zi
Publication venue
Publication date: 01/01/2017
Field of study

Deep neural networks for machine comprehension typically utilizes only word or character embeddings without explicitly taking advantage of structured linguistic information such as constituency trees and dependency trees. In this paper, we propose structural embedding of syntactic trees (SEST), an algorithm framework to utilize structured information and encode them into vector representations that can boost the performance of algorithms for the machine comprehension. We evaluate our approach using a state-of-the-art neural attention model on the SQuAD dataset. Experimental results demonstrate that our model can accurately identify the syntactic boundaries of the sentences and extract answers that are syntactically coherent over the baseline methods

arXiv.org e-Print Archive

Crossref

Age Problem in Lemaitre-Tolman-Bondi Void Models

Author: Liu De-Zi
Wei Hao
Yan Xiao-Peng
Publication venue: 'Elsevier BV'
Publication date: 03/02/2015
Field of study

As is well known, one can explain the current cosmic acceleration by considering an inhomogeneous and/or anisotropic universe (which violates the cosmological principle), without invoking dark energy or modified gravity. The well-known one of this kind of models is the so-called Lema\^{\i}tre-Tolman-Bondi (LTB) void model, in which the universe is spherically symmetric and radially inhomogeneous, and we are living in a locally underdense void centered nearby our location. In the present work, we test various LTB void models with some old high redshift objects (OHROs). Obviously, the universe cannot be younger than its constituents. We find that an unusually large

r_0

(characterizing the size of the void) is required to accommodate these OHROs in LTB void models. There is a serious tension between this unusually large

r_0

and the much smaller

r_0

inferred from other observations (e.g. SNIa, CMB and so on). However, if we instead consider the lowest limit 1.7\,Gyr for the quasar APM 08279+5255 at redshift

z=3.91

, this tension could be greatly alleviated.Comment: 17 pages, 9 figures, revtex4; v2: discussions added, Phys. Lett. B in press; v3: published versio

arXiv.org e-Print Archive

Elsevier - Publisher Connector

Directory of Open Access Journals

New Generalizations of Cosmography Inspired by the Pade Approximant

Author: Liu De-Zi
Wei Hao
Zhou Ya-Nan
Zou Xiao-Bo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

The current accelerated expansion of the universe has been one of the most important fields in physics and astronomy since 1998. Many cosmological models have been proposed in the literature to explain this mysterious phenomenon. Since the nature and cause of the cosmic acceleration are still unknown, model-independent approaches to study the evolution of the universe are welcome. One of the powerful model-independent approaches is the so-called cosmography. It only relies on the cosmological principle, without postulating any underlying theoretical model. However, there are several shortcomings in the usual cosmography. For instance, it is plagued with the problem of divergence (or an unacceptably large error), and it fails to predict the future evolution of the universe. In the present work, we try to overcome or at least alleviate these problems, and we propose two new generalizations of cosmography inspired by the Pad\'e approximant. One is to directly parameterize the luminosity distance based on the Pad\'e approximant, while the other is to generalize cosmography with respect to a so-called

y_\beta

-shift

y_\beta=z/(1+\beta z)

, which is also inspired by the Pad\'e approximant. Then, we confront them with the observational data with the help of the Markov chain Monte Carlo (MCMC) code emcee, and find that they work fairly well.Comment: 16 pages, 3 tables, 5 figures, revtex4; v2: discussions added, Eur. Phys. J. C in press; v3: published versio

arXiv.org e-Print Archive

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

Springer - Publisher Connector

Alignment is not sufficient to prevent large language models from generating harmful information: A psychoanalytic perspective

Author: Ding Wei
Liu Jia
Yin Zi
Publication venue
Publication date: 14/11/2023
Field of study

Large Language Models (LLMs) are central to a multitude of applications but struggle with significant risks, notably in generating harmful content and biases. Drawing an analogy to the human psyche's conflict between evolutionary survival instincts and societal norm adherence elucidated in Freud's psychoanalysis theory, we argue that LLMs suffer a similar fundamental conflict, arising between their inherent desire for syntactic and semantic continuity, established during the pre-training phase, and the post-training alignment with human values. This conflict renders LLMs vulnerable to adversarial attacks, wherein intensifying the models' desire for continuity can circumvent alignment efforts, resulting in the generation of harmful information. Through a series of experiments, we first validated the existence of the desire for continuity in LLMs, and further devised a straightforward yet powerful technique, such as incomplete sentences, negative priming, and cognitive dissonance scenarios, to demonstrate that even advanced LLMs struggle to prevent the generation of harmful information. In summary, our study uncovers the root of LLMs' vulnerabilities to adversarial attacks, hereby questioning the efficacy of solely relying on sophisticated alignment methods, and further advocates for a new training idea that integrates modal concepts alongside traditional amodal concepts, aiming to endow LLMs with a more nuanced understanding of real-world contexts and ethical considerations

arXiv.org e-Print Archive

Promoting cold-start items in recommender systems

Author: Li Wei-Min
Liu Chuang
Liu Jin-Hu
Yang Zimo
Zhang Zi-Ke
Zhou Tao
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 19/04/2014
Field of study

As one of major challenges, cold-start problem plagues nearly all recommender systems. In particular, new items will be overlooked, impeding the development of new products online. Given limited resources, how to utilize the knowledge of recommender systems and design efficient marketing strategy for new items is extremely important. In this paper, we convert this ticklish issue into a clear mathematical problem based on a bipartite network representation. Under the most widely used algorithm in real e-commerce recommender systems, so-called the item-based collaborative filtering, we show that to simply push new items to active users is not a good strategy. To our surprise, experiments on real recommender systems indicate that to connect new items with some less active users will statistically yield better performance, namely these new items will have more chance to appear in other users' recommendation lists. Further analysis suggests that the disassortative nature of recommender systems contributes to such observation. In a word, getting in-depth understanding on recommender systems could pave the way for the owners to popularize their cold-start products with low costs.Comment: 6 pages, 6 figure

arXiv.org e-Print Archive

Directory of Open Access Journals

FigShare

Swept blade influence on aerodynamic performance of steam turbine nozzle cascades

Author: Feng Zi-Ming
Liu Xiaolei
Tan Jingjing
Wei Cui
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2018
Field of study

To improve the aerodynamic performance of steam turbine nozzle cascades, it is significant to study the effect of swept blades to control the flow field within the cascade. Numerical simulations of three different sweep angle blades (−20°, +20° and 0°) were carried out, using CFD modelling. Simulation results showed that the aft-swept blade can effectively improve the corresponding flow characteristics and reduce the total pressure loss. Meanwhile, it has better aerodynamic performance than the straight blade and the fore-swept blade

Crossref

Cranfield CERES

Enlighten

Radial Angular Momentum Transfer and Magnetic Barrier for Short-Type Gamma-Ray Burst Central Engine Activity

Author: Dai Zi-Gao
Gu Wei-Min
Hou Shu-Jin
Lei Wei-Hua
Liang En-Wei
Lin Lin
Liu Tong
Zhang Shuang-Nan
顾为民
Publication venue: 'IOP Publishing'
Publication date: 30/09/2012
Field of study

Soft extended emission (EE) following initial hard spikes up to 100 seconds was observed with {\em Swift}/BAT for about half of short-type gamma-ray bursts (SGRBs). This challenges the conversional central engine models of SGRBs, i.e., compact star merger models. In the framework of the black hole-neutron star merger models, we study the roles of the radial angular momentum transfer in the disk and the magnetic barrier around the black hole for the activity of SGRB central engines. We show that the radial angular momentum transfer may significantly prolong the lifetime of the accretion process and multiple episodes may be switched by the magnetic barrier. Our numerical calculations based on the models of the neutrino-dominated accretion flows suggest that the disk mass is critical for producing the observed EE. In case of the mass being

\sim 0.8M_{\odot}

, our model can reproduce the observed timescale and luminosity of both the main and EE episodes in a reasonable parameter set. The predicted luminosity of the EE component is lower than the observed EE with about one order of magnitude and the timescale is shorter than 20 seconds if the disk mass being

\sim 0.2M_{\odot}

. {\em Swift}/BAT-like instruments may be not sensitive enough to detect the EE component in this case. We argue that the EE component would be a probe for merger process and disk formation for compact star mergers.Comment: 9 pages, 3 figures, accepted for publication in Ap

arXiv.org e-Print Archive

Xiamen University Institutional Repository