Search CORE

424 research outputs found

Diversifying Top-K Results

Author: Chang Lijun
Qin Lu
Yu Jeffrey Xu
Publication venue
Publication date: 01/01/2012
Field of study

Top-k query processing finds a list of k results that have largest scores w.r.t the user given query, with the assumption that all the k results are independent to each other. In practice, some of the top-k results returned can be very similar to each other. As a result some of the top-k results returned are redundant. In the literature, diversified top-k search has been studied to return k results that take both score and diversity into consideration. Most existing solutions on diversified top-k search assume that scores of all the search results are given, and some works solve the diversity problem on a specific problem and can hardly be extended to general cases. In this paper, we study the diversified top-k search problem. We define a general diversified top-k search problem that only considers the similarity of the search results themselves. We propose a framework, such that most existing solutions for top-k query processing can be extended easily to handle diversified top-k search, by simply applying three new functions, a sufficient stop condition sufficient(), a necessary stop condition necessary(), and an algorithm for diversified top-k search on the current set of generated results, div-search-current(). We propose three new algorithms, namely, div-astar, div-dp, and div-cut to solve the div-search-current() problem. div-astar is an A* based algorithm, div-dp is an algorithm that decomposes the results into components which are searched using div-astar independently and combined using dynamic programming. div-cut further decomposes the current set of generated results using cut points and combines the results using sophisticated operations. We conducted extensive performance studies using two real datasets, enwiki and reuters. Our div-cut algorithm finds the optimal solution for diversified top-k search problem in seconds even for k as large as 2,000.Comment: VLDB201

arXiv.org e-Print Archive

OPUS - University of Technology Sydney

Low Density Nuclear Matter in Heavy Ion Collisions

Author: Qin Lijun
Publication venue
Publication date: 14/01/2010
Field of study

The symmetry energy is the energy difference between symmetric nuclear matter and pure neutron matter at a given density. Around normal nuclear density, i.e. p/p0 =1, and temperature, i.e. T = 0, the symmetry energy is approximately 23.5 MeV/nucleon for finite nuclear matter and 30 MeV/nucleon for infinite nuclear matter, but at other densities, the symmetry energies are very poorly understood. Since the symmetry energy is very important in understanding many aspects of heavy ion reactions, structure, and nuclear astrophysics, many different models have been developed and some predications of the density dependence of symmetry energy have been made. Intermediate energy heavy ion collisions provide a unique tool to probe the nuclear equation of state. The initial compression and the thermal shock in Fermi- Energy heavy ion collisions lead naturally to the production of nucleonic matter at varying temperatures and densities which are interesting in this context. Since the light particle emission during this stage witnesses each stage of the reaction, it carries essential information on the early dynamics and on the degree of equilibration at each stage of the reaction. The kinematic features and yields of emitted light particles and clusters in the invairant velocity frame have been exploited to probe the nature of the intermediate system and information on the Equation Of State (EOS) with emphasis on the properties of the low density participant matter produced in such collisions. In order to pursue this effort and broaden the density range over which the symmetry energies are experimentally determined we have now carried out a series of experiments in which the reactions of 112Sn and 124Sn with projectiles, ranging from 4He,10B, 20Ne, 40Ar to 64Zn, all at the same energy per nucleon, 47 Mev/u, were performed. In this series of experiments different collision systems should lead to different average densities. By careful comparisons of the yields, spectra and angular distributions observed for particle emission from these different systems we attempted to cleanly separate early emission resulting from nucleon-nucleon collisions from that resulting from evaporation from the thermalized system and obtain a much cleaner picture of the dynamic evolution of the hotter systems. The Albergo Model has been used to calculate the density and temperature, symmetry free energies with the isoscaling technique for systems with different N/Z ratios. Those are compared with Roepke Model results. Also other models like VEOS, Lattimer, and Shen-Toki have been added to calculate the alpha mass fraction in order to understand the properties of low density matter further

OAKTrust Digital Repository (Texas A&M Univ)

A Validation Approach to Over-parameterized Matrix and Image Recovery

Author: Ding Lijun
Jiang Liwei
Qin Zhen
Zhou Jinxin
Zhu Zhihui
Publication venue
Publication date: 21/09/2022
Field of study

In this paper, we study the problem of recovering a low-rank matrix from a number of noisy random linear measurements. We consider the setting where the rank of the ground-truth matrix is unknown a prior and use an overspecified factored representation of the matrix variable, where the global optimal solutions overfit and do not correspond to the underlying ground-truth. We then solve the associated nonconvex problem using gradient descent with small random initialization. We show that as long as the measurement operators satisfy the restricted isometry property (RIP) with its rank parameter scaling with the rank of ground-truth matrix rather than scaling with the overspecified matrix variable, gradient descent iterations are on a particular trajectory towards the ground-truth matrix and achieve nearly information-theoretically optimal recovery when stop appropriately. We then propose an efficient early stopping strategy based on the common hold-out method and show that it detects nearly optimal estimator provably. Moreover, experiments show that the proposed validation approach can also be efficiently used for image restoration with deep image prior which over-parameterizes an image with a deep network.Comment: 29 pages and 9 figure

arXiv.org e-Print Archive