Search CORE

4,904 research outputs found

A text-based approach to the ImageCLEF 2010 photo annotation task

Author: Jones Gareth J.F.
Li Wei B.
Min Jinming
Publication venue
Publication date: 01/01/2010
Field of study

The challenges of searching the increasingly large collections of digital images which are appearing in many places mean that automated annotation of images is becoming an important task. We describe our participation in the ImageCLEF 2010 Visual Concept Detection and Annotation Task. Our approach used only the textual features (Flickr user tags and EXIF information) to perform the automatic annotation. Our approach was to explore the use of different techniques to improve the results of textual annotation. We identify the drawbacks of our approach and how these might be addressed and optimized in further work

CiteSeerX

Irish Universities

DCU Online Research Access Service

Enhanced information retrieval using domain-specific recommender models

Author: Ganguly Debasis
Jones Gareth J.F.
Li Wei B.
Publication venue
Publication date: 01/09/2011
Field of study

The objective of an information retrieval (IR) system is to retrieve relevant items which meet a user information need. There is currently significant interest in personalized IR which seeks to improve IR effectiveness by incorporating a model of the user’s interests. However, in some situations there may be no opportunity to learn about the interests of a specific user on a certain topic. In our work, we propose an IR approach which combines a recommender algorithm with IR methods to improve retrieval for domains where the system has no opportunity to learn prior information about the user’s knowledge of a domain for which they have not previously entered a query. We use search data from other previous users interested in the same topic to build a recommender model for this topic. When a user enters a query on a topic, new to this user, an appropriate recommender model is selected and used to predict a ranking which the user may find interesting based on the behaviour of previous users with similar queries. The recommender output is integrated with a standard IR method in a weighted linear combination to provide a final result for the user. Experiments using the INEX 2009 data collection with a simulated recommender training set show that our approach can improve on a baseline IR system

CiteSeerX

Irish Universities

DCU Online Research Access Service

A proposal for the evaluation of adaptive information retrieval systems using simulated interaction

Author: Jones Gareth J.F.
Lawless Séamus
Li Wei B.
Mulwa Catherine
Publication venue
Publication date: 23/07/2010
Field of study

The Centre for Next Generation Localisation (CNGL) is involved in building interactive adaptive systems which combine Information Retrieval (IR), Adaptive Hypermedia (AH) and adaptive web techniques and technologies. The complex functionality of these systems coupled with the variety of potential users means that the experiments necessary to evaluate such systems are difficult to plan, implement and execute. This evaluation requires both component-level scientific evaluation and user-based evaluation. Automated replication of experiments and simulation of user interaction would be hugely beneficial in the evaluation of adaptive information retrieval systems (AIRS). This paper proposes a methodology for the evaluation of AIRS which leverages simulated interaction. The hybrid approach detailed combines: (i) user-centred methods for simulating interaction and personalisation; (ii) evaluation metrics that combine Human Computer Interaction (HCI), AH and IR techniques; and (iii) the use of qualitative and quantitative evaluations. The benefits and limitations of evaluations based on user simulations are also discussed

CiteSeerX

Irish Universities

DCU Online Research Access Service

Towards evaluation of personalized and collaborative information retrieval

Author: Ganguly Debasis
Jones Gareth J.F.
Leveling Johannes
Li Wei B.
Publication venue
Publication date: 01/01/2011
Field of study

We propose to extend standard information retrieval (IR) ad-hoc test collection design to facilitate research on personalized and collaborative IR by gathering additional meta-information during the topic (query) development process. We propose a controlled query generation process with activity logging for each topic developer. The standard ad-hoc collection will thus be accompanied by a new set of thematically related topics and the associated log information, and has the potential to simulate a real-world search scenario to encourage retrieval systems to mine user information from the logs to improve IR effectiveness. The proposed methodology described in this paper will be applied in a pilot task which is scheduled to run in the FIRE 2011 evaluation campaign. The task aims at investigating the research question of whether personalized and collaborative IR retrieval experiments and evaluation can be pursued by enriching a standard ad-hoc collection with such meta-information

CiteSeerX

Crossref

DCU Online Research Access Service

A study on mutual information-based feature selection for text categorization

Author: Jones Gareth J.F.
Li Jintao
Sun ChunMing
Wang Bin
Xu Yang
Publication venue: 'Binary Information Press'
Publication date: 01/01/2007
Field of study

Feature selection plays an important role in text categorization. Automatic feature selection methods such as document frequency thresholding (DF), information gain (IG), mutual information (MI), and so on are commonly applied in text categorization. Many existing experiments show IG is one of the most effective methods, by contrast, MI has been demonstrated to have relatively poor performance. According to one existing MI method, the mutual information of a category c and a term t can be negative, which is in conflict with the definition of MI derived from information theory where it is always non-negative. We show that the form of MI used in TC is not derived correctly from information theory. There are two different MI based feature selection criteria which are referred to as MI in the TC literature. Actually, one of them should correctly be termed "pointwise mutual information" (PMI). In this paper, we clarify the terminological confusion surrounding the notion of "mutual information" in TC, and detail an MI method derived correctly from information theory. Experiments with the Reuters-21578 collection and OHSUMED collection show that the corrected MI method’s performance is similar to that of IG, and it is considerably better than PMI

Irish Universities

DCU Online Research Access Service

Overview of the personalized and collaborative information retrieval (PIR) track at FIRE-2011

Author: Curtis Keith
Ganguly Debasis
Jones Gareth J.F.
Leveling Johannes
Li Wei B.
Publication venue
Publication date: 02/12/2011
Field of study

The Personalized and collaborative Information Retrieval (PIR) track at FIRE 2011 was organized with an aim to extend standard information retrieval (IR) ad-hoc test collection design to facilitate research on personalized and collaborative IR by collecting additional meta-information during the topic (query) development process. A controlled query generation process through task-based activities with activity logging was used for each topic developer to construct the final list of topics. The standard ad-hoc collection is thus accompanied by a new set of thematically related topics and the associated log information. We believe this can better simulate a real-world search scenario and encourage mining user information from the logs to improve IR effectiveness. A set of 25 TREC formatted topics and the associated metadata of activity logs were released for the participants to use. In this paper we illustrate the data construction phase in detail and also outline two simple ways of using the additional information from the logs to improve retrieval effectiveness

Irish Universities

DCU Online Research Access Service

Effective Dynamic Range in Measurements with Flash Analog-to-Digital Convertor

Author: D.Z. Liu
H.B. Li
H.T. Wong
J. Li
J.F. Qiu
Lai
Li
Li
Liu
Morris
Q. Yue
S.T. Lin
V. Singh
W.C. Chang
W.P. Lai
Wang
Wong
Zucker
Publication venue: 'Elsevier BV'
Publication date: 14/04/2003
Field of study

Flash Analog to Digital Convertor (FADC) is frequently used in nuclear and particle physics experiments, often as the major component in big multi-channel systems. The large data volume makes the optimization of operating parameters necessary. This article reports a study of a method to extend the dynamic range of an 8-bit FADC from the nominal

\rm{2^8}

value. By comparing the integrated pulse area with that of a reference profile, good energy reconstruction and event identification can be achieved on saturated events from CsI(Tl) crystal scintillators. The effective dynamic range can be extended by at least 4 more bits. The algorithm is generic and is expected to be applicable to other detector systems with FADC readout.Comment: 19 pages, 1 table, 10 figure

arXiv.org e-Print Archive

Crossref

CERN Document Server

SUSY-QCD Effect on Top-Charm Associated Production at Linear Collider

Author: A. Denner
B.A. Campbell
C.S. Huang
C.S. Li
C.S. Li
Chong Sheng Li
G. Eilam
G. Eilam
G. Passarino
H.E. Haber
J. Ellis
J.-M. Yang
J.-M. Yang
J.F. Donoghue
J.F. Gunion
J.F. Gunion
J.M. Yang
M.J. Duncan
M.J. Duncan
R. Barbieri
R. Mertig
R.D. Peccei
Shou Hua Zhu
T. Han
T. Han
T. Inami
Xinmin Zhang
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/1999
Field of study

We evaluate the contribution of SUSY-QCD to top-charm associated production at next generation linear colliders. Our results show that the production cross section of the process

e^+e^-\to t\bar c{or}\bar t c

could be as large as 0.1 fb, which is larger than the prediction of the SM by a factor of

10^8

.Comment: version to appear in PR

arXiv.org e-Print Archive

Crossref

CERN Document Server

Enhanced information retrieval by exploiting recommender techniques in cluster-based link analysis

Author: Jones Gareth J.F.
Li Wei B.
Publication venue
Publication date: 01/01/2013
Field of study

Inspired by the use of PageRank algorithms in document ranking, we develop and evaluate a cluster-based PageRank algorithm to re-rank information retrieval (IR) output with the objective of improving ad hoc search effectiveness. Unlike existing work, our methods exploit recommender techniques to extract the correlation between documents and apply detected correlations in a cluster-based PageRank algorithm to compute the importance of each document in a dataset. In this study two popular recommender techniques are examined in four proposed PageRank models to investigate the effectiveness of our approach. Comparison of our methods with strong baselines demonstrates the solid performance of our approach. Experimental results are reported on an extended version of the FIRE 2011 personal information retrieval (PIR) data collection which includes topically related queries with click-through data and relevance assessment data collected from the query creators. The search logs of the query creators are categorized based on their different topical interests. The experimental results show the significant improvement of our approach compared to results using standard IR and cluster-based PageRank methods

Crossref

Irish Universities

DCU Online Research Access Service

Comparative evaluation of query expansion methods for enhanced search on microblog data: DCU ADAPT @ SMERP 2017 workshop data challenge

Author: Jones Gareth J.F.
Li Wei B.
Publication venue: CEUR-WS
Publication date: 01/04/2017
Field of study

The rapid growth in the availability of social media content posted during emergency situations is creating significant interest in research into how this information can be exploited to assist emergency relief operations and to help with emergency preparedness and in early warning systems. We describe the DCU ADAPT Centre participation in the microblog search data challenge at the SMERP 2017 workshop. This task aimed to promote development of information retrieval (IR) methods for practical challenges that need to be addressed during an emergency event, along with comparative evaluation of the methodologies developed for this task. The task is based on a large dataset of microblogs posted during the earthquake in Italy in August 2016, together with a set of query topics provided by the task organisers. For our participation in this task we explored use of three different IR techniques: standard IR query expansion based on an external resource, query expansion based on WordNet and use of query expansio

Irish Universities

DCU Online Research Access Service