193 research outputs found
Using the quantum probability ranking principle to rank interdependent documents
A known limitation of the Probability Ranking Principle (PRP) is that it does not cater for dependence between documents. Recently, the Quantum Probability Ranking Principle (QPRP) has been proposed, which implicitly captures dependencies between documents through “quantum interference”. This paper explores whether this new ranking principle leads to improved performance for subtopic retrieval, where novelty and diversity is required. In a thorough empirical investigation, models based on the PRP, as well as other recently proposed ranking strategies for subtopic retrieval (i.e. Maximal Marginal Relevance (MMR) and Portfolio Theory(PT)), are compared against the QPRP. On the given task, it is shown that the QPRP outperforms these other ranking strategies. And unlike MMR and PT, one of the main advantages of the QPRP is that no parameter estimation/tuning is required; making the QPRP both simple and effective. This research demonstrates that the application of quantum theory to problems within information retrieval can lead to significant improvements
Evolving text classification rules with genetic programming
We describe a novel method for using genetic programming to create compact classification rules using combinations of N-grams (character strings). Genetic programs acquire fitness by producing rules that are effective classifiers in terms of precision and recall when evaluated against a set of training documents. We describe a set of functions and terminals and provide results from a classification task using the Reuters 21578 dataset. We also suggest that the rules may have a number of other uses beyond classification and provide a basis for text mining applications
A Compromise between Neutrino Masses and Collider Signatures in the Type-II Seesaw Model
A natural extension of the standard gauge
model to accommodate massive neutrinos is to introduce one Higgs triplet and
three right-handed Majorana neutrinos, leading to a neutrino mass
matrix which contains three sub-matrices ,
and . We show that three light Majorana neutrinos (i.e., the mass
eigenstates of , and ) are exactly massless in this
model, if and only if
exactly holds. This no-go theorem implies that small but non-vanishing neutrino
masses may result from a significant but incomplete cancellation between
and terms in the Type-II
seesaw formula, provided three right-handed Majorana neutrinos are of TeV and experimentally detectable at the LHC. We propose three simple
Type-II seesaw scenarios with the flavor symmetry to
interpret the observed neutrino mass spectrum and neutrino mixing pattern. Such
a TeV-scale neutrino model can be tested in two complementary ways: (1)
searching for possible collider signatures of lepton number violation induced
by the right-handed Majorana neutrinos and doubly-charged Higgs particles; and
(2) searching for possible consequences of unitarity violation of the neutrino mixing matrix in the future long-baseline neutrino oscillation
experiments.Comment: RevTeX 19 pages, no figure
Semantic distillation: a method for clustering objects by their contextual specificity
Techniques for data-mining, latent semantic analysis, contextual search of
databases, etc. have long ago been developed by computer scientists working on
information retrieval (IR). Experimental scientists, from all disciplines,
having to analyse large collections of raw experimental data (astronomical,
physical, biological, etc.) have developed powerful methods for their
statistical analysis and for clustering, categorising, and classifying objects.
Finally, physicists have developed a theory of quantum measurement, unifying
the logical, algebraic, and probabilistic aspects of queries into a single
formalism. The purpose of this paper is twofold: first to show that when
formulated at an abstract level, problems from IR, from statistical data
analysis, and from physical measurement theories are very similar and hence can
profitably be cross-fertilised, and, secondly, to propose a novel method of
fuzzy hierarchical clustering, termed \textit{semantic distillation} --
strongly inspired from the theory of quantum measurement --, we developed to
analyse raw data coming from various types of experiments on DNA arrays. We
illustrate the method by analysing DNA arrays experiments and clustering the
genes of the array according to their specificity.Comment: Accepted for publication in Studies in Computational Intelligence,
Springer-Verla
Cracking the code of oscillatory activity
Neural oscillations are ubiquitous measurements of cognitive processes and dynamic routing and gating of information. The fundamental and so far unresolved problem for neuroscience remains to understand how oscillatory activity in the brain codes information for human cognition. In a biologically relevant cognitive task, we instructed six human observers to categorize facial expressions of emotion while we measured the observers' EEG. We combined state-of-the-art stimulus control with statistical information theory analysis to quantify how the three parameters of oscillations (i.e., power, phase, and frequency) code the visual information relevant for behavior in a cognitive task. We make three points: First, we demonstrate that phase codes considerably more information (2.4 times) relating to the cognitive task than power. Second, we show that the conjunction of power and phase coding reflects detailed visual features relevant for behavioral response-that is, features of facial expressions predicted by behavior. Third, we demonstrate, in analogy to communication technology, that oscillatory frequencies in the brain multiplex the coding of visual features, increasing coding capacity. Together, our findings about the fundamental coding properties of neural oscillations will redirect the research agenda in neuroscience by establishing the differential role of frequency, phase, and amplitude in coding behaviorally relevant information in the brai
An optimized TOPS+ comparison method for enhanced TOPS models
This article has been made available through the Brunel Open Access Publishing Fund.Background
Although methods based on highly abstract descriptions of protein structures, such as VAST and TOPS, can perform very fast protein structure comparison, the results can lack a high degree of biological significance. Previously we have discussed the basic mechanisms of our novel method for structure comparison based on our TOPS+ model (Topological descriptions of Protein Structures Enhanced with Ligand Information). In this paper we show how these results can be significantly improved using parameter optimization, and we call the resulting optimised TOPS+ method as advanced TOPS+ comparison method i.e. advTOPS+.
Results
We have developed a TOPS+ string model as an improvement to the TOPS [1-3] graph model by considering loops as secondary structure elements (SSEs) in addition to helices and strands, representing ligands as first class objects, and describing interactions between SSEs, and SSEs and ligands, by incoming and outgoing arcs, annotating SSEs with the interaction direction and type. Benchmarking results of an all-against-all pairwise comparison using a large dataset of 2,620 non-redundant structures from the PDB40 dataset [4] demonstrate the biological significance, in terms of SCOP classification at the superfamily level, of our TOPS+ comparison method.
Conclusions
Our advanced TOPS+ comparison shows better performance on the PDB40 dataset [4] compared to our basic TOPS+ method, giving 90 percent accuracy for SCOP alpha+beta; a 6 percent increase in accuracy compared to the TOPS and basic TOPS+ methods. It also outperforms the TOPS, basic TOPS+ and SSAP comparison methods on the Chew-Kedem dataset [5], achieving 98 percent accuracy. Software Availability: The TOPS+ comparison server is available at http://balabio.dcs.gla.ac.uk/mallika/WebTOPS/.This article is available through the Brunel Open Access Publishing Fun
Investigating non-classical correlations between decision fused multi-modal documents
Correlation has been widely used to facilitate various information retrieval methods such as query expansion, relevance feedback, document clustering, and multi-modal fusion. Especially, correlation and independence are important issues when fusing different modalities that influence a multi-modal information retrieval process. The basic idea of correlation is that an observable can help predict or enhance another observable. In quantum mechanics, quantum correlation, called entanglement, is a sort of correlation between the observables measured in atomic-size particles when these particles are not necessarily collected in ensembles. In this paper, we examine a multimodal fusion scenario that might be similar to that encountered in physics by firstly measuring two observables (i.e., text-based relevance and image-based relevance) of a multi-modal document without counting on an ensemble of multi-modal documents already labeled in terms of these two variables. Then, we investigate the existence of non-classical correlations between pairs of multi-modal documents. Despite there are some basic differences between entanglement and classical correlation encountered in the macroscopic world, we investigate the existence of this kind of non-classical correlation through the Bell inequality violation. Here, we experimentally test several novel association methods in a small-scale experiment. However, in the current experiment we did not find any violation of the Bell inequality. Finally, we present a series of interesting discussions, which may provide theoretical and empirical insights and inspirations for future development of this direction
Perspectives of pregnant women on broadening the scope of noninvasive prenatal testing from screening for foetal aneuploidies to prediction of adverse pregnancy outcomes: A qualitative study
Objective: To explore the perspectives of pregnant women on broadening the scope of noninvasive prenatal testing (NIPT) from screening for foetal aneuploidies to prediction of adverse pregnancy outcomes. Methods: Four online focus groups (n = 23 participants) and 14 individual semi-structured interviews were conducted. Participants included pregnant women with and without a history of adverse pregnancy outcomes. Results: Both women at low and high risk of adverse pregnancy outcomes had a positive attitude towards using NIPT to predict adverse pregnancy outcomes. Perceived benefits included the possibility to potentially improve maternal and foetal outcomes by taking risk-reducing measures and/or intensified monitoring during pregnancy and the ability to mentally prepare for the potential adverse outcome. Perceived concerns included anxiety and stress caused by a high-risk test result, a false sense of control over pregnancy, and potential false reassurance. Additionally, women reasoned that broadening the scope of NIPT could increase the complexity of prenatal screening and raised concerns on the combined screening aims in one test (prediction of adverse pregnancy outcomes to improve foetal and maternal health vs. screening for foetal aneuploidies to increase reproductive autonomy). On a societal level, considerations on the risk of medicalising pregnancy and overall pressure to opt for NIPT were mentioned. Conclusion: In general, pregnant women have a positive attitude towards broadening the scope of NIPT to the prediction of pregnancy outcomes, although some concerns are acknowledged
- …