Search CORE

144,078 research outputs found

The Dependence of Routine Bayesian Model Selection Methods on Irrelevant Alternatives

Author: Smith Jim Q.
Zwiernik Piotr
Publication venue
Publication date: 01/01/2012
Field of study

Bayesian methods - either based on Bayes Factors or BIC - are now widely used for model selection. One property that might reasonably be demanded of any model selection method is that if a model

{M}_{1}

is preferred to a model

{M}_{0}

, when these two models are expressed as members of one model class

\mathbb{M}

, this preference is preserved when they are embedded in a different class

\mathbb{M}'

. However, we illustrate in this paper that with the usual implementation of these common Bayesian procedures this property does not hold true even approximately. We therefore contend that to use these methods it is first necessary for there to exist a "natural" embedding class. We argue that in any context like the one illustrated in our running example of Bayesian model selection of binary phylogenetic trees there is no such embedding

arXiv.org e-Print Archive

Repository TU/e

Skill-Based Few-Shot Selection for In-Context Learning

Author: An Shengnan
Chen Bei
Chen Weizhu
Fu Qiang
Lin Zeqi
Lou Jian-Guang
Zheng Nanning
Zhou Bo
Publication venue
Publication date: 10/10/2023
Field of study

In-context learning is the paradigm that adapts large language models to downstream tasks by providing a few examples. Few-shot selection -- selecting appropriate examples for each test instance separately -- is important for in-context learning. In this paper, we propose Skill-KNN, a skill-based few-shot selection method for in-context learning. The key advantages of Skill-KNN include: (1) it addresses the problem that existing methods based on pre-trained embeddings can be easily biased by surface natural language features that are not important for the target task; (2) it does not require training or fine-tuning of any models, making it suitable for frequently expanding or changing example banks. The key insight is to optimize the inputs fed into the embedding model, rather than tuning the model itself. Technically, Skill-KNN generates the skill-based descriptions for each test case and candidate example by utilizing a pre-processing few-shot prompting, thus eliminating unimportant surface features. Experimental results across five cross-domain semantic parsing datasets and six backbone models show that Skill-KNN significantly outperforms existing methods.Comment: Accepted by EMNLP 2023 main conferenc

arXiv.org e-Print Archive

Recommended from our members

Protection of medical images and patient related information in healthcare: Using an intelligent and reversible watermarking technique

Author: Alattar
An
Aqsa Saeed Qureshi
Arsalan
Asifullah Khan
Boland
Caldelli
Chen
Coltuc
Cox
Cox
Dragoi
Fridrich
Gao
Gonzalez
Huang
Koza
Lee
Lin
Memon
Muhammad Arsalan
Muttukrishnan Rajarajan
Parmar
Pursley
Rathi
Sachnev
Siddiqa
Tian
Ulutas
Usman
Usman
Wakatani
Wu
Xu
Xuan
Xuan
Xuan
Publication venue: 'Elsevier BV'
Publication date: 01/02/2017
Field of study

This work presents an intelligent technique based on reversible watermarking for protecting patient and medical related information. In the proposed technique ‘IRW-Med’, the concept of companding function is exploited for reducing embedding distortion, while Integer Wavelet Transform (IWT) is used as an embedding domain for achieving reversibility. Histogram processing is employed to avoid underflow/overflow. In addition, the learning capabilities of Genetic Programming (GP) are exploited for intelligent wavelet coefficient selection. In this context, GP is used to evolve models that not only make an optimal tradeoff between imperceptibility and capacity of the watermark, but also exploit the wavelet coefficient hidden dependencies and information related to the type of sub band. The novelty of the proposed IRW-Med technique lies in its ability to generate a model that can find optimal wavelet coefficients for embedding, and also acts as a companding factor for watermark embedding. The proposed IRW-Med is thus able to embed watermark with low distortion, take out the hidden information, and also recovers the original image. The proposed IRW-Med technique is effective with respect to capacity and imperceptibility and effectiveness is demonstrated through experimental comparisons with existing techniques using standard images as well as a publically available medical image dataset

City Research Online

Crossref

Implementation of Multiple-Instance Learning in Drug Activity Prediction

Author: Fu Gang
Publication venue: eGrove
Publication date: 01/01/2012
Field of study

In the context of drug discovery and development, much effort has been exerted to determine which conformers of a given molecule are responsible for the observed biological activity. In this work we aimed to predict bioactive conformers using a variant of supervised learning, named multiple-instance learning. A single molecule, treated as a bag of conformers, is biologically active if and only if at least one of its conformers, treated as an instance, is responsible for the observed bioactivity; and a molecule is inactive if none of its conformers is responsible for the observed bioactivity. The implementation requires instance-based embedding, and joint feature selection and classification. The goal of the present project is to implement multiple-instance learning in drug activity prediction, and subsequently to identify the bioactive conformers for each molecule. We encoded the 3-dimensional structures using pharmacophore fingerprints which are binary strings, and accomplished instance-based embedding using calculated dissimilarity distances. Four dissimilarity measures were employed and their performances were compared. 1-norm SVM was used for joint feature selection and classification. The approach was applied to four data sets, and the best proposed model for each data set was determined by using the dissimilarity measure yielding the smallest number of selected features. The predictive abilities of the proposed approach were compared with three classical predictive models without instance-based embedding. The proposed approach produced the best predictive models for one data set and second best predictive models for the rest of the data sets, based on the external validations. To validate the ability of the proposed approach to find bioactive conformers, 12 small molecules with co-crystallized structures were seeded in one data set. 10 out of 12 co-crystallized structures were indeed identified as significant conformers using the proposed approach. The proposed approach was demonstrated to be highly competitive with classical predictive models, hence it is very powerful for drug activity prediction. The approach was also validated as a useful method for pursuit of bioactive conformers

eGrove (Univ. of Mississippi)