Search CORE

394,997 research outputs found

Probabilistic models of information retrieval based on measuring the divergence from randomness

Author: Allan J.
Amati G.
Bookstein A.
Carpineto C.
Cornelis Joost Van Rijsbergen
Croft W.
Damerau F.
Gianni Amati
Harman D.
Harter S. P.
Harter S. P.
Lafferty J.
Margulis E.
Ponte J.
Robertson S.
Robertson S.
Robertson S. E.
Robertson S. E.
Solomonoff R.
Solomonoff R.
van Rijsbergen C.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/10/2002
Field of study

We introduce and create a framework for deriving probabilistic models of Information Retrieval. The models are nonparametric models of IR obtained in the language model approach. We derive term-weighting models by measuring the divergence of the actual term distribution from that obtained under a random process. Among the random processes we study the binomial distribution and Bose--Einstein statistics. We define two types of term frequency normalization for tuning term weights in the document--query matching process. The first normalization assumes that documents have the same length and measures the information gain with the observed term once it has been accepted as a good descriptor of the observed document. The second normalization is related to the document length and to other statistics. These two normalization methods are applied to the basic models in succession to obtain weighting formulae. Results show that our framework produces different nonparametric models forming baseline alternatives to the standard tf-idf model

Crossref

Enlighten

Combining vocal tract length normalization with hierarchial linear transformations

Author: Dines J.
Garner P.N.
Saheer L.
Yamagishi J.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

Recent research has demonstrated the effectiveness of vocal tract length normalization (VTLN) as a rapid adaptation technique for statistical parametric speech synthesis. VTLN produces speech with naturalness preferable to that of MLLR-based adaptation techniques, being much closer in quality to that generated by the original av-erage voice model. However with only a single parameter, VTLN captures very few speaker specific characteristics when compared to linear transform based adaptation techniques. This paper pro-poses that the merits of VTLN can be combined with those of linear transform based adaptation in a hierarchial Bayesian frame-work, where VTLN is used as the prior information. A novel tech-nique for propagating the gender information from the VTLN prior through constrained structural maximum a posteriori linear regres-sion (CSMAPLR) adaptation is presented. Experiments show that the resulting transformation has improved speech quality with better naturalness, intelligibility and improved speaker similarity. Index Terms — Statistical parametric speech synthesis, hidden Markov models, speaker adaptation, vocal tract length normaliza-tion, constrained structural maximum a posteriori linear regression 1

Infoscience - École polytechnique fédérale de Lausanne

CiteSeerX

Crossref

Edinburgh Research Explorer

Constraining Implicit Space with Minimum Description Length: An Unsupervised Attention Mechanism across Neural Network Layers

Author: Lin Baihan
Publication venue
Publication date: 10/09/2020
Field of study

Inspired by the adaptation phenomenon of neuronal firing, we propose the regularity normalization (RN) as an unsupervised attention mechanism (UAM) which computes the statistical regularity in the implicit space of neural networks under the Minimum Description Length (MDL) principle. Treating the neural network optimization process as a partially observable model selection problem, UAM constrains the implicit space by a normalization factor, the universal code length. We compute this universal code incrementally across neural network layers and demonstrated the flexibility to include data priors such as top-down attention and other oracle information. Empirically, our approach outperforms existing normalization methods in tackling limited, imbalanced and non-stationary input distribution in image classification, classic control, procedurally-generated reinforcement learning, generative modeling, handwriting generation and question answering tasks with various neural network architectures. Lastly, UAM tracks dependency and critical learning stages across layers and recurrent time steps of deep networks

arXiv.org e-Print Archive

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

PubMed Central

Correlation Between the Deuteron Characteristics and the Low-energy Triplet np Scattering Parameters

Author: A. M. Mukhamedzhanov
D. W. L. Sprung
G. G. Simon
G. L. Greene
H. P. Noyes
I. Borbély
I. Borbély
J. Horáček
J. W. Humberstone
K. Holinde
L. D. Blokhintsev
L. Hulthén
M. Lacombe
M. P. Locher
M. W. Kermode
N. J. McGurk
N. K. Glendenning
O. Dumbrajs
R. B. Viringa
R. Machleidt
R. Machleidt
R. V. Reid Jr.
R. W. Bérard
R. Wilson
S. Klarsfeld
S. Klarsfeld
S. Klarsfeld
T. E. O. Ericson
T. E. O. Ericson
T. L. Houk
T. L. Houk
V. G. J. Stoks
V. G. J. Stoks
V. I. Kukulin
W. Dilg
Publication venue: 'Pleiades Publishing Ltd'
Publication date: 30/06/2003
Field of study

The correlation relationship between the deuteron asymptotic normalization constant,

A_{S}

, and the triplet np scattering length,

a_{t}

, is investigated. It is found that 99.7% of the asymptotic constant

A_{S}

is determined by the scattering length

a_{t}

. It is shown that the linear correlation relationship between the quantities

A_{S}^{-2}

and

1/a_{t}

provides a good test of correctness of various models of nucleon-nucleon interaction. It is revealed that, for the normalization constant

A_{S}

and for the root-mean-square deuteron radius

r_{d}

, the results obtained with the experimental value recommended at present for the triplet scattering length

a_{t}

are exaggerated with respect to their experimental counterparts. By using the latest experimental phase shifts of Arndt et al., we obtain, for the low-energy scattering parameters (

a_{t}

r_{t}

P_{t}

) and for the deuteron characteristics (

A_{S}

r_{d}

), results that comply well with experimental data.Comment: 19 pages, 1 figure, To be published in Physics of Atomic Nucle

arXiv.org e-Print Archive

Crossref

CERN Document Server

Bounding normalization time through intersection types

Author: De Benedetti Erika
Della Rocca Simona Ronchi
Publication venue: 'Open Publishing Association'
Publication date: 01/01/2013
Field of study

Non-idempotent intersection types are used in order to give a bound of the length of the normalization beta-reduction sequence of a lambda term: namely, the bound is expressed as a function of the size of the term.Comment: In Proceedings ITRS 2012, arXiv:1307.784

arXiv.org e-Print Archive

Directory of Open Access Journals

Institutional Research Information System University of Turin