Search CORE

38,572 research outputs found

Some aspects of (r,k)-parking functions

Author: Stanley Richard
Wang Yinghui
Publication venue: 'Elsevier BV'
Publication date: 24/01/2018
Field of study

An \emph{

(r,k)

-parking function} of length

n

may be defined as a sequence

(a_1,\dots,a_n)

of positive integers whose increasing rearrangement

b_1\leq\cdots\leq b_n

satisfies

b_i\leq k+(i-1)r

. The case

r=k=1

corresponds to ordinary parking functions. We develop numerous properties of

(r,k)

-parking functions. In particular, if

F_n^{(r,k)}

denotes the Frobenius characteristic of the action of the symmetric group

\mathfrak{S}_n

on the set of all

(r,k)

-parking functions of length

n

, then we find a combinatorial interpretation of the coefficients of the power series

\left( \sum_{n\geq 0}F_n^{(r,1)}t^n\right)^k

for any

k\in \mathbb{Z}

. When

k>0

, this power series is just

\sum_{n\geq 0} F_n^{(r,k)} t^n

; when

k<0

, we obtain a dual to

(r,k)

-parking functions. We also give a

q

-analogue of this result. For fixed

r

, we can use the symmetric functions

F_n^{(r,1)}

to define a multiplicative basis for the ring

\Lambda

of symmetric functions. We investigate some of the properties of this basis

arXiv.org e-Print Archive

Crossref

University of Miami: Scholarship Miami

A useful variant of the Davis--Kahan theorem for statisticians

Author: Samworth Richard J.
Wang Tengyao
Yu Yi
Publication venue
Publication date: 04/05/2014
Field of study

The Davis--Kahan theorem is used in the analysis of many statistical procedures to bound the distance between subspaces spanned by population eigenvectors and their sample versions. It relies on an eigenvalue separation condition between certain relevant population and sample eigenvalues. We present a variant of this result that depends only on a population eigenvalue separation condition, making it more natural and convenient for direct application in statistical contexts, and improving the bounds in some cases. We also provide an extension to situations where the matrices under study may be asymmetric or even non-square, and where interest is in the distance between subspaces spanned by corresponding singular vectors.Comment: 12 page

arXiv.org e-Print Archive

Crossref

UCL Discovery

Warwick Research Archives Portal Repository

Explore Bristol Research

Mining Pure, Strict Epistatic Interactions from High-Dimensional Datasets: Ameliorating the Curse of Dimensionality

Author: Richard E. Neapolitan
Xia Jiang
Xiaofeng Wang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 12/10/2012
Field of study

Background: The interaction between loci to affect phenotype is called epistasis. It is strict epistasis if no proper subset of the interacting loci exhibits a marginal effect. For many diseases, it is likely that unknown epistatic interactions affect disease susceptibility. A difficulty when mining epistatic interactions from high-dimensional datasets concerns the curse of dimensionality. There are too many combinations of SNPs to perform an exhaustive search. A method that could locate strict epistasis without an exhaustive search can be considered the brass ring of methods for analyzing high-dimensional datasets. Methodology/Findings: A SNP pattern is a Bayesian network representing SNP-disease relationships. The Bayesian score for a SNP pattern is the probability of the data given the pattern, and has been used to learn SNP patterns. We identified a bound for the score of a SNP pattern. The bound provides an upper limit on the Bayesian score of any pattern that could be obtained by expanding a given pattern. We felt that the bound might enable the data to say something about the promise of expanding a 1-SNP pattern even when there are no marginal effects. We tested the bound using simulated datasets and semi-synthetic high-dimensional datasets obtained from GWAS datasets. We found that the bound was able to dramatically reduce the search time for strict epistasis. Using an Alzheimer's dataset, we showed that it is possible to discover an interaction involving the APOE gene based on its score because of its large marginal effect, but that the bound is most effective at discovering interactions without marginal effects. Conclusions/Significance: We conclude that the bound appears to ameliorate the curse of dimensionality in high-dimensional datasets. This is a very consequential result and could be pivotal in our efforts to reveal the dark matter of genetic disease risk from high-dimensional datasets. © 2012 Jiang, Neapolitan

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

D-Scholarship@Pitt