Search CORE

123 research outputs found

Shifting Goals And Mounting Challenges For Statistical Methodology

Author: Sen Pranab K.
Publication venue: DigitalCommons@WayneState
Publication date: 01/05/2002
Field of study

Modern interdisciplinary research in statistical science encompasses a wide field: agriculture, biology, biomedical sciences along with bioinformatics, clinical sciences, education, environmental and public health disciplines, genomic science, industry, molecular genetics, socio-behavior, socio-economics, toxicology, and a variety of other disciplines. Statistical science has historically had mathematical perspectives dominating theoretical and methodological developments. Yet, the advent of modern information technology has opened the doors for highly computation intensive statistical tools (i.e., software), wherein mathematical aspects are often de-emphasized. Knowledge discovery and data mining (KDDM) is now becoming a dominating force, with bioinformatics as a notable example. In view of this apparent discordance between mathematical (frequentist as well as Bayesian) and computational approaches to statistical resolutions, and a genuine need to formulate training as well as research curricula to meet growing demands, a critical appraisal of statistical innovations is made with due respect to its mathematical heritage, as well as scope of application. Some of the challenging statistical tasks are illustrated

Crossref

Digital Commons@Wayne State University

Kendall's tau in high-dimensional genomic parsimony

Author: Jayanta K. Ghosh
Pranab K. Sen
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2008
Field of study

High-dimensional data models, often with low sample size, abound in many interdisciplinary studies, genomics and large biological systems being most noteworthy. The conventional assumption of multinormality or linearity of regression may not be plausible for such models which are likely to be statistically complex due to a large number of parameters as well as various underlying restraints. As such, parametric approaches may not be very effective. Anything beyond parametrics, albeit, having increased scope and robustness perspectives, may generally be baffled by the low sample size and hence unable to give reasonable margins of errors. Kendall's tau statistic is exploited in this context with emphasis on dimensional rather than sample size asymptotics. The Chen--Stein theorem has been thoroughly appraised in this study. Applications of these findings in some microarray data models are illustrated.Comment: Published in at http://dx.doi.org/10.1214/074921708000000183 the IMS Collections (http://www.imstat.org/publications/imscollections.htm) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Crossref

An asymptotically normal test for the selective neutrality hypothesis

Author: Aluísio Pinheiro
Hildete P. Pinheiro
Pranab K. Sen
Samara Kiihl
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2008
Field of study

An important parameter in the study of population evolution is

\theta=4N\nu

, where

N

is the effective population size and

\nu

is the rate of mutation per locus per generation. Therefore,

\theta

represents the mean number of mutations per site per generation. There are many estimators of

\theta

, one of them being the mean number of pairwise nucleotide differences, which we call

\mathcal{T}_2

. Other estimators are

\mathcal{T}_1

, based on the number of segregating sites and

\mathcal{T}_3

, based on the number of singletons. The concept of selective neutrality can be interpreted as a differentiated nucleotide distribution for mutant sites when compared to the overall nucleotide distribution. Tajima (1989) has proposed the so-called Tajima's test of selective neutrality based on

\mathcal{T}_2-\mathcal{T}_1

. Its complex empirical behavior (Kiihl, 2005) motivates us to propose a test statistic solely based on

\mathcal{T}_2

. We are thus able to prove asymptotic normality under different assumptions on the number of sequences and number of sites via

U

-statistics theory.Comment: Published in at http://dx.doi.org/10.1214/193940307000000293 the IMS Collections (http://www.imstat.org/publications/imscollections.htm) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Crossref

Two-Stage Likelihood Ratio and Union–Intersection Tests for One-Sided Alternatives Multivariate Mean with Nuisance Dispersion Matrix

Author: Sen Pranab K.
Tsai Ming-Tien
Publication venue
Publication date: 01/01/1999
Field of study

For a multinormal distribution with an unknown dispersion matrix, union-intersection (UI) tests for the mean against one-sided alternatives are considered. The null distribution of the UI test statistic is derived and its power monotonicity properties are studied. A Stain-type two-stage procedure is proposed to eliminate some of the inherent drawbacks of such tests. Some comparisons are also made with some recently proposed alternative conditional likelihood ratio tests

Elsevier - Publisher Connector

Carolina Digital Repository

Second-Order Pitman Admissibility and Pitman Closeness: The Multiparameter Case and Stein-Rule Estimators

Author: Ghosh Jayanta K.
Mukerjee Rahul
Sen Pranab K.
Publication venue
Publication date: 01/01/1996
Field of study

In a multiparameter estimation problem, for first-order efficient estimators, second-order Pitman admissibility, and Pitman closeness properties are studied. Bearing in mind the dominant role of Stein-rule estimators in multiparameter estimation theory, such second-order properties are also studied for shrinkage maximum likelihood estimators

Elsevier - Publisher Connector

Carolina Digital Repository

A New Smooth Density Estimator for Non-Negative Random Variables

Author: Chaubey Yogendra P.
Sen Arusharka
Sen Pranab K.
Publication venue: Concordia University. Department of Mathematics & Statistics
Publication date: 01/01/2007
Field of study

Commonly used kernel density estimators may not provide admissible values of the density or its functionals at the boundaries for densities with restricted support. For smoothing the empirical distribution a generalization of the Hille's lemma, considered here, alleviates some of the problems of kernel density estimator near the boundaries. For nonnegative random variables which crop up in reliability and survival analysis, the proposed procedure is thoroughly explored; its consistency and asymptotic distributional results are established under appropriate regularity assumptions. Methods of obtaining smoothing parameters through cross-validation are given, and graphical illustrations of the estimator for continuous (at zero) as well as discontinuous densities are provided

Concordia University Research Repository

On Nonparametric Estimators of the Density of a Non-negative Function of Observations

Author: Chaubey Yogendra P.
Sen Pranab K.
Publication venue
Publication date: 15/05/2013
Field of study

Concordia University Research Repository

On the Selection of the Smoothing Parameter in Poisson Smoothing of Histogram Estimator: Computational Aspects

Author: Chaubey Yogendra P.
Sen Pranab K.
Publication venue: Concordia University. Department of Mathematics & Statistics
Publication date: 01/12/2008
Field of study

Concordia University Research Repository