6,748 research outputs found

    Comparing large covariance matrices under weak conditions on the dependence structure and its application to gene clustering

    Get PDF
    Comparing large covariance matrices has important applications in modern genomics, where scientists are often interested in understanding whether relationships (e.g., dependencies or co-regulations) among a large number of genes vary between different biological states. We propose a computationally fast procedure for testing the equality of two large covariance matrices when the dimensions of the covariance matrices are much larger than the sample sizes. A distinguishing feature of the new procedure is that it imposes no structural assumptions on the unknown covariance matrices. Hence the test is robust with respect to various complex dependence structures that frequently arise in genomics. We prove that the proposed procedure is asymptotically valid under weak moment conditions. As an interesting application, we derive a new gene clustering algorithm which shares the same nice property of avoiding restrictive structural assumptions for high-dimensional genomics data. Using an asthma gene expression dataset, we illustrate how the new test helps compare the covariance matrices of the genes across different gene sets/pathways between the disease group and the control group, and how the gene clustering algorithm provides new insights on the way gene clustering patterns differ between the two groups. The proposed methods have been implemented in an R-package HDtest and is available on CRAN.Comment: The original title dated back to May 2015 is "Bootstrap Tests on High Dimensional Covariance Matrices with Applications to Understanding Gene Clustering

    Simulation-Based Hypothesis Testing of High Dimensional Means Under Covariance Heterogeneity

    Get PDF
    In this paper, we study the problem of testing the mean vectors of high dimensional data in both one-sample and two-sample cases. The proposed testing procedures employ maximum-type statistics and the parametric bootstrap techniques to compute the critical values. Different from the existing tests that heavily rely on the structural conditions on the unknown covariance matrices, the proposed tests allow general covariance structures of the data and therefore enjoy wide scope of applicability in practice. To enhance powers of the tests against sparse alternatives, we further propose two-step procedures with a preliminary feature screening step. Theoretical properties of the proposed tests are investigated. Through extensive numerical experiments on synthetic datasets and an human acute lymphoblastic leukemia gene expression dataset, we illustrate the performance of the new tests and how they may provide assistance on detecting disease-associated gene-sets. The proposed methods have been implemented in an R-package HDtest and are available on CRAN.Comment: 34 pages, 10 figures; Accepted for biometric

    Cram\'{e}r-type moderate deviations for Studentized two-sample UU-statistics with applications

    Full text link
    Two-sample UU-statistics are widely used in a broad range of applications, including those in the fields of biostatistics and econometrics. In this paper, we establish sharp Cram\'{e}r-type moderate deviation theorems for Studentized two-sample UU-statistics in a general framework, including the two-sample tt-statistic and Studentized Mann-Whitney test statistic as prototypical examples. In particular, a refined moderate deviation theorem with second-order accuracy is established for the two-sample tt-statistic. These results extend the applicability of the existing statistical methodologies from the one-sample tt-statistic to more general nonlinear statistics. Applications to two-sample large-scale multiple testing problems with false discovery rate control and the regularized bootstrap method are also discussed.Comment: Published at http://dx.doi.org/10.1214/15-AOS1375 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Positive solutions to a system of adjointable operator equations over Hilbert C∗-modules

    Get PDF
    AbstractWe present necessary and sufficient conditions for the existence of a positive solution to the system of adjointable operator equations A1X=C1,XB2=C2,A3XB3=C3 over Hilbert C∗-modules. We also derive a representation for a general positive solution to this system when the solvability conditions are satisfied. The results of this paper extend some known results in the literature

    Width-tuned magnetic order oscillation on zigzag edges of honeycomb nanoribbons

    Full text link
    Quantum confinement and interference often generate exotic properties in nanostructures. One recent highlight is the experimental indication of a magnetic phase transition in zigzag-edged graphene nanoribbons at the critical ribbon width of about 7 nm [G. Z. Magda et al., Nature \textbf{514}, 608 (2014)]. Here we show theoretically that with further increase in the ribbon width, the magnetic correlation of the two edges can exhibit an intriguing oscillatory behavior between antiferromagnetic and ferromagnetic, driven by acquiring the positive coherence between the two edges to lower the free energy. The oscillation effect is readily tunable in applied magnetic fields. These novel properties suggest new experimental manifestation of the edge magnetic orders in graphene nanoribbons, and enhance the hopes of graphene-like spintronic nanodevices functioning at room temperature.Comment: 22 pages, 9 figure
    corecore