Search CORE

349 research outputs found

The fused Kolmogorov filter: A nonparametric model-free screening method

Author: Mai Qing
Zou Hui
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 30/07/2015
Field of study

A new model-free screening method called the fused Kolmogorov filter is proposed for high-dimensional data analysis. This new method is fully nonparametric and can work with many types of covariates and response variables, including continuous, discrete and categorical variables. We apply the fused Kolmogorov filter to deal with variable screening problems emerging from a wide range of applications, such as multiclass classification, nonparametric regression and Poisson regression, among others. It is shown that the fused Kolmogorov filter enjoys the sure screening property under weak regularity conditions that are much milder than those required for many existing nonparametric screening methods. In particular, the fused Kolmogorov filter can still be powerful when covariates are strongly dependent on each other. We further demonstrate the superior performance of the fused Kolmogorov filter over existing screening methods by simulations and real data examples.Comment: Published at http://dx.doi.org/10.1214/14-AOS1303 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Statistical analysis for a penalized EM algorithm in high-dimensional mixture linear regression model

Author: Mai Qing
Wang Ning
Zhang Xin
Publication venue
Publication date: 21/07/2023
Field of study

The expectation-maximization (EM) algorithm and its variants are widely used in statistics. In high-dimensional mixture linear regression, the model is assumed to be a finite mixture of linear regression and the number of predictors is much larger than the sample size. The standard EM algorithm, which attempts to find the maximum likelihood estimator, becomes infeasible for such model. We devise a group lasso penalized EM algorithm and study its statistical properties. Existing theoretical results of regularized EM algorithms often rely on dividing the sample into many independent batches and employing a fresh batch of sample in each iteration of the algorithm. Our algorithm and theoretical analysis do not require sample-splitting, and can be extended to multivariate response cases. The proposed methods also have encouraging performances in numerical studies

arXiv.org e-Print Archive

Envelopes and principal component regression

Author: Deng Kai
Mai Qing
Zhang Xin
Publication venue
Publication date: 12/07/2022
Field of study

Envelope methods offer targeted dimension reduction for various models. The overarching goal is to improve efficiency in multivariate parameter estimation by projecting the data onto a lower-dimensional subspace known as the envelope. Envelope approaches have advantages in analyzing data with highly correlated variables, but their iterative Grassmannian optimization algorithms do not scale very well with ultra high-dimensional data. While the connections between envelopes and partial least squares in multivariate linear regression have promoted recent progress in high-dimensional studies of envelopes, we propose a more straightforward way of envelope modeling from a novel principal components regression perspective. The proposed procedure, Non-Iterative Envelope Component Estimation (NIECE), has excellent computational advantages over the iterative Grassmannian optimization alternatives in high dimensions. We develop a unified NIECE theory that bridges the gap between envelope methods and principal components in regression. The new theoretical insights also shed light on the envelope subspace estimation error as a function of eigenvalue gaps of two symmetric positive definite matrices used in envelope modeling. We apply the new theory and algorithm to several envelope models, including response and predictor reduction in multivariate linear models, logistic regression, and Cox proportional hazard model. Simulations and illustrative data analysis show the potential for NIECE to improve standard methods in linear and generalized linear models significantly

arXiv.org e-Print Archive

Slicing-free Inverse Regression in High-dimensional Sufficient Dimension Reduction

Author: Mai Qing
Shao Xiaofeng
Wang Runmin
Zhang Xin
Publication venue: 'Institute of Statistical Science'
Publication date: 12/04/2023
Field of study

Sliced inverse regression (SIR, Li 1991) is a pioneering work and the most recognized method in sufficient dimension reduction. While promising progress has been made in theory and methods of high-dimensional SIR, two remaining challenges are still nagging high-dimensional multivariate applications. First, choosing the number of slices in SIR is a difficult problem, and it depends on the sample size, the distribution of variables, and other practical considerations. Second, the extension of SIR from univariate response to multivariate is not trivial. Targeting at the same dimension reduction subspace as SIR, we propose a new slicing-free method that provides a unified solution to sufficient dimension reduction with high-dimensional covariates and univariate or multivariate response. We achieve this by adopting the recently developed martingale difference divergence matrix (MDDM, Lee & Shao 2018) and penalized eigen-decomposition algorithms. To establish the consistency of our method with a high-dimensional predictor and a multivariate response, we develop a new concentration inequality for sample MDDM around its population counterpart using theories for U-statistics, which may be of independent interest. Simulations and real data analysis demonstrate the favorable finite sample performance of the proposed method

arXiv.org e-Print Archive

Microenvironment Signals and Mechanisms in the Regulation of Osteosarcoma

Author: Mai Qing
Xie Chunyuan
Zhang Xiaowen
Zhang Yan
Zhang Yu
Publication venue: 'IntechOpen'
Publication date: 26/04/2017
Field of study

Osteosarcoma (OS) is the most common malignant primary bone tumor in children and adolescents and features rapid development, strong metastatic ability, and poor prognosis. It has been well established that diverse genetic aberrations and metabolic alterations confer the tumorigenesis and development of OS. The intricate metabolism and vascularization that contributes to the nutrient and structural support for tumor progression should be thoroughly clarified to help us gain novel insights into OS and its clinical diagnoses and treatments. With regard to the complex bone extracellular matrix (ECM) and local cell populations, we intend to illustrate the interrelationship between various microenvironmental signals and the different stages of OS evolution. Solid evidence has noted two crucial factors of the OS microenvironment in the acquisition of stem cell phenotypes - transforming growth factor-β1 (TGF-β1) signaling and hypoxia. Different cell subtypes in the local environment might also serve as unique contributors that interact with each other and communicate with distant cells, thus participating in local invasion and metastasis. Proper models have been established and improved to reveal the evolutionary footsteps of how normal cells transform into a neoplastic state and progress toward malignancy

IntechOpen

Recommended from our members

Free-standing kinked nanowire transistor probes for targeted intracellular recording in three dimensions

Author: Gao Ruixuan
Jiang Zhe
Lieber Charles M.
Mai Liqiang
Qing Quan
Xu Lin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 08/09/2014
Field of study

Recording intracellular bioelectrical signals is central to understanding the fundamental behaviour of cells and cell-networks in, for example, neural and cardiac systems1–4. The standard tool for intracellular recording, the patch-clamp micropipette5 is widely applied, yet remains limited in terms of reducing the tip size, the ability to reuse the pipette5, and ion exchange with the cytoplasm6. Recent efforts have been directed towards developing new chip-based tools1–4,7–13, including micro-to-nanoscale metal pillars7–9, transistor-based kinked nanowire10,11 and nanotube devices12,13. These nanoscale tools are interesting with respect to chip-based multiplexing, but, to date, preclude targeted recording from specific cell regions and/or subcellular structures. Here we overcome this limitation in a general manner by fabricating free-standing probes where a kinked silicon nanowire with encoded field-effect transistor detector serves as the tip end. These probes can be manipulated in three dimensions (3D) within a standard microscope to target specific cells/cell regions, and record stable full-amplitude intracellular action potentials from different targeted cells without the need to clean or change the tip. Simultaneous measurements from the same cell made with free-standing nanowire and patch-clamp probes show that the same action potential amplitude and temporal properties are recorded without corrections to the raw nanowire signal. In addition, we demonstrate real-time monitoring of changes in the action potential as different ion-channel blockers are applied to cells, and multiplexed recording from cells by independent manipulation of two free-standing nanowire probes

Harvard University - DASH

Recommended from our members

Design and Synthesis of Diverse Functional Kinked Nanowire Structures for Nanoelectronic Bioprobes

Author: Jiang Zhe
Lieber Charles M.
Mai Liqiang
Qing Quan
Xu Lin
Zhang Qingjie
Publication venue: 'American Chemical Society (ACS)'
Publication date: 13/03/2013
Field of study

Functional kinked nanowires (KNWs) represent a new class of nanowire building blocks, in which functional devices, for example, nanoscale field-effect transistors (nanoFETs), are encoded in geometrically controlled nanowire superstructures during synthesis. The bottom-up control of both structure and function of KNWs enables construction of spatially isolated point-like nanoelectronic probes that are especially useful for monitoring biological systems where finely tuned feature size and structure are highly desired. Here we present three new types of functional KNWs including (1) the zero-degree KNW structures with two parallel heavily doped arms of U-shaped structures with a nanoFET at the tip of the “U”, (2) series multiplexed functional KNW integrating multi-nanoFETs along the arm and at the tips of V-shaped structures, and (3) parallel multiplexed KNWs integrating nanoFETs at the two tips of W-shaped structures. First, U-shaped KNWs were synthesized with separations as small as 650 nm between the parallel arms and used to fabricate three-dimensional nanoFET probes at least 3 times smaller than previous V-shaped designs. In addition, multiple nanoFETs were encoded during synthesis in one of the arms/tip of V-shaped and distinct arms/tips of W-shaped KNWs. These new multiplexed KNW structures were structurally verified by optical and electron microscopy of dopant-selective etched samples and electrically characterized using scanning gate microscopy and transport measurements. The facile design and bottom-up synthesis of these diverse functional KNWs provides a growing toolbox of building blocks for fabricating highly compact and multiplexed three-dimensional nanoprobes for applications in life sciences, including intracellular and deep tissue/cell recordings.Chemistry and Chemical Biolog

Harvard University - DASH