research

Separation of the largest eigenvalues in eigenanalysis of genotype data from discrete subpopulations

Abstract

We present a mathematical model, and the corresponding mathematical analysis, that justifies and quantifies the use of principal component analysis of biallelic genetic marker data for a set of individuals to detect the number of subpopulations represented in the data. We indicate that the power of the technique relies more on the number of individuals genotyped than on the number of markers.Comment: Corrected typos in Section 3.1 (M=120, N=2500) and proof of Lemma

    Similar works

    Full text

    thumbnail-image

    Available Versions