48 research outputs found
Kernel PCA and the Nyström method
This thesis treats kernel PCA and the Nystrom method. We present a novel incre- ¨
mental algorithm for calculation of kernel PCA, which we extend to incremental
calculation of the Nystrom approximation. We suggest a new data-dependent ¨
method to select the number of data points to include in the Nystrom subset, ¨
and create a statistical hypothesis test for the same purpose. We further present
a cross-validation procedure for kernel PCA to select the number of principal
components to retain. Finally, we derive kernel PCA with the Nystrom method ¨
in line with linear PCA and study its statistical accuracy through a confidence
bound
Human consumption of seaweed and freshwater aquatic plants in ancient Europe
During the Mesolithic in Europe, there is widespread evidence for an increase in exploitation of aquatic resources. In contrast, the subsequent Neolithic is characterised by the spread of farming, land ownership, and full sedentism, which lead to the perception of marine resources subsequently representing marginal or famine food or being abandoned altogether even at the furthermost coastal limits of Europe. Here, we examine biomarkers extracted from human dental calculus, using sequential thermal desorption- and pyrolysis-GCMS, to report direct evidence for widespread consumption of seaweed and submerged aquatic and freshwater plants across Europe. Notably, evidence of consumption of these resources extends through the Neolithic transition to farming and into the Early Middle Ages, suggesting that these resources, now rarely eaten in Europe, only became marginal much more recently. Understanding ancient foodstuffs is crucial to reconstructing the past, while a better knowledge of local, forgotten resources is likewise important today
The genetic prehistory of the Baltic Sea region
Correction: Nature communications 9 (2018), art. no. 1494 doi:10.1038/s41467-018-03872-yWhile the series of events that shaped the transition between foraging societies and food producers are well described for Central and Southern Europe, genetic evidence from Northern Europe surrounding the Baltic Sea is still sparse. Here, we report genome-wide DNA data from 38 ancient North Europeans ranging from similar to 9500 to 2200 years before present. Our analysis provides genetic evidence that hunter-gatherers settled Scandinavia via two routes. We reveal that the first Scandinavian farmers derive their ancestry from Anatolia 1000 years earlier than previously demonstrated. The range of Mesolithic Western hunter-gatherers extended to the east of the Baltic Sea, where these populations persisted without gene-flow from Central European farmers during the Early and Middle Neolithic. The arrival of steppe pastoralists in the Late Neolithic introduced a major shift in economy and mediated the spread of a new ancestry associated with the Corded Ware Complex in Northern Europe.Peer reviewe
A gene-centric approach to biomarker discovery identifies transglutaminase 1 as an epidermal autoantigen
Publisher Copyright: © 2021 National Academy of Sciences. All rights reserved.Autoantigen discovery is a critical challenge for the understanding and diagnosis of autoimmune diseases. While autoantibody markers in current clinical use have been identified through studies focused on individual disorders, we postulated that a reverse approach starting with a putative autoantigen to explore multiple disorders might hold promise. We here targeted the epidermal protein transglutaminase 1 (TGM1) as a member of a protein family prone to autoimmune attack. By screening sera from patients with various acquired skin disorders, we identified seropositive subjects with the blistering mucocutaneous disease paraneoplastic pemphigus. Validation in further subjects confirmed TGM1 autoantibodies as a 55% sensitive and 100% specific marker for paraneoplastic pemphigus. This gene-centric approach leverages the wealth of data available for human genes and may prove generally applicable for biomarker discovery in autoimmune diseases.Peer reviewe
Massive migration from the steppe is a source for Indo-European languages in Europe
We generated genome-wide data from 69 Europeans who lived between 8,000-3,000
years ago by enriching ancient DNA libraries for a target set of almost four
hundred thousand polymorphisms. Enrichment of these positions decreases the
sequencing required for genome-wide ancient DNA analysis by a median of around
250-fold, allowing us to study an order of magnitude more individuals than
previous studies and to obtain new insights about the past. We show that the
populations of western and far eastern Europe followed opposite trajectories
between 8,000-5,000 years ago. At the beginning of the Neolithic period in
Europe, ~8,000-7,000 years ago, closely related groups of early farmers
appeared in Germany, Hungary, and Spain, different from indigenous
hunter-gatherers, whereas Russia was inhabited by a distinctive population of
hunter-gatherers with high affinity to a ~24,000 year old Siberian6 . By
~6,000-5,000 years ago, a resurgence of hunter-gatherer ancestry had occurred
throughout much of Europe, but in Russia, the Yamnaya steppe herders of this
time were descended not only from the preceding eastern European
hunter-gatherers, but from a population of Near Eastern ancestry. Western and
Eastern Europe came into contact ~4,500 years ago, as the Late Neolithic Corded
Ware people from Germany traced ~3/4 of their ancestry to the Yamnaya,
documenting a massive migration into the heartland of Europe from its eastern
periphery. This steppe ancestry persisted in all sampled central Europeans
until at least ~3,000 years ago, and is ubiquitous in present-day Europeans.
These results provide support for the theory of a steppe origin of at least
some of the Indo-European languages of Europe
Recommended from our members
Author Correction: The genetic prehistory of the Baltic Sea region
The original version of this Article omitted references to previous work, which are detailed in the associated Author Correction. These omissions have been corrected in both the PDF and HTML versions of the Article
Ten millennia of hepatitis B virus evolution
Hepatitis B virus (HBV) has been infecting humans for millennia and remains a global health problem, but its past diversity and dispersal routes are largely unknown. We generated HBV genomic data from 137 Eurasians and Native Americans dated between ~10,500 and ~400 years ago. We date the most recent common ancestor of all HBV lineages to between ~20,000 and 12,000 years ago, with the virus present in European and South American hunter-gatherers during the early Holocene. After the European Neolithic transition, Mesolithic HBV strains were replaced by a lineage likely disseminated by early farmers that prevailed throughout western Eurasia for ~4000 years, declining around the end of the 2nd millennium BCE. The only remnant of this prehistoric HBV diversity is the rare genotype G, which appears to have reemerged during the HIV pandemic
Analyse en composantes principales avec noyaux et la méthode Nyström
47 pages, 6 figuresThe Nyström method is one of the most popular techniques for improving the scalability of kernel methods. However, it has not yet been derived for kernel PCA in line with classical PCA. In this paper we derive kernel PCA with the Nyström method, thereby providing one of the few available options to make kernel PCA scalable. We further study its statistical accuracy through a finite-sample confidence bound on the empirical reconstruction error compared to the full method. The behaviours of the method and bound are illustrated through computer experiments on multiple real-world datasets. As an application of the method we present kernel principal component regression with the Nyström method, as an alternative to Nyström kernel ridge regression for efficient regularized regression with kernels.La méthode Nyström est une des solutions les plus populaires pour rendre les méthodes de noyaux plus efficace. Pourtant, ce méthode n'a pas été dérivée pour l'analyse en composantes principales avec noyaux. Ici, nous présentons l'analyse en componsantes principales avec noyaux et la méthode Nyström et nous étudions sa précision statistique, fournissant une borne de confiance en échantillon fini. Nous démontrons le comportement de la méthode présentée et la borne avec des expériences informatiques utilisant des données réalistes. De plus, nous appliquons la méthode vers la tâche de régression pour présenter régression sur composantes principales avec la méthode Nyström