Search CORE

218 research outputs found

Personalized Anomaly Detection in PPG Data using Representation Learning and Biometric Identification

Author: Ghorbani Ramin
Reinders Marcel J. T.
Tax David M. J.
Publication venue
Publication date: 12/07/2023
Field of study

Photoplethysmography (PPG) signals, typically acquired from wearable devices, hold significant potential for continuous fitness-health monitoring. In particular, heart conditions that manifest in rare and subtle deviating heart patterns may be interesting. However, robust and reliable anomaly detection within these data remains a challenge due to the scarcity of labeled data and high inter-subject variability. This paper introduces a two-stage framework leveraging representation learning and personalization to improve anomaly detection performance in PPG data. The proposed framework first employs representation learning to transform the original PPG signals into a more discriminative and compact representation. We then apply three different unsupervised anomaly detection methods for movement detection and biometric identification. We validate our approach using two different datasets in both generalized and personalized scenarios. The results show that representation learning significantly improves anomaly detection performance while reducing the high inter-subject variability. Personalized models further enhance anomaly detection performance, underscoring the role of personalization in PPG-based fitness-health monitoring systems. The results from biometric identification show that it's easier to distinguish a new user from one intended authorized user than from a group of users. Overall, this study provides evidence of the effectiveness of representation learning and personalization for anomaly detection in PPG data

arXiv.org e-Print Archive

Erratum to: Breast cancer subtype predictors revisited: from consensus to concordance?

Author: Herman M. J. Sontrop
Marcel J. T. Reinders
Perry D. Moerland
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Single-cell immune profiling reveals thymus-seeding populations, T cell commitment, and multilineage development in the human thymus

Author: Akker Erik B. van den
Cordes Martijn
Dongen J. J. M. van
García-Perez Laura
Kiełbasa Szymon M.
Morett Federico A.
Pike-Overzet Karin
Reinders Marcel J. T.
Staal Frank J. T.
Teodosio Cristina
Vloemans Sandra A.
Publication venue: American Association for the Advancement of Science
Publication date: 09/03/2023
Field of study

T cell development in the mouse thymus has been studied extensively, but less is known regarding T cell development in the human thymus. We used a combination of single-cell techniques and functional assays to perform deep immune profiling of human T cell development, focusing on the initial stages of prelineage commitment. We identified three thymus-seeding progenitor populations that also have counterparts in the bone marrow. In addition, we found that the human thymus physiologically supports the development of monocytes, dendritic cells, and NK cells, as well as limited development of B cells. These results are an important step toward monitoring and guiding regenerative therapies in patients after hematopoietic stem cell transplantation

Digital.CSIC

De novo sequencing, assembly and analysis of the genome of the laboratory strain Saccharomyces cerevisiae CEN.PK113-7D, a model for modern industrial biotechnology

Author: Bosman Lizanne
Daran Jean-Marc
Daran-Lapujade Pascale
Datema Erwin
de Kok Stefan
de Ridder Dick
Heijne Wilbert HM
Klaassen Paul
Kötter Peter
Luttik Marijke A
Nielsen Jens
Nijkamp Jurgen F
Paddon Chris J
Platt Darren
Pronk Jack T
Reinders Marcel JT
van den Broek Marcel
van Ham Roeland C
Vongsangnak Wanwipa
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Saccharomyces cerevisiae CEN.PK 113-7D is widely used for metabolic engineering and systems biology research in industry and academia. We sequenced, assembled, annotated and analyzed its genome. Single-nucleotide variations (SNV), insertions/deletions (indels) and differences in genome organization compared to the reference strain S. cerevisiae S288C were analyzed. In addition to a few large deletions and duplications, nearly 3000 indels were identified in the CEN.PK113-7D genome relative to S288C. These differences were overrepresented in genes whose functions are related to transcriptional regulation and chromatin remodelling. Some of these variations were caused by unstable tandem repeats, suggesting an innate evolvability of the corresponding genes. Besides a previously characterized mutation in adenylate cyclase, the CEN.PK113-7D genome sequence revealed a significant enrichment of non-synonymous mutations in genes encoding for components of the cAMP signalling pathway. Some phenotypic characteristics of the CEN.PK113-7D strains were explained by the presence of additional specific metabolic genes relative to S288C. In particular, the presence of the BIO1 and BIO6 genes correlated with a biotin prototrophy of CEN.PK113-7D. Furthermore, the copy number, chromosomal location and sequences of the MAL loci were resolved. The assembled sequence reveals that CEN.PK113-7D has a mosaic genome that combines characteristics of laboratory strains and wild-industrial strains

Crossref

TU Delft Repository

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Wageningen University & Research Publications

Chalmers Research

Chalmers Publication Library

Hochschulschriftenserver - Universität Frankfurt am Main

Knowledge driven decomposition of tumor expression profiles

Author: A Bild
A Subramanian
AE Teschendorff
AH Bild
B Kreike
CJ Creighton
CK Anders
CM Perou
CR Acharya
D Hanahan
E Segal
G Golub
HY Chuang
I Vastrik
J Lamb
JP Brunet
KR Hess
L van't Veer
Lodewyk FA Wessels
M Ashburner
M Kanehisa
M Karnaugh
Marcel JT Reinders
Martin H van Vliet
R Tibshirani
T Hastie
T Sjoblom
V Mootha
Y Gong
Z Hu
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Tumors have been hypothesized to be the result of a mixture of oncogenic events, some of which will be reflected in the gene expression of the tumor. Based on this hypothesis a variety of data-driven methods have been employed to decompose tumor expression profiles into component profiles, hypothetically linked to these events. Interpretation of the resulting data-driven components is often done by post-hoc comparison to, for instance, functional groupings of genes into gene sets. None of the data-driven methods allow the incorporation of that type of knowledge directly into the decomposition. Results We present a linear model which uses knowledge driven, pre-defined components to perform the decomposition. We solve this decomposition model in a constrained linear least squares fashion. From a variety of options, a lasso-based solution to the model performs best in linking single gene perturbation data to mouse data. Moreover, we show the decomposition of expression profiles from human breast cancer samples into single gene perturbation profiles and gene sets that are linked to the hallmarks of cancer. For these breast cancer samples we were able to discern several links between clinical parameters, and the decomposition weights, providing new insights into the biology of these tumors. Lastly, we show that the order in which the Lasso regularization shrinks the weights, unveils consensus patterns within clinical subgroups of the breast cancer samples. Conclusion The proposed lasso-based constrained least squares decomposition provides a stable and relevant relation between samples and knowledge-based components, and is thus a viable alternative to data-driven methods. In addition, the consensus order of component importance within clinical subgroups provides a better molecular characterization of the subtypes.</p

Crossref

Directory of Open Access Journals

PubMed Central

Unsupervised protein embeddings outperform hand-crafted sequence and structure features at predicting molecular function

Author: Alley
Altschul
Amelia Villegas-Morcillo
Anfinsen
Angel M Gomez
Arne Elofsson
Ashburner
Bartoli
Bepler
Berman
Bonetta
Cao
Cheng
Clark
Cozzetto
Devlin
Doersch
Duarte
Eddy
Fa
Fout
Fu
Gidaris
Gligorijevic
Heinzinger
Jiang
Jones
Kabsch
Kane
Kimura
Kingma
Kipf
Kulmanov
Kulmanov
Liu
Liu
Lyons
Marcel J T Reinders
Mathis
McCann
Pesquita
Peters
Radivojac
Rao
Rives
Roeland C H J van Ham
Srivastava
Stavros Makrodimitris
Sureyya Rifaioglu
Victoria Sanchez
Wang
Weinhold
Wilson
Zamora-Resendiz
Zheng
Zhou
Zhu
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2020
Field of study

This work was supported by Keygene N.V., a crop innovation company in the Netherlands and by the Spanish MINECO/FEDER Project TEC201680141-P with the associated FPI grant BES-2017-079792.The authors thank Dr. Elvin Isufi and Chirag Raman for their valuable comments and feedback.Motivation: Protein function prediction is a difficult bioinformatics problem. Many recent methods use deep neural networks to learn complex sequence representations and predict function from these. Deep supervised models require a lot of labeled training data which are not available for this task. However, a very large amount of protein sequences without functional labels is available. Results: We applied an existing deep sequence model that had been pretrained in an unsupervised setting on the supervised task of protein molecular function prediction. We found that this complex feature representation is effective for this task, outperforming hand-crafted features such as one-hot encoding of amino acids, k-mer counts, secondary structure and backbone angles. Also, it partly negates the need for complex prediction models, as a two-layer perceptron was enough to achieve competitive performance in the third Critical Assessment of Functional Annotation benchmark. We also show that combining this sequence representation with protein 3D structure information does not lead to performance improvement, hinting that 3D structure is also potentially learned during the unsupervised pretraining.Keygene N.V., a crop innovation company in the NetherlandsSpanish MINECO/FEDER TEC201680141-PFPI grant BES-2017-07979

Crossref

TU Delft Repository

Repositorio Institucional Universidad de Granada

Unified relevance models for rating prediction in collaborative filtering

Author: Arjen P. de Vries
Breese J.
Dempster A. P.
Jun Wang
Lafferty J.
Marcel J. T. Reinders
Pennock D. M.
Robertson S.
Robertson S. E.
Robertson S. E.
Sarwar B. M.
Schölkopf B.
Schölkopf B.
Si L.
Skurichina M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Identification of Networks of Co-Occurring, Tumor-Related DNA Copy Number Changes Using a Genome-Wide Scoring Approach

Author: A Horlein
AG Uren
AS Dhillon
CG Mullighan
Christiaan Klijn
D Chitale
D Hanahan
D Pinkel
David J. Adams
F Furuya
F Michor
H Dohner
H Rajagopalan
H Seifert
HG Yoon
J de Ridder
J Drach
J Zhang
Jan Bot
JC Strefford
Jos Jonkers
K Jepsen
K Quon
L Jensen
L Michalik
Lodewyk Wessels
M Berglund
M Bredel
Marcel Reinders
NA Heerema
O Sieber
P Dowell
P Haverty
PA Futreal
R Schmitz
R Thomas
S Stilgenbauer
T Suzuki
V Ibanez
W Chen
William Stafford Noble
X Feng
Z Yan
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Tumorigenesis is a multi-step process in which normal cells transform into malignant tumors following the accumulation of genetic mutations that enable them to evade the growth control checkpoints that would normally suppress their growth or result in apoptosis. It is therefore important to identify those combinations of mutations that collaborate in cancer development and progression. DNA copy number alterations (CNAs) are one of the ways in which cancer genes are deregulated in tumor cells. We hypothesized that synergistic interactions between cancer genes might be identified by looking for regions of co-occurring gain and/or loss. To this end we developed a scoring framework to separate truly co-occurring aberrations from passenger mutations and dominant single signals present in the data. The resulting regions of high co-occurrence can be investigated for between-region functional interactions. Analysis of high-resolution DNA copy number data from a panel of 95 hematological tumor cell lines correctly identified co-occurring recombinations at the T-cell receptor and immunoglobulin loci in T- and B-cell malignancies, respectively, showing that we can recover truly co-occurring genomic alterations. In addition, our analysis revealed networks of co-occurring genomic losses and gains that are enriched for cancer genes. These networks are also highly enriched for functional relationships between genes. We further examine sub-networks of these networks, core networks, which contain many known cancer genes. The core network for co-occurring DNA losses we find seems to be independent of the canonical cancer genes within the network. Our findings suggest that large-scale, low-intensity copy number alterations may be an important feature of cancer development or maintenance by affecting gene dosage of a large interconnected network of functionally related genes

Public Library of Science (PLOS)

Crossref

TU Delft Repository

Directory of Open Access Journals

PubMed Central