Search CORE

52 research outputs found

A Kalman Filter Approach for Biomolecular Systems with Noise Covariance Updating

Author: Chakrabarti Kushal
Dey Abhishek
Gola Krishan Kumar
Sen Shaunak
Publication venue
Publication date: 12/11/2018
Field of study

An important part of system modeling is determining parameter values, particularly for biomolecular systems, where direct measurements of individual parameters are typically hard. While Extended Kalman Filters have been used for this purpose, the choice of the process noise covariance is generally unclear. In this chapter, we address this issue for biomolecular systems using a combination of Monte Carlo simulations and experimental data, exploiting the dependence of the process noise covariance on the states and parameters, as given in the Langevin framework. We adapt a Hybrid Extended Kalman Filtering technique by updating the process noise covariance at each time step based on estimates. We compare the performance of this framework with different fixed values of process noise covariance in biomolecular system models, including an oscillator model, as well as in experimentally measured data for a negative transcriptional feedback circuit. We find that the Extended Kalman Filter with such process noise covariance update is closer to the optimality condition in the sense that the innovation sequence becomes white and in achieving a balance between the mean square estimation error and parameter convergence time. The results of this chapter may help in the use of Extended Kalman Filters for systems where process noise covariance depends on states and/or parameters.Comment: 23 pages, 9 figure

arXiv.org e-Print Archive

Crossref

SAIGE-GENE plus improves the efficiency and accuracy of set-based rare variant association tests

Author: Bi Wenjian
Daly Mark J.
Dey Kushal K.
Jagadeesh Karthik A.
Karczewski Konrad J.
Lee Seunggeun
Neale Benjamin M.
Zhao Zhangchen
Zhou Wei
Publication venue
Publication date: 01/01/2022
Field of study

Several biobanks, including UK Biobank (UKBB), are generating large-scale sequencing data. An existing method, SAIGE-GENE, performs well when testing variants with minor allele frequency (MAF) SAIGE-GENE+ performs set-based rare variant association tests with improved type 1 error control and computational efficiency by collapsing ultra-rare variants and conducting multiple tests corresponding to different minor allele frequency cutoffs and annotations.Peer reviewe

PubMed Central

Helsingin yliopiston digitaalinen arkisto

Strengths and limitations of microarray-based phenotype prediction: lessons learned from the IMPROVER Diagnostic Signature Challenge

Author: Bilal Erhan
Boue Stephanie
Hoeng Julia
Koeppl Heinz
Kumar Dey Kushal
Lauria Mario
Martin Florian
Meyer Pablo
Nandy Preetam
Norel Raquel
Peitsch Manuel
Rice Jeremy J.
Romero Roberto
Stolovitzky Gustavo
Talikka Marja
Tarca Adi L.
Unger Michael
Xiang Yang
Zechner Christoph
Publication venue
Publication date: 02/08/2017
Field of study

Motivation: After more than a decade since microarrays were used to predict phenotype of biological samples, real-life applications for disease screening and identification of patients who would best benefit from treatment are still emerging. The interest of the scientific community in identifying best approaches to develop such prediction models was reaffirmed in a competition style international collaboration called IMPROVER Diagnostic Signature Challenge whose results we describe herein. Results: Fifty-four teams used public data to develop prediction models in four disease areas including multiple sclerosis, lung cancer, psoriasis and chronic obstructive pulmonary disease, and made predictions on blinded new data that we generated. Teams were scored using three metrics that captured various aspects of the quality of predictions, and best performers were awarded. This article presents the challenge results and introduces to the community the approaches of the best overall three performers, as well as an R package that implements the approach of the best overall team. The analyses of model performance data submitted in the challenge as well as additional simulations that we have performed revealed that (i) the quality of predictions depends more on the disease endpoint than on the particular approaches used in the challenge; (ii) the most important modeling factor (e.g. data preprocessing, feature selection and classifier type) is problem dependent; and (iii) for optimal results datasets and methods have to be carefully matched. Biomedical factors such as the disease severity and confidence in diagnostic were found to be associated with the misclassification rates across the different teams. Availability: The lung cancer dataset is available from Gene Expression Omnibus (accession, GSE43580). The maPredictDSC R package implementing the approach of the best overall team is available at www.bioconductor.org or http://bioinformaticsprb.med.wayne.edu/. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics onlin

RERO DOC Digital Library

Recommended from our members

Model based Visualization of Structure in Biological Data

Author: Dey Kushal Kumar
Publication venue: University of Chicago
Publication date: 21/09/2018
Field of study

Knowledge UChicago