Search CORE

4 research outputs found

Analysis and improvement of security and privacy techniques for genomic information

Author: García Perez José Antonio
Publication venue: Universitat Politècnica de Catalunya
Publication date: 28/10/2021
Field of study

The purpose of this thesis is to review the current literature of privacy preserving techniques for genomic information on the last years. Based on the analysis, we propose a long-term classification system for the reviewed techniques. We also develop a security improvement proposal for the Beacon system without hindering research utility

UPCommons. Portal del coneixement obert de la UPC

Preface

Author: Pape-Haugaard Louise B.
Scott Philip
Publication venue: 'IOS Press'
Publication date: 16/06/2020
Field of study

Portsmouth University Research Portal (Pure)

Protecting Genomic Data Privacy with Probabilistic Modeling

Author: Berger Leighton Bonnie
Sahinalp Cenk
Simmons Sean
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 07/11/2019
Field of study

As genetic sequencing becomes less expensive and data sets linking genetic data and medical records (e.g., Biobanks) become larger and more common, issues of data privacy and computational challenges become more necessary to address in order to realize the benefits of these datasets. One possibility for alleviating these issues is through the use of already-computed summary statistics (e.g., slopes and standard errors from a regression model of a phenotype on a genotype). If groups share summary statistics from their analyses of biobanks, many of the privacy issues and computational challenges concerning the access of these data could be bypassed. In this paper we explore the possibility of using summary statistics from simple linear models of phenotype on genotype in order to make inferences about more complex phenotypes (those that are derived from two or more simple phenotypes). We provide exact formulas for the slope, intercept, and standard error of the slope for linear regressions when combining phenotypes. Derived equations are validated via simulation and tested on a real data set exploring the genetics of fatty acids. Keywords: privacy; biobank; genetics; genome-wide association study; single nucleotide variant; computational challenges; data security; phenotype

DSpace@MIT