8 research outputs found
Analysis and improvement of security and privacy techniques for genomic information
The purpose of this thesis is to review the current literature of privacy preserving techniques for genomic information on the last years. Based on the analysis, we propose a long-term classification system for the reviewed techniques. We also develop a security improvement proposal for the Beacon system without hindering research utility
Systematizing Genome Privacy Research: A Privacy-Enhancing Technologies Perspective
Rapid advances in human genomics are enabling researchers to gain a better
understanding of the role of the genome in our health and well-being,
stimulating hope for more effective and cost efficient healthcare. However,
this also prompts a number of security and privacy concerns stemming from the
distinctive characteristics of genomic data. To address them, a new research
community has emerged and produced a large number of publications and
initiatives.
In this paper, we rely on a structured methodology to contextualize and
provide a critical analysis of the current knowledge on privacy-enhancing
technologies used for testing, storing, and sharing genomic data, using a
representative sample of the work published in the past decade. We identify and
discuss limitations, technical challenges, and issues faced by the community,
focusing in particular on those that are inherently tied to the nature of the
problem and are harder for the community alone to address. Finally, we report
on the importance and difficulty of the identified challenges based on an
online survey of genome data privacy expertsComment: To appear in the Proceedings on Privacy Enhancing Technologies
(PoPETs), Vol. 2019, Issue
Privacy-preserving data sharing infrastructures for medical research: systematization and comparison
Background: Data sharing is considered a crucial part of modern medical research. Unfortunately, despite its advantages, it often faces obstacles, especially data privacy challenges. As a result, various approaches and infrastructures have been developed that aim to ensure that patients and research participants remain anonymous when data is shared. However, privacy protection typically comes at a cost, e.g. restrictions regarding the types of analyses that can be performed on shared data. What is lacking is a systematization making the trade-offs taken by different approaches transparent. The aim of the work described in this paper was to develop a systematization for the degree of privacy protection provided and the trade-offs taken by different data sharing methods. Based on this contribution, we categorized popular data sharing approaches and identified research gaps by analyzing combinations of promising properties and features that are not yet supported by existing approaches.
Methods: The systematization consists of different axes. Three axes relate to privacy protection aspects and were adopted from the popular Five Safes Framework: (1) safe data, addressing privacy at the input level, (2) safe settings, addressing privacy during shared processing, and (3) safe outputs, addressing privacy protection of analysis results. Three additional axes address the usefulness of approaches: (4) support for de-duplication, to enable the reconciliation of data belonging to the same individuals, (5) flexibility, to be able to adapt to different data analysis requirements, and (6) scalability, to maintain performance with increasing complexity of shared data or common analysis processes.
Results: Using the systematization, we identified three different categories of approaches: distributed data analyses, which exchange anonymous aggregated data, secure multi-party computation protocols, which exchange encrypted data, and data enclaves, which store pooled individual-level data in secure environments for access for analysis purposes. We identified important research gaps, including a lack of approaches enabling the de-duplication of horizontally distributed data or providing a high degree of flexibility.
Conclusions: There are fundamental differences between different data sharing approaches and several gaps in their functionality that may be interesting to investigate in future work. Our systematization can make the properties of privacy-preserving data sharing infrastructures more transparent and support decision makers and regulatory authorities with a better understanding of the trade-offs taken
Innovative Verfahren fĂĽr die standortĂĽbergreifende Datennutzung in der medizinischen Forschung
Implementing modern data-driven medical research approaches ("Artificial intelligence", "Data Science") requires access to large amounts of data ("Big Data"). Typically, this can only be achieved through cross-institutional data use and exchange ("Data Sharing"). In this process, the protection of the privacy of patients and probands affected is a central challenge. Various methods can be used to meet this challenge, such as anonymization or federation. However, data sharing is currently put into practice only to a limited extent, although it is demanded and promoted from many sides. One reason for this is the lack of clarity about the advantages and disadvantages of different data sharing approaches. The first goal of this thesis was to develop an instrument that makes these advantages and disadvantages more transparent. The instrument systematizes approaches based on two dimensions - utility and protection - where each dimension is further differentiated with three axes describing different aspects of the dimensions, such as the degree of privacy protection provided by the results of performed analyses or the flexibility of a platform regarding the types of analyses that can be performed. The instrument was used for evaluation purposes to analyze the status quo and to identify gaps and potentials for innovative approaches. Next, and as a second goal, an innovative tool for the practical use of cryptographic data sharing methods has been designed and implemented. So far, such approaches are only rarely used in practice due to two main obstacles: (1) the technical complexity of setting up a cryptography-based data sharing infrastructure and (2) a lack of user-friendliness of cryptographic data sharing methods, especially for medical researchers. The tool EasySMPC, which was developed as part of this work, is characterized by the fact that it allows cryptographically secure computation of sums (e.g., frequencies of diagnoses) across institutional boundaries based on an easy-to-use graphical user interface. Neither technical expertise nor the deployment of specific infrastructure components is necessary for its practical use. The practicability of EasySMPC was analyzed experimentally in a detailed performance evaluation.Moderne datengetriebene medizinische Forschungsansätze („Künstliche Intelligenz“,
„Data Science“) benötigen große Datenmengen („Big Data“). Dies kann im Regelfall nur
durch eine institutionsübergreifende Datennutzung erreicht werden („Data Sharing“).
Datenschutz und der Schutz der Privatsphäre der Betroffenen ist dabei eine zentrale
Herausforderung. Um dieser zu begegnen, können verschiedene Methoden, wie etwa
Anonymisierungsverfahren oder föderierte Auswertungen, eingesetzt werden. Allerdings
findet Data Sharing in der Praxis nur selten statt, obwohl es von vielen Seiten gefordert
und gefördert wird. Ein Grund hierfür ist die Unklarheit ¸über Vor- und Nachteile
verschiedener Data Sharing-Ansätze. Erstes Ziel dieser Arbeit war es, ein Instrument zu
entwickeln, welches diese Vor- und Nachteile transparent macht. Das Instrument
bewertet Ansätze anhand von zwei Dimensionen - Nutzen und Schutz - wobei jede
Dimension mit drei Achsen weiter differenziert ist. Die Achsen bestehen etwa aus dem
Grad des Schutzes der Privatsphäre, der durch die Ergebnisse der durchgeführten
Analysen gewährleistet wird oder der Flexibilität einer Plattform hinsichtlich der Arten von
Analysen, die durchgeführt werden können. Das Instrument wurde zu
Evaluationszwecken fĂĽr die Analyse des Status Quo sowie zur Identifikation von LĂĽcken
und Potenzialen fĂĽr innovative Verfahren eingesetzt. Als zweites Ziel wurde anschlieĂźend
ein innovatives Werkzeug fĂĽr den praktischen Einsatz von kryptographischen Data
Sharing-Verfahren entwickelt. Der Einsatz entsprechender Ansätze scheitert bisher vor
allem an zwei Barrieren: (1) der technischen Komplexität beim Aufbau einer
Kryptographie-basierten Data Sharing-Infrastruktur und (2) der Benutzerfreundlichkeit
kryptographischer Data Sharing-Verfahren, insbesondere fĂĽr medizinische Forschende.
Das neue Werkzeug EasySMPC zeichnet sich dadurch aus, dass es eine
kryptographisch sichere Berechnung von Summen (beispielsweise Häufigkeiten von
Diagnosen) ĂĽber Institutionsgrenzen hinweg auf Basis einer einfach zu bedienenden
graphischen Benutzeroberfläche ermöglicht. Zur Anwendung ist weder technische
Expertise noch der Aufbau spezieller Infrastrukturkomponenten notwendig. Die
Praxistauglichkeit von EasySMPC wurde in einer ausfĂĽhrlichen Performance-Evaluation
experimentell analysiert
Blockchain for Genomics:A Systematic Literature Review
Human genomic data carry unique information about an individual and offer
unprecedented opportunities for healthcare. The clinical interpretations
derived from large genomic datasets can greatly improve healthcare and pave the
way for personalized medicine. Sharing genomic datasets, however, pose major
challenges, as genomic data is different from traditional medical data,
indirectly revealing information about descendants and relatives of the data
owner and carrying valid information even after the owner passes away.
Therefore, stringent data ownership and control measures are required when
dealing with genomic data. In order to provide secure and accountable
infrastructure, blockchain technologies offer a promising alternative to
traditional distributed systems. Indeed, the research on blockchain-based
infrastructures tailored to genomics is on the rise. However, there is a lack
of a comprehensive literature review that summarizes the current
state-of-the-art methods in the applications of blockchain in genomics. In this
paper, we systematically look at the existing work both commercial and
academic, and discuss the major opportunities and challenges. Our study is
driven by five research questions that we aim to answer in our review. We also
present our projections of future research directions which we hope the
researchers interested in the area can benefit from
Blockchain for Genomics:A Systematic Literature Review
Human genomic data carry unique information about an individual and offer
unprecedented opportunities for healthcare. The clinical interpretations
derived from large genomic datasets can greatly improve healthcare and pave the
way for personalized medicine. Sharing genomic datasets, however, pose major
challenges, as genomic data is different from traditional medical data,
indirectly revealing information about descendants and relatives of the data
owner and carrying valid information even after the owner passes away.
Therefore, stringent data ownership and control measures are required when
dealing with genomic data. In order to provide secure and accountable
infrastructure, blockchain technologies offer a promising alternative to
traditional distributed systems. Indeed, the research on blockchain-based
infrastructures tailored to genomics is on the rise. However, there is a lack
of a comprehensive literature review that summarizes the current
state-of-the-art methods in the applications of blockchain in genomics. In this
paper, we systematically look at the existing work both commercial and
academic, and discuss the major opportunities and challenges. Our study is
driven by five research questions that we aim to answer in our review. We also
present our projections of future research directions which we hope the
researchers interested in the area can benefit from
The Contribution of Ethical Governance of Artificial Intelligence & Machine Learning in Healthcare
With the Internet Age and technology progressively advancing every year, the usage of Artificial Intelligence (AI) along with Machine Learning (ML) algorithms has only increased since its introduction to society. Specifically, in the healthcare field, AI/ML has proven to its end-users how beneficial its assistance has been. However, despite its effectiveness and efficiencies, AI/ML has also been under scrutiny due to its unethical outcomes. As a result of this, two polarizing views are typically debated when discussing AI/ML. One side believes that AI/ML usage should continue regardless of its unsureness, while the other side argues that this technology is too dangerous and should not be utilized at all. Given the fact that AI/ML can provide prompt and fairly accurate results, it is unrealistic to assume that AI/ML usage will end any time soon. Therefore, governance of AI/ML is needed to ensure that these technologies are reliable.
Notably, AI governance has been positively reviewed and pushed for by scholars in the field. While AI governance does guarantee a sense of oversight on AI/ML, this form of governance is not sustainable. AI governance primarily focuses on the safety of the technology, with ethical, legal, and social factors serving as elements of AI governance. The safety of AI/ML is only one of the considerations for producing and ensuring ethical AI/ML. Ethical governance of AI/ML, which concentrates on incorporating ethics into all aspects of AI/ML—specifically, narrowing in on the stakeholders involved, will lead to not only a safer product but a more viable one as well. Thus, ethical governance of AI/ML must be advocated for in order to bring more awareness, which would lead to greater research and implementation of this type of governance.
Although AI/ML can be used for a multitude of areas, the healthcare industry is slightly more significant, especially since these technologies directly affect the patients’ health. This dissertation explores the contribution of ethical governance of AI/ML in several facets of healthcare. As AI/ML requires big data to provide outcomes, the context of data analytics is discussed. Other areas the dissertation explores are clinical decision-making, end-of-life decisions, and biotechnology. While these topics certainly do not cover the whole healthcare field, the dissertation attempts to include a wide range of AI/ML functions from the beginning of its process (with data analytics) to the future of AI/ML (with biotechnology). With each of these areas of interest, various ethical governance principles are introduced and endorsed for to develop ethical AI/ML. The goal of this dissertation in discussing the contribution of ethical governance of AI/ML in healthcare is to provide a foundational groundwork for more future research of the ethical governance of AI/ML