Search CORE

953 research outputs found

Evolutionary tree-based quasi identifier and federated gradient privacy preservations over big healthcare data

Author: Krishna Sujatha
Vinayaka Murthy Udayarani
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 01/02/2022
Field of study

Big data has remodeled the way organizations supervise, examine and leverage data in any industry. To safeguard sensitive data from public contraventions, several countries investigated this issue and carried out privacy protection mechanism. With the aid of quasi-identifiers privacy is not said to be preserved to a greater extent. This paper proposes a method called evolutionary tree-based quasi-identifier and federated gradient (ETQI-FD) for privacy preservations over big healthcare data. The first step involved in the ETQI-FD is learning quasi-identifiers. Learning quasi-identifiers by employing information loss function separately for categorical and numerical attributes accomplishes both the largest dissimilarities and partition without a comprehensive exploration between tuples of features or attributes. Next with the learnt quasi-identifiers, privacy preservation of data item is made by applying federated gradient arbitrary privacy preservation learning model. This model attains optimal balance between privacy and accuracy. In the federated gradient privacy preservation learning model, we evaluate the determinant of each attribute to the outputs. Then injecting Adaptive Lorentz noise to data attributes our ETQI-FD significantly minimizes the influence of noise on the final results and therefore contributing to privacy and accuracy. An experimental evaluation of ETQI-FD method achieves better accuracy and privacy than the existing methods

ZENODO

Institute of Advanced Engineering and Science

The Future of Information Sciences : INFuture2015 : e-Institutions – Openness, Accessibility, and Preservation

Author
Publication venue: Department of Information and Communication Sciences, Faculty of Humanities and Social Sciences, University of Zagreb
Publication date: 01/11/2015
Field of study

Repozitorij Filozofskog fakulteta u Zagrebu' at University of Zagreb

Protecting sensitive data using differential privacy and role-based access control

Author: Torabian Hajaralsadat
Publication venue
Publication date: 23/04/2018
Field of study

Dans le monde d'aujourd'hui où la plupart des aspects de la vie moderne sont traités par des systèmes informatiques, la vie privée est de plus en plus une grande préoccupation. En outre, les données ont été générées massivement et traitées en particulier dans les deux dernières années, ce qui motive les personnes et les organisations à externaliser leurs données massives à des environnements infonuagiques offerts par des fournisseurs de services. Ces environnements peuvent accomplir les tâches pour le stockage et l'analyse de données massives, car ils reposent principalement sur Hadoop MapReduce qui est conçu pour traiter efficacement des données massives en parallèle. Bien que l'externalisation de données massives dans le nuage facilite le traitement de données et réduit le coût de la maintenance et du stockage de données locales, elle soulève de nouveaux problèmes concernant la protection de la vie privée. Donc, comment on peut effectuer des calculs sur de données massives et sensibles tout en préservant la vie privée. Par conséquent, la construction de systèmes sécurisés pour la manipulation et le traitement de telles données privées et massives est cruciale. Nous avons besoin de mécanismes pour protéger les données privées, même lorsque le calcul en cours d'exécution est non sécurisé. Il y a eu plusieurs recherches ont porté sur la recherche de solutions aux problèmes de confidentialité et de sécurité lors de l'analyse de données dans les environnements infonuagique. Dans cette thèse, nous étudions quelques travaux existants pour protéger la vie privée de tout individu dans un ensemble de données, en particulier la notion de vie privée connue comme confidentialité différentielle. Confidentialité différentielle a été proposée afin de mieux protéger la vie privée du forage des données sensibles, assurant que le résultat global publié ne révèle rien sur la présence ou l'absence d'un individu donné. Enfin, nous proposons une idée de combiner confidentialité différentielle avec une autre méthode de préservation de la vie privée disponible.In nowadays world where most aspects of modern life are handled and managed by computer systems, privacy has increasingly become a big concern. In addition, data has been massively generated and processed especially over the last two years. The rate at which data is generated on one hand, and the need to efficiently store and analyze it on the other hand, lead people and organizations to outsource their massive amounts of data (namely Big Data) to cloud environments supported by cloud service providers (CSPs). Such environments can perfectly undertake the tasks for storing and analyzing big data since they mainly rely on Hadoop MapReduce framework, which is designed to efficiently handle big data in parallel. Although outsourcing big data into the cloud facilitates data processing and reduces the maintenance cost of local data storage, it raises new problem concerning privacy protection. The question is how one can perform computations on sensitive and big data while still preserving privacy. Therefore, building secure systems for handling and processing such private massive data is crucial. We need mechanisms to protect private data even when the running computation is untrusted. There have been several researches and work focused on finding solutions to the privacy and security issues for data analytics on cloud environments. In this dissertation, we study some existing work to protect the privacy of any individual in a data set, specifically a notion of privacy known as differential privacy. Differential privacy has been proposed to better protect the privacy of data mining over sensitive data, ensuring that the released aggregate result gives almost nothing about whether or not any given individual has been contributed to the data set. Finally, we propose an idea of combining differential privacy with another available privacy preserving method

CorpusUL

Secure big data ecosystem architecture : challenges and solutions

Author: Anwar Memoona
Gill Asif
Hussain Farookh
Imran Muhammad
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Big data ecosystems are complex data-intensive, digital–physical systems. Data-intensive ecosystems offer a number of benefits; however, they present challenges as well. One major challenge is related to the privacy and security. A number of privacy and security models, techniques and algorithms have been proposed over a period of time. The limitation is that these solutions are primarily focused on an individual or on an isolated organizational context. There is a need to study and provide complete end-to-end solutions that ensure security and privacy throughout the data lifecycle across the ecosystem beyond the boundary of an individual system or organizational context. The results of current study provide a review of the existing privacy and security challenges and solutions using the systematic literature review (SLR) approach. Based on the SLR approach, 79 applicable articles were selected and analyzed. The information from these articles was extracted to compile a catalogue of security and privacy challenges in big data ecosystems and to highlight their interdependencies. The results were categorized from theoretical viewpoint using adaptive enterprise architecture and practical viewpoint using DAMA framework as guiding lens. The findings of this research will help to identify the research gaps and draw novel research directions in the context of privacy and security in big data-intensive ecosystems. © 2021, The Author(s)

Federation ResearchOnline

Integration of Differential Privacy Mechanism to Map-Reduce Platform for Preserving Privacy in Cloud Environments

Author: Vosough Tehrani Melissa
Publication venue
Publication date: 01/05/2020
Field of study

Le cloud computing peut être désigné comme utilisant les capacités de ressources matérielles et logicielles basées sur Internet; C’est la tendance de la dernière décennie dans le monde numérique d’aujourd’hui, de plus en plus rapide. Cela a changé le monde qui nous entoure. L’utilisation du cloud est devenue une norme et les utilisateurs transfèrent leurs données vers le cloud à mesure que les données grossissent et qu’il est nécessaire d’accéder aux données à partir de nombreux appareils. Des tonnes de données sont créées chaque jour et toutes les organisations, des instituts scientifiques aux entreprises industrielles, ont pour objectif d’analyser les données et d’en extraire les schémas afin d’améliorer leurs services ou à d’autres fins. Dans l’intervalle, les sociétés d’analyse de données utilisent les informations de millions de personnes et il est de plus en plus nécessaire de garantir la protection de leurs données. Des techniques d’ingénierie sociale aux attaques techniques malveillantes, les données risquent toujours de fuir et nous devrions proposer des solutions pour protéger les données des individus. Dans cette thèse, nous présentons «Parmanix», une plateforme de protection de la confidentialité pour l’analyse de données. Il est basé sur le système MapReduce et fournit des garanties de confidentialité pour les données sensibles dans les calculs distribués sur des données sensibles. Sur cette plate-forme, les fournisseurs de données définissent la politique de sécurité de leurs données. Le fournisseur de calcul peut écrire du code Mapper non approuvé et utiliser l’un des réducteurs de confiance déjà définis dans Parmanix. Comme le système garantit une surcharge acceptable, il n’y aura aucune fuite de données individuelles lors des calculs de la plate-forme.----------ABSTRACT: Cloud computing can be referred to as using the capabilities of hardware and software resources that are based on the Internet; It is the trend of the past decade growing among today’s digital world at a fast pace. It has changed the world around us. Using the cloud has become a norm and people are moving their data to the cloud since data is getting bigger and there is the need to access the data from many devices. Tones of data are creating every day and all the organizations, from science institutes to industrial companies aim to analyze the data and extract the patterns within them to improve their services or for other purposes. In between, information of millions of people is getting used by data analytic companies and there is an increasing need to guarantee the protection of their data. From social engineering techniques to malicious technical attacks, the data is always at the risk of leakage and we should propose solutions to keep an individual’s data protected. In this thesis, we present “Parmanix”, a privacy preserve module for data analytics. It is based on the MapReduce system and provides privacy guarantees for sensitive data in distributed computations on sensitive data. With this module, data providers define the security policy for their data, and computation provider can write untrusted Mapper code and use one of the trusted Reducers that we have already defined within Parmanix. As system guarantees with an acceptable amount of overhead, there would be no leakage of individual’s data through the platform computations

PolyPublie

Big data y privacidad. Estudio bibliométrico

Author: Muñoz Díaz Alejandro
Publication venue
Publication date: 01/01/2017
Field of study

Revisión bibliográfica sobre la Privacidad de los datos personales en la actividad relacionada con el concepto “Big Data”.Universidad de Sevilla. Máster Universitario en Estudios Avanzados en Dirección de Empresa

idUS. Depósito de Investigación Universidad de Sevilla