3 research outputs found

    Assessing Blockchain’s Potential to Ensure Data Integrity and Security for AI and Machine Learning Applications

    Get PDF
    The increasing use of data-centric approaches in the fields of Machine Learning and Artificial Intelligence (ML/AI) has raised substantial issues over the security, integrity, and trustworthiness of data. In response to this challenge, Blockchain technology offered a promising and practical solution, as its inherent characteristics as a decentralized distributed ledger, coupled with cryptographic processes, offer an unprecedented level of data confidentiality and immutability. This study examines the mutually beneficial connection between Blockchain technology and ML/AI, using Blockchain\u27s inherent capacity to protect against unauthorized alterations of data during the training phase of ML models. The method involves building valid blocks of data from the training dataset and then sending them to the mining process using smart contracts and the Proof of Work (PoW) consensus method. Using SHA256 to produce a cryptographic signature for each data block improves the aforementioned procedure. The public Ethereum blockchain serves as a secure repository for these signatures, whereas a cloud-based infrastructure houses the original data file. Particularly during the training phase of Machine Learning (ML) models, this cryptographic framework is critical in ensuring the data verification procedure. This research investigates the potential collaboration between Blockchain technology and ML/AI, bolstering data quality and trust to enhance data-driven decision-making fortifying the models\u27 ability to provide precise and dependable results

    Study and Performance Analysis of Different Techniques for Computing Data Cubes

    Get PDF
    Data is an integrated form of observable and recordable facts in operational or transactional systems in the data warehouse. Usually, data warehouse stores aggregated and historical data in multi-dimensional schemas. Data only have value to end-users when it is formulated and represented as information. And Information is a composed collection of facts for decision making. Cube computation is the most efficient way for answering this decision making queries and retrieve information from data. Online Analytical Process (OLAP) used in this purpose of the cube computation. There are two types of OLAP: Relational Online Analytical Processing (ROLAP) and Multidimensional Online Analytical Processing (MOLAP). This research worked on ROLAP and MOLAP and then compare both methods to find out the computation times by the data volume. Generally, a large data warehouse produces an extensive output, and it takes a larger space with a huge amount of empty data cells. To solve this problem, data compression is inevitable. Therefore, Compressed Row Storage (CRS) is applied to reduce empty cell overhead

    Study and Performance Analysis of Different Techniques for Computing Data Cubes

    Get PDF
    Data is an integrated form of observable and recordable facts in operational or transactional systems in the data warehouse. Usually, data warehouse stores aggregated and historical data in multi-dimensional schemas. Data only have value to end-users when it is formulated and represented as information. And Information is a composed collection of facts for decision making. Cube computation is the most efficient way for answering this decision making queries and retrieve information from data. Online Analytical Process (OLAP) used in this purpose of the cube computation. There are two types of OLAP: Relational Online Analytical Processing (ROLAP) and Multidimensional Online Analytical Processing (MOLAP). This research worked on ROLAP and MOLAP and then compare both methods to find out the computation times by the data volume. Generally, a large data warehouse produces an extensive output, and it takes a larger space with a huge amount of empty data cells. To solve this problem, data compression is inevitable. Therefore, Compressed Row Storage (CRS) is applied to reduce empty cell overhead
    corecore