9 research outputs found

    Secure Split Learning against Property Inference, Data Reconstruction, and Feature Space Hijacking Attacks

    Full text link
    Split learning of deep neural networks (SplitNN) has provided a promising solution to learning jointly for the mutual interest of a guest and a host, which may come from different backgrounds, holding features partitioned vertically. However, SplitNN creates a new attack surface for the adversarial participant, holding back its practical use in the real world. By investigating the adversarial effects of highly threatening attacks, including property inference, data reconstruction, and feature hijacking attacks, we identify the underlying vulnerability of SplitNN and propose a countermeasure. To prevent potential threats and ensure the learning guarantees of SplitNN, we design a privacy-preserving tunnel for information exchange between the guest and the host. The intuition is to perturb the propagation of knowledge in each direction with a controllable unified solution. To this end, we propose a new activation function named R3eLU, transferring private smashed data and partial loss into randomized responses in forward and backward propagations, respectively. We give the first attempt to secure split learning against three threatening attacks and present a fine-grained privacy budget allocation scheme. The analysis proves that our privacy-preserving SplitNN solution provides a tight privacy budget, while the experimental results show that our solution performs better than existing solutions in most cases and achieves a good tradeoff between defense and model usability.Comment: 23 page

    Strategies for Searching Video Content with Text Queries or Video Examples

    Full text link
    The large number of user-generated videos uploaded on to the Internet everyday has led to many commercial video search engines, which mainly rely on text metadata for search. However, metadata is often lacking for user-generated videos, thus these videos are unsearchable by current search engines. Therefore, content-based video retrieval (CBVR) tackles this metadata-scarcity problem by directly analyzing the visual and audio streams of each video. CBVR encompasses multiple research topics, including low-level feature design, feature fusion, semantic detector training and video search/reranking. We present novel strategies in these topics to enhance CBVR in both accuracy and speed under different query inputs, including pure textual queries and query by video examples. Our proposed strategies have been incorporated into our submission for the TRECVID 2014 Multimedia Event Detection evaluation, where our system outperformed other submissions in both text queries and video example queries, thus demonstrating the effectiveness of our proposed approaches

    Can we share models if sharing data is not an option?

    No full text
    In the big data era, vast volumes of data are generated daily as the foundation of data-driven scientific discovery. Thanks to the recent open data movement, much of these data are being made available to the public, significantly advancing scientific research and accelerating socio-technical development. However, not all data are suitable for opening or sharing because of concerns over privacy, ownership, trust, and incentive. Therefore, data sharing remains a challenge for specific data types and holders, making a bottleneck for further unleashing the potential of these “closed data.” To address this challenge, in this perspective, we conceptualize the current practices and technologies in data collaboration in a data-sharing-free manner and propose a concept of the model-sharing strategy for using closed data without sharing them. Supported by emerging advances in artificial intelligence, this strategy will unleash the large potential in closed data. Moreover, we show the advantages of the model-sharing strategy and explain how it will lead to a new paradigm of big data governance and collaboration

    Seasonal Cycles of Phytoplankton Expressed by Sine Equations Using the Daily Climatology from Satellite-Retrieved Chlorophyll-a Concentration (1997–2019) Over Global Ocean

    No full text
    The global coverage of Chlorophyll-a concentration (Chl-a) has been continuously available from ocean color satellite sensors since September 1997 and the Chl-a data (1997–2019) were used to produce a climatological dataset by averaging Chl-a values at same locations and same day of year. The constructed climatology can remarkably reduce the variability of satellite data and clearly exhibit the seasonal cycles, demonstrating that the growth and decay of phytoplankton recurs with similarly seasonal cycles year after year. As the shapes of time series of the climatology exhibit strong periodical change, we wonder whether the seasonality of Chl-a can be expressed by a mathematic equation. Our results show that sinusoid functions are suitable to describe cyclical variations of data in time series and patterns of the daily climatology can be matched by sine equations with parameters of mean, amplitude, phase, and frequency. Three types of sine equations were used to match the climatological Chl-a with Mean Relative Differences (MRD) of 7.1%, 4.5%, and 3.3%, respectively. The sine equation with four sinusoids can modulate the shapes of the fitted values to match various patterns of climatology with small MRD values (less than 5%) in about 90% of global oceans. The fitted values can reflect an overall pattern of seasonal cycles of Chl-a which can be taken as a time series of biomass baseline for describing the state of seasonal variations of phytoplankton. The amplitude images, the spatial patterns of seasonal variations of phytoplankton, can be used to identify the transition zone chlorophyll fronts. The timing of phytoplankton blooms is identified by the biggest peak of the fitted values and used to classify oceans as different bloom seasons, indicating that blooms occur in all four seasons with regional features. In global oceans within latitude domains (48°N–48°S), blooms occupy approximately half of the ocean (50.6%) during boreal winter (December–February) in the northern hemisphere and more than half (58.0%) during austral winter (June–August) in the southern hemisphere. Therefore, the sine equation can be used to match the daily Chl-a climatology and the fitted values can reflect the seasonal cycles of phytoplankton, which can be used to investigate the underlying phenological characteristics
    corecore