9 research outputs found
Secure Split Learning against Property Inference, Data Reconstruction, and Feature Space Hijacking Attacks
Split learning of deep neural networks (SplitNN) has provided a promising
solution to learning jointly for the mutual interest of a guest and a host,
which may come from different backgrounds, holding features partitioned
vertically. However, SplitNN creates a new attack surface for the adversarial
participant, holding back its practical use in the real world. By investigating
the adversarial effects of highly threatening attacks, including property
inference, data reconstruction, and feature hijacking attacks, we identify the
underlying vulnerability of SplitNN and propose a countermeasure. To prevent
potential threats and ensure the learning guarantees of SplitNN, we design a
privacy-preserving tunnel for information exchange between the guest and the
host. The intuition is to perturb the propagation of knowledge in each
direction with a controllable unified solution. To this end, we propose a new
activation function named R3eLU, transferring private smashed data and partial
loss into randomized responses in forward and backward propagations,
respectively. We give the first attempt to secure split learning against three
threatening attacks and present a fine-grained privacy budget allocation
scheme. The analysis proves that our privacy-preserving SplitNN solution
provides a tight privacy budget, while the experimental results show that our
solution performs better than existing solutions in most cases and achieves a
good tradeoff between defense and model usability.Comment: 23 page
Strategies for Searching Video Content with Text Queries or Video Examples
The large number of user-generated videos uploaded on to the Internet
everyday has led to many commercial video search engines, which mainly rely on
text metadata for search. However, metadata is often lacking for user-generated
videos, thus these videos are unsearchable by current search engines.
Therefore, content-based video retrieval (CBVR) tackles this metadata-scarcity
problem by directly analyzing the visual and audio streams of each video. CBVR
encompasses multiple research topics, including low-level feature design,
feature fusion, semantic detector training and video search/reranking. We
present novel strategies in these topics to enhance CBVR in both accuracy and
speed under different query inputs, including pure textual queries and query by
video examples. Our proposed strategies have been incorporated into our
submission for the TRECVID 2014 Multimedia Event Detection evaluation, where
our system outperformed other submissions in both text queries and video
example queries, thus demonstrating the effectiveness of our proposed
approaches
Can we share models if sharing data is not an option?
In the big data era, vast volumes of data are generated daily as the foundation of data-driven scientific discovery. Thanks to the recent open data movement, much of these data are being made available to the public, significantly advancing scientific research and accelerating socio-technical development. However, not all data are suitable for opening or sharing because of concerns over privacy, ownership, trust, and incentive. Therefore, data sharing remains a challenge for specific data types and holders, making a bottleneck for further unleashing the potential of these “closed data.” To address this challenge, in this perspective, we conceptualize the current practices and technologies in data collaboration in a data-sharing-free manner and propose a concept of the model-sharing strategy for using closed data without sharing them. Supported by emerging advances in artificial intelligence, this strategy will unleash the large potential in closed data. Moreover, we show the advantages of the model-sharing strategy and explain how it will lead to a new paradigm of big data governance and collaboration
Seasonal Cycles of Phytoplankton Expressed by Sine Equations Using the Daily Climatology from Satellite-Retrieved Chlorophyll-a Concentration (1997–2019) Over Global Ocean
The global coverage of Chlorophyll-a concentration (Chl-a) has been continuously available from ocean color satellite sensors since September 1997 and the Chl-a data (1997–2019) were used to produce a climatological dataset by averaging Chl-a values at same locations and same day of year. The constructed climatology can remarkably reduce the variability of satellite data and clearly exhibit the seasonal cycles, demonstrating that the growth and decay of phytoplankton recurs with similarly seasonal cycles year after year. As the shapes of time series of the climatology exhibit strong periodical change, we wonder whether the seasonality of Chl-a can be expressed by a mathematic equation. Our results show that sinusoid functions are suitable to describe cyclical variations of data in time series and patterns of the daily climatology can be matched by sine equations with parameters of mean, amplitude, phase, and frequency. Three types of sine equations were used to match the climatological Chl-a with Mean Relative Differences (MRD) of 7.1%, 4.5%, and 3.3%, respectively. The sine equation with four sinusoids can modulate the shapes of the fitted values to match various patterns of climatology with small MRD values (less than 5%) in about 90% of global oceans. The fitted values can reflect an overall pattern of seasonal cycles of Chl-a which can be taken as a time series of biomass baseline for describing the state of seasonal variations of phytoplankton. The amplitude images, the spatial patterns of seasonal variations of phytoplankton, can be used to identify the transition zone chlorophyll fronts. The timing of phytoplankton blooms is identified by the biggest peak of the fitted values and used to classify oceans as different bloom seasons, indicating that blooms occur in all four seasons with regional features. In global oceans within latitude domains (48°N–48°S), blooms occupy approximately half of the ocean (50.6%) during boreal winter (December–February) in the northern hemisphere and more than half (58.0%) during austral winter (June–August) in the southern hemisphere. Therefore, the sine equation can be used to match the daily Chl-a climatology and the fitted values can reflect the seasonal cycles of phytoplankton, which can be used to investigate the underlying phenological characteristics