5 research outputs found
Content-Based Medical Image Retrieval with Opponent Class Adaptive Margin Loss
Broadspread use of medical imaging devices with digital storage has paved the
way for curation of substantial data repositories. Fast access to image samples
with similar appearance to suspected cases can help establish a consulting
system for healthcare professionals, and improve diagnostic procedures while
minimizing processing delays. However, manual querying of large data
repositories is labor intensive. Content-based image retrieval (CBIR) offers an
automated solution based on dense embedding vectors that represent image
features to allow quantitative similarity assessments. Triplet learning has
emerged as a powerful approach to recover embeddings in CBIR, albeit
traditional loss functions ignore the dynamic relationship between opponent
image classes. Here, we introduce a triplet-learning method for automated
querying of medical image repositories based on a novel Opponent Class Adaptive
Margin (OCAM) loss. OCAM uses a variable margin value that is updated
continually during the course of training to maintain optimally discriminative
representations. CBIR performance of OCAM is compared against state-of-the-art
loss functions for representational learning on three public databases
(gastrointestinal disease, skin lesion, lung disease). Comprehensive
experiments in each application domain demonstrate the superior performance of
OCAM against baselines.Comment: 10 pages, 6 figure
A Novel Approach for Effective Multi-View Clustering with Information-Theoretic Perspective
Multi-view clustering (MVC) is a popular technique for improving clustering
performance using various data sources. However, existing methods primarily
focus on acquiring consistent information while often neglecting the issue of
redundancy across multiple views. This study presents a new approach called
Sufficient Multi-View Clustering (SUMVC) that examines the multi-view
clustering framework from an information-theoretic standpoint. Our proposed
method consists of two parts. Firstly, we develop a simple and reliable
multi-view clustering method SCMVC (simple consistent multi-view clustering)
that employs variational analysis to generate consistent information. Secondly,
we propose a sufficient representation lower bound to enhance consistent
information and minimise unnecessary information among views. The proposed
SUMVC method offers a promising solution to the problem of multi-view
clustering and provides a new perspective for analyzing multi-view data.
To verify the effectiveness of our model, we conducted a theoretical analysis
based on the Bayes Error Rate, and experiments on multiple multi-view datasets
demonstrate the superior performance of SUMVC
Contrastive Difference Predictive Coding
Predicting and reasoning about the future lie at the heart of many
time-series questions. For example, goal-conditioned reinforcement learning can
be viewed as learning representations to predict which states are likely to be
visited in the future. While prior methods have used contrastive predictive
coding to model time series data, learning representations that encode
long-term dependencies usually requires large amounts of data. In this paper,
we introduce a temporal difference version of contrastive predictive coding
that stitches together pieces of different time series data to decrease the
amount of data required to learn predictions of future events. We apply this
representation learning method to derive an off-policy algorithm for
goal-conditioned RL. Experiments demonstrate that, compared with prior RL
methods, ours achieves median improvement in success rates and can
better cope with stochastic environments. In tabular settings, we show that our
method is about more sample efficient than the successor
representation and more sample efficient than the standard (Monte
Carlo) version of contrastive predictive coding.Comment: Website (https://chongyi-zheng.github.io/td_infonce) and code
(https://github.com/chongyi-zheng/td_infonce
The artefacts of intelligence: governing scientists' contribution to AI proliferation
This DPhil dissertation is about attempts to govern how artificial intelligence (AI) researchers share their work. There is growing concern that the software artefacts built by AI researchers will have adverse impacts on society if made freely available online. AI research is a scientific field, and openly sharing these artefacts is routine and expected, as part of the functioning of the scientific field. Recently, there have been a number of occasions where members of the AI research community have trialled new ways of sharing their work, in response to their concerns that it poses risks to society. The case study follows: the ‘staged release’ of the GPT-2 language model, where more capable models were gradually released; the platform through which researchers and developers could access GPT-3, the successor to GPT-2; and a wave of new ethics regimes for AI conference publications. The study relies on 42 qualitative interviews with members of the AI research community, conducted between 2019 and 2021, as well as many other publicly available sources such as blog posts and Twitter. The aim is to understand how concerns about risk can become a feature of the way AI research is shared. Major themes are: the relationship between science and society; the relationship between industry AI labs and academia; the interplay between AI risks and AI governance regimes; and how the existing scientific field provides an insecure footing for new governance regimes