12 research outputs found
Out-of-Distribution Detection in Long-Tailed Recognition with Calibrated Outlier Class Learning
Existing out-of-distribution (OOD) methods have shown great success on
balanced datasets but become ineffective in long-tailed recognition (LTR)
scenarios where 1) OOD samples are often wrongly classified into head classes
and/or 2) tail-class samples are treated as OOD samples. To address these
issues, current studies fit a prior distribution of auxiliary/pseudo OOD data
to the long-tailed in-distribution (ID) data. However, it is difficult to
obtain such an accurate prior distribution given the unknowingness of real OOD
samples and heavy class imbalance in LTR. A straightforward solution to avoid
the requirement of this prior is to learn an outlier class to encapsulate the
OOD samples. The main challenge is then to tackle the aforementioned confusion
between OOD samples and head/tail-class samples when learning the outlier
class. To this end, we introduce a novel calibrated outlier class learning
(COCL) approach, in which 1) a debiased large margin learning method is
introduced in the outlier class learning to distinguish OOD samples from both
head and tail classes in the representation space and 2) an outlier-class-aware
logit calibration method is defined to enhance the long-tailed classification
confidence. Extensive empirical results on three popular benchmarks CIFAR10-LT,
CIFAR100-LT, and ImageNet-LT demonstrate that COCL substantially outperforms
state-of-the-art OOD detection methods in LTR while being able to improve the
classification accuracy on ID data. Code is available at
https://github.com/mala-lab/COCL.Comment: AAAI2024, with supplementary materia
Learning Adversarial Semantic Embeddings for Zero-Shot Recognition in Open Worlds
Zero-Shot Learning (ZSL) focuses on classifying samples of unseen classes
with only their side semantic information presented during training. It cannot
handle real-life, open-world scenarios where there are test samples of unknown
classes for which neither samples (e.g., images) nor their side semantic
information is known during training. Open-Set Recognition (OSR) is dedicated
to addressing the unknown class issue, but existing OSR methods are not
designed to model the semantic information of the unseen classes. To tackle
this combined ZSL and OSR problem, we consider the case of "Zero-Shot Open-Set
Recognition" (ZS-OSR), where a model is trained under the ZSL setting but it is
required to accurately classify samples from the unseen classes while being
able to reject samples from the unknown classes during inference. We perform
large experiments on combining existing state-of-the-art ZSL and OSR models for
the ZS-OSR task on four widely used datasets adapted from the ZSL task, and
reveal that ZS-OSR is a non-trivial task as the simply combined solutions
perform badly in distinguishing the unseen-class and unknown-class samples. We
further introduce a novel approach specifically designed for ZS-OSR, in which
our model learns to generate adversarial semantic embeddings of the unknown
classes to train an unknowns-informed ZS-OSR classifier. Extensive empirical
results show that our method 1) substantially outperforms the combined
solutions in detecting the unknown classes while retaining the classification
accuracy on the unseen classes and 2) achieves similar superiority under
generalized ZS-OSR settings
Towards Applying Powerful Large AI Models in Classroom Teaching: Opportunities, Challenges and Prospects
This perspective paper proposes a series of interactive scenarios that
utilize Artificial Intelligence (AI) to enhance classroom teaching, such as
dialogue auto-completion, knowledge and style transfer, and assessment of
AI-generated content. By leveraging recent developments in Large Language
Models (LLMs), we explore the potential of AI to augment and enrich
teacher-student dialogues and improve the quality of teaching. Our goal is to
produce innovative and meaningful conversations between teachers and students,
create standards for evaluation, and improve the efficacy of AI-for-Education
initiatives. In Section 3, we discuss the challenges of utilizing existing LLMs
to effectively complete the educated tasks and present a unified framework for
addressing diverse education dataset, processing lengthy conversations, and
condensing information to better accomplish more downstream tasks. In Section
4, we summarize the pivoting tasks including Teacher-Student Dialogue
Auto-Completion, Expert Teaching Knowledge and Style Transfer, and Assessment
of AI-Generated Content (AIGC), providing a clear path for future research. In
Section 5, we also explore the use of external and adjustable LLMs to improve
the generated content through human-in-the-loop supervision and reinforcement
learning. Ultimately, this paper seeks to highlight the potential for AI to aid
the field of education and promote its further exploration.Comment: 16 pages, 2 figure
Carbon Price Forecasting with Quantile Regression and Feature Selection
Carbon futures has recently emerged as a novel financial asset in the trading
markets such as the European Union and China. Monitoring the trend of the
carbon price has become critical for both national policy-making as well as
industrial manufacturing planning. However, various geopolitical, social, and
economic factors can impose substantial influence on the carbon price. Due to
its volatility and non-linearity, predicting accurate carbon prices is
generally a difficult task. In this study, we propose to improve carbon price
forecasting with several novel practices. First, we collect various influencing
factors, including commodity prices, export volumes such as oil and natural
gas, and prosperity indices. Then we select the most significant factors and
disclose their optimal grouping for explainability. Finally, we use the Sparse
Quantile Group Lasso and Adaptive Sparse Quantile Group Lasso for robust price
predictions. We demonstrate through extensive experimental studies that our
proposed methods outperform existing ones. Also, our quantile predictions
provide a complete profile of future prices at different levels, which better
describes the distributions of the carbon market
Physiochemical properties of modified starch under yogurt manufacturing conditions and its relation to the properties of yogurt
The characteristics of three acetylated distarch phosphates with different degree of cross linking (ADP-L < ADP-M < ADP-H) and acetylation were studied under yogurt manufacture conditions, and the properties of yogurts made with these starches were evaluated. The modified starch showed lower solubility and viscosity than native starch (NS), but better resistance to acid and shear force was obtained. The acid milk gels containing modified starches exhibited well-organized and homogenized microstructure, while much denser structure with large aggregates were observed in control and NS samples. The modified starch improved the properties of yogurt more effectively than NS at 0.5% concentration, in terms of yield stress, consistency, apparent viscosity, thixotropy, pseudoplasticity. By increasing the concentration, ADP-M showed increasing positive effect on apparent viscosity, thixotropy, pseudoplasticity, firmness, adhesiveness of yogurt; while no significant difference or adverse effect was seen with ADP-L or ADP-H