172 research outputs found
Leveraging Language Representation for Material Recommendation, Ranking, and Exploration
Data-driven approaches for material discovery and design have been
accelerated by emerging efforts in machine learning. While there is enormous
progress towards learning the structure to property relationship of materials,
methods that allow for general representations of crystals to effectively
explore the vast material search space and identify high-performance candidates
remain limited. In this work, we introduce a material discovery framework that
uses natural language embeddings derived from material science-specific
language models as representations of compositional and structural features.
The discovery framework consists of a joint scheme that, given a query
material, first recalls candidates based on representational similarity, and
ranks the candidates based on target properties through multi-task learning.
The contextual knowledge encoded in language representations is found to convey
information about material properties and structures, enabling both similarity
analysis for recall, and multi-task learning to share information for related
properties. By applying the discovery framework to thermoelectric materials, we
demonstrate diversified recommendations of prototype structures and identify
under-studied high-performance material spaces, including halide perovskite,
delafossite-like, and spinel-like structures. By leveraging material language
representations, our framework provides a generalized means for effective
material recommendation, which is task-agnostic and can be applied to various
material systems
Pseudo Replay-based Class Continual Learning for Online New Category Anomaly Detection in Additive Manufacturing
The incorporation of advanced sensors and machine learning techniques has
enabled modern manufacturing enterprises to perform data-driven in-situ quality
monitoring based on the sensor data collected in manufacturing processes.
However, one critical challenge is that newly presented defect category may
manifest as the manufacturing process continues, resulting in monitoring
performance deterioration of previously trained machine learning models. Hence,
there is an increasing need for empowering machine learning model to learn
continually. Among all continual learning methods, memory-based continual
learning has the best performance but faces the constraints of data storage
capacity. To address this issue, this paper develops a novel pseudo
replay-based continual learning by integrating class incremental learning and
oversampling-based data generation. Without storing all the data, the developed
framework could generate high-quality data representing previous classes to
train machine learning model incrementally when new category anomaly occurs. In
addition, it could even enhance the monitoring performance since it also
effectively improves the data quality. The effectiveness of the proposed
framework is validated in an additive manufacturing process, which leverages
supervised classification problem for anomaly detection. The experimental
results show that the developed method is very promising in detecting novel
anomaly while maintaining a good performance on the previous task and brings up
more flexibility in model architecture
MEET: Mobility-Enhanced Edge inTelligence for Smart and Green 6G Networks
Edge intelligence is an emerging paradigm for real-time training and
inference at the wireless edge, thus enabling mission-critical applications.
Accordingly, base stations (BSs) and edge servers (ESs) need to be densely
deployed, leading to huge deployment and operation costs, in particular the
energy costs. In this article, we propose a new framework called
Mobility-Enhanced Edge inTelligence (MEET), which exploits the sensing,
communication, computing, and self-powering capabilities of intelligent
connected vehicles for the smart and green 6G networks. Specifically, the
operators can incorporate infrastructural vehicles as movable BSs or ESs, and
schedule them in a more flexible way to align with the communication and
computation traffic fluctuations. Meanwhile, the remaining compute resources of
opportunistic vehicles are exploited for edge training and inference, where
mobility can further enhance edge intelligence by bringing more compute
resources, communication opportunities, and diverse data. In this way, the
deployment and operation costs are spread over the vastly available vehicles,
so that the edge intelligence is realized cost-effectively and sustainably.
Furthermore, these vehicles can be either powered by renewable energy to reduce
carbon emissions, or charged more flexibly during off-peak hours to cut
electricity bills.Comment: This paper has been accepted by IEEE Communications Magazin
Recommender AI Agent: Integrating Large Language Models for Interactive Recommendations
Recommender models excel at providing domain-specific item recommendations by
leveraging extensive user behavior data. Despite their ability to act as
lightweight domain experts, they struggle to perform versatile tasks such as
providing explanations and engaging in conversations. On the other hand, large
language models (LLMs) represent a significant step towards artificial general
intelligence, showcasing remarkable capabilities in instruction comprehension,
commonsense reasoning, and human interaction. However, LLMs lack the knowledge
of domain-specific item catalogs and behavioral patterns, particularly in areas
that diverge from general world knowledge, such as online e-commerce.
Finetuning LLMs for each domain is neither economic nor efficient.
In this paper, we bridge the gap between recommender models and LLMs,
combining their respective strengths to create a versatile and interactive
recommender system. We introduce an efficient framework called InteRecAgent,
which employs LLMs as the brain and recommender models as tools. We first
outline a minimal set of essential tools required to transform LLMs into
InteRecAgent. We then propose an efficient workflow within InteRecAgent for
task execution, incorporating key components such as a memory bus, dynamic
demonstration-augmented task planning, and reflection. InteRecAgent enables
traditional recommender systems, such as those ID-based matrix factorization
models, to become interactive systems with a natural language interface through
the integration of LLMs. Experimental results on several public datasets show
that InteRecAgent achieves satisfying performance as a conversational
recommender system, outperforming general-purpose LLMs.Comment: 16 pages, 15 figures, 4 table
RecExplainer: Aligning Large Language Models for Recommendation Model Interpretability
Recommender systems are widely used in various online services, with
embedding-based models being particularly popular due to their expressiveness
in representing complex signals. However, these models often lack
interpretability, making them less reliable and transparent for both users and
developers. With the emergence of large language models (LLMs), we find that
their capabilities in language expression, knowledge-aware reasoning, and
instruction following are exceptionally powerful. Based on this, we propose a
new model interpretation approach for recommender systems, by using LLMs as
surrogate models and learn to mimic and comprehend target recommender models.
Specifically, we introduce three alignment methods: behavior alignment,
intention alignment, and hybrid alignment. Behavior alignment operates in the
language space, representing user preferences and item information as text to
learn the recommendation model's behavior; intention alignment works in the
latent space of the recommendation model, using user and item representations
to understand the model's behavior; hybrid alignment combines both language and
latent spaces for alignment training. To demonstrate the effectiveness of our
methods, we conduct evaluation from two perspectives: alignment effect, and
explanation generation ability on three public datasets. Experimental results
indicate that our approach effectively enables LLMs to comprehend the patterns
of recommendation models and generate highly credible recommendation
explanations.Comment: 12 pages, 8 figures, 4 table
Optimally Solving Two-Agent Decentralized POMDPs Under One-Sided Information Sharing
International audienceOptimally solving decentralized partially observable Markov decision processes (Dec-POMDPs) under either full or no information sharing received significant attention in recent years. However, little is known about how partial information sharing affects existing theory and algorithms. This paper addresses this question for a team of two agents, with one-sided information sharing, i.e. both agents have imperfect information about the state of the world, but only one has access to what the other sees and does. From the perspective of a central planner, we show that the original problem can be reformulated into an equivalent information-state Markov decision process and solved as such. Besides, we prove that the optimal value function exhibits a specific form of uniform continuity. We also present heuristic search algorithms utilizing this property and providing the first results for this family of problems
Time-varying resonant mass at collider and beam dump experiments
A new particle usually manifests itself as a single resonant peak located at its mass. We propose if the new particle mass is time-varying due to environmental effects, then its mass spectrum typically has a novel double-peak feature. A representative model is the kinetic mixing dark photon interacting with an ultralight complex scalar dark matter charged under U(1)\u27. We reanalyze the existing experiments, showing the constraints on such a model are drastically weakened than those on the traditional single-peak resonance model, due to the reduction of the luminosity exposure in each resonant mass bin. Consequently, for mass around tens of MeV, the muon gÎĽ -2 solution from the kinetic mixing dark photon becomes viable again. The scenario can be further tested by reanalyzing the existing data with timing information included
- …