Search CORE

172 research outputs found

Leveraging Language Representation for Material Recommendation, Ranking, and Exploration

Author: Ertekin Elif
Qu Jiaxing
Xie Yuxuan Richard
Publication venue
Publication date: 01/05/2023
Field of study

Data-driven approaches for material discovery and design have been accelerated by emerging efforts in machine learning. While there is enormous progress towards learning the structure to property relationship of materials, methods that allow for general representations of crystals to effectively explore the vast material search space and identify high-performance candidates remain limited. In this work, we introduce a material discovery framework that uses natural language embeddings derived from material science-specific language models as representations of compositional and structural features. The discovery framework consists of a joint scheme that, given a query material, first recalls candidates based on representational similarity, and ranks the candidates based on target properties through multi-task learning. The contextual knowledge encoded in language representations is found to convey information about material properties and structures, enabling both similarity analysis for recall, and multi-task learning to share information for related properties. By applying the discovery framework to thermoelectric materials, we demonstrate diversified recommendations of prototype structures and identify under-studied high-performance material spaces, including halide perovskite, delafossite-like, and spinel-like structures. By leveraging material language representations, our framework provides a generalized means for effective material recommendation, which is task-agnostic and can be applied to various material systems

arXiv.org e-Print Archive

Pseudo Replay-based Class Continual Learning for Online New Category Anomaly Detection in Additive Manufacturing

Author: Li Yuxuan
Liu Chenang
Shi Zhangyue
Xie Tianxin
Publication venue
Publication date: 04/12/2023
Field of study

The incorporation of advanced sensors and machine learning techniques has enabled modern manufacturing enterprises to perform data-driven in-situ quality monitoring based on the sensor data collected in manufacturing processes. However, one critical challenge is that newly presented defect category may manifest as the manufacturing process continues, resulting in monitoring performance deterioration of previously trained machine learning models. Hence, there is an increasing need for empowering machine learning model to learn continually. Among all continual learning methods, memory-based continual learning has the best performance but faces the constraints of data storage capacity. To address this issue, this paper develops a novel pseudo replay-based continual learning by integrating class incremental learning and oversampling-based data generation. Without storing all the data, the developed framework could generate high-quality data representing previous classes to train machine learning model incrementally when new category anomaly occurs. In addition, it could even enhance the monitoring performance since it also effectively improves the data quality. The effectiveness of the proposed framework is validated in an additive manufacturing process, which leverages supervised classification problem for anomaly detection. The experimental results show that the developed method is very promising in detecting novel anomaly while maintaining a good performance on the previous task and brings up more flexibility in model architecture

arXiv.org e-Print Archive

MEET: Mobility-Enhanced Edge inTelligence for Smart and Green 6G Networks

Author: Niu Zhisheng
Sun Yuxuan
Xie Bowen
Zhou Sheng
Publication venue
Publication date: 26/10/2022
Field of study

Edge intelligence is an emerging paradigm for real-time training and inference at the wireless edge, thus enabling mission-critical applications. Accordingly, base stations (BSs) and edge servers (ESs) need to be densely deployed, leading to huge deployment and operation costs, in particular the energy costs. In this article, we propose a new framework called Mobility-Enhanced Edge inTelligence (MEET), which exploits the sensing, communication, computing, and self-powering capabilities of intelligent connected vehicles for the smart and green 6G networks. Specifically, the operators can incorporate infrastructural vehicles as movable BSs or ESs, and schedule them in a more flexible way to align with the communication and computation traffic fluctuations. Meanwhile, the remaining compute resources of opportunistic vehicles are exploited for edge training and inference, where mobility can further enhance edge intelligence by bringing more compute resources, communication opportunities, and diverse data. In this way, the deployment and operation costs are spread over the vastly available vehicles, so that the edge intelligence is realized cost-effectively and sustainably. Furthermore, these vehicles can be either powered by renewable energy to reduce carbon emissions, or charged more flexibly during off-peak hours to cut electricity bills.Comment: This paper has been accepted by IEEE Communications Magazin

arXiv.org e-Print Archive

Recommender AI Agent: Integrating Large Language Models for Interactive Recommendations

Author: Huang Xu
Lei Yuxuan
Lian Defu
Lian Jianxun
Xie Xing
Yao Jing
Publication venue
Publication date: 01/09/2023
Field of study

Recommender models excel at providing domain-specific item recommendations by leveraging extensive user behavior data. Despite their ability to act as lightweight domain experts, they struggle to perform versatile tasks such as providing explanations and engaging in conversations. On the other hand, large language models (LLMs) represent a significant step towards artificial general intelligence, showcasing remarkable capabilities in instruction comprehension, commonsense reasoning, and human interaction. However, LLMs lack the knowledge of domain-specific item catalogs and behavioral patterns, particularly in areas that diverge from general world knowledge, such as online e-commerce. Finetuning LLMs for each domain is neither economic nor efficient. In this paper, we bridge the gap between recommender models and LLMs, combining their respective strengths to create a versatile and interactive recommender system. We introduce an efficient framework called InteRecAgent, which employs LLMs as the brain and recommender models as tools. We first outline a minimal set of essential tools required to transform LLMs into InteRecAgent. We then propose an efficient workflow within InteRecAgent for task execution, incorporating key components such as a memory bus, dynamic demonstration-augmented task planning, and reflection. InteRecAgent enables traditional recommender systems, such as those ID-based matrix factorization models, to become interactive systems with a natural language interface through the integration of LLMs. Experimental results on several public datasets show that InteRecAgent achieves satisfying performance as a conversational recommender system, outperforming general-purpose LLMs.Comment: 16 pages, 15 figures, 4 table

arXiv.org e-Print Archive

RecExplainer: Aligning Large Language Models for Recommendation Model Interpretability

Author: Huang Xu
Lei Yuxuan
Lian Defu
Lian Jianxun
Xie Xing
Yao Jing
Publication venue
Publication date: 17/11/2023
Field of study

Recommender systems are widely used in various online services, with embedding-based models being particularly popular due to their expressiveness in representing complex signals. However, these models often lack interpretability, making them less reliable and transparent for both users and developers. With the emergence of large language models (LLMs), we find that their capabilities in language expression, knowledge-aware reasoning, and instruction following are exceptionally powerful. Based on this, we propose a new model interpretation approach for recommender systems, by using LLMs as surrogate models and learn to mimic and comprehend target recommender models. Specifically, we introduce three alignment methods: behavior alignment, intention alignment, and hybrid alignment. Behavior alignment operates in the language space, representing user preferences and item information as text to learn the recommendation model's behavior; intention alignment works in the latent space of the recommendation model, using user and item representations to understand the model's behavior; hybrid alignment combines both language and latent spaces for alignment training. To demonstrate the effectiveness of our methods, we conduct evaluation from two perspectives: alignment effect, and explanation generation ability on three public datasets. Experimental results indicate that our approach effectively enables LLMs to comprehend the patterns of recommendation models and generate highly credible recommendation explanations.Comment: 12 pages, 8 figures, 4 table

arXiv.org e-Print Archive

Optimally Solving Two-Agent Decentralized POMDPs Under One-Sided Information Sharing

Author: Buffet Olivier
Dibangoye Jilles,
Xie Yuxuan
Publication venue: HAL CCSD
Publication date: 12/07/2020
Field of study

International audienceOptimally solving decentralized partially observable Markov decision processes (Dec-POMDPs) under either full or no information sharing received significant attention in recent years. However, little is known about how partial information sharing affects existing theory and algorithms. This paper addresses this question for a team of two agents, with one-sided information sharing, i.e. both agents have imperfect information about the state of the world, but only one has access to what the other sees and does. From the perspective of a central planner, we show that the original problem can be reformulated into an equivalent information-state Markov decision process and solved as such. Besides, we prove that the optimal value function exhibits a specific form of uniform continuity. We also present heuristic search algorithms utilizing this property and providing the first results for this family of problems

INRIA a CCSD electronic archive server

Time-varying resonant mass at collider and beam dump experiments

Author: Guo Jinhui
He Yuxuan
Liu Jia
Wang Xiao-Ping
Xie Ke-Pan
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 28/06/2022
Field of study

A new particle usually manifests itself as a single resonant peak located at its mass. We propose if the new particle mass is time-varying due to environmental effects, then its mass spectrum typically has a novel double-peak feature. A representative model is the kinetic mixing dark photon interacting with an ultralight complex scalar dark matter charged under U(1)\u27. We reanalyze the existing experiments, showing the constraints on such a model are drastically weakened than those on the traditional single-peak resonance model, due to the reduction of the luminosity exposure in each resonant mass bin. Consequently, for mass around tens of MeV, the muon gμ -2 solution from the kinetic mixing dark photon becomes viable again. The scenario can be further tested by reanalyzing the existing data with timing information included

DigitalCommons@University of Nebraska