Search CORE

74 research outputs found

FedHM: Efficient Federated Learning for Heterogeneous Models via Low-rank Factorization

Author: Dai Yutong
Jin Hai
O'Neill Michael J
Pan Wanning
Sun Lichao
Wan Yao
Yao Dezhong
Publication venue
Publication date: 26/05/2022
Field of study

One underlying assumption of recent federated learning (FL) paradigms is that all local models usually share the same network architecture and size, which becomes impractical for devices with different hardware resources. A scalable federated learning framework should address the heterogeneity that clients have different computing capacities and communication capabilities. To this end, this paper proposes FedHM, a novel heterogeneous federated model compression framework, distributing the heterogeneous low-rank models to clients and then aggregating them into a full-rank model. Our solution enables the training of heterogeneous models with varying computational complexities and aggregates them into a single global model. Furthermore, FedHM significantly reduces the communication cost by using low-rank models. Extensive experimental results demonstrate that FedHM is superior in the performance and robustness of models of different sizes, compared with state-of-the-art heterogeneous FL methods under various FL settings. Additionally, the convergence guarantee of FL for heterogeneous devices is first theoretically analyzed

arXiv.org e-Print Archive

I Think, Therefore I am: Benchmarking Awareness of Large Language Models Using AwareBench

Author: Huang Yue
Li Yuan
Lin Yuli
Sun Lichao
Wan Yao
Wu Siyuan
Publication venue
Publication date: 16/02/2024
Field of study

Do large language models (LLMs) exhibit any forms of awareness similar to humans? In this paper, we introduce AwareBench, a benchmark designed to evaluate awareness in LLMs. Drawing from theories in psychology and philosophy, we define awareness in LLMs as the ability to understand themselves as AI models and to exhibit social intelligence. Subsequently, we categorize awareness in LLMs into five dimensions, including capability, mission, emotion, culture, and perspective. Based on this taxonomy, we create a dataset called AwareEval, which contains binary, multiple-choice, and open-ended questions to assess LLMs' understandings of specific awareness dimensions. Our experiments, conducted on 13 LLMs, reveal that the majority of them struggle to fully recognize their capabilities and missions while demonstrating decent social intelligence. We conclude by connecting awareness of LLMs with AI alignment and safety, emphasizing its significance to the trustworthy and ethical development of LLMs. Our dataset and code are available at https://github.com/HowieHwong/Awareness-in-LLM

arXiv.org e-Print Archive

MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use

Author: Fan Chenrui
Gong Neil Zhenqiang
Huang Yue
Li Yuan
Liu Yixin
Shi Jiawen
Sun Lichao
Wan Yao
Wu Siyuan
Zhang Qihui
Zhou Pan
Publication venue
Publication date: 23/10/2023
Field of study

Large language models (LLMs) have garnered significant attention due to their impressive natural language processing (NLP) capabilities. Recently, many studies have focused on the tool utilization ability of LLMs. They primarily investigated how LLMs effectively collaborate with given specific tools. However, in scenarios where LLMs serve as intelligent agents, as seen in applications like AutoGPT and MetaGPT, LLMs are expected to engage in intricate decision-making processes that involve deciding whether to employ a tool and selecting the most suitable tool(s) from a collection of available tools to fulfill user requests. Therefore, in this paper, we introduce MetaTool, a benchmark designed to evaluate whether LLMs have tool usage awareness and can correctly choose tools. Specifically, we create a dataset called ToolE within the benchmark. This dataset contains various types of user queries in the form of prompts that trigger LLMs to use tools, including both single-tool and multi-tool scenarios. Subsequently, we set the tasks for both tool usage awareness and tool selection. We define four subtasks from different perspectives in tool selection, including tool selection with similar choices, tool selection in specific scenarios, tool selection with possible reliability issues, and multi-tool selection. We conduct experiments involving nine popular LLMs and find that the majority of them still struggle to effectively select tools, highlighting the existing gaps between LLMs and genuine intelligent agents. However, through the error analysis, we found there is still significant room for improvement. Finally, we conclude with insights for tool developers that follow ChatGPT to provide detailed descriptions that can enhance the tool selection performance of LLMs

arXiv.org e-Print Archive

Instance Segmentation of Buildings using Keypoints

Author: Hua Yuansheng
Jin Pu
Li Qingyu
Mou LiChao
Shi Yilei
Sun Yao
Zhu Xiao Xiang
Publication venue
Publication date: 06/06/2020
Field of study

Building segmentation is of great importance in the task of remote sensing imagery interpretation. However, the existing semantic segmentation and instance segmentation methods often lead to segmentation masks with blurred boundaries. In this paper, we propose a novel instance segmentation network for building segmentation in high-resolution remote sensing images. More specifically, we consider segmenting an individual building as detecting several keypoints. The detected keypoints are subsequently reformulated as a closed polygon, which is the semantic boundary of the building. By doing so, the sharp boundary of the building could be preserved. Experiments are conducted on selected Aerial Imagery for Roof Segmentation (AIRS) dataset, and our method achieves better performance in both quantitative and qualitative results with comparison to the state-of-the-art methods. Our network is a bottom-up instance segmentation method that could well preserve geometric details

Institute of Transport Research:Publications

arXiv.org e-Print Archive

Aggressive Environment Resistance of Concrete Products Modified With Nano Alumina and Nano Silica

Author: Lichao Feng
Na Cui
Ning Xie
Ning Xie
Pengkun Hou
Shuyuan Zhao
Yao Zhao
Yunzhe Zhu
Publication venue: 'Frontiers Media SA'
Publication date: 01/06/2021
Field of study

The service life of concrete products with exposure to an aggressive environment has raised great concerns in the past decades. Nanomaterials have been used as a promising approach to improve the environmental resistance of concrete products when exposed to synergistic attacks. The impacts of CaCl2 on nano-modified concrete, especially along with freeze/thaw (F/T) and wet/dry (W/D) cycles, were barely discussed. In this study, the impacts of CaCl2 along with F/T and W/D cycles on the nano SiO2 and Al2O3 modified concrete were investigated. The mass loss, flexural strength, compressive strength, and relative dynamic modulus of elasticity were tested to evaluate the durability of concrete products. The testing results indicate that the addition of nanoparticles has a distinctive effect on the environment resistance enhancement of concrete samples. The microstructure analysis demonstrates that with the addition of nanoparticles, high-density hydration products were formed, which is beneficial to the properties enhancement of concrete products. This study not only provides an approach to realize the nano modification on the durability of concrete products but also helps to design and fabricate environmentally resistant concrete products when exposed to a synergistic aggressive environment

Directory of Open Access Journals

Permian-Triassic boundary microbialites (PTBMs) in soutwest China: implications for paleoenvironment reconstruction

Author: A Baud
A Baud
AD Woods
AH Knoll
AH Knoll
BJ Wang
C Dupraz
C Pietsch
C Richmond
CE Savrda
Cheng Huang
Chengbo Lian
D Yao
DC Rhoads
DJ Lehrmann
DJ Lehrmann
DJ Lehrmann
DJ Lehrmann
E Vennin
Fei Li
G Luo
GR Loope
Guang Hu
H Jiang
H Jiang
H Liu
H Rong
H Rong
H Song
H Song
H Tang
H Yang
Hao Tang
Hong Liu
J He
J Shen
J Wheeley
JG Ogg
JJ Sepkoski
JK Schubert
JL Payne
JP Grotzinger
K Hips
L He
L Liu
L Wang
L Xiang
Lichao Wang
Ling Li
M Hautmann
M Hautmann
M Mary
M Schobben
MA Arthur
MB Forel
ML Fraiser
N Adachi
N Li
N Zhong
P Chen
PB Wignall
PJ Marenco
PJ Marenco
PM Novack-Gottshall
PY Collin
R Wang
RA Berner
RJ Twitchett
RS Shapiro
S Kershaw
S Kershaw
S Kershaw
S Kershaw
S Kershaw
S Kershaw
S Kershaw
SA Mata
SB Pruss
SB Pruss
SB Pruss
SE Grasby
Stephen Kershaw
W Liao
W Oschmann
X Fu
X Mu
X Tan
X Xie
X Zhao
Xiucheng Tan
Xuefei Yang
Y Ezaki
Y Ezaki
Y Isozaki
Y Wang
Y Wu
YS Wu
YS Wu
ZQ Chen
Ø Hammer
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Permian–Triassic boundary microbialites (PTBMs) are commonly interpreted to be a sedimentary response to upwelling of anoxic alkaline seawater and indicate a harsh marine environment in the Permian–Triassic transition. However, recent studies propose that PTBMs may instead be developed in an oxic environment, therefore necessitating the need to reassess the paleoenvironment of formation of PTBMs. This paper is an integrated study of the PTBM sequence at Yudongzi, northwest Sichuan Basin, which is one of the thickest units of PTBMs in south China. Analysis of conodont biostratigraphy, mega- to microscopic microbialite structures, stratigraphic variations in abundance and size of metazoan fossils, and total organic carbon (TOC) and total sulfur (TS) contents within the PTBM reveals the following results: (1) the microbialites occur mainly in the Hindeodus parvus Zone but may cross the Permian–Triassic boundary, and are comprised of, from bottom to top: lamellar thrombolites, dendritic thrombolites and lamellar-reticular thrombolites; (2) most metazoan fossils of the microbialite succession increase in abundance upsection, so does the sizes of bivalve and brachiopod fossils; (3) TOC and TS values of microbialites account respectively for 0.07 and 0.31 wt% on average, both of which are very low. The combination of increase in abundance and size of metazoan fossils upsection, together with the low TOC and TS contents, is evidence that the Yudongzi PTBMs developed in oxic seawater. We thus dispute the previous view, at least for the Chinese sequences, of low-oxygen seawater for microbialite growth, and question whether it is now appropriate to associate PTBMs with anoxic, harsh environments associated with the end-Permian extinction. Instead, we interpret those conditions as fully oxygenated.13th Five-Year Plan National Scientific and Technology Major Project (2016ZX05004002-001); National Natural Science Foundation of China (41602166)

Crossref

Springer - Publisher Connector

Brunel University Research Archive

Research of ontology model of heading face

Author: CHEN Lichao
PAN Lihu
YAO Huiting
Publication venue: Editorial Department of Industry and Mine Automation
Publication date: 01/09/2015
Field of study

The paper pointed out that building ontology model of heading face was an important step towards the construction of intelligent decision-making system based on deep analysis of production model of the heading face, obtained relationship between the staff, devices and environment of underground heading face by analyzing factors of mine production safety, and built safety production ontology model of the heading face. Hazardous workplace was taken as an example to simply verify the validity of construction method of the ontology model of heading face

Directory of Open Access Journals

Large-scale Building Height Estimation from Single VHR SAR image Using Fully Convolutional Network and GIS building footprints

Author: Hua Yuansheng
Mou LiChao
Sun Yao
Zhu Xiaoxiang
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

Institute of Transport Research:Publications

CG-Net: Conditional GIS-aware Network for Individual Building Segmentation in VHR SAR Images

Author: Hua Yuansheng
Mou LiChao
Sun Yao
Zhu Xiao Xiang
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2022
Field of study

Object retrieval and reconstruction from very high resolution (VHR) synthetic aperture radar (SAR) images are of great importance for urban SAR applications, yet highly challenging owing to the complexity of SAR data. This paper addresses the issue of individual building segmentation from a single VHR SAR image in large-scale urban areas. To achieve this, we introduce building footprints from GIS data as complementary information and propose a novel conditional GIS-aware network (CG-Net). The proposed model learns multi-level visual features and employs building footprints to normalize the features for predicting building masks in the SAR image. We validate our method using a high resolution spotlight TerraSAR-X image collected over Berlin. Experimental results show that the proposed CG-Net effectively brings improvements with variant backbones. We further compare two representations of building footprints, namely complete building footprints and sensor-visible footprint segments, for our task, and conclude that the use of the former leads to better segmentation results. Moreover, we investigate the impact of inaccurate GIS data on our CG-Net, and this study shows that CG-Net is robust against positioning errors in GIS data. In addition, we propose an approach of ground truth generation of buildings from an accurate digital elevation model (DEM), which can be used to generate large-scale SAR image datasets. The segmentation results can be applied to reconstruct 3D building models at level-of-detail (LoD) 1, which is demonstrated in our experiments

Institute of Transport Research:Publications