Search CORE

177 research outputs found

GLIME: General, Stable and Local LIME Explanation

Author: Li Jian
Tan Zeren
Tian Yang
Publication venue
Publication date: 27/11/2023
Field of study

As black-box machine learning models grow in complexity and find applications in high-stakes scenarios, it is imperative to provide explanations for their predictions. Although Local Interpretable Model-agnostic Explanations (LIME) [22] is a widely adpoted method for understanding model behaviors, it is unstable with respect to random seeds [35,24,3] and exhibits low local fidelity (i.e., how well the explanation approximates the model's local behaviors) [21,16]. Our study shows that this instability problem stems from small sample weights, leading to the dominance of regularization and slow convergence. Additionally, LIME's sampling neighborhood is non-local and biased towards the reference, resulting in poor local fidelity and sensitivity to reference choice. To tackle these challenges, we introduce GLIME, an enhanced framework extending LIME and unifying several prior methods. Within the GLIME framework, we derive an equivalent formulation of LIME that achieves significantly faster convergence and improved stability. By employing a local and unbiased sampling distribution, GLIME generates explanations with higher local fidelity compared to LIME. GLIME explanations are independent of reference choice. Moreover, GLIME offers users the flexibility to choose a sampling distribution based on their specific scenarios.Comment: Accepted by NeurIPS 2023 as a Spotlight pape

arXiv.org e-Print Archive

Bi-level Actor-Critic for Multi-agent Coordination

Author: Chen Weizhe
Huang Zeren
Li Minne
Wang Jun
Yang Yaodong
Zhang Haifeng
Zhang Weinan
Publication venue
Publication date: 03/04/2020
Field of study

Coordination is one of the essential problems in multi-agent systems. Typically multi-agent reinforcement learning (MARL) methods treat agents equally and the goal is to solve the Markov game to an arbitrary Nash equilibrium (NE) when multiple equilibra exist, thus lacking a solution for NE selection. In this paper, we treat agents \emph{unequally} and consider Stackelberg equilibrium as a potentially better convergence point than Nash equilibrium in terms of Pareto superiority, especially in cooperative environments. Under Markov games, we formally define the bi-level reinforcement learning problem in finding Stackelberg equilibrium. We propose a novel bi-level actor-critic learning method that allows agents to have different knowledge base (thus intelligent), while their actions still can be executed simultaneously and distributedly. The convergence proof is given, while the resulting learning algorithm is tested against the state of the arts. We found that the proposed bi-level actor-critic algorithm successfully converged to the Stackelberg equilibria in matrix games and find an asymmetric solution in a highway merge environment

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

HEAD: HEtero-Assists Distillation for Heterogeneous Object Detectors

Author: Jiang Zeren
Li Xiaojie
Liao Yue
Liu Si
Qian Chen
Wang Fei
Wang Luting
Wu Jianlong
Publication venue
Publication date: 12/07/2022
Field of study

Conventional knowledge distillation (KD) methods for object detection mainly concentrate on homogeneous teacher-student detectors. However, the design of a lightweight detector for deployment is often significantly different from a high-capacity detector. Thus, we investigate KD among heterogeneous teacher-student pairs for a wide application. We observe that the core difficulty for heterogeneous KD (hetero-KD) is the significant semantic gap between the backbone features of heterogeneous detectors due to the different optimization manners. Conventional homogeneous KD (homo-KD) methods suffer from such a gap and are hard to directly obtain satisfactory performance for hetero-KD. In this paper, we propose the HEtero-Assists Distillation (HEAD) framework, leveraging heterogeneous detection heads as assistants to guide the optimization of the student detector to reduce this gap. In HEAD, the assistant is an additional detection head with the architecture homogeneous to the teacher head attached to the student backbone. Thus, a hetero-KD is transformed into a homo-KD, allowing efficient knowledge transfer from the teacher to the student. Moreover, we extend HEAD into a Teacher-Free HEAD (TF-HEAD) framework when a well-trained teacher detector is unavailable. Our method has achieved significant improvement compared to current detection KD methods. For example, on the MS-COCO dataset, TF-HEAD helps R18 RetinaNet achieve 33.9 mAP (+2.2), while HEAD further pushes the limit to 36.2 mAP (+4.5).Comment: ECCV 2022, Code: https://github.com/LutingWang/HEA

arXiv.org e-Print Archive

Siamese DETR

Author: Chen Zeren
Huang Gengshi
Li Wei
Loy Chen Change
Shao Jing
Sheng Lu
Teng Jianing
Wang Kun
Publication venue
Publication date: 31/03/2023
Field of study

Recent self-supervised methods are mainly designed for representation learning with the base model, e.g., ResNets or ViTs. They cannot be easily transferred to DETR, with task-specific Transformer modules. In this work, we present Siamese DETR, a Siamese self-supervised pretraining approach for the Transformer architecture in DETR. We consider learning view-invariant and detection-oriented representations simultaneously through two complementary tasks, i.e., localization and discrimination, in a novel multi-view learning framework. Two self-supervised pretext tasks are designed: (i) Multi-View Region Detection aims at learning to localize regions-of-interest between augmented views of the input, and (ii) Multi-View Semantic Discrimination attempts to improve object-level discrimination for each region. The proposed Siamese DETR achieves state-of-the-art transfer performance on COCO and PASCAL VOC detection using different DETR variants in all setups. Code is available at https://github.com/Zx55/SiameseDETR.Comment: 10 pages, 11 figures. Accepted in CVPR 202

arXiv.org e-Print Archive

Theoretical foundations of studying criticality in the brain

Author: Chen Chun
Cheng Aohua
Hou Hedong
Li Guoqi
Qiu Yike
Sun Pei
Tan Zeren
Tian Yang
Weng Kangyu
Publication venue: 'MIT Press - Journals'
Publication date: 08/06/2023
Field of study

Criticality is hypothesized as a physical mechanism underlying efficient transitions between cortical states and remarkable information processing capacities in the brain. While considerable evidence generally supports this hypothesis, non-negligible controversies persist regarding the ubiquity of criticality in neural dynamics and its role in information processing. Validity issues frequently arise during identifying potential brain criticality from empirical data. Moreover, the functional benefits implied by brain criticality are frequently misconceived or unduly generalized. These problems stem from the non-triviality and immaturity of the physical theories that analytically derive brain criticality and the statistic techniques that estimate brain criticality from empirical data. To help solve these problems, we present a systematic review and reformulate the foundations of studying brain criticality, i.e., ordinary criticality (OC), quasi-criticality (qC), self-organized criticality (SOC), and self-organized quasi-criticality (SOqC), using the terminology of neuroscience. We offer accessible explanations of the physical theories and statistic techniques of brain criticality, providing step-by-step derivations to characterize neural dynamics as a physical system with avalanches. We summarize error-prone details and existing limitations in brain criticality analysis and suggest possible solutions. Moreover, we present a forward-looking perspective on how optimizing the foundations of studying brain criticality can deepen our understanding of various neuroscience questions

arXiv.org e-Print Archive

Seismo-ionospheric anomalies in ionospheric TEC and plasma density before the 17 July 2006 M 7.7 south of Java earthquake

Author: Dan Tao
Jinbin Cao
Lanwei Wang
Liuyuan Li
Malcolm Wray Dunlop
Roberto Battiston
Wenlong Liu
Yuduan Ma
Zeren Zhima
Publication venue
Publication date: 01/04/2017
Field of study

Abstract. In this paper, we report significant evidence for preseismic ionospheric anomalies in total electron content (TEC) of the global ionosphere map (GIM) and plasma density appearing on day 2 before the 17 July 2006 M7.7 south of Java earthquake. After distinguishing other anomalies related to the geomagnetic activities, we found a temporal precursor around the epicenter on day 2 before the earthquake (15 July 2006), which agrees well with the spatial variations in latitude–longitude–time (LLT) maps. Meanwhile, the sequences of latitude–time–TEC (LTT) plots reveal that the TECs on epicenter side anomalously decrease and lead to an anomalous asymmetric structure with respect to the magnetic equator in the daytime from day 2 before the earthquake. This anomalous asymmetric structure disappears after the earthquake. To further confirm these anomalies, we studied the plasma data from DEMETER satellite in the earthquake preparation zone (2046.4 km in radius) during the period from day 45 before to day 10 after the earthquake, and also found that the densities of both electron and total ion in the daytime significantly increase on day 2 before the earthquake. Very interestingly, O+ density increases significantly and H+ density decreases, while He+ remains relatively stable. These results indicate that there exists a distinct preseismic signal (preseismic ionospheric anomaly) over the epicenter

Crossref

Directory of Open Access Journals

Open Access Repository

A Retrospective Analysis of the Clinical Features of Inpatients With Epilepsy in the Ganzi Tibetan Autonomous Prefecture

Author: Dong Zhou
Jiani Chen
Jianmei Peng
Jie Chen
Jie Mu
Sisi Li
Wenyu Fang
Xintong Wu
Yingjuan Li
Yongqiao He
Yongyi Deng
Zhamu Zeren
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2018
Field of study

Background: There is limited detailed clinical information for patients with epilepsy in Tibet. This study sought to provide data about the clinical features of epilepsy in the Ganzi Tibetan Autonomous Prefecture to improve strategies for epilepsy prevention and management in this region.Methods: We reviewed the clinical record of patients with epilepsy in the Neurology Department, Ganzi Tibetan Autonomous Prefecture People's Hospital and compared the clinical features and compared it with control, from West China Hospital in Chengdu.Results: This retrospective study included 165 patients with epilepsy admitted between January 2015 and February 2018. Majority of patients (97%) in this study had active epilepsy; 28.5% had generalized onset seizures and 68.5% had focal onset seizures. Fifty-four patients had received anti-epileptic drug (AED) treatment prior to hospitalization, however, 38 (70.4%) patients took the medication irregularly. The leading etiology of this cohort was head trauma (20.6%), followed by stroke (10.9%), neurocysticercosis (7.9%), brain hydatidosis (6.7%) and tuberculous infection (5.5%). Compared with in-patients in Chengdu, epilepsy in Ganzi was more frequently caused by infection (OR = 4.216, 95% CI, 2.124–8.367), including neurocysticercosis (OR = 29.301, 95% CI, 1.727–497.167) and brain hydatidosis (OR = 24.637, 95% CI, 1.439–421.670).Conclusions: These data suggest that the control of cerebral infections, especially parasite infection, is essential for the prevention of epilepsy in the Ganzi Tibetan Autonomous Prefecture. Education of local primary doctors and patients about the literacy of epilepsy will enable better management of epilepsy in this population

Directory of Open Access Journals

Frontiers - Publisher Connector

Preparation and electrochemical properties of pomegranate-shaped Fe₂O₃/C anodes for li-ion batteries

Author: Chen Zhihong
Ke Chengzhi
Li Meixian
Qin Chunling
Tan Minghui
Tan Taizhe
Wang Yichao
Wang Zhifeng
Zhang Xiaomin
Zhao Yan
Zhao Zeren
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Due to the severe volume expansion and poor cycle stability, transition metal oxide anode is still not meeting the commercial utilization. We herein demonstrate the synthetic method of core-shell pomegranate-shaped Fe2O3/C nano-composite via one-step hydrothermal process for the first time. The electrochemical performances were measured as anode material for Li-ion batteries. It exhibits excellent cycling performance, which sustains 705 mAh g-1 reversible capacities after 100 cycles at 100 mA g-1. The anodes also showed good rate stability with discharge capacities of 480 mAh g-1 when cycling at a rate of 2000 mA g-1. The excellent Li storage properties can be attributed to the unique core-shell pomegranate structure, which can not only ensure good electrical conductivity for active Fe2O3, but also accommodate huge volume change during cycles as well as facilitate the fast diffusion of Li ion

Institutional Repository of Institute of Automation, CAS

Deakin Research Online

Shenyang Institute of Automation,Chinese Academy Of Sciences

Directory of Open Access Journals

Western Sydney ResearchDirect