Search CORE

238 research outputs found

CD-CNN: A Partially Supervised Cross-Domain Deep Learning Model for Urban Resident Recognition

Author: He Xu
Wang Jingyuan
Wang Ze
Wu Junjie
Xie Xing
Xiong Zhang
Yuan Nicholas Jing
Publication venue
Publication date: 25/04/2018
Field of study

Driven by the wave of urbanization in recent decades, the research topic about migrant behavior analysis draws great attention from both academia and the government. Nevertheless, subject to the cost of data collection and the lack of modeling methods, most of existing studies use only questionnaire surveys with sparse samples and non-individual level statistical data to achieve coarse-grained studies of migrant behaviors. In this paper, a partially supervised cross-domain deep learning model named CD-CNN is proposed for migrant/native recognition using mobile phone signaling data as behavioral features and questionnaire survey data as incomplete labels. Specifically, CD-CNN features in decomposing the mobile data into location domain and communication domain, and adopts a joint learning framework that combines two convolutional neural networks with a feature balancing scheme. Moreover, CD-CNN employs a three-step algorithm for training, in which the co-training step is of great value to partially supervised cross-domain learning. Comparative experiments on the city Wuxi demonstrate the high predictive power of CD-CNN. Two interesting applications further highlight the ability of CD-CNN for in-depth migrant behavioral analysis.Comment: 8 pages, 5 figures, conferenc

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Full Bayesian Significance Testing for Neural Networks

Author: He Yue
Li Zimeng
Liu Zehua
Wang Jingyuan
Publication venue
Publication date: 24/01/2024
Field of study

Significance testing aims to determine whether a proposition about the population distribution is the truth or not given observations. However, traditional significance testing often needs to derive the distribution of the testing statistic, failing to deal with complex nonlinear relationships. In this paper, we propose to conduct Full Bayesian Significance Testing for neural networks, called \textit{n}FBST, to overcome the limitation in relationship characterization of traditional approaches. A Bayesian neural network is utilized to fit the nonlinear and multi-dimensional relationships with small errors and avoid hard theoretical derivation by computing the evidence value. Besides, \textit{n}FBST can test not only global significance but also local and instance-wise significance, which previous testing methods don't focus on. Moreover, \textit{n}FBST is a general framework that can be extended based on the measures selected, such as Grad-\textit{n}FBST, LRP-\textit{n}FBST, DeepLIFT-\textit{n}FBST, LIME-\textit{n}FBST. A range of experiments on both simulated and real data are conducted to show the advantages of our method.Comment: Published as a conference paper at AAAI 202

arXiv.org e-Print Archive

TransFusion: Covariate-Shift Robust Transfer Learning for High-Dimensional Regression

Author: He Zelin
Li Runze
Liu Jingyuan
Sun Ying
Publication venue
Publication date: 01/04/2024
Field of study

The main challenge that sets transfer learning apart from traditional supervised learning is the distribution shift, reflected as the shift between the source and target models and that between the marginal covariate distributions. In this work, we tackle model shifts in the presence of covariate shifts in the high-dimensional regression setting. Specifically, we propose a two-step method with a novel fused-regularizer that effectively leverages samples from source tasks to improve the learning performance on a target task with limited samples. Nonasymptotic bound is provided for the estimation error of the target model, showing the robustness of the proposed method to covariate shifts. We further establish conditions under which the estimator is minimax-optimal. Additionally, we extend the method to a distributed setting, allowing for a pretraining-finetuning strategy, requiring just one round of communication while retaining the estimation rate of the centralized version. Numerical tests validate our theory, highlighting the method's robustness to covariate shifts.Comment: Accepted by the 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024

arXiv.org e-Print Archive

Self-Supervised Deconfounding Against Spatio-Temporal Shifts: Theory and Modeling

Author: He Yue
Huang Chao
Ji Jiahao
Wang Jingyuan
Zhang Wentao
Publication venue
Publication date: 06/03/2024
Field of study

As an important application of spatio-temporal (ST) data, ST traffic forecasting plays a crucial role in improving urban travel efficiency and promoting sustainable development. In practice, the dynamics of traffic data frequently undergo distributional shifts attributed to external factors such as time evolution and spatial differences. This entails forecasting models to handle the out-of-distribution (OOD) issue where test data is distributed differently from training data. In this work, we first formalize the problem by constructing a causal graph of past traffic data, future traffic data, and external ST contexts. We reveal that the failure of prior arts in OOD traffic data is due to ST contexts acting as a confounder, i.e., the common cause for past data and future ones. Then, we propose a theoretical solution named Disentangled Contextual Adjustment (DCA) from a causal lens. It differentiates invariant causal correlations against variant spurious ones and deconfounds the effect of ST contexts. On top of that, we devise a Spatio-Temporal sElf-superVised dEconfounding (STEVE) framework. It first encodes traffic data into two disentangled representations for associating invariant and variant ST contexts. Then, we use representative ST contexts from three conceptually different perspectives (i.e., temporal, spatial, and semantic) as self-supervised signals to inject context information into both representations. In this way, we improve the generalization ability of the learned context-oriented representations to OOD ST traffic forecasting. Comprehensive experiments on four large-scale benchmark datasets demonstrate that our STEVE consistently outperforms the state-of-the-art baselines across various ST OOD scenarios.Comment: 14 pages, 9 figure

arXiv.org e-Print Archive

Distance measurements via the morphogen gradient of Bicoid in Drosophila embryos

Author: Cheung David
Deng Jingyuan
He Feng
Jiao Renjie
Lu Long J
Ma Jun
Wen Ying
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Patterning along the anterior-posterior (A-P) axis in <it>Drosophila </it>embryos is instructed by the morphogen gradient of Bicoid (Bcd). Despite extensive studies of this morphogen, how embryo geometry may affect gradient formation and target responses has not been investigated experimentally. Results In this report, we systematically compare the Bcd gradient profiles and its target expression patterns on the dorsal and ventral sides of the embryo. Our results support a hypothesis that proper distance measurement and the encoded positional information of the Bcd gradient are along the perimeter of the embryo. Our results also reveal that the dorsal and ventral sides of the embryo have a fundamentally similar relationship between Bcd and its target Hunchback (Hb), suggesting that Hb expression properties on the two sides of the embryo can be directly traced to Bcd gradient properties. Our 3-D simulation studies show that a curvature difference between the two sides of an embryo is sufficient to generate Bcd gradient properties that are consistent with experimental observations. Conclusions The findings described in this report provide a first quantitative, experimental evaluation of embryo geometry on Bcd gradient formation and target responses. They demonstrate that the physical features of an embryo, such as its shape, are integral to how pattern is formed.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Validating Multimedia Content Moderation Software via Semantic Fusion

Author: Chen Chang
Gu Jiazhen
He Pinjia
Huang Jingyuan
Lyu Michael
Wang Wenxuan
Wu Weibin
Zhang Jianping
Publication venue
Publication date: 22/05/2023
Field of study

The exponential growth of social media platforms, such as Facebook and TikTok, has revolutionized communication and content publication in human society. Users on these platforms can publish multimedia content that delivers information via the combination of text, audio, images, and video. Meanwhile, the multimedia content release facility has been increasingly exploited to propagate toxic content, such as hate speech, malicious advertisements, and pornography. To this end, content moderation software has been widely deployed on these platforms to detect and blocks toxic content. However, due to the complexity of content moderation models and the difficulty of understanding information across multiple modalities, existing content moderation software can fail to detect toxic content, which often leads to extremely negative impacts. We introduce Semantic Fusion, a general, effective methodology for validating multimedia content moderation software. Our key idea is to fuse two or more existing single-modal inputs (e.g., a textual sentence and an image) into a new input that combines the semantics of its ancestors in a novel manner and has toxic nature by construction. This fused input is then used for validating multimedia content moderation software. We realized Semantic Fusion as DUO, a practical content moderation software testing tool. In our evaluation, we employ DUO to test five commercial content moderation software and two state-of-the-art models against three kinds of toxic content. The results show that DUO achieves up to 100% error finding rate (EFR) when testing moderation software. In addition, we leverage the test cases generated by DUO to retrain the two models we explored, which largely improves model robustness while maintaining the accuracy on the original test set.Comment: Accepted by ISSTA 202

arXiv.org e-Print Archive

The Model 2.0 and Friends: An Interim Report

Author: Agarwal Shivaank
Cottrell Garrison W
Fischer Thomas
Gahl Martha
He Jingyuan
Kulkarni Shubham
Li Cody
Long Keyu
Shah Yash
Venkatramani Shashank
Zhi Xuzhe
Publication venue: 'Purdue University (bepress)'
Publication date: 16/05/2023
Field of study

Last year, I reported on preliminary results of an anatomically-inspired deep learning model of the visual system and its role in explaining the face inversion effect. This year, I will report on new results and some variations on network architectures that we have explored, mainly as a way to generate discussion and get feedback. This is by no means a polished, final presentation! We look forward to the group’s suggestions for these projects

Purdue E-Pubs

Amino acid Formula induces Microbiota Dysbiosis and Depressive-Like Behavior in Mice

Author: Bi Guoqiang
Dou Yiping
He Kaixin
Hu Ji
Huang Chuan
Niu Chaoshi
Wang Hao
Wang Jingyuan
Yang Yifei
Zhang Guorong
Zhang Lan
Zhu Shu
Publication venue: DigitalCommons@TMC
Publication date: 26/03/2024
Field of study

Amino acid formula (AAF) is increasingly consumed in infants with cow\u27s milk protein allergy; however, the long-term influences on health are less described. In this study, we established a mouse model by subjecting neonatal mice to an amino acid diet (AAD) to mimic the feeding regimen of infants on AAF. Surprisingly, AAD-fed mice exhibited dysbiotic microbiota and increased neuronal activity in both the intestine and brain, as well as gastrointestinal peristalsis disorders and depressive-like behavior. Furthermore, fecal microbiota transplantation from AAD-fed mice or AAF-fed infants to recipient mice led to elevated neuronal activations and exacerbated depressive-like behaviors compared to that from normal chow-fed mice or cow\u27s-milk-formula-fed infants, respectively. Our findings highlight the necessity to avoid the excessive use of AAF, which may influence the neuronal development and mental health of children

DigitalCommons@The Texas Medical Center