Search CORE

77 research outputs found

Latent Embeddings for Collective Activity Recognition

Author: Hu Jian-Fang
Tang Yongyi
Zhang Peizhen
Zheng Wei-Shi
Publication venue
Publication date: 20/09/2017
Field of study

Rather than simply recognizing the action of a person individually, collective activity recognition aims to find out what a group of people is acting in a collective scene. Previ- ous state-of-the-art methods using hand-crafted potentials in conventional graphical model which can only define a limited range of relations. Thus, the complex structural de- pendencies among individuals involved in a collective sce- nario cannot be fully modeled. In this paper, we overcome these limitations by embedding latent variables into feature space and learning the feature mapping functions in a deep learning framework. The embeddings of latent variables build a global relation containing person-group interac- tions and richer contextual information by jointly modeling broader range of individuals. Besides, we assemble atten- tion mechanism during embedding for achieving more com- pact representations. We evaluate our method on three col- lective activity datasets, where we contribute a much larger dataset in this work. The proposed model has achieved clearly better performance as compared to the state-of-the- art methods in our experiments.Comment: 6pages, accepted by IEEE-AVSS201

arXiv.org e-Print Archive

Crossref

Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity

Author: Hu Wei
Yan Junchi
Yang Xiaojiang
Yang Yongyi
Zhou Zhanpeng
Publication venue
Publication date: 13/11/2023
Field of study

Recent work has revealed many intriguing empirical phenomena in neural network training, despite the poorly understood and highly complex loss landscapes and training dynamics. One of these phenomena, Linear Mode Connectivity (LMC), has gained considerable attention due to the intriguing observation that different solutions can be connected by a linear path in the parameter space while maintaining near-constant training and test losses. In this work, we introduce a stronger notion of linear connectivity, Layerwise Linear Feature Connectivity (LLFC), which says that the feature maps of every layer in different trained networks are also linearly connected. We provide comprehensive empirical evidence for LLFC across a wide range of settings, demonstrating that whenever two trained networks satisfy LMC (via either spawning or permutation methods), they also satisfy LLFC in nearly all the layers. Furthermore, we delve deeper into the underlying factors contributing to LLFC, which reveal new insights into the spawning and permutation approaches. The study of LLFC transcends and advances our understanding of LMC by adopting a feature-learning perspective.Comment: 25 pages, 23 figure

arXiv.org e-Print Archive

DSCom: A Data-Driven Self-Adaptive Community-Based Framework for Influence Maximization in Social Networks

Author: Gao Xiaofeng
Guo Jianxiong
Hu Yongyi
Sun Haojia
Zuo Yuxin
Publication venue
Publication date: 18/11/2023
Field of study

Influence maximization aims to find a subset of seeds that maximize the influence spread under a given budget. In this paper, we mainly address the data-driven version of this problem, where the diffusion model is not given but needs to be inferred from the history cascades. Several previous works have addressed this topic in a statistical way and provided efficient algorithms with theoretical guarantee. However, in their settings, though the diffusion parameters are inferred, they still need users to preset the diffusion model, which can be an intractable problem in real-world practices. In this paper, we reformulate the problem on the attributed network and leverage the node attributes to estimate the closeness between the connected nodes. Specifically, we propose a machine learning-based framework, named DSCom, to address this problem in a heuristic way. Under this framework, we first infer the users' relationship from the diffusion dataset through attention mechanism and then leverage spectral clustering to overcome the influence overlap problem in the lack of exact diffusion formula. Compared to the previous theoretical works, we carefully designed empirical experiments with parameterized diffusion models based on real-world social networks, which prove the efficiency and effectiveness of our algorithm

arXiv.org e-Print Archive

Skywork: A More Open Bilingual Foundation Model

Author: Chen Yifu
Cheng Cheng
Cheng Peng
Cheng Wenjun
Dong Chuanhai
Fang Han
Hu Rui
Li Biye
Li Chenxia
Liang Xiaojuan
Lin Lei
Liu Lunan
Luo Xilin
Lü Weiwei
Ma Yutuan
Peng Yongyi
Sun Yanqi
Wang Lijie
Wang Xiaokun
Wei Tianwen
Wu Xuejie
Yan Shuicheng
Yang Haihua
Yang Liu
Zhang Jianhao
Zhang Lichang
Zhang Xiaoyu
Zhao Liang
Zhou Yahui
Zhu Bo
Publication venue
Publication date: 30/10/2023
Field of study

In this technical report, we present Skywork-13B, a family of large language models (LLMs) trained on a corpus of over 3.2 trillion tokens drawn from both English and Chinese texts. This bilingual foundation model is the most extensively trained and openly published LLMs of comparable size to date. We introduce a two-stage training methodology using a segmented corpus, targeting general purpose training and then domain-specific enhancement training, respectively. We show that our model not only excels on popular benchmarks, but also achieves \emph{state of the art} performance in Chinese language modeling on diverse domains. Furthermore, we propose a novel leakage detection method, demonstrating that test data contamination is a pressing issue warranting further investigation by the LLM community. To spur future research, we release Skywork-13B along with checkpoints obtained during intermediate stages of the training process. We are also releasing part of our SkyPile corpus, a collection of over 150 billion tokens of web text, which is the largest high quality open Chinese pre-training corpus to date. We hope Skywork-13B and our open corpus will serve as a valuable open-source resource to democratize access to high-quality LLMs

arXiv.org e-Print Archive

Association of Glomerular Filtration Rate with High-Sensitivity Cardiac Troponin T in a Community-Based Population Study in Beijing

Author: A Shankar
AE Gabr
AV Chobanian
B Conway
BJ Freda
BM Everett
CR deFilippi
DW Kehl
EM Antman
ES Ford
Fan Wang
FS Apple
Hongmei Wu
J Hu
JG Dickhout
JH Diris
JK Inrig
John E. Mendelson
JS Alpert
K Thygesen
L Zhang
Leiming Luo
LX Zhang
M Kanbay
M Nakayama
MP Turakhia
NA Abbas
Ping Ye
PJ Best
PW Wilson
R Latini
R Sharma
RN Foley
Ruyi Xu
Ruyi Xu
SJ Saunders
T Berl
T Ninomiya
T Omland
T Otsuka
T Tsutamoto
TW Wallace
YC Ma
Yongyi Bai
Publication venue: Public Library of Science
Publication date: 31/05/2012
Field of study

BACKGROUND: Reduced renal function is an independent risk factor for cardiovascular disease mortality, and persistently elevated cardiac troponin T (cTnT) is frequently observed in patients with end-stage renal disease. In the general population the relationship between renal function and cTnT levels may not be clear because of the low sensitivity of the assay. In this study, we investigated the level of cTnT using a highly sensitive assay (hs-cTnT) and evaluated the association of estimated glomerular filtration rate (eGFR) with detectable hs-cTnT levels in a community-based population. METHODS: The serum hs-cTnT levels were measured in 1365 community dwelling population aged ≥45 years in Beijing, China. eGFR was determined by the Chinese modifying modification of diet in renal disease (C-MDRD) equation. RESULTS: With the highly sensitive assay, cTnT levels were detectable (≥3pg/mL) in 744 subjects (54.5%). The result showed that eGFR was associated with Log hs-cTnT (r = -0.14, P<0.001). After adjustment for the high predicted Framingham Coronary Heart Disease (CHD) risk (10-year risk >20%) and other prognostic indicators, moderate to severe reduced eGFR was independently associated with detectable hs-cTnT, whereas normal to mildly reduced eGFR was not independently associated with detectable hs-cTnT. In addition, after adjustment for other risk factors, the high predicted Framingham CHD risk was associated with detectable hs-cTnT in the subjects with different quartile levels of eGFR. CONCLUSION: The levels of hs-cTnT are detectable in a community-based Chinese population and low eGFR is associated with detectable hs-cTnT. Moreover, eGFR and high predicted Framingham CHD risk are associated with detectable hs-cTnT in subjects with moderate-to-severe reduced renal function

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Multiple sex partner behavior in female undergraduate students in China: A multi-campus survey

Abstract Background China is realizing increases in women engaged in premarital sex and multiple sex partner behavior. Our aim was to examine prevalence and determinants of multiple sex partner behavior among female undergraduates in China. Methods Anonymously completed questionnaires were received from 4,769 unmarried female undergraduates, recruited using randomized cluster sampling by type of university and students' major and grade. Items captured demographic, family, peer and work influence, and student factors (major, academic performance, and sex-related knowledge and attitudes). To examine risk factors for sexual behaviors, we used multi-level logistic regression, yielding odds ratios (OR) and 95% confidence intervals (95% CI). Results Of 4,769 female students, 863 (18.10%) reported ever having sexual intercourse, and 5.31% reported having multiple sex partners (29.32% of all women having sexual intercourse). Several demographic, family, peer and work influences, and student factors (including major, performance, knowledge, and attitude toward sex) were risk factors for ever having sex. However, risk factors for multiple sex partners only included working in a place of entertainment, having current close friends that were living with boyfriends, poor academic performance, and positive attitudes toward multiple partners. These women also were more likely to practice masturbation, start having sex at a younger age, have sex with married men and/or men not their "boyfriends" at first coitus, and not use condoms consistently. Conclusion A small but important subset of Chinese female undergraduates is engaged in unprotected sex with multiple sex partners. Interventions need to target at risk women, stressing the importance of consistent condom use.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Progesterone Inhibits Epithelial-to-Mesenchymal Transition in Endometrial Cancer

Background: Every year approximately 74,000 women die of endometrial cancer, mainly due to recurrent or metastatic disease. The presence of tumor infiltrating lymphocytes (TILs) as well as progesterone receptor (PR) positivity has been correlated with improved prognosis. This study describes two mechanisms by which

CiteSeerX

Public Library of Science (PLOS)

Crossref

PubMed Central

Erasmus University Digital Repository

The Performance Effect of Inter-Firm Trust in Corrupt Environments: A Signaling Theory Perspective

Author: Hu Wenjin
Shou Yongyi
Publication venue
Publication date: 01/01/2019
Field of study

Explore Bristol Research

To See from Where You Stand: Supply Chain Network Positions and Merger and Acquisition Likelihood

Author: Dong Yan
Hu Wenjin
Shou Yongyi
Publication venue
Publication date: 01/01/2020
Field of study

Explore Bristol Research

Are Neurons Actually Collapsed? On the Fine-Grained Structure in Neural Representations

Author: Hu Wei
Steinhardt Jacob
Yang Yongyi
Publication venue
Publication date: 29/06/2023
Field of study

Recent work has observed an intriguing ''Neural Collapse'' phenomenon in well-trained neural networks, where the last-layer representations of training samples with the same label collapse into each other. This appears to suggest that the last-layer representations are completely determined by the labels, and do not depend on the intrinsic structure of input distribution. We provide evidence that this is not a complete description, and that the apparent collapse hides important fine-grained structure in the representations. Specifically, even when representations apparently collapse, the small amount of remaining variation can still faithfully and accurately captures the intrinsic structure of input distribution. As an example, if we train on CIFAR-10 using only 5 coarse-grained labels (by combining two classes into one super-class) until convergence, we can reconstruct the original 10-class labels from the learned representations via unsupervised clustering. The reconstructed labels achieve

93\%

accuracy on the CIFAR-10 test set, nearly matching the normal CIFAR-10 accuracy for the same architecture. We also provide an initial theoretical result showing the fine-grained representation structure in a simplified synthetic setting. Our results show concretely how the structure of input data can play a significant role in determining the fine-grained structure of neural representations, going beyond what Neural Collapse predicts.Comment: This paper has been accepted as a conference paper at ICML 202

arXiv.org e-Print Archive