77 research outputs found

    Latent Embeddings for Collective Activity Recognition

    Full text link
    Rather than simply recognizing the action of a person individually, collective activity recognition aims to find out what a group of people is acting in a collective scene. Previ- ous state-of-the-art methods using hand-crafted potentials in conventional graphical model which can only define a limited range of relations. Thus, the complex structural de- pendencies among individuals involved in a collective sce- nario cannot be fully modeled. In this paper, we overcome these limitations by embedding latent variables into feature space and learning the feature mapping functions in a deep learning framework. The embeddings of latent variables build a global relation containing person-group interac- tions and richer contextual information by jointly modeling broader range of individuals. Besides, we assemble atten- tion mechanism during embedding for achieving more com- pact representations. We evaluate our method on three col- lective activity datasets, where we contribute a much larger dataset in this work. The proposed model has achieved clearly better performance as compared to the state-of-the- art methods in our experiments.Comment: 6pages, accepted by IEEE-AVSS201

    Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity

    Full text link
    Recent work has revealed many intriguing empirical phenomena in neural network training, despite the poorly understood and highly complex loss landscapes and training dynamics. One of these phenomena, Linear Mode Connectivity (LMC), has gained considerable attention due to the intriguing observation that different solutions can be connected by a linear path in the parameter space while maintaining near-constant training and test losses. In this work, we introduce a stronger notion of linear connectivity, Layerwise Linear Feature Connectivity (LLFC), which says that the feature maps of every layer in different trained networks are also linearly connected. We provide comprehensive empirical evidence for LLFC across a wide range of settings, demonstrating that whenever two trained networks satisfy LMC (via either spawning or permutation methods), they also satisfy LLFC in nearly all the layers. Furthermore, we delve deeper into the underlying factors contributing to LLFC, which reveal new insights into the spawning and permutation approaches. The study of LLFC transcends and advances our understanding of LMC by adopting a feature-learning perspective.Comment: 25 pages, 23 figure

    DSCom: A Data-Driven Self-Adaptive Community-Based Framework for Influence Maximization in Social Networks

    Full text link
    Influence maximization aims to find a subset of seeds that maximize the influence spread under a given budget. In this paper, we mainly address the data-driven version of this problem, where the diffusion model is not given but needs to be inferred from the history cascades. Several previous works have addressed this topic in a statistical way and provided efficient algorithms with theoretical guarantee. However, in their settings, though the diffusion parameters are inferred, they still need users to preset the diffusion model, which can be an intractable problem in real-world practices. In this paper, we reformulate the problem on the attributed network and leverage the node attributes to estimate the closeness between the connected nodes. Specifically, we propose a machine learning-based framework, named DSCom, to address this problem in a heuristic way. Under this framework, we first infer the users' relationship from the diffusion dataset through attention mechanism and then leverage spectral clustering to overcome the influence overlap problem in the lack of exact diffusion formula. Compared to the previous theoretical works, we carefully designed empirical experiments with parameterized diffusion models based on real-world social networks, which prove the efficiency and effectiveness of our algorithm

    Skywork: A More Open Bilingual Foundation Model

    Full text link
    In this technical report, we present Skywork-13B, a family of large language models (LLMs) trained on a corpus of over 3.2 trillion tokens drawn from both English and Chinese texts. This bilingual foundation model is the most extensively trained and openly published LLMs of comparable size to date. We introduce a two-stage training methodology using a segmented corpus, targeting general purpose training and then domain-specific enhancement training, respectively. We show that our model not only excels on popular benchmarks, but also achieves \emph{state of the art} performance in Chinese language modeling on diverse domains. Furthermore, we propose a novel leakage detection method, demonstrating that test data contamination is a pressing issue warranting further investigation by the LLM community. To spur future research, we release Skywork-13B along with checkpoints obtained during intermediate stages of the training process. We are also releasing part of our SkyPile corpus, a collection of over 150 billion tokens of web text, which is the largest high quality open Chinese pre-training corpus to date. We hope Skywork-13B and our open corpus will serve as a valuable open-source resource to democratize access to high-quality LLMs

    Association of Glomerular Filtration Rate with High-Sensitivity Cardiac Troponin T in a Community-Based Population Study in Beijing

    Get PDF
    BACKGROUND: Reduced renal function is an independent risk factor for cardiovascular disease mortality, and persistently elevated cardiac troponin T (cTnT) is frequently observed in patients with end-stage renal disease. In the general population the relationship between renal function and cTnT levels may not be clear because of the low sensitivity of the assay. In this study, we investigated the level of cTnT using a highly sensitive assay (hs-cTnT) and evaluated the association of estimated glomerular filtration rate (eGFR) with detectable hs-cTnT levels in a community-based population. METHODS: The serum hs-cTnT levels were measured in 1365 community dwelling population aged β‰₯45 years in Beijing, China. eGFR was determined by the Chinese modifying modification of diet in renal disease (C-MDRD) equation. RESULTS: With the highly sensitive assay, cTnT levels were detectable (β‰₯3pg/mL) in 744 subjects (54.5%). The result showed that eGFR was associated with Log hs-cTnT (rβ€Š=β€Š-0.14, P<0.001). After adjustment for the high predicted Framingham Coronary Heart Disease (CHD) risk (10-year risk >20%) and other prognostic indicators, moderate to severe reduced eGFR was independently associated with detectable hs-cTnT, whereas normal to mildly reduced eGFR was not independently associated with detectable hs-cTnT. In addition, after adjustment for other risk factors, the high predicted Framingham CHD risk was associated with detectable hs-cTnT in the subjects with different quartile levels of eGFR. CONCLUSION: The levels of hs-cTnT are detectable in a community-based Chinese population and low eGFR is associated with detectable hs-cTnT. Moreover, eGFR and high predicted Framingham CHD risk are associated with detectable hs-cTnT in subjects with moderate-to-severe reduced renal function

    Multiple sex partner behavior in female undergraduate students in China: A multi-campus survey

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>China is realizing increases in women engaged in premarital sex and multiple sex partner behavior. Our aim was to examine prevalence and determinants of multiple sex partner behavior among female undergraduates in China.</p> <p>Methods</p> <p>Anonymously completed questionnaires were received from 4,769 unmarried female undergraduates, recruited using randomized cluster sampling by type of university and students' major and grade. Items captured demographic, family, peer and work influence, and student factors (major, academic performance, and sex-related knowledge and attitudes). To examine risk factors for sexual behaviors, we used multi-level logistic regression, yielding odds ratios (OR) and 95% confidence intervals (95% CI).</p> <p>Results</p> <p>Of 4,769 female students, 863 (18.10%) reported ever having sexual intercourse, and 5.31% reported having multiple sex partners (29.32% of all women having sexual intercourse). Several demographic, family, peer and work influences, and student factors (including major, performance, knowledge, and attitude toward sex) were risk factors for ever having sex. However, risk factors for multiple sex partners only included working in a place of entertainment, having current close friends that were living with boyfriends, poor academic performance, and positive attitudes toward multiple partners. These women also were more likely to practice masturbation, start having sex at a younger age, have sex with married men and/or men not their "boyfriends" at first coitus, and not use condoms consistently.</p> <p>Conclusion</p> <p>A small but important subset of Chinese female undergraduates is engaged in unprotected sex with multiple sex partners. Interventions need to target at risk women, stressing the importance of consistent condom use.</p

    Are Neurons Actually Collapsed? On the Fine-Grained Structure in Neural Representations

    Full text link
    Recent work has observed an intriguing ''Neural Collapse'' phenomenon in well-trained neural networks, where the last-layer representations of training samples with the same label collapse into each other. This appears to suggest that the last-layer representations are completely determined by the labels, and do not depend on the intrinsic structure of input distribution. We provide evidence that this is not a complete description, and that the apparent collapse hides important fine-grained structure in the representations. Specifically, even when representations apparently collapse, the small amount of remaining variation can still faithfully and accurately captures the intrinsic structure of input distribution. As an example, if we train on CIFAR-10 using only 5 coarse-grained labels (by combining two classes into one super-class) until convergence, we can reconstruct the original 10-class labels from the learned representations via unsupervised clustering. The reconstructed labels achieve 93%93\% accuracy on the CIFAR-10 test set, nearly matching the normal CIFAR-10 accuracy for the same architecture. We also provide an initial theoretical result showing the fine-grained representation structure in a simplified synthetic setting. Our results show concretely how the structure of input data can play a significant role in determining the fine-grained structure of neural representations, going beyond what Neural Collapse predicts.Comment: This paper has been accepted as a conference paper at ICML 202
    • …
    corecore