110 research outputs found

    I2^2MD: 3D Action Representation Learning with Inter- and Intra-modal Mutual Distillation

    Full text link
    Recent progresses on self-supervised 3D human action representation learning are largely attributed to contrastive learning. However, in conventional contrastive frameworks, the rich complementarity between different skeleton modalities remains under-explored. Moreover, optimized with distinguishing self-augmented samples, models struggle with numerous similar positive instances in the case of limited action categories. In this work, we tackle the aforementioned problems by introducing a general Inter- and Intra-modal Mutual Distillation (I2^2MD) framework. In I2^2MD, we first re-formulate the cross-modal interaction as a Cross-modal Mutual Distillation (CMD) process. Different from existing distillation solutions that transfer the knowledge of a pre-trained and fixed teacher to the student, in CMD, the knowledge is continuously updated and bidirectionally distilled between modalities during pre-training. To alleviate the interference of similar samples and exploit their underlying contexts, we further design the Intra-modal Mutual Distillation (IMD) strategy, In IMD, the Dynamic Neighbors Aggregation (DNA) mechanism is first introduced, where an additional cluster-level discrimination branch is instantiated in each modality. It adaptively aggregates highly-correlated neighboring features, forming local cluster-level contrasting. Mutual distillation is then performed between the two branches for cross-level knowledge exchange. Extensive experiments on three datasets show that our approach sets a series of new records.Comment: submitted to IJCV. arXiv admin note: substantial text overlap with arXiv:2208.1244

    Masked Motion Predictors are Strong 3D Action Representation Learners

    Full text link
    In 3D human action recognition, limited supervised data makes it challenging to fully tap into the modeling potential of powerful networks such as transformers. As a result, researchers have been actively investigating effective self-supervised pre-training strategies. In this work, we show that instead of following the prevalent pretext task to perform masked self-component reconstruction in human joints, explicit contextual motion modeling is key to the success of learning effective feature representation for 3D action recognition. Formally, we propose the Masked Motion Prediction (MAMP) framework. To be specific, the proposed MAMP takes as input the masked spatio-temporal skeleton sequence and predicts the corresponding temporal motion of the masked human joints. Considering the high temporal redundancy of the skeleton sequence, in our MAMP, the motion information also acts as an empirical semantic richness prior that guide the masking process, promoting better attention to semantically rich temporal regions. Extensive experiments on NTU-60, NTU-120, and PKU-MMD datasets show that the proposed MAMP pre-training substantially improves the performance of the adopted vanilla transformer, achieving state-of-the-art results without bells and whistles. The source code of our MAMP is available at https://github.com/maoyunyao/MAMP.Comment: To appear in ICCV 202

    Bi-LRFusion: Bi-Directional LiDAR-Radar Fusion for 3D Dynamic Object Detection

    Full text link
    LiDAR and Radar are two complementary sensing approaches in that LiDAR specializes in capturing an object's 3D shape while Radar provides longer detection ranges as well as velocity hints. Though seemingly natural, how to efficiently combine them for improved feature representation is still unclear. The main challenge arises from that Radar data are extremely sparse and lack height information. Therefore, directly integrating Radar features into LiDAR-centric detection networks is not optimal. In this work, we introduce a bi-directional LiDAR-Radar fusion framework, termed Bi-LRFusion, to tackle the challenges and improve 3D detection for dynamic objects. Technically, Bi-LRFusion involves two steps: first, it enriches Radar's local features by learning important details from the LiDAR branch to alleviate the problems caused by the absence of height information and extreme sparsity; second, it combines LiDAR features with the enhanced Radar features in a unified bird's-eye-view representation. We conduct extensive experiments on nuScenes and ORR datasets, and show that our Bi-LRFusion achieves state-of-the-art performance for detecting dynamic objects. Notably, Radar data in these two datasets have different formats, which demonstrates the generalizability of our method. Codes are available at https://github.com/JessieW0806/BiLRFusion.Comment: accepted by CVPR202

    Effects of different physical activity interventions on children with attention-deficit/hyperactivity disorder: A network meta-analysis of randomized controlled trials

    Get PDF
    BackgroundPrevious studies have shown that physical activity interventions positively affect core symptoms and executive functioning in children with attention-deficit/hyperactivity disorder (ADHD). However, comparisons between different physical activity interventions still need to be made. This study is the first to analyze the effects of 10 different types of physical activity on children with ADHD through a network meta-analysis.MethodsPubMed, Embase, Web of Science, and Cochrane Library databases were searched for randomized controlled trials on the effects of physical activity interventions on children with ADHD. The search time frame was from database creation to October 2022. Two investigators independently performed literature screening, extraction, and quality assessment. Network meta-analysis was performed with Stata 15.1.ResultsA total of 31 studies were included, and the results indicated that perceptual-motor training was the most effective in terms of motor ability and working memory (SUCRA = 82.7 and 73.3%, respectively). For attention problems and cognitive flexibility, aquatic exercise was the most effective (SUCRA = 80.9 and 86.6%, respectively). For social problems, horsemanship was the most effective (SUCRA = 79.4%). For inhibition switching, cognitive-motor training was the most effective (SUCRA = 83.5%).ConclusionOur study revealed that aquatic exercise and perceptual-motor training had a superior overall performance. However, the effects of various physical activity interventions on different indicators in children with ADHD can vary depending on the individual and the intervention’s validity. To ensure an appropriate physical activity intervention is selected, it is important to assess the severity of symptoms exhibited by children with ADHD beforehand

    Retrieve Anyone: A General-purpose Person Re-identification Task with Instructions

    Full text link
    Human intelligence can retrieve any person according to both visual and language descriptions. However, the current computer vision community studies specific person re-identification (ReID) tasks in different scenarios separately, which limits the applications in the real world. This paper strives to resolve this problem by proposing a new instruct-ReID task that requires the model to retrieve images according to the given image or language instructions.Our instruct-ReID is a more general ReID setting, where existing ReID tasks can be viewed as special cases by designing different instructions. We propose a large-scale OmniReID benchmark and an adaptive triplet loss as a baseline method to facilitate research in this new setting. Experimental results show that the baseline model trained on our OmniReID benchmark can improve +0.5%, +3.3% mAP on Market1501 and CUHK03 for traditional ReID, +2.1%, +0.2%, +15.3% mAP on PRCC, VC-Clothes, LTCC for clothes-changing ReID, +12.5% mAP on COCAS+ real2 for clothestemplate based clothes-changing ReID when using only RGB images, +25.5% mAP on COCAS+ real2 for our newly defined language-instructed ReID. The dataset, model, and code will be available at https://github.com/hwz-zju/Instruct-ReID

    RETRACTED: Quercetin Inhibits Tumorigenesis of Colorectal Cancer Through Downregulation of hsa_circ_0006990

    Get PDF
    Quercetin can significantly inhibit the progression of colorectal cancer (CRC). However, its specific mechanism remains largely unclear. In this study, we aimed to explore the correlation among quercetin, tumour-associated macrophages (TAMs) and circular RNAs (circRNAs) in the progression of CRC and to present a novel strategy for the treatment of CRC. In this study, we revealed that quercetin could suppress the autophagy of M2-TAMs and induced their differentiation into M1-TAMs, by which quercetin significantly reversed the inhibition of M2-TAMS on CRC cell apoptosis and the promotion of M2-TAMS on CRC cell proliferation. Moreover, quercetin could promote the expression of downregulated hsa_circ_0006990 in CRC cells co-cultured with M2-TAMs, and the overexpression of hsa_circ_0006990 significantly reversed the anti-tumour effect of quercetin on CRC. Furthermore, we found quercetin can notably suppress the progression of CRC via mediation of the hsa_circ_0006990/miR-132-3p/MUC13 axis. In conclusion, our results suggested that quercetin inhibits the tumorigenesis of CRC via inhibiting the polarisation of M2 macrophages and downregulating hsa_circ_0006990. Our study provides useful insights for those exploring new methods of treating CRC
    corecore