223 research outputs found

    Face Recognition from Sequential Sparse 3D Data via Deep Registration

    Full text link
    Previous works have shown that face recognition with high accurate 3D data is more reliable and insensitive to pose and illumination variations. Recently, low-cost and portable 3D acquisition techniques like ToF(Time of Flight) and DoE based structured light systems enable us to access 3D data easily, e.g., via a mobile phone. However, such devices only provide sparse(limited speckles in structured light system) and noisy 3D data which can not support face recognition directly. In this paper, we aim at achieving high-performance face recognition for devices equipped with such modules which is very meaningful in practice as such devices will be very popular. We propose a framework to perform face recognition by fusing a sequence of low-quality 3D data. As 3D data are sparse and noisy which can not be well handled by conventional methods like the ICP algorithm, we design a PointNet-like Deep Registration Network(DRNet) which works with ordered 3D point coordinates while preserving the ability of mining local structures via convolution. Meanwhile we develop a novel loss function to optimize our DRNet based on the quaternion expression which obviously outperforms other widely used functions. For face recognition, we design a deep convolutional network which takes the fused 3D depth-map as input based on AMSoftmax model. Experiments show that our DRNet can achieve rotation error 0.95{\deg} and translation error 0.28mm for registration. The face recognition on fused data also achieves rank-1 accuracy 99.2% , FAR-0.001 97.5% on Bosphorus dataset which is comparable with state-of-the-art high-quality data based recognition performance.Comment: To be appeared in ICB201

    SAM-PARSER: Fine-tuning SAM Efficiently by Parameter Space Reconstruction

    Full text link
    Segment Anything Model (SAM) has received remarkable attention as it offers a powerful and versatile solution for object segmentation in images. However, fine-tuning SAM for downstream segmentation tasks under different scenarios remains a challenge, as the varied characteristics of different scenarios naturally requires diverse model parameter spaces. Most existing fine-tuning methods attempt to bridge the gaps among different scenarios by introducing a set of new parameters to modify SAM's original parameter space. Unlike these works, in this paper, we propose fine-tuning SAM efficiently by parameter space reconstruction (SAM-PARSER), which introduce nearly zero trainable parameters during fine-tuning. In SAM-PARSER, we assume that SAM's original parameter space is relatively complete, so that its bases are able to reconstruct the parameter space of a new scenario. We obtain the bases by matrix decomposition, and fine-tuning the coefficients to reconstruct the parameter space tailored to the new scenario by an optimal linear combination of the bases. Experimental results show that SAM-PARSER exhibits superior segmentation performance across various scenarios, while reducing the number of trainable parameters by 290\approx 290 times compared with current parameter-efficient fine-tuning methods

    Inequalities for Permanents and Permanental Minors of Row Substochastic Matrices

    Get PDF
    In this paper, some inequalities for permanents and permanental minors of row substochastic matrices are proved. The convexity of the permanent function on the interval between the identity matrix and an arbitrary row substochastic matrix is also proved. In addition, a conjecture about the permanent and permanental minors of square row substochastic matrices with fixed row and column sums is formulated

    Machine Learning for Predictive Deployment of UAVs with Multiple Access

    Full text link
    In this paper, a machine learning based deployment framework of unmanned aerial vehicles (UAVs) is studied. In the considered model, UAVs are deployed as flying base stations (BS) to offload heavy traffic from ground BSs. Due to time-varying traffic distribution, a long short-term memory (LSTM) based prediction algorithm is introduced to predict the future cellular traffic. To predict the user service distribution, a KEG algorithm, which is a joint K-means and expectation maximization (EM) algorithm based on Gaussian mixture model (GMM), is proposed for determining the service area of each UAV. Based on the predicted traffic, the optimal UAV positions are derived and three multi-access techniques are compared so as to minimize the total transmit power. Simulation results show that the proposed method can reduce up to 24\% of the total power consumption compared to the conventional method without traffic prediction. Besides, rate splitting multiple access (RSMA) has the lower required transmit power compared to frequency domain multiple access (FDMA) and time domain multiple access (TDMA)

    A106: Aerobic Exercise Modulates GPCR/cAMP/PKA Signaling Pathway and Complement-Microglia Axis to Prevent Synaptic Loss in APP/PS1 Mice

    Get PDF
    Purpose: Synaptic failure serves as a primary contributor to memory dysfunction in Alzheimer\u27s disease (AD). Physical exercise has demonstrated the potential to thwart and delay degenerative alterations in memory functions linked to AD. Investigating the underlying mechanisms may unveil crucial insights into early pathological changes, offering breakthroughs for both understanding and treating AD. Methods: We utilized 3-month-old APP/PS1 mice and subjected them to a 12-week aerobic exercise intervention. The spatial learning and memory functions of the mice were assessed using the Morris water maze test, while Golgi staining was employed to determine dendritic spine density in each mouse group. To analyze the potential mechanisms mediating the effects of exercise intervention in the AD brain, we conducted RNA sequencing. Subsequently, pathway enrichment analysis, immunofluorescence, real-time quantitative PCR, and western blotting were employed to elucidate the impact of regular aerobic exercise on the GPCR/cAMP/PKA signaling pathway and complement-microglia axis. Results: Our findings reveal that a 12-week aerobic exercise intervention significantly enhanced spatial learning and memory function in APP/PS1 mice. Moreover, it led to a substantial increase in dendritic spine density and elevated expression of postsynaptic density protein 95 (PSD-95) in the cortex and hippocampus. Aerobic exercise demonstrated the ability to improve the expression of certain genes and enhance synaptic pathways in the brains of APP/PS1 mice. This suggests that aerobic exercise facilitates synaptic growth in APP/PS1 mice by modulating G protein-coupled receptors (GPCRs) and activating the cAMP signaling pathway, with significant alterations observed in the expressions of Hcar1 and Vipr2 genes. Furthermore, exercise intervention resulted in the significant down-regulation (P \u3c 0.05 or P \u3c 0.01) of cAMP, p-PKA/PKA, GluA1, and CaMKII protein expressions in the brain tissue of APP/PS1 mice, which were subsequently up-regulated after exercise (P \u3c 0.01). Notably, regular aerobic exercise effectively suppressed the activation of IBA-1+ microglia cells (P \u3c 0.01), reversed changes in M1 phenotype markers (Cd86 and iNOS) and M2 phenotype markers (Arg-1) of microglia cells (P \u3c 0.05), reduced the production of promoters C1q and central factor C3 in the macrosomatic cascade (P \u3c 0.05), and prevented the colocalization of microglia and PSD-95 (P \u3c 0.01). Conclusion: In conclusion, our results indicate that physical exercise plays a pivotal role in fostering early synaptic growth and averting synaptic loss in Alzheimer\u27s disease (AD). This effect may be attributed to the regulation of the G protein-coupled receptors (GPCRs)/cAMP/PKA signaling pathway and the suppression of complement-mediated microglial phagocytosis of synapses. This mechanistic insight underscores the inherent contribution of exercise to health promotion, offering potential avenues for synaptic-focused interventions in the early stages of AD treatment

    Boosting Unsupervised Contrastive Learning Using Diffusion-Based Data Augmentation From Scratch

    Full text link
    Unsupervised contrastive learning methods have recently seen significant improvements, particularly through data augmentation strategies that aim to produce robust and generalizable representations. However, prevailing data augmentation methods, whether hand designed or based on foundation models, tend to rely heavily on prior knowledge or external data. This dependence often compromises their effectiveness and efficiency. Furthermore, the applicability of most existing data augmentation strategies is limited when transitioning to other research domains, especially science-related data. This limitation stems from the paucity of prior knowledge and labeled data available in these domains. To address these challenges, we introduce DiffAug-a novel and efficient Diffusion-based data Augmentation technique. DiffAug aims to ensure that the augmented and original data share a smoothed latent space, which is achieved through diffusion steps. Uniquely, unlike traditional methods, DiffAug first mines sufficient prior semantic knowledge about the neighborhood. This provides a constraint to guide the diffusion steps, eliminating the need for labels, external data/models, or prior knowledge. Designed as an architecture-agnostic framework, DiffAug provides consistent improvements. Specifically, it improves image classification and clustering accuracy by 1.6%~4.5%. When applied to biological data, DiffAug improves performance by up to 10.1%, with an average improvement of 5.8%. DiffAug shows good performance in both vision and biological domains.Comment: arXiv admin note: text overlap with arXiv:2302.07944 by other author

    Pan-cancer analysis of the prevalence and associated factors of lung metastasis and the construction of the lung metastatic classification system

    Get PDF
    This study first presents an analysis of the prevalence and associated factors of the lung metastasis (LM) database and then uses this analysis to construct an LM classification system. Using cancer patient data gathered from the surveillance, epidemiology, and end results (SEER) database, this study shows that the prevalence of LM is not consistent among different cancers; that is, the prevalence of LM ranges from 0.0013 [brain; 95% confidence interval (95% CI); 0.0010–0.0018] to 0.234 (“other digestive organs”; 95% CI; 0.221–0.249). This study finds that advanced age, poor grade, higher tumor or node stage, and metastases including bone, brain, and liver are positively related to LM occurrence, while female gender, income, marital status, and insured status are negatively related. Then, this study generates four categories from 58 cancer types based on prevalence and influence factors and satisfactorily validates these. This classification system reflects the LM risk of different cancers. It can guide individualized treatment and the management of these synchronous metastatic cancer patients and help clinicians better distribute medical resources

    A Survey on Label-efficient Deep Image Segmentation: Bridging the Gap between Weak Supervision and Dense Prediction

    Full text link
    The rapid development of deep learning has made a great progress in image segmentation, one of the fundamental tasks of computer vision. However, the current segmentation algorithms mostly rely on the availability of pixel-level annotations, which are often expensive, tedious, and laborious. To alleviate this burden, the past years have witnessed an increasing attention in building label-efficient, deep-learning-based image segmentation algorithms. This paper offers a comprehensive review on label-efficient image segmentation methods. To this end, we first develop a taxonomy to organize these methods according to the supervision provided by different types of weak labels (including no supervision, inexact supervision, incomplete supervision and inaccurate supervision) and supplemented by the types of segmentation problems (including semantic segmentation, instance segmentation and panoptic segmentation). Next, we summarize the existing label-efficient image segmentation methods from a unified perspective that discusses an important question: how to bridge the gap between weak supervision and dense prediction -- the current methods are mostly based on heuristic priors, such as cross-pixel similarity, cross-label constraint, cross-view consistency, and cross-image relation. Finally, we share our opinions about the future research directions for label-efficient deep image segmentation.Comment: Accepted to IEEE TPAM

    Competitive voting-based multi-class prediction for ore selection

    Get PDF
    Sensor-based intelligent sorting technology is a mineral separation technology with the merits of high-efficiency, energy-saving and water-saving. However, the prediction accuracy of conventional machine learning methods is unstable in multi-class selection of ores. The purpose of this study is to propose a competitive voting method to improve the multi-class prediction accuracy of ores in machine vision-based sorting system by combining the classification advantages of various machine learning methods. The operations of image segmentation, feature extraction and feature selection are presented to obtain the multi-class datasets. Three ones of traditional machine learning models with higher classification accuracies are used to establish competitive voting classification models. A case study using the image data of a gas coal shows the merits of the proposed approach. Results derived using this competitive voting approach reveal that it outperforms pre-existing approaches
    corecore