23 research outputs found

    Self-supervised Likelihood Estimation with Energy Guidance for Anomaly Segmentation in Urban Scenes

    Full text link
    Robust autonomous driving requires agents to accurately identify unexpected areas in urban scenes. To this end, some critical issues remain open: how to design advisable metric to measure anomalies, and how to properly generate training samples of anomaly data? Previous effort usually resorts to uncertainty estimation and sample synthesis from classification tasks, which ignore the context information and sometimes requires auxiliary datasets with fine-grained annotations. On the contrary, in this paper, we exploit the strong context-dependent nature of segmentation task and design an energy-guided self-supervised frameworks for anomaly segmentation, which optimizes an anomaly head by maximizing the likelihood of self-generated anomaly pixels. To this end, we design two estimators for anomaly likelihood estimation, one is a simple task-agnostic binary estimator and the other depicts anomaly likelihood as residual of task-oriented energy model. Based on proposed estimators, we further incorporate our framework with likelihood-guided mask refinement process to extract informative anomaly pixels for model training. We conduct extensive experiments on challenging Fishyscapes and Road Anomaly benchmarks, demonstrating that without any auxiliary data or synthetic models, our method can still achieves competitive performance to other SOTA schemes

    Learning with Noisy labels via Self-supervised Adversarial Noisy Masking

    Full text link
    Collecting large-scale datasets is crucial for training deep models, annotating the data, however, inevitably yields noisy labels, which poses challenges to deep learning algorithms. Previous efforts tend to mitigate this problem via identifying and removing noisy samples or correcting their labels according to the statistical properties (e.g., loss values) among training samples. In this paper, we aim to tackle this problem from a new perspective, delving into the deep feature maps, we empirically find that models trained with clean and mislabeled samples manifest distinguishable activation feature distributions. From this observation, a novel robust training approach termed adversarial noisy masking is proposed. The idea is to regularize deep features with a label quality guided masking scheme, which adaptively modulates the input data and label simultaneously, preventing the model to overfit noisy samples. Further, an auxiliary task is designed to reconstruct input data, it naturally provides noise-free self-supervised signals to reinforce the generalization ability of deep models. The proposed method is simple and flexible, it is tested on both synthetic and real-world noisy datasets, where significant improvements are achieved over previous state-of-the-art methods

    Rethinking Mobile Block for Efficient Attention-based Models

    Full text link
    This paper focuses on developing modern, efficient, lightweight models for dense predictions while trading off parameters, FLOPs, and performance. Inverted Residual Block (IRB) serves as the infrastructure for lightweight CNNs, but no counterpart has been recognized by attention-based studies. This work rethinks lightweight infrastructure from efficient IRB and effective components of Transformer from a unified perspective, extending CNN-based IRB to attention-based models and abstracting a one-residual Meta Mobile Block (MMB) for lightweight model design. Following simple but effective design criterion, we deduce a modern Inverted Residual Mobile Block (iRMB) and build a ResNet-like Efficient MOdel (EMO) with only iRMB for down-stream tasks. Extensive experiments on ImageNet-1K, COCO2017, and ADE20K benchmarks demonstrate the superiority of our EMO over state-of-the-art methods, e.g., EMO-1M/2M/5M achieve 71.5, 75.1, and 78.4 Top-1 that surpass equal-order CNN-/Attention-based models, while trading-off the parameter, efficiency, and accuracy well: running 2.8-4.0x faster than EdgeNeXt on iPhone14

    AR-Miner: Mining informative reviews for developers from mobile app marketplace

    Get PDF
    Ministry of Education, Singapore under its Academic Research Funding Tier

    Deep learning with application to hashing

    No full text
    Deep Learning and Learning to Hash are two important research areas in machine learning, which have rapid improvements in recent years. What I mainly researched on is an inter-discipline field: deep learning for cross view hashing. Multiple layers of representation in deep learning has the property of abstracting representation from input data, while, in the cross view similarity search, the biggest difficulty is to represent items from one domain to another. Here, I want to take advantage of the latest deep learning technology to solve the cross view similarity search problem. Hashing is used to accelerate this process. This thesis mainly contains three parts. Chapter 2 is a literature survey. It contains a deep learning survey and a learning to hash survey. The deep learning survey briefly introduces fundamental technology of deep learning and its recent development including the latest technology. The Learning to Hash survey brief introduces some widely used learning to hash algorithms. Chapter 3 is an experiment about comparison of some state of the arts learning to hash algorithms. Chapter 4 is cross view hashing based on deep learning. I present a cross view feature hashing technique using deep learning and show some results. These three chapters are main chapters. Chapter 1 and Chapter 5 are introduction and conclusion.MASTER OF ENGINEERING (SCE

    Health Condition Assessment of Marine Systems Based on an Improved Radar Chart

    No full text
    Since health assessment plays a significant role in marine systems (MSs), it has caught the attention of researchers. In this study, a powerful evaluation method called an improved radar chart was developed as a means of reliability estimation. General evaluation methods applied in the comprehensive evaluation of MS are slightly insufficient in terms of considering index coordination. However, the application of a radar chart can solve this problem. To improving the shortcomings of a traditional radar chart, the fuzzy centralization statistical theory and the entropy weight were combined in this study to obtain the comprehensive weight. The weight could be converted into an angle, and it could reflect the influence degree of the indexes on the evaluation objects. Additionally, an angle bisector was introduced as an index axis, and the eigenvector was extracted to get the unique evaluation result. The result showed that the proposed method could achieve the continuous online monitoring of the system state, and the reliable and accurate assessment results were able to provide a reference for condition-based maintenance and decision-making

    PCViT: A Pre-Convolutional ViT Coal Gangue Identification Method

    No full text
    For the study of coal and gangue identification using near-infrared reflection spectroscopy, samples of anthracite coal and gangue with similar appearances were collected, and different dust concentrations (200 ug/m3, 500 ug/m3 and 800 ug/m3), detection distances (1.2 m, 1.5 m and 1.8 m) and mixing gangue rates (one-third coal, two-thirds coal, full coal) were collected in the laboratory by the reflection spectroscopy acquisition device and the gangue reflection spectral data. The spectral data were pre-processed using three methods, first-order differentiation, second-order differentiation and standard normal variable transformation, in order to enhance the absorption characteristics of the reflectance spectra and to eliminate the effects of changes in the experimental environment. The PCViT gangue identification model is established, and the disadvantages of the violent patch embedding of the ViT model are improved by using the stepwise convolution operation to extract features. Then, the interdependence of the features of the hyperspectral data is modeled by the self-attention module, and the learned features are optimized adaptively. The results of gangue recognition under nine working conditions show that the proposed recognition model can significantly improve the recognition accuracy, and this study can provide a reference value for gangue recognition using the near-infrared reflection spectra of gangue

    Single-Nucleotide Polymorphisms in XPO5 are Associated with Noise-Induced Hearing Loss in a Chinese Population

    No full text
    Objectives.The purpose of this study was to investigate the correlation between single-nucleotide polymorphism (SNP) in 3′UTR of XPO5 gene and the occurrence of noise-induced hearing loss (NIHL), and to further explore the regulatory mechanism of miRNAs in NIHL on XPO5 gene. Methods.We conducted a case-control study involving 1040 cases and 1060 controls. The effects of SNPs on XPO5 expression were studied by genotyping, real-time polymerase chain reaction (qPCR), cell transfection, and the dual-luciferase reporter assay. Results.We genotyped four SNPs (rs2257082, rs11077, rs7755135, and rs1106841) in the XPO5 gene. The rs2257082 AG/GG carriers have special connection to an increased risk of noise-induced hearing loss compared to the AA carriers. The rs11077TG/GG carriers had a significantly increased association with NIHL susceptibility than the TT carriers. There was a higher risk of NIHL in the XPO5 gene rs7755135 CC carriers than in the TT carriers. No statistically significant correlation was obtained with respect to SNPrs1106841. Functional experiments showed that the rs11077 change might inhibit the interaction between miRNAs (miRNA-4763-5p, miRNA-5002-3p, and miRNA-617) and XPO5, with rs11077G allele resulting in overexpression of XPO5. Conclusion. The genetic polymorphism, rs11077, within XPO5 is associated with the risk of noise-induced hearing loss in a Chinese population
    corecore