33 research outputs found

    Fairness-aware Vision Transformer via Debiased Self-Attention

    Full text link
    Vision Transformer (ViT) has recently gained significant interest in solving computer vision (CV) problems due to its capability of extracting informative features and modeling long-range dependencies through the self-attention mechanism. To fully realize the advantages of ViT in real-world applications, recent works have explored the trustworthiness of ViT, including its robustness and explainability. However, another desiderata, fairness has not yet been adequately addressed in the literature. We establish that the existing fairness-aware algorithms (primarily designed for CNNs) do not perform well on ViT. This necessitates the need for developing our novel framework via Debiased Self-Attention (DSA). DSA is a fairness-through-blindness approach that enforces ViT to eliminate spurious features correlated with the sensitive attributes for bias mitigation. Notably, adversarial examples are leveraged to locate and mask the spurious features in the input image patches. In addition, DSA utilizes an attention weights alignment regularizer in the training objective to encourage learning informative features for target prediction. Importantly, our DSA framework leads to improved fairness guarantees over prior works on multiple prediction tasks without compromising target prediction performance

    Learning Compact Features via In-Training Representation Alignment

    Full text link
    Deep neural networks (DNNs) for supervised learning can be viewed as a pipeline of the feature extractor (i.e., last hidden layer) and a linear classifier (i.e., output layer) that are trained jointly with stochastic gradient descent (SGD) on the loss function (e.g., cross-entropy). In each epoch, the true gradient of the loss function is estimated using a mini-batch sampled from the training set and model parameters are then updated with the mini-batch gradients. Although the latter provides an unbiased estimation of the former, they are subject to substantial variances derived from the size and number of sampled mini-batches, leading to noisy and jumpy updates. To stabilize such undesirable variance in estimating the true gradients, we propose In-Training Representation Alignment (ITRA) that explicitly aligns feature distributions of two different mini-batches with a matching loss in the SGD training process. We also provide a rigorous analysis of the desirable effects of the matching loss on feature representation learning: (1) extracting compact feature representation; (2) reducing over-adaption on mini-batches via an adaptive weighting mechanism; and (3) accommodating to multi-modalities. Finally, we conduct large-scale experiments on both image and text classifications to demonstrate its superior performance to the strong baselines.Comment: 11 pages, 4 figures, 6 tables. Accepted for publication by AAAI-23. arXiv admin note: text overlap with arXiv:2002.0991

    A multi-decadal analysis of river discharge and suspended sediment load in three Texas coastal rivers in relation to hurricanes, seasonal rainfall, and ENSO

    Get PDF
    Coastal river discharge and sediment load exert major influence on the sustainability of coastal systems. Controlled by various hydroclimatic/hydrometeorological agents, they exhibit distinct trend/variability at different time scales. Coastal Texas, while being a major target for tropical cyclones over the past 6 decades, has been experiencing drought and flood cycles associated with ENSO in the long term. However, it is still unclear the temporal variability of river discharge and the associated sediment delivery over this area at different time scales, and the controlling factors behind it. In this study, a 58-years (1960–2017) dataset is compiled to analyze the influence of ENSO, seasonal rainfall distribution and hurricanes event on the river discharge and suspended sediment load of three Texas coastal rivers-the San Bernard River, the Brazos River, and the Trinity River, at annual, seasonal and event scales, respectively. In the short-term, all three rivers attained the highest average daily discharge and sediment load during Hurricane Harvey. On a seasonal scale, the precipitation regime exerts more influence on the Texas watersheds than tropical storms and hurricanes. Over a multi-decadal scale, amplified rainstorms during the El Niño phases likely play an important role in the overall discharge and sediment transport in large rivers along the northern Gulf coast. Overall, it is reasonable to conclude that the magnitude of hurricane impacts on the overall discharge and suspended sediment load is regulated by the duration and intensity of the rainfall, as well as the coupled drought-flood cycle in relation to the intensity of ENSO

    Auto-Prompting SAM for Mobile Friendly 3D Medical Image Segmentation

    Full text link
    The Segment Anything Model (SAM) has rapidly been adopted for segmenting a wide range of natural images. However, recent studies have indicated that SAM exhibits subpar performance on 3D medical image segmentation tasks. In addition to the domain gaps between natural and medical images, disparities in the spatial arrangement between 2D and 3D images, the substantial computational burden imposed by powerful GPU servers, and the time-consuming manual prompt generation impede the extension of SAM to a broader spectrum of medical image segmentation applications. To address these challenges, in this work, we introduce a novel method, AutoSAM Adapter, designed specifically for 3D multi-organ CT-based segmentation. We employ parameter-efficient adaptation techniques in developing an automatic prompt learning paradigm to facilitate the transformation of the SAM model's capabilities to 3D medical image segmentation, eliminating the need for manually generated prompts. Furthermore, we effectively transfer the acquired knowledge of the AutoSAM Adapter to other lightweight models specifically tailored for 3D medical image analysis, achieving state-of-the-art (SOTA) performance on medical image segmentation tasks. Through extensive experimental evaluation, we demonstrate the AutoSAM Adapter as a critical foundation for effectively leveraging the emerging ability of foundation models in 2D natural image segmentation for 3D medical image segmentation.Comment: 9 pages, 4 figures, 4 table

    The relationship between Cho/NAA and glioma metabolism: implementation for margin delineation of cerebral gliomas

    Get PDF
    BACKGROUND: The marginal delineation of gliomas cannot be defined by conventional imaging due to their infiltrative growth pattern. Here we investigate the relationship between changes in glioma metabolism by proton magnetic resonance spectroscopic imaging ((1)H-MRSI) and histopathological findings in order to determine an optimal threshold value of choline/N-acetyl-aspartate (Cho/NAA) that can be used to define the extent of glioma spread. METHOD: Eighteen patients with different grades of glioma were examined using (1)H-MRSI. Needle biopsies were performed under the guidance of neuronavigation prior to craniotomy. Intraoperative magnetic resonance imaging (MRI) was performed to evaluate the accuracy of sampling. Haematoxylin and eosin, and immunohistochemical staining with IDH1, MIB-1, p53, CD34 and glial fibrillary acidic protein (GFAP) antibodies were performed on all samples. Logistic regression analysis was used to determine the relationship between Cho/NAA and MIB-1, p53, CD34, and the degree of tumour infiltration. The clinical threshold ratio distinguishing tumour tissue in high-grade (grades III and IV) glioma (HGG) and low-grade (grade II) glioma (LGG) was calculated. RESULTS: In HGG, higher Cho/NAA ratios were associated with a greater probability of higher MIB-1 counts, stronger CD34 expression, and tumour infiltration. Ratio threshold values of 0.5, 1.0, 1.5 and 2.0 appeared to predict the specimens containing the tumour with respective probabilities of 0.38, 0.60, 0.79, 0.90 in HGG and 0.16, 0.39, 0.67, 0.87 in LGG. CONCLUSIONS: HGG and LGG exhibit different spectroscopic patterns. Using (1)H-MRSI to guide the extent of resection has the potential to improve the clinical outcome of glioma surgery

    Revealing the source of Jupiter’s x-ray auroral flares

    Get PDF
    Jupiter’s rapidly rotating, strong magnetic field provides a natural laboratory that is key to understanding the dynamics of high-energy plasmas. Spectacular auroral x-ray flares are diagnostic of the most energetic processes governing magnetospheres but seemingly unique to Jupiter. Since their discovery 40 years ago, the processes that produce Jupiter’s x-ray flares have remained unknown. Here, we report simultaneous in situ satellite and space-based telescope observations that reveal the processes that produce Jupiter’s x-ray flares, showing surprising similarities to terrestrial ion aurora. Planetary-scale electromagnetic waves are observed to modulate electromagnetic ion cyclotron waves, periodically causing heavy ions to precipitate and produce Jupiter’s x-ray pulses. Our findings show that ion aurorae share common mechanisms across planetary systems, despite temporal, spatial, and energetic scales varying by orders of magnitude

    Comprehensive Benefit Evaluation of the Wind-PV-ES and Transmission Hybrid Power System Consideration of System Functionality and Proportionality

    No full text
    In the background of decreasing fossil fuels and increasing environmental pollution, the wind-photovoltaic energy storage and transmission hybrid power system (or called the wind-PV-ES and transmission hybrid system) has become a strategic choice to achieve energy sustainability. However, the comprehensive benefit evaluation of such a combined power system is in a relatively blank state in China, which will hinder the reasonable and orderly development of this station. Four parts, the technical performance, economic benefit, ecological impact and social benefit, are considered in this paper, and a multi-angle evaluation index system of the wind-PV-ES and transmission system is designed. The projection pursuit model is used to evaluated system functionality conventionally; relative entropy theory is used to evaluate the system functionality simultaneously; and a comprehensive benefit evaluation model of the technique for order preference by similar to ideal solution (TOPSIS) considering both system functionality and proportionality is constructed. Finally, the national demonstration station of the wind-PV-ES-transmission system is taken as an example to testify to the practicability and validity of the evaluation index system and model

    Negative Flux Aggregation to Estimate Feature Attributions

    Full text link
    There are increasing demands for understanding deep neural networks' (DNNs) behavior spurred by growing security and/or transparency concerns. Due to multi-layer nonlinearity of the deep neural network architectures, explaining DNN predictions still remains as an open problem, preventing us from gaining a deeper understanding of the mechanisms. To enhance the explainability of DNNs, we estimate the input feature's attributions to the prediction task using divergence and flux. Inspired by the divergence theorem in vector analysis, we develop a novel Negative Flux Aggregation (NeFLAG) formulation and an efficient approximation algorithm to estimate attribution map. Unlike the previous techniques, ours doesn't rely on fitting a surrogate model nor need any path integration of gradients. Both qualitative and quantitative experiments demonstrate a superior performance of NeFLAG in generating more faithful attribution maps than the competing methods. Our code is available at \url{https://github.com/xinli0928/NeFLAG}Comment: 14 pages, 4 figures, 2 table
    corecore