48 research outputs found

    RFAConv: Innovating Spatital Attention and Standard Convolutional Operation

    Full text link
    Spatial attention has been widely used to improve the performance of convolutional neural networks by allowing them to focus on important information. However, it has certain limitations. In this paper, we propose a new perspective on the effectiveness of spatial attention, which is that it can solve the problem of convolutional kernel parameter sharing. Despite this, the information contained in the attention map generated by spatial attention is not sufficient for large-size convolutional kernels. Therefore, we introduce a new attention mechanism called Receptive-Field Attention (RFA). While previous attention mechanisms such as the Convolutional Block Attention Module (CBAM) and Coordinate Attention (CA) only focus on spatial features, they cannot fully address the issue of convolutional kernel parameter sharing. In contrast, RFA not only focuses on the receptive-field spatial feature but also provides effective attention weights for large-size convolutional kernels. The Receptive-Field Attention convolutional operation (RFAConv), developed by RFA, represents a new approach to replace the standard convolution operation. It offers nearly negligible increment of computational cost and parameters, while significantly improving network performance. We conducted a series of experiments on ImageNet-1k, MS COCO, and VOC datasets, which demonstrated the superiority of our approach in various tasks including classification, object detection, and semantic segmentation. Of particular importance, we believe that it is time to shift focus from spatial features to receptive-field spatial features for current spatial attention mechanisms. By doing so, we can further improve network performance and achieve even better results. The code and pre-trained models for the relevant tasks can be found at https://github.com/Liuchen1997/RFAConv.Comment: 14 pages, 5 figure

    AKConv: Convolutional Kernel with Arbitrary Sampled Shapes and Arbitrary Number of Parameters

    Full text link
    Neural networks based on convolutional operations have achieved remarkable results in the field of deep learning, but there are two inherent flaws in standard convolutional operations. On the one hand, the convolution operation be confined to a local window and cannot capture information from other locations, and its sampled shapes is fixed. On the other hand, the size of the convolutional kernel is fixed to k ×\times k, which is a fixed square shape, and the number of parameters tends to grow squarely with size. It is obvious that the shape and size of targets are various in different datasets and at different locations. Convolutional kernels with fixed sample shapes and squares do not adapt well to changing targets. In response to the above questions, the Alterable Kernel Convolution (AKConv) is explored in this work, which gives the convolution kernel an arbitrary number of parameters and arbitrary sampled shapes to provide richer options for the trade-off between network overhead and performance. In AKConv, we define initial positions for convolutional kernels of arbitrary size by means of a new coordinate generation algorithm. To adapt to changes for targets, we introduce offsets to adjust the shape of the samples at each position. Moreover, we explore the effect of the neural network by using the AKConv with the same size and different initial sampled shapes. AKConv completes the process of efficient feature extraction by irregular convolutional operations and brings more exploration options for convolutional sampling shapes. Object detection experiments on representative datasets COCO2017, VOC 7+12 and VisDrone-DET2021 fully demonstrate the advantages of AKConv. AKConv can be used as a plug-and-play convolutional operation to replace convolutional operations to improve network performance. The code for the relevant tasks can be found at https://github.com/CV-ZhangXin/AKConv.Comment: 10 pages, 5 figure

    The Role of High Mobility Group Box 1 in Ischemic Stroke

    Get PDF
    High-mobility group box 1 protein (HMGB1) is a novel, cytokine-like, and ubiquitous, highly conserved, nuclear protein that can be actively secreted by microglia or passively released by necrotic neurons. Ischemic stroke is a leading cause of death and disability worldwide, and the outcome is dependent on the amount of hypoxia-related neuronal death in the cerebral ischemic region. Acting as an endogenous danger-associated molecular pattern (DAMP) protein, HMGB1 mediates cerebral inflammation and brain injury and participates in the pathogenesis of ischemic stroke. It is thought that HMGB1 signals via its presumed receptors, such as toll-like receptors (TLRs), matrix metalloproteinase (MMP) enzymes, and receptor for advanced glycation end products (RAGEs) during ischemic stroke. In addition, the release of HMGB1 from the brain into the bloodstream influences peripheral immune cells. However, the role of HMGB1 in ischemic stroke may be more complex than this and has not yet been clarified. Here, we summarize and review the research into HMGB1 in ischemic stroke

    Transmembrane and coiled-coil domains 3 is a diagnostic biomarker for predicting immune checkpoint blockade efficacy in hepatocellular carcinoma

    Get PDF
    Liver hepatocellular carcinoma (LIHC) is a malignancy with a high mortality and morbidity rate worldwide. However, the pathogenesis of LIHC has still not been thoroughly studied. Transmembrane and coiled-coil domains 3 (TMCO3) encodes a monovalent cation, a member of the proton transducer 2 (CPA2) family of transporter proteins. In the present study, TMCO3 expression and its relationship with cancer prognosis, as well as its immunological role in LIHC were studied by bioinformatic analysis. We found the significant overexpression of TMCO3 in LIHC in the TCGA, HCCDB, and GEO databases. In LIHC patients, high TMCO3 expression was related to poorer overall survival (OS) and TMCO3 had good predictive accuracy for prognosis. Moreover, TMCO3 was linked to the infiltrates of certain immune cells in LIHC. The correlation of TMCO3 with immune checkpoints was also revealed. Moreover, patients with LIHC with low TMCO3 expression showed a better response to immune checkpoint blockade (ICB) than those with LIHC with high TMCO3 expression. GO and KEGG enrichment analyses indicated that TMCO3 was probably involved in the microtubule cytoskeleton organization involved in mitosis, small GTPase mediated signal transduction, and TGF-β pathway. In conclusion, TMCO3 may be a potential biomarker for LIHC prognosis and immunotherapy

    Development and validation of a deep learning-based model to distinguish acetabular fractures on pelvic anteroposterior radiographs

    Get PDF
    Objective: To develop and test a deep learning (DL) model to distinguish acetabular fractures (AFs) on pelvic anteroposterior radiographs (PARs) and compare its performance to that of clinicians.Materials and methods: A total of 1,120 patients from a big level-I trauma center were enrolled and allocated at a 3:1 ratio for the DL model’s development and internal test. Another 86 patients from two independent hospitals were collected for external validation. A DL model for identifying AFs was constructed based on DenseNet. AFs were classified into types A, B, and C according to the three-column classification theory. Ten clinicians were recruited for AF detection. A potential misdiagnosed case (PMC) was defined based on clinicians’ detection results. The detection performance of the clinicians and DL model were evaluated and compared. The detection performance of different subtypes using DL was assessed using the area under the receiver operating characteristic curve (AUC).Results: The means of 10 clinicians’ sensitivity, specificity, and accuracy to identify AFs were 0.750/0.735, 0.909/0.909, and 0.829/0.822, in the internal test/external validation set, respectively. The sensitivity, specificity, and accuracy of the DL detection model were 0.926/0.872, 0.978/0.988, and 0.952/0.930, respectively. The DL model identified type A fractures with an AUC of 0.963 [95% confidence interval (CI): 0.927–0.985]/0.950 (95% CI: 0.867–0.989); type B fractures with an AUC of 0.991 (95% CI: 0.967–0.999)/0.989 (95% CI: 0.930–1.000); and type C fractures with an AUC of 1.000 (95% CI: 0.975–1.000)/1.000 (95% CI: 0.897–1.000) in the test/validation set. The DL model correctly recognized 56.5% (26/46) of PMCs.Conclusion: A DL model for distinguishing AFs on PARs is feasible. In this study, the DL model achieved a diagnostic performance comparable to or even superior to that of clinicians

    DeePMD-kit v2: A software package for Deep Potential models

    Full text link
    DeePMD-kit is a powerful open-source software package that facilitates molecular dynamics simulations using machine learning potentials (MLP) known as Deep Potential (DP) models. This package, which was released in 2017, has been widely used in the fields of physics, chemistry, biology, and material science for studying atomistic systems. The current version of DeePMD-kit offers numerous advanced features such as DeepPot-SE, attention-based and hybrid descriptors, the ability to fit tensile properties, type embedding, model deviation, Deep Potential - Range Correction (DPRc), Deep Potential Long Range (DPLR), GPU support for customized operators, model compression, non-von Neumann molecular dynamics (NVNMD), and improved usability, including documentation, compiled binary packages, graphical user interfaces (GUI), and application programming interfaces (API). This article presents an overview of the current major version of the DeePMD-kit package, highlighting its features and technical details. Additionally, the article benchmarks the accuracy and efficiency of different models and discusses ongoing developments.Comment: 51 pages, 2 figure

    An interlaboratory comparison of aerosol inorganic ion measurements by ion chromatography : Implications for aerosol pH estimate

    Get PDF
    Water-soluble inorganic ions such as ammonium, nitrate and sulfate are major components of fine aerosols in the atmosphere and are widely used in the estimation of aerosol acidity. However, different experimental practices and instrumentation may lead to uncertainties in ion concentrations. Here, an intercomparison experiment was conducted in 10 different laboratories (labs) to investigate the consistency of inorganic ion concentrations and resultant aerosol acidity estimates using the same set of aerosol filter samples. The results mostly exhibited good agreement for major ions Cl-, SO2-4, NO-3, NHC4 and KC. However, F-, Mg2C and Ca2C were observed with more variations across the different labs. The Aerosol Chemical Speciation Monitor (ACSM) data of nonrefractory SO2-4, NO-3 and NHC4 generally correlated very well with the filter-analysis-based data in our study, but the absolute concentrations differ by up to 42 %. Cl-from the two methods are correlated, but the concentration differ by more than a factor of 3. The analyses of certified reference materials (CRMs) generally showed a good detection accuracy (DA) of all ions in all the labs, the majority of which ranged between 90 % and 110 %. The DA was also used to correct the ion concentrations to showcase the importance of using CRMs for calibration check and quality control. Better agreements were found for Cl-, SO2-4, NO-3, NHC4 and KC across the labs after their concentrations were corrected with DA; the coefficient of variation (CV) of Cl-, SO2-4, NO-3, NHC4 and KC decreased by 1.7 %, 3.4 %, 3.4 %, 1.2 % and 2.6 %, respectively, after DA correction. We found that the ratio of anion to cation equivalent concentrations (AE/CE) and ion balance (anions-cations) are not good indicators for aerosol acidity estimates, as the results in different labs did not agree well with each other. In situ aerosol pH calculated from the ISORROPIA II thermodynamic equilibrium model with measured ion and ammonia concentrations showed a similar trend and good agreement across the 10 labs. Our results indicate that although there are important uncertainties in aerosol ion concentration measurements, the estimated aerosol pH from the ISORROPIA II model is more consistent

    Delaying Function Construction Based on Triple DES

    No full text
    corecore