48 research outputs found
RFAConv: Innovating Spatital Attention and Standard Convolutional Operation
Spatial attention has been widely used to improve the performance of
convolutional neural networks by allowing them to focus on important
information. However, it has certain limitations. In this paper, we propose a
new perspective on the effectiveness of spatial attention, which is that it can
solve the problem of convolutional kernel parameter sharing. Despite this, the
information contained in the attention map generated by spatial attention is
not sufficient for large-size convolutional kernels. Therefore, we introduce a
new attention mechanism called Receptive-Field Attention (RFA). While previous
attention mechanisms such as the Convolutional Block Attention Module (CBAM)
and Coordinate Attention (CA) only focus on spatial features, they cannot fully
address the issue of convolutional kernel parameter sharing. In contrast, RFA
not only focuses on the receptive-field spatial feature but also provides
effective attention weights for large-size convolutional kernels. The
Receptive-Field Attention convolutional operation (RFAConv), developed by RFA,
represents a new approach to replace the standard convolution operation. It
offers nearly negligible increment of computational cost and parameters, while
significantly improving network performance. We conducted a series of
experiments on ImageNet-1k, MS COCO, and VOC datasets, which demonstrated the
superiority of our approach in various tasks including classification, object
detection, and semantic segmentation. Of particular importance, we believe that
it is time to shift focus from spatial features to receptive-field spatial
features for current spatial attention mechanisms. By doing so, we can further
improve network performance and achieve even better results. The code and
pre-trained models for the relevant tasks can be found at
https://github.com/Liuchen1997/RFAConv.Comment: 14 pages, 5 figure
AKConv: Convolutional Kernel with Arbitrary Sampled Shapes and Arbitrary Number of Parameters
Neural networks based on convolutional operations have achieved remarkable
results in the field of deep learning, but there are two inherent flaws in
standard convolutional operations. On the one hand, the convolution operation
be confined to a local window and cannot capture information from other
locations, and its sampled shapes is fixed. On the other hand, the size of the
convolutional kernel is fixed to k k, which is a fixed square shape,
and the number of parameters tends to grow squarely with size. It is obvious
that the shape and size of targets are various in different datasets and at
different locations. Convolutional kernels with fixed sample shapes and squares
do not adapt well to changing targets. In response to the above questions, the
Alterable Kernel Convolution (AKConv) is explored in this work, which gives the
convolution kernel an arbitrary number of parameters and arbitrary sampled
shapes to provide richer options for the trade-off between network overhead and
performance. In AKConv, we define initial positions for convolutional kernels
of arbitrary size by means of a new coordinate generation algorithm. To adapt
to changes for targets, we introduce offsets to adjust the shape of the samples
at each position. Moreover, we explore the effect of the neural network by
using the AKConv with the same size and different initial sampled shapes.
AKConv completes the process of efficient feature extraction by irregular
convolutional operations and brings more exploration options for convolutional
sampling shapes. Object detection experiments on representative datasets
COCO2017, VOC 7+12 and VisDrone-DET2021 fully demonstrate the advantages of
AKConv. AKConv can be used as a plug-and-play convolutional operation to
replace convolutional operations to improve network performance. The code for
the relevant tasks can be found at https://github.com/CV-ZhangXin/AKConv.Comment: 10 pages, 5 figure
The Role of High Mobility Group Box 1 in Ischemic Stroke
High-mobility group box 1 protein (HMGB1) is a novel, cytokine-like, and ubiquitous, highly conserved, nuclear protein that can be actively secreted by microglia or passively released by necrotic neurons. Ischemic stroke is a leading cause of death and disability worldwide, and the outcome is dependent on the amount of hypoxia-related neuronal death in the cerebral ischemic region. Acting as an endogenous danger-associated molecular pattern (DAMP) protein, HMGB1 mediates cerebral inflammation and brain injury and participates in the pathogenesis of ischemic stroke. It is thought that HMGB1 signals via its presumed receptors, such as toll-like receptors (TLRs), matrix metalloproteinase (MMP) enzymes, and receptor for advanced glycation end products (RAGEs) during ischemic stroke. In addition, the release of HMGB1 from the brain into the bloodstream influences peripheral immune cells. However, the role of HMGB1 in ischemic stroke may be more complex than this and has not yet been clarified. Here, we summarize and review the research into HMGB1 in ischemic stroke
Transmembrane and coiled-coil domains 3 is a diagnostic biomarker for predicting immune checkpoint blockade efficacy in hepatocellular carcinoma
Liver hepatocellular carcinoma (LIHC) is a malignancy with a high mortality and morbidity rate worldwide. However, the pathogenesis of LIHC has still not been thoroughly studied. Transmembrane and coiled-coil domains 3 (TMCO3) encodes a monovalent cation, a member of the proton transducer 2 (CPA2) family of transporter proteins. In the present study, TMCO3 expression and its relationship with cancer prognosis, as well as its immunological role in LIHC were studied by bioinformatic analysis. We found the significant overexpression of TMCO3 in LIHC in the TCGA, HCCDB, and GEO databases. In LIHC patients, high TMCO3 expression was related to poorer overall survival (OS) and TMCO3 had good predictive accuracy for prognosis. Moreover, TMCO3 was linked to the infiltrates of certain immune cells in LIHC. The correlation of TMCO3 with immune checkpoints was also revealed. Moreover, patients with LIHC with low TMCO3 expression showed a better response to immune checkpoint blockade (ICB) than those with LIHC with high TMCO3 expression. GO and KEGG enrichment analyses indicated that TMCO3 was probably involved in the microtubule cytoskeleton organization involved in mitosis, small GTPase mediated signal transduction, and TGF-β pathway. In conclusion, TMCO3 may be a potential biomarker for LIHC prognosis and immunotherapy
Development and validation of a deep learning-based model to distinguish acetabular fractures on pelvic anteroposterior radiographs
Objective: To develop and test a deep learning (DL) model to distinguish acetabular fractures (AFs) on pelvic anteroposterior radiographs (PARs) and compare its performance to that of clinicians.Materials and methods: A total of 1,120 patients from a big level-I trauma center were enrolled and allocated at a 3:1 ratio for the DL model’s development and internal test. Another 86 patients from two independent hospitals were collected for external validation. A DL model for identifying AFs was constructed based on DenseNet. AFs were classified into types A, B, and C according to the three-column classification theory. Ten clinicians were recruited for AF detection. A potential misdiagnosed case (PMC) was defined based on clinicians’ detection results. The detection performance of the clinicians and DL model were evaluated and compared. The detection performance of different subtypes using DL was assessed using the area under the receiver operating characteristic curve (AUC).Results: The means of 10 clinicians’ sensitivity, specificity, and accuracy to identify AFs were 0.750/0.735, 0.909/0.909, and 0.829/0.822, in the internal test/external validation set, respectively. The sensitivity, specificity, and accuracy of the DL detection model were 0.926/0.872, 0.978/0.988, and 0.952/0.930, respectively. The DL model identified type A fractures with an AUC of 0.963 [95% confidence interval (CI): 0.927–0.985]/0.950 (95% CI: 0.867–0.989); type B fractures with an AUC of 0.991 (95% CI: 0.967–0.999)/0.989 (95% CI: 0.930–1.000); and type C fractures with an AUC of 1.000 (95% CI: 0.975–1.000)/1.000 (95% CI: 0.897–1.000) in the test/validation set. The DL model correctly recognized 56.5% (26/46) of PMCs.Conclusion: A DL model for distinguishing AFs on PARs is feasible. In this study, the DL model achieved a diagnostic performance comparable to or even superior to that of clinicians
DeePMD-kit v2: A software package for Deep Potential models
DeePMD-kit is a powerful open-source software package that facilitates
molecular dynamics simulations using machine learning potentials (MLP) known as
Deep Potential (DP) models. This package, which was released in 2017, has been
widely used in the fields of physics, chemistry, biology, and material science
for studying atomistic systems. The current version of DeePMD-kit offers
numerous advanced features such as DeepPot-SE, attention-based and hybrid
descriptors, the ability to fit tensile properties, type embedding, model
deviation, Deep Potential - Range Correction (DPRc), Deep Potential Long Range
(DPLR), GPU support for customized operators, model compression, non-von
Neumann molecular dynamics (NVNMD), and improved usability, including
documentation, compiled binary packages, graphical user interfaces (GUI), and
application programming interfaces (API). This article presents an overview of
the current major version of the DeePMD-kit package, highlighting its features
and technical details. Additionally, the article benchmarks the accuracy and
efficiency of different models and discusses ongoing developments.Comment: 51 pages, 2 figure
An interlaboratory comparison of aerosol inorganic ion measurements by ion chromatography : Implications for aerosol pH estimate
Water-soluble inorganic ions such as ammonium, nitrate and sulfate are major components of fine aerosols in the atmosphere and are widely used in the estimation of aerosol acidity. However, different experimental practices and instrumentation may lead to uncertainties in ion concentrations. Here, an intercomparison experiment was conducted in 10 different laboratories (labs) to investigate the consistency of inorganic ion concentrations and resultant aerosol acidity estimates using the same set of aerosol filter samples. The results mostly exhibited good agreement for major ions Cl-, SO2-4, NO-3, NHC4 and KC. However, F-, Mg2C and Ca2C were observed with more variations across the different labs. The Aerosol Chemical Speciation Monitor (ACSM) data of nonrefractory SO2-4, NO-3 and NHC4 generally correlated very well with the filter-analysis-based data in our study, but the absolute concentrations differ by up to 42 %. Cl-from the two methods are correlated, but the concentration differ by more than a factor of 3. The analyses of certified reference materials (CRMs) generally showed a good detection accuracy (DA) of all ions in all the labs, the majority of which ranged between 90 % and 110 %. The DA was also used to correct the ion concentrations to showcase the importance of using CRMs for calibration check and quality control. Better agreements were found for Cl-, SO2-4, NO-3, NHC4 and KC across the labs after their concentrations were corrected with DA; the coefficient of variation (CV) of Cl-, SO2-4, NO-3, NHC4 and KC decreased by 1.7 %, 3.4 %, 3.4 %, 1.2 % and 2.6 %, respectively, after DA correction. We found that the ratio of anion to cation equivalent concentrations (AE/CE) and ion balance (anions-cations) are not good indicators for aerosol acidity estimates, as the results in different labs did not agree well with each other. In situ aerosol pH calculated from the ISORROPIA II thermodynamic equilibrium model with measured ion and ammonia concentrations showed a similar trend and good agreement across the 10 labs. Our results indicate that although there are important uncertainties in aerosol ion concentration measurements, the estimated aerosol pH from the ISORROPIA II model is more consistent