89 research outputs found
APPT : Asymmetric Parallel Point Transformer for 3D Point Cloud Understanding
Transformer-based networks have achieved impressive performance in 3D point
cloud understanding. However, most of them concentrate on aggregating local
features, but neglect to directly model global dependencies, which results in a
limited effective receptive field. Besides, how to effectively incorporate
local and global components also remains challenging. To tackle these problems,
we propose Asymmetric Parallel Point Transformer (APPT). Specifically, we
introduce Global Pivot Attention to extract global features and enlarge the
effective receptive field. Moreover, we design the Asymmetric Parallel
structure to effectively integrate local and global information. Combined with
these designs, APPT is able to capture features globally throughout the entire
network while focusing on local-detailed features. Extensive experiments show
that our method outperforms the priors and achieves state-of-the-art on several
benchmarks for 3D point cloud understanding, such as 3D semantic segmentation
on S3DIS, 3D shape classification on ModelNet40, and 3D part segmentation on
ShapeNet
TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training
Contrastive Language-Image Pre-training (CLIP) has demonstrated impressive
capabilities in open-vocabulary classification. The class token in the image
encoder is trained to capture the global features to distinguish different text
descriptions supervised by contrastive loss, making it highly effective for
single-label classification. However, it shows poor performance on multi-label
datasets because the global feature tends to be dominated by the most prominent
class and the contrastive nature of softmax operation aggravates it. In this
study, we observe that the multi-label classification results heavily rely on
discriminative local features but are overlooked by CLIP. As a result, we
dissect the preservation of patch-wise spatial information in CLIP and proposed
a local-to-global framework to obtain image tags. It comprises three steps: (1)
patch-level classification to obtain coarse scores; (2) dual-masking attention
refinement (DMAR) module to refine the coarse scores; (3) class-wise
reidentification (CWR) module to remedy predictions from a global perspective.
This framework is solely based on frozen CLIP and significantly enhances its
multi-label classification performance on various benchmarks without
dataset-specific training. Besides, to comprehensively assess the quality and
practicality of generated tags, we extend their application to the downstream
task, i.e., weakly supervised semantic segmentation (WSSS) with generated tags
as image-level pseudo labels. Experiments demonstrate that this
classify-then-segment paradigm dramatically outperforms other annotation-free
segmentation methods and validates the effectiveness of generated tags. Our
code is available at https://github.com/linyq2117/TagCLIP.Comment: Accepted by AAAI202
Automated interpretation of congenital heart disease from multi-view echocardiograms
Congenital heart disease (CHD) is the most common birth defect and the
leading cause of neonate death in China. Clinical diagnosis can be based on the
selected 2D key-frames from five views. Limited by the availability of
multi-view data, most methods have to rely on the insufficient single view
analysis. This study proposes to automatically analyze the multi-view
echocardiograms with a practical end-to-end framework. We collect the five-view
echocardiograms video records of 1308 subjects (including normal controls,
ventricular septal defect (VSD) patients and atrial septal defect (ASD)
patients) with both disease labels and standard-view key-frame labels.
Depthwise separable convolution-based multi-channel networks are adopted to
largely reduce the network parameters. We also approach the imbalanced class
problem by augmenting the positive training samples. Our 2D key-frame model can
diagnose CHD or negative samples with an accuracy of 95.4\%, and in negative,
VSD or ASD classification with an accuracy of 92.3\%. To further alleviate the
work of key-frame selection in real-world implementation, we propose an
adaptive soft attention scheme to directly explore the raw video data. Four
kinds of neural aggregation methods are systematically investigated to fuse the
information of an arbitrary number of frames in a video. Moreover, with a view
detection module, the system can work without the view records. Our video-based
model can diagnose with an accuracy of 93.9\% (binary classification), and
92.1\% (3-class classification) in a collected 2D video testing set, which does
not need key-frame selection and view annotation in testing. The detailed
ablation study and the interpretability analysis are provided.Comment: Published in Medical Image Analysi
Detection of neural connections with ex vivo MRI using a ferritin-encoding trans-synaptic virus
The elucidation of neural networks is essential to understanding the mechanisms of brain functions and brain disorders. Neurotropic virus-based trans-synaptic tracing tools have become an effective method for dissecting the structure and analyzing the function of neural-circuitry. However, these tracing systems rely on fluorescent signals, making it hard to visualize the panorama of the labeled networks in mammalian brain in vivo. One MRI method, Diffusion Tensor Imaging (DTI), is capable of imaging the networks of the whole brain in live animals but without information of anatomical connections through synapses. In this report, a chimeric gene coding for ferritin and enhanced green fluorescent protein (EGFP) was integrated into Vesicular stomatitis virus (VSV), a neurotropic virus that is able to spread anterogradely in synaptically connected networks. After the animal was injected with the recombinant VSV (rVSV), rVSV-Ferritin-EGFP, into the somatosensory cortex (SC) for four days, the labeled neural-network was visualized in the postmortem whole brain with a T2-weighted MRI sequence. The modified virus transmitted from SC to synaptically connected downstream regions. The results demonstrate that rVSV-Ferritin-EGFP could be used as a bimodal imaging vector for detecting synaptically connected neural-network with both ex vivo MRI and fluorescent imaging. The strategy in the current study has the potential to longitudinally monitor the global structure of a given neural-network in living animals
Meibomian Gland Dysfunction in Type 2 Diabetic Patients
Purpose. To investigate meibomian gland and tear film function in patients with type 2 diabetes. Methods. This prospective study compared changes in meibomian gland and tear film function in type 2 diabetic patients with nondiabetic patients. Meibomian gland function was evaluated by measuring lipid layer thickness (LLT), grading of meibomian gland loss, lid margin abnormalities, and expression of meibum. Tear film function was assessed by measuring tear breakup time (TBUT), the Schirmer I test, noninvasive breakup time (NIBUT), tear meniscus height (TMH), and corneal fluorescein staining. Results. Meibography scores were significantly higher in the diabetic group compared with the nondiabetic group (p=0.004). The number of expressible glands was significantly lower in the diabetic group in temporal, central, and nasal third of the lower eyelid (nasal: p=0.002; central: p=0.040; and temporal: p=0.039). The lid margin abnormality score was significantly higher in the diabetic group than in the nondiabetic group (p=0.04). There was no statistically significant difference in the tear film function parameters between the two groups. Conclusions. Meibomian gland dysfunction (MGD) in type 2 diabetic patients is more severe compared with nondiabetic patients. Overall, most of the diabetic patients manifest as having asymptomatic MGD
Integrative Analysis of Identifying Methylation-Driven Genes Signature Predicts Prognosis in Colorectal Carcinoma
BackgroundAberrant DNA methylation is a critical regulator of gene expression and plays a crucial role in the occurrence, progression, and prognosis of colorectal cancer (CRC). We aimed to identify methylation-driven genes by integrative epigenetic and transcriptomic analysis to predict the prognosis of CRC patients.MethodsMethylation-driven genes were selected for CRC using a MethylMix algorithm and LASSO regression screening strategy, and were further used to construct a prognostic risk-assessment model. The Cancer Genome Atlas (TCGA) database was obtained as the training set for both the screening of methylation-driven genes and the effect of genes signature on CRC prognosis. Then, the prognostic genes signature was validated in three independent expression arrays of CRC data from Gene Expression Omnibus (GEO).ResultsWe identified 143 methylation-driven genes, of which the combination of BATF, PHYHIPL, RBP1, and PNPLA4 expression levels was screened as a better prognostic model with the best area under the curve (AUC) (AUC = 0.876). Compared with patients in the low-risk group, CRC patients in the high-risk group had significantly poorer overall survival in the training set (HR = 2.184, 95% CI: 1.404–3.396, P < 0.001). Similar results were observed in the validation set. Moreover, VanderWeele’s mediation analysis indicated that the effect of methylation on prognosis was mediated by the levels of their expression (HRindirect = 1.473, P = 0.001, Proportion mediated, 69.10%).ConclusionsWe identified a four-gene prognostic signature by integrative analysis and developed a risk-assessment model that is significantly associated with patients’ survival. Methylation-driven genes might be a potential prognostic signature for CRC patients
Label-free immunoassay for porcine circovirus type 2 based on excessively tilted fiber grating modified with staphylococcal protein A
Using excessively tilted fiber grating (Ex-TFG) inscribed in standard single mode fiber, we developed a novel label-free immunoassay for specific detection of porcine circovirus type 2 (PCV2), which is a minim animal virus. Staphylococcal protein A (SPA) was used to modify the silanized fiber surface thus forming a SPA layer, which would greatly enhance the proportion of anti-PCV2 monoclonal antibody (MAb) bioactivity, thus improving the effectiveness of specific adsorption and binding events between anti-PCV2 MAbs and PCV2 antigens. Immunoassay experiments were carried out by monitoring the resonance wavelength shift of the proposed sensor under different PCV2 titer levels. Anti-PCV2 MAbs were thoroughly dissociated from the SPA layer by treatment with urea, and recombined to the SPA layer on the sensor surface for repeated immunoassay of PCV2. The specificity of the immunosensor was inspected by detecting porcine reproductive and respiratory syndrome virus (PRRSV) first, and PCV2 subsequently. The results showed a limit of detection (LOD) for the PCV2 immunosensor of ~9.371TCID50/mL, for a saturation value of ~4.801Ă—103TCID50/mL, with good repeatability and excellent specificity
- …