89 research outputs found

    APPT : Asymmetric Parallel Point Transformer for 3D Point Cloud Understanding

    Full text link
    Transformer-based networks have achieved impressive performance in 3D point cloud understanding. However, most of them concentrate on aggregating local features, but neglect to directly model global dependencies, which results in a limited effective receptive field. Besides, how to effectively incorporate local and global components also remains challenging. To tackle these problems, we propose Asymmetric Parallel Point Transformer (APPT). Specifically, we introduce Global Pivot Attention to extract global features and enlarge the effective receptive field. Moreover, we design the Asymmetric Parallel structure to effectively integrate local and global information. Combined with these designs, APPT is able to capture features globally throughout the entire network while focusing on local-detailed features. Extensive experiments show that our method outperforms the priors and achieves state-of-the-art on several benchmarks for 3D point cloud understanding, such as 3D semantic segmentation on S3DIS, 3D shape classification on ModelNet40, and 3D part segmentation on ShapeNet

    TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training

    Full text link
    Contrastive Language-Image Pre-training (CLIP) has demonstrated impressive capabilities in open-vocabulary classification. The class token in the image encoder is trained to capture the global features to distinguish different text descriptions supervised by contrastive loss, making it highly effective for single-label classification. However, it shows poor performance on multi-label datasets because the global feature tends to be dominated by the most prominent class and the contrastive nature of softmax operation aggravates it. In this study, we observe that the multi-label classification results heavily rely on discriminative local features but are overlooked by CLIP. As a result, we dissect the preservation of patch-wise spatial information in CLIP and proposed a local-to-global framework to obtain image tags. It comprises three steps: (1) patch-level classification to obtain coarse scores; (2) dual-masking attention refinement (DMAR) module to refine the coarse scores; (3) class-wise reidentification (CWR) module to remedy predictions from a global perspective. This framework is solely based on frozen CLIP and significantly enhances its multi-label classification performance on various benchmarks without dataset-specific training. Besides, to comprehensively assess the quality and practicality of generated tags, we extend their application to the downstream task, i.e., weakly supervised semantic segmentation (WSSS) with generated tags as image-level pseudo labels. Experiments demonstrate that this classify-then-segment paradigm dramatically outperforms other annotation-free segmentation methods and validates the effectiveness of generated tags. Our code is available at https://github.com/linyq2117/TagCLIP.Comment: Accepted by AAAI202

    Automated interpretation of congenital heart disease from multi-view echocardiograms

    Full text link
    Congenital heart disease (CHD) is the most common birth defect and the leading cause of neonate death in China. Clinical diagnosis can be based on the selected 2D key-frames from five views. Limited by the availability of multi-view data, most methods have to rely on the insufficient single view analysis. This study proposes to automatically analyze the multi-view echocardiograms with a practical end-to-end framework. We collect the five-view echocardiograms video records of 1308 subjects (including normal controls, ventricular septal defect (VSD) patients and atrial septal defect (ASD) patients) with both disease labels and standard-view key-frame labels. Depthwise separable convolution-based multi-channel networks are adopted to largely reduce the network parameters. We also approach the imbalanced class problem by augmenting the positive training samples. Our 2D key-frame model can diagnose CHD or negative samples with an accuracy of 95.4\%, and in negative, VSD or ASD classification with an accuracy of 92.3\%. To further alleviate the work of key-frame selection in real-world implementation, we propose an adaptive soft attention scheme to directly explore the raw video data. Four kinds of neural aggregation methods are systematically investigated to fuse the information of an arbitrary number of frames in a video. Moreover, with a view detection module, the system can work without the view records. Our video-based model can diagnose with an accuracy of 93.9\% (binary classification), and 92.1\% (3-class classification) in a collected 2D video testing set, which does not need key-frame selection and view annotation in testing. The detailed ablation study and the interpretability analysis are provided.Comment: Published in Medical Image Analysi

    Detection of neural connections with ex vivo MRI using a ferritin-encoding trans-synaptic virus

    Get PDF
    The elucidation of neural networks is essential to understanding the mechanisms of brain functions and brain disorders. Neurotropic virus-based trans-synaptic tracing tools have become an effective method for dissecting the structure and analyzing the function of neural-circuitry. However, these tracing systems rely on fluorescent signals, making it hard to visualize the panorama of the labeled networks in mammalian brain in vivo. One MRI method, Diffusion Tensor Imaging (DTI), is capable of imaging the networks of the whole brain in live animals but without information of anatomical connections through synapses. In this report, a chimeric gene coding for ferritin and enhanced green fluorescent protein (EGFP) was integrated into Vesicular stomatitis virus (VSV), a neurotropic virus that is able to spread anterogradely in synaptically connected networks. After the animal was injected with the recombinant VSV (rVSV), rVSV-Ferritin-EGFP, into the somatosensory cortex (SC) for four days, the labeled neural-network was visualized in the postmortem whole brain with a T2-weighted MRI sequence. The modified virus transmitted from SC to synaptically connected downstream regions. The results demonstrate that rVSV-Ferritin-EGFP could be used as a bimodal imaging vector for detecting synaptically connected neural-network with both ex vivo MRI and fluorescent imaging. The strategy in the current study has the potential to longitudinally monitor the global structure of a given neural-network in living animals

    Meibomian Gland Dysfunction in Type 2 Diabetic Patients

    Get PDF
    Purpose. To investigate meibomian gland and tear film function in patients with type 2 diabetes. Methods. This prospective study compared changes in meibomian gland and tear film function in type 2 diabetic patients with nondiabetic patients. Meibomian gland function was evaluated by measuring lipid layer thickness (LLT), grading of meibomian gland loss, lid margin abnormalities, and expression of meibum. Tear film function was assessed by measuring tear breakup time (TBUT), the Schirmer I test, noninvasive breakup time (NIBUT), tear meniscus height (TMH), and corneal fluorescein staining. Results. Meibography scores were significantly higher in the diabetic group compared with the nondiabetic group (p=0.004). The number of expressible glands was significantly lower in the diabetic group in temporal, central, and nasal third of the lower eyelid (nasal: p=0.002; central: p=0.040; and temporal: p=0.039). The lid margin abnormality score was significantly higher in the diabetic group than in the nondiabetic group (p=0.04). There was no statistically significant difference in the tear film function parameters between the two groups. Conclusions. Meibomian gland dysfunction (MGD) in type 2 diabetic patients is more severe compared with nondiabetic patients. Overall, most of the diabetic patients manifest as having asymptomatic MGD

    Integrative Analysis of Identifying Methylation-Driven Genes Signature Predicts Prognosis in Colorectal Carcinoma

    Get PDF
    BackgroundAberrant DNA methylation is a critical regulator of gene expression and plays a crucial role in the occurrence, progression, and prognosis of colorectal cancer (CRC). We aimed to identify methylation-driven genes by integrative epigenetic and transcriptomic analysis to predict the prognosis of CRC patients.MethodsMethylation-driven genes were selected for CRC using a MethylMix algorithm and LASSO regression screening strategy, and were further used to construct a prognostic risk-assessment model. The Cancer Genome Atlas (TCGA) database was obtained as the training set for both the screening of methylation-driven genes and the effect of genes signature on CRC prognosis. Then, the prognostic genes signature was validated in three independent expression arrays of CRC data from Gene Expression Omnibus (GEO).ResultsWe identified 143 methylation-driven genes, of which the combination of BATF, PHYHIPL, RBP1, and PNPLA4 expression levels was screened as a better prognostic model with the best area under the curve (AUC) (AUC = 0.876). Compared with patients in the low-risk group, CRC patients in the high-risk group had significantly poorer overall survival in the training set (HR = 2.184, 95% CI: 1.404–3.396, P < 0.001). Similar results were observed in the validation set. Moreover, VanderWeele’s mediation analysis indicated that the effect of methylation on prognosis was mediated by the levels of their expression (HRindirect = 1.473, P = 0.001, Proportion mediated, 69.10%).ConclusionsWe identified a four-gene prognostic signature by integrative analysis and developed a risk-assessment model that is significantly associated with patients’ survival. Methylation-driven genes might be a potential prognostic signature for CRC patients

    Label-free immunoassay for porcine circovirus type 2 based on excessively tilted fiber grating modified with staphylococcal protein A

    Get PDF
    Using excessively tilted fiber grating (Ex-TFG) inscribed in standard single mode fiber, we developed a novel label-free immunoassay for specific detection of porcine circovirus type 2 (PCV2), which is a minim animal virus. Staphylococcal protein A (SPA) was used to modify the silanized fiber surface thus forming a SPA layer, which would greatly enhance the proportion of anti-PCV2 monoclonal antibody (MAb) bioactivity, thus improving the effectiveness of specific adsorption and binding events between anti-PCV2 MAbs and PCV2 antigens. Immunoassay experiments were carried out by monitoring the resonance wavelength shift of the proposed sensor under different PCV2 titer levels. Anti-PCV2 MAbs were thoroughly dissociated from the SPA layer by treatment with urea, and recombined to the SPA layer on the sensor surface for repeated immunoassay of PCV2. The specificity of the immunosensor was inspected by detecting porcine reproductive and respiratory syndrome virus (PRRSV) first, and PCV2 subsequently. The results showed a limit of detection (LOD) for the PCV2 immunosensor of ~9.371TCID50/mL, for a saturation value of ~4.801Ă—103TCID50/mL, with good repeatability and excellent specificity
    • …
    corecore