394 research outputs found

    Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding

    Full text link
    To bridge the gap between supervised semantic segmentation and real-world applications that acquires one model to recognize arbitrary new concepts, recent zero-shot segmentation attracts a lot of attention by exploring the relationships between unseen and seen object categories, yet requiring large amounts of densely-annotated data with diverse base classes. In this paper, we propose a new open-world semantic segmentation pipeline that makes the first attempt to learn to segment semantic objects of various open-world categories without any efforts on dense annotations, by purely exploiting the image-caption data that naturally exist on the Internet. Our method, Vision-language-driven Semantic Segmentation (ViL-Seg), employs an image and a text encoder to generate visual and text embeddings for the image-caption data, with two core components that endow its segmentation ability: First, the image encoder is jointly trained with a vision-based contrasting and a cross-modal contrasting, which encourage the visual embeddings to preserve both fine-grained semantics and high-level category information that are crucial for the segmentation task. Furthermore, an online clustering head is devised over the image encoder, which allows to dynamically segment the visual embeddings into distinct semantic groups such that they can be classified by comparing with various text embeddings to complete our segmentation pipeline. Experiments show that without using any data with dense annotations, our method can directly segment objects of arbitrary categories, outperforming zero-shot segmentation methods that require data labeling on three benchmark datasets.Comment: Accepted to ECCV 202

    Rest–activity rhythm associated with depressive symptom severity and attention among patients with major depressive disorder: a 12-month follow-up study

    Get PDF
    IntroductionPatients with depressive disorder demonstrate rest–activity rhythm disturbances and cognitive function impairment. This study examined the association of individual rest–activity rhythm changes over time with mood symptoms and attention.MethodsWe recruited 15 adult outpatients with a diagnosis of major depressive disorder from a single medical center and observed them for 12 months. Weekly rest–activity parameters, including rhythm characteristics generated from nonparametric circadian rhythm analysis, were retrieved from actigraphy data. Attention was evaluated weekly with a smartphone-based psychomotor vigilance test upon awakening. Depressive symptom severity was evaluated using the Beck Depression Inventory (BDI) fortnightly. The association of rest–activity parameters with BDI score and attention was examined using generalized linear mixed regression. A fixed-effects analysis was used to examine the association between rest–activity parameters and depressive episodes.ResultsAn advanced bedtime and most active continuous 10 h starting time were associated with depressive symptom severity but also associated with higher vigilance test performance. A longer sleep duration, mainly due to an earlier bedtime, was associated with depressive symptom severity. Compared to remission, sleep duration was 27.8 min longer during depressive episodes, and bed time was 24 min earlier. A shorter sleep duration and increased activity during sleep were associated with poorer attention.DiscussionRest–activity rhythms change with mood symptoms among patients with depressive disorder. The circadian rhythms of rest–activity among patients with depressive disorder should be distinguished during various mood states in future studies

    AutoEncoding Tree for City Generation and Applications

    Full text link
    City modeling and generation have attracted an increased interest in various applications, including gaming, urban planning, and autonomous driving. Unlike previous works focused on the generation of single objects or indoor scenes, the huge volumes of spatial data in cities pose a challenge to the generative models. Furthermore, few publicly available 3D real-world city datasets also hinder the development of methods for city generation. In this paper, we first collect over 3,000,000 geo-referenced objects for the city of New York, Zurich, Tokyo, Berlin, Boston and several other large cities. Based on this dataset, we propose AETree, a tree-structured auto-encoder neural network, for city generation. Specifically, we first propose a novel Spatial-Geometric Distance (SGD) metric to measure the similarity between building layouts and then construct a binary tree over the raw geometric data of building based on the SGD metric. Next, we present a tree-structured network whose encoder learns to extract and merge spatial information from bottom-up iteratively. The resulting global representation is reversely decoded for reconstruction or generation. To address the issue of long-dependency as the level of the tree increases, a Long Short-Term Memory (LSTM) Cell is employed as a basic network element of the proposed AETree. Moreover, we introduce a novel metric, Overlapping Area Ratio (OAR), to quantitatively evaluate the generation results. Experiments on the collected dataset demonstrate the effectiveness of the proposed model on 2D and 3D city generation. Furthermore, the latent features learned by AETree can serve downstream urban planning applications

    Paeoniflorin has anti-inflammation and neurogenesis functions through nicotinic acetylcholine receptors in cerebral ischemia-reperfusion injury rats

    Get PDF
    Objective(s): Paeoniflorin (PF) has anti-oxidation, anti-inflammation, anti-apoptosis, and neuroprotection pharmacological effects against ischemic injury. The aim of the present study was to investigate the neuroprotection mechanisms of PF in cerebral ischemia-reperfusion injury rats.Materials and Methods: We established an animal model of cerebral infarct by occlusion of the middle cerebral artery for 15 min, followed by reperfusion, and PF was administered 24 hr later (20 mg/kg, intraperitoneally for 6 days) after reperfusionResults: Treatment with PF reduced the neurological deficit score, improved motor function, decreased cell counts of nicotinic acetylcholine receptor (nAChR) α4β2 immunoreactive cells, and increased cell counts of nAChR α7. Furthermore, PF administration suppressed neuronal apoptosis and promoted neurogenesis.Conclusion: PF rescued neurological deficit and underlying mechanisms were inhibition of neurological apoptosis and inflammation by nAChRs

    Berberine Ameliorates High Glucose-Induced Cardiomyocyte Injury via AMPK Signaling Activation to Stimulate Mitochondrial Biogenesis and Restore Autophagic Flux

    Get PDF
    Background: Type II diabetes (T2D)-induced cardiomyocyte hypertrophy is closely linked to the impairment of mitochondrial function. Berberine has been shown to be a promising effect for hypoglycemia in T2D models. High glucose-induced cardiomyocyte hypertrophy in vitro has been reported. The present study investigated the protective effect and the underlying mechanism of berberine on high glucose-induced H9C2 cell line.Methods: High glucose-induced H9C2 cell line was used to mimic the hyperglycemia resulting in cardiomyocyte hypertrophy. Berberine was used to rescue in this model and explore the mechanism in it. Confocal microscopy, immunofluorescence, RT-PCR, and western blot analysis were performed to evaluate the protective effects of berberine in high glucose-induced H9C2 cell line.Results: Berberine dramatically alleviated hypertrophy of H9C2 cell line and significantly ameliorated mitochondrial function by rectifying the imbalance of fusion and fission in mitochondrial dynamics. Furthermore, berberine further promoted mitogenesis and cleared the damaged mitochondria via mitophagy. In addition, berberine also restored autophagic flux in high glucose-induced cardiomyocyte injury via AMPK signaling pathway activation.Conclusion: Berberine ameliorates high glucose-induced cardiomyocyte injury via AMPK signaling pathway activation to stimulate mitochondrial biogenesis and restore autophagicflux in H9C2 cell line

    Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale

    Full text link
    Neural Architecture Search (NAS) has demonstrated its efficacy in computer vision and potential for ranking systems. However, prior work focused on academic problems, which are evaluated at small scale under well-controlled fixed baselines. In industry system, such as ranking system in Meta, it is unclear whether NAS algorithms from the literature can outperform production baselines because of: (1) scale - Meta ranking systems serve billions of users, (2) strong baselines - the baselines are production models optimized by hundreds to thousands of world-class engineers for years since the rise of deep learning, (3) dynamic baselines - engineers may have established new and stronger baselines during NAS search, and (4) efficiency - the search pipeline must yield results quickly in alignment with the productionization life cycle. In this paper, we present Rankitect, a NAS software framework for ranking systems at Meta. Rankitect seeks to build brand new architectures by composing low level building blocks from scratch. Rankitect implements and improves state-of-the-art (SOTA) NAS methods for comprehensive and fair comparison under the same search space, including sampling-based NAS, one-shot NAS, and Differentiable NAS (DNAS). We evaluate Rankitect by comparing to multiple production ranking models at Meta. We find that Rankitect can discover new models from scratch achieving competitive tradeoff between Normalized Entropy loss and FLOPs. When utilizing search space designed by engineers, Rankitect can generate better models than engineers, achieving positive offline evaluation and online A/B test at Meta scale.Comment: Wei Wen and Kuang-Hung Liu contribute equall

    Automated Facial Recognition for Noonan Syndrome Using Novel Deep Convolutional Neural Network With Additive Angular Margin Loss

    Get PDF
    BackgroundNoonan syndrome (NS), a genetically heterogeneous disorder, presents with hypertelorism, ptosis, dysplastic pulmonary valve stenosis, hypertrophic cardiomyopathy, and small stature. Early detection and assessment of NS are crucial to formulating an individualized treatment protocol. However, the diagnostic rate of pediatricians and pediatric cardiologists is limited. To overcome this challenge, we propose an automated facial recognition model to identify NS using a novel deep convolutional neural network (DCNN) with a loss function called additive angular margin loss (ArcFace).MethodsThe proposed automated facial recognition models were trained on dataset that included 127 NS patients, 163 healthy children, and 130 children with several other dysmorphic syndromes. The photo dataset contained only one frontal face image from each participant. A novel DCNN framework with ArcFace loss function (DCNN-Arcface model) was constructed. Two traditional machine learning models and a DCNN model with cross-entropy loss function (DCNN-CE model) were also constructed. Transfer learning and data augmentation were applied in the training process. The identification performance of facial recognition models was assessed by five-fold cross-validation. Comparison of the DCNN-Arcface model to two traditional machine learning models, the DCNN-CE model, and six physicians were performed.ResultsAt distinguishing NS patients from healthy children, the DCNN-Arcface model achieved an accuracy of 0.9201 ± 0.0138 and an area under the receiver operator characteristic curve (AUC) of 0.9797 ± 0.0055. At distinguishing NS patients from children with several other genetic syndromes, it achieved an accuracy of 0.8171 ± 0.0074 and an AUC of 0.9274 ± 0.0062. In both cases, the DCNN-Arcface model outperformed the two traditional machine learning models, the DCNN-CE model, and six physicians.ConclusionThis study shows that the proposed DCNN-Arcface model is a promising way to screen NS patients and can improve the NS diagnosis rate
    • …
    corecore