146 research outputs found

    OMG-Seg: Is One Model Good Enough For All Segmentation?

    Full text link
    In this work, we address various segmentation tasks, each traditionally tackled by distinct or partially unified models. We propose OMG-Seg, One Model that is Good enough to efficiently and effectively handle all the segmentation tasks, including image semantic, instance, and panoptic segmentation, as well as their video counterparts, open vocabulary settings, prompt-driven, interactive segmentation like SAM, and video object segmentation. To our knowledge, this is the first model to handle all these tasks in one model and achieve satisfactory performance. We show that OMG-Seg, a transformer-based encoder-decoder architecture with task-specific queries and outputs, can support over ten distinct segmentation tasks and yet significantly reduce computational and parameter overhead across various tasks and datasets. We rigorously evaluate the inter-task influences and correlations during co-training. Code and models are available at https://github.com/lxtGH/OMG-Seg.Comment: Project Page: https://lxtgh.github.io/project/omg_seg

    Transformer-Based Visual Segmentation: A Survey

    Full text link
    Visual segmentation seeks to partition images, video frames, or point clouds into multiple segments or groups. This technique has numerous real-world applications, such as autonomous driving, image editing, robot sensing, and medical analysis. Over the past decade, deep learning-based methods have made remarkable strides in this area. Recently, transformers, a type of neural network based on self-attention originally designed for natural language processing, have considerably surpassed previous convolutional or recurrent approaches in various vision processing tasks. Specifically, vision transformers offer robust, unified, and even simpler solutions for various segmentation tasks. This survey provides a thorough overview of transformer-based visual segmentation, summarizing recent advancements. We first review the background, encompassing problem definitions, datasets, and prior convolutional methods. Next, we summarize a meta-architecture that unifies all recent transformer-based approaches. Based on this meta-architecture, we examine various method designs, including modifications to the meta-architecture and associated applications. We also present several closely related settings, including 3D point cloud segmentation, foundation model tuning, domain-aware segmentation, efficient segmentation, and medical segmentation. Additionally, we compile and re-evaluate the reviewed methods on several well-established datasets. Finally, we identify open challenges in this field and propose directions for future research. The project page can be found at https://github.com/lxtGH/Awesome-Segmenation-With-Transformer. We will also continually monitor developments in this rapidly evolving field.Comment: Work in progress. Github: https://github.com/lxtGH/Awesome-Segmenation-With-Transforme

    A token-based dynamic scheduled MAC protocol for health monitoring

    Get PDF
    Developments of wireless body area networks (WBANs) facilitate the pervasive health monitoring with mHealth applications. WBANs can support continuous health monitoring for the human body in convenience and high efficiency without any intervention. The monitoring data in health care have the characteristics of various data flows and heterogeneous data arrival rates, the transmission of which must be in timeliness and reliability, especially the burst data. Moreover, the energy-constraint nodes should be provident in energy consumption. Designing MAC protocols with high reliability and energy efficiency for WBANs is the prime consideration. In this paper, we propose a token-based two-round reservation MAC (TTR MAC) protocol based on IEEE 802.15.6 with considering the data features of health monitoring. With analyzing the characteristics of monitoring data, one-round reservation is conducted for periodic data and two-round reservation is generated adaptively for burst data to save energy. Besides, TTR MAC protocol assigns appropriate number of allocation slots to nodes in heterogeneous data arrival rates. Furthermore, a token is introduced on the basis of user priority and health severity index to indicate the transmission order of nodes with burst data, which highly decreases the average delay. In addition, a bit sequence scheduled algorithm is proposed for m-periodic (m>1) monitoring data for network capacity expansion. The simulation results show that TTR MAC protocol achieves higher energy efficiency and longer lifetime compared with IEEE 802.15.6 and other one-round reservation MAC (OR MAC) protocols for both 1-periodic and m-periodic data.info:eu-repo/semantics/publishedVersio

    Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation

    Get PDF
    Video segmentation aims to segment and track every pixel in diverse scenarios accurately. In this paper, we present Tube-Link, a versatile framework that addresses multiple core tasks of video segmentation with a unified architecture. Our framework is a near-online approach that takes a short subclip as input and outputs the corresponding spatial-temporal tube masks. To enhance the modeling of cross-tube relationships, we propose an effective way to perform tube-level linking via attention along the queries. In addition, we introduce temporal contrastive learning to instance-wise discriminative features for tube-level association. Our approach offers flexibility and efficiency for both short and long video inputs, as the length of each subclip can be varied according to the needs of datasets or scenarios. Tube-Link outperforms existing specialized architectures by a significant margin on five video segmentation datasets. Specifically, it achieves almost 13% relative improvements on VIPSeg and 4% improvements on KITTI-STEP over the strong baseline Video K-Net. When using a ResNet50 backbone on Youtube-VIS2019 and 2021, Tube-Link boosts IDOL by 3% and 4%, respectively. Code is available at https://github.com/lxtGH/Tube-Link

    TLR5 signaling enhances the proliferation of human allogeneic CD40-activated B cell induced CD4hiCD25+ regulatory T cells

    Get PDF
    Although diverse functions of different toll-like receptors (TLR) on human natural regulatory T cells have been demonstrated recently, the role of TLR-related signals on human induced regulatory T cells remain elusive. Previously our group developed an ex vivo high-efficient system in generating human alloantigen-specific CD4(hi)CD25(+) regulatory T cells from naive CD4(+)CD25(-) T cells using allogeneic CD40-activated B cells as stimulators. In this study, we investigated the role of TLR5-related signals on the generation and function of these novel CD4(hi)CD25(+) regulatory T cells. It was found that induced CD4(hi)CD25(+) regulatory T cells expressed an up-regulated level of TLR5 compared to their precursors. The blockade of TLR5 using anti-TLR5 antibodies during the co-culture decreased CD4(hi)CD25(+) regulatory T cells proliferation by induction of S phase arrest. The S phase arrest was associated with reduced ERK1/2 phosphorylation. However, TLR5 blockade did not decrease the CTLA-4, GITR and FOXP3 expressions, and the suppressive function of CD4(hi)CD25(+) regulatory T cells. In conclusion, we discovered a novel function of TLR5-related signaling in enhancing the proliferation of CD4(hi)CD25(+) regulatory T cells by promoting S phase progress but not involved in the suppressive function of human CD40-activated B cell-induced CD4(hi)CD25(+) regulatory T cells, suggesting a novel role of TLR5-related signals in the generation of induced regulatory T cells.published_or_final_versio

    Caste-specific RNA editomes in the leaf-cutting ant <i>Acromyrmex echinatior</i>

    Get PDF
    Eusocial insects have evolved the capacity to generate adults with distinct morphological, reproductive and behavioural phenotypes from the same genome. Recent studies suggest that RNA editing might enhance the diversity of gene products at the post-transcriptional level, particularly to induce functional changes in the nervous system. Using head samples from the leaf-cutting ant Acromyrmex echinatior, we compare RNA editomes across eusocial castes, identifying ca. 11,000 RNA editing sites in gynes, large workers and small workers. Those editing sites map to 800 genes functionally enriched for neurotransmission, circadian rhythm, temperature response, RNA splicing and carboxylic acid biosynthesis. Most A. echinatior editing sites are species specific, but 8–23% are conserved across ant subfamilies and likely to have been important for the evolution of eusociality in ants. The level of editing varies for the same site between castes, suggesting that RNA editing might be a general mechanism that shapes caste behaviour in ants

    Transformer-Based Visual Segmentation: A Survey.

    Get PDF
    Visual segmentation seeks to partition images, video frames, or point clouds into multiple segments or groups. This technique has numerous real-world applications, such as autonomous driving, image editing, robot sensing, and medical analysis. Over the past decade, deep learning-based methods have made remarkable strides in this area. Recently, transformers, a type of neural network based on self-attention originally designed for natural language processing, have considerably surpassed previous convolutional or recurrent approaches in various vision processing tasks. Specifically, vision transformers offer robust, unified, and even simpler solutions for various segmentation tasks. This survey provides a thorough overview of transformer-based visual segmentation, summarizing recent advancements. We first review the background, encompassing problem definitions, datasets, and prior convolutional methods. Next, we summarize a meta-architecture that unifies all recent transformer-based approaches. Based on this meta-architecture, we examine various method designs, including modifications to the meta-architecture and associated applications. We also present several specific subfields, including 3D point cloud segmentation, foundation model tuning, domain-aware segmentation, efficient segmentation, and medical segmentation. Additionally, we compile and re-evaluate the reviewed methods on several well-established datasets. Finally, we identify open challenges in this field and propose directions for future research. The project page can be found at https://github.com/lxtGH/Awesome-Segmentation-With-Transformer
    • …
    corecore