1,155 research outputs found

    SNR-Based Teachers-Student Technique for Speech Enhancement

    Full text link
    It is very challenging for speech enhancement methods to achieves robust performance under both high signal-to-noise ratio (SNR) and low SNR simultaneously. In this paper, we propose a method that integrates an SNR-based teachers-student technique and time-domain U-Net to deal with this problem. Specifically, this method consists of multiple teacher models and a student model. We first train the teacher models under multiple small-range SNRs that do not coincide with each other so that they can perform speech enhancement well within the specific SNR range. Then, we choose different teacher models to supervise the training of the student model according to the SNR of the training data. Eventually, the student model can perform speech enhancement under both high SNR and low SNR. To evaluate the proposed method, we constructed a dataset with an SNR ranging from -20dB to 20dB based on the public dataset. We experimentally analyzed the effectiveness of the SNR-based teachers-student technique and compared the proposed method with several state-of-the-art methods.Comment: Published in 2020 IEEE International Conference on Multimedia and Expo (ICME 2020

    UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-noise Ratio Condition

    Full text link
    Speech enhancement at extremely low signal-to-noise ratio (SNR) condition is a very challenging problem and rarely investigated in previous works. This paper proposes a robust speech enhancement approach (UNetGAN) based on U-Net and generative adversarial learning to deal with this problem. This approach consists of a generator network and a discriminator network, which operate directly in the time domain. The generator network adopts a U-Net like structure and employs dilated convolution in the bottleneck of it. We evaluate the performance of the UNetGAN at low SNR conditions (up to -20dB) on the public benchmark. The result demonstrates that it significantly improves the speech quality and substantially outperforms the representative deep learning models, including SEGAN, cGAN fo SE, Bidirectional LSTM using phase-sensitive spectrum approximation cost function (PSA-BLSTM) and Wave-U-Net regarding Short-Time Objective Intelligibility (STOI) and Perceptual evaluation of speech quality (PESQ).Comment: Published in Interspeech 201

    Charge Measurement of Cosmic Ray Nuclei with the Plastic Scintillator Detector of DAMPE

    Full text link
    One of the main purposes of the DArk Matter Particle Explorer (DAMPE) is to measure the cosmic ray nuclei up to several tens of TeV or beyond, whose origin and propagation remains a hot topic in astrophysics. The Plastic Scintillator Detector (PSD) on top of DAMPE is designed to measure the charges of cosmic ray nuclei from H to Fe and serves as a veto detector for discriminating gamma-rays from charged particles. We propose in this paper a charge reconstruction procedure to optimize the PSD performance in charge measurement. Essentials of our approach, including track finding, alignment of PSD, light attenuation correction, quenching and equalization correction are described detailedly in this paper after a brief description of the structure and operational principle of the PSD. Our results show that the PSD works very well and almost all the elements in cosmic rays from H to Fe are clearly identified in the charge spectrum.Comment: 20 pages, 4 figure

    IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception

    Full text link
    Multi-agent collaborative perception has emerged as a widely recognized technology in the field of autonomous driving in recent years. However, current collaborative perception predominantly relies on LiDAR point clouds, with significantly less attention given to methods using camera images. This severely impedes the development of budget-constrained collaborative systems and the exploitation of the advantages offered by the camera modality. This work proposes an instance-level fusion transformer for visual collaborative perception (IFTR), which enhances the detection performance of camera-only collaborative perception systems through the communication and sharing of visual features. To capture the visual information from multiple agents, we design an instance feature aggregation that interacts with the visual features of individual agents using predefined grid-shaped bird eye view (BEV) queries, generating more comprehensive and accurate BEV features. Additionally, we devise a cross-domain query adaptation as a heuristic to fuse 2D priors, implicitly encoding the candidate positions of targets. Furthermore, IFTR optimizes communication efficiency by sending instance-level features, achieving an optimal performance-bandwidth trade-off. We evaluate the proposed IFTR on a real dataset, DAIR-V2X, and two simulated datasets, OPV2V and V2XSet, achieving performance improvements of 57.96%, 9.23% and 12.99% in AP@70 metrics compared to the previous SOTAs, respectively. Extensive experiments demonstrate the superiority of IFTR and the effectiveness of its key components. The code is available at https://github.com/wangsh0111/IFTR

    Wired Perspectives: Multi-View Wire Art Embraces Generative AI

    Full text link
    Creating multi-view wire art (MVWA), a static 3D sculpture with diverse interpretations from different viewpoints, is a complex task even for skilled artists. In response, we present DreamWire, an AI system enabling everyone to craft MVWA easily. Users express their vision through text prompts or scribbles, freeing them from intricate 3D wire organisation. Our approach synergises 3D B\'ezier curves, Prim's algorithm, and knowledge distillation from diffusion models or their variants (e.g., ControlNet). This blend enables the system to represent 3D wire art, ensuring spatial continuity and overcoming data scarcity. Extensive evaluation and analysis are conducted to shed insight on the inner workings of the proposed system, including the trade-off between connectivity and visual aesthetics.Comment: Project page: https://dreamwireart.github.i

    Case Report: A management strategy and clinical analysis of primary squamous cell carcinoma of the colon

    Get PDF
    Primary colorectal squamous cell carcinoma (CSCC) is a rare pathological subtype. Currently, clinical data with regards to its prognosis and treatment is limited, and there is no optimal treatment method. The case presented involves a proficient mismatch repair (pMMR) and microsatellite-stable (MSS) Colorectal cancer (CRC) patient with squamous cell carcinoma (SCC) located transversely in the colon. Based on the imaging assessment, the tumor infiltration depth is classified as T4. After receiving 4 cycles of neoadjuvant treatment with oxaliplatin and capecitabine (XELOX), the patients were evaluated for partial response (PR) in 2 cycles and stable disease (SD) in 4 cycles. The patient underwent a right hemicolectomy and received postoperative paclitaxel/cisplatin (TC) adjuvant chemotherapy. After 23 months, a systemic examination revealed abdominal metastasis. A needle biopsy was conducted on the detected abdominal metastases, with the resulting pathology indicating the presence of metastatic SCC. The individual exhibited expression of programmed cell death ligand 1 (PD-L1) and a mutation in the TP53 gene. Considering the patient’s disease recurrence based on medical history, a treatment plan was formulated. This involved Sintilimab plus Cetuximab and the combination of leucovorin, fluorouracil, and irinotecan (FOLFIRI) regimen. The patient received four cycles of treatment with an efficacy evaluation of SD- and seven cycles of treatment with an efficacy evaluation of SD+, which resulted in a progression-free survival (PFS) duration of 7 months. This case study presents the conventional XELOX chemotherapy protocol, which has shown limited effectiveness, and highlights the favorable results achieved by implementing the TC adjuvant chemotherapy regimen in individuals diagnosed with primary colonic SCC. Furthermore, combining immune checkpoint blockade (ICB) with other therapies for patients with advanced disease is anticipated to provide an extended duration of survival

    Effects of Forward and Reverse Shear Displacements on Geometric and Hydraulic Characteristics of Single Rough Fracture by the Finite Volume Method

    Get PDF
    AbstractShear displacement will lead to the change of rock fracture space and then affect seepage characteristics of the fracture, but for the same rock fracture, whether the spatial geometry and seepage characteristics of the fracture can be consistent under the forward and reverse shear displacements is a new question. In this paper, the 2D rough fracture profile was used to establish models of different shear displacements in the forward and reverse directions without contact zone, and the geometric distribution characteristics of the fracture space with shear displacements were analyzed. The FVM (finite volume method) was adopted to calculate and simulate the hydraulic characteristics of the relative seepage direction (forward and reverse flow) under different pressure gradients at various shear displacement models. The results showed that under the same shear displacement, the spatial geometry characteristics of forward and reverse shear displacements are consistent after the initial angle of the fracture profile is eliminated. The slope of equivalent hydraulic aperture decreases with the shear displacement, and the amplitude of the non-Darcy coefficient difference increases with the shear displacement, which are inconsistent in the forward and reverse directions, which are negatively correlated with the directional roughness of the initial fracture profile. It shows that the directional roughness inconsistency between the forward and reverse directions of fracture profile is the primary factor leading to the difference of seepage characteristic parameters under the forward and reverse shear displacements

    Slope-based shape cluster method for smart metering load profiles

    Get PDF
    Cluster analysis is used to study the group of load profiles from smart meters to improve the operability in distribution network. The traditional K-means clustering analysis method employs Euclidean distance as similarity measurement, which is insufficient in reflecting the shape similarities of load profiles. In this work, we propose a novel shape cluster method based on the segmented slope of load profiles. Compared with traditional K-means and two improved algorithms, the proposed method can improve the clustering accuracy and efficiency by capturing the shape features of smart metering load profiles
    corecore