101 research outputs found

    Mask-CTC-based Encoder Pre-training for Streaming End-to-End Speech Recognition

    Full text link
    Achieving high accuracy with low latency has always been a challenge in streaming end-to-end automatic speech recognition (ASR) systems. By attending to more future contexts, a streaming ASR model achieves higher accuracy but results in larger latency, which hurts the streaming performance. In the Mask-CTC framework, an encoder network is trained to learn the feature representation that anticipates long-term contexts, which is desirable for streaming ASR. Mask-CTC-based encoder pre-training has been shown beneficial in achieving low latency and high accuracy for triggered attention-based ASR. However, the effectiveness of this method has not been demonstrated for various model architectures, nor has it been verified that the encoder has the expected look-ahead capability to reduce latency. This study, therefore, examines the effectiveness of Mask-CTCbased pre-training for models with different architectures, such as Transformer-Transducer and contextual block streaming ASR. We also discuss the effect of the proposed pre-training method on obtaining accurate output spike timing.Comment: Accepted to EUSIPCO 202

    Conversation-oriented ASR with multi-look-ahead CBS architecture

    Full text link
    During conversations, humans are capable of inferring the intention of the speaker at any point of the speech to prepare the following action promptly. Such ability is also the key for conversational systems to achieve rhythmic and natural conversation. To perform this, the automatic speech recognition (ASR) used for transcribing the speech in real-time must achieve high accuracy without delay. In streaming ASR, high accuracy is assured by attending to look-ahead frames, which leads to delay increments. To tackle this trade-off issue, we propose a multiple latency streaming ASR to achieve high accuracy with zero look-ahead. The proposed system contains two encoders that operate in parallel, where a primary encoder generates accurate outputs utilizing look-ahead frames, and the auxiliary encoder recognizes the look-ahead portion of the primary encoder without look-ahead. The proposed system is constructed based on contextual block streaming (CBS) architecture, which leverages block processing and has a high affinity for the multiple latency architecture. Various methods are also studied for architecting the system, including shifting the network to perform as different encoders; as well as generating both encoders' outputs in one encoding pass.Comment: Submitted to ICASSP202

    Rheo-optical Raman study of microscopic deformation in high-density polyethylene under hot drawing

    Get PDF
    In situ observation of the microscopic structural changes in high-density polyethylene during hot drawing was performed by incorporating a temperature-controlled tensile machine into a Raman spectroscopy apparatus. It was found that the load sharing and molecular orientation during elongation drastically changed at 50°C. The microscopic stress of the crystalline chains decreased with increasing temperature and diminished around 50°C. Moreover, the orientation of the crystalline chains was greatly promoted above 50°C. These microscopic structural changes were caused by the thermal activation of the molecular motion within lamellar crystalline chains owing to the onset of relaxation of the crystalline phase. © 2015 Elsevier Ltd. All rights reserved.Embargo Period 24 month

    Deformation mechanism of high-density polyethylene probed by in situ Raman spectroscopy

    Get PDF
    The microscopic mechanism of high-density polyethylene under uniaxial drawing is investigated using in situ Raman spectroscopy. From the peak shifts of the symmetric and anti-symmetric C-C stretching modes, it is found that the load sharing on the polymer chain in the yielding region is anisotropic with stretching along the chain and compression perpendicular to the chain. The orientation functions (〈P2〉 and 〈P4〉) as well as the orientation distribution function (N(θ)) are determined from the polarized Raman spectra. The molecular orientation with cold drawing is found to proceed more effectively for lower crystallinity specimens. In the yielding region, it is also found that N(θ) has a maximum at the polar angle θ = 30-70°. This peculiar behavior in the microscopic scale is explained by the preferential collapse of spherulites and the existence of lamellar clusters as the bulky mobile units

    The reoperation rate after single-level ACDFs

    Get PDF
    Introduction : The plate fixation for anterior cervical discectomy and fusion (ACDF) has become increasingly widespread for facilitating early mobilization and improving fusion rate. However, apart from multilevel operations, there is still some controversy over its use for single-level ACDF. This retrospective study has compared the reoperation rates after single-level ACDFs performed at our institution between the procedures with and without plate fixation. Methods : This retrospective study included a total of 131 patients with ≥ 1-year of follow-up after a single-level ACDF, consisting of 100 patients without plating (conventional ACDF) and 31 patients with plate fixation (plated ACDF). Results : Eleven patients (8.4% of all patients) : four conventional ACDFs (4% of the conventional ACDFs) and seven plated ACDFs (22.6% of the plated ACDFs), had reoperation surgeries. The incidence of reoperation was significantly higher in the plated ACDFs than in the conventional ACDFs (P = 0.0037). The log-rank test revealed a significant difference (P = 0.00003) in 5-year reoperation-free survival rates between the conventional (96.9%) and the plated groups (68.3%). Conclusion : Anterior cervical plating may have a negative impact on the adjacent segment integrity, resulting in an increased reoperation rate after a single-level ACDF at relatively shorter postoperative time points

    Pediatric cardiorespiratory failure successfully managed with venoarterial-venous extracorporeal membrane oxygenation: a case report

    Get PDF
    Background: Venoarterial-venous extracorporeal membrane oxygenation (VAV ECMO) configuration is a combined procedure of extracorporeal membrane oxygenation (ECMO). The proportion of cardiac and respiratory support can be controlled by adjusting arterial and venous return. Therefore, VAV ECMO can be applicable as a bridging therapy in the transition from venoarterial (VA) to venovenous (VV) ECMO. Case presentation: We present an 11-year-old girl with chemotherapy-induced myocarditis requiring extracorporeal cardiorespiratory support. She showed progressive hypotension, tachycardia, hyperlactemia, and tachypnea under support of catecholamines. Echocardiography showed severe left ventricular hypokinesis with an ejection fraction of 30 %. She was placed on VA ECMO with a drainage catheter from the right femoral vein (19.5 Fr) and a return catheter to the right femoral artery (16.5 Fr). Extracorporeal circulation was initiated at a blood flow of 2.0 L/min (59 mL/kg/min). On day 31, although cardiac function had improved, persistent pulmonary failure made weaning from VA ECMO difficult. We planned transition from VA ECMO to VAV ECMO to ensure gradual tapering of extracorporeal cardiac support while evaluating cardiopulmonary function. An additional return cannula (13.5 Fr) was inserted from the right internal jugular vein, which was connected to the circuit branch from the original returning cannula. We then gradually shifted the blood from the femoral artery to the right internal jugular vein over 24 h. She was successfully switched from VA to VV ECMO via VAV ECMO. Conclusions: VAV ECMO might be an option in ensuring oxygenation to the coronary circulation and allowing time to adequately evaluate cardiac function during transition from VA to VV ECMO. Further investigations using larger cohorts are necessary to validate the efficacy of VAV ECMO as a bridging therapy in the transition from VA to VV ECMO.This work was supported by a JSPS KAKENHI Grant (Number JP 16K09541)
    corecore