141 research outputs found

    Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks

    Full text link
    Zazo R, Lozano-Diez A, Gonzalez-Dominguez J, T. Toledano D, Gonzalez-Rodriguez J (2016) Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks. PLoS ONE 11(1): e0146917. doi:10.1371/journal.pone.0146917Long Short Term Memory (LSTM) Recurrent Neural Networks (RNNs) have recently outperformed other state-of-the-art approaches, such as i-vector and Deep Neural Networks (DNNs), in automatic Language Identification (LID), particularly when dealing with very short utterances (similar to 3s). In this contribution we present an open-source, end-to-end, LSTM RNN system running on limited computational resources (a single GPU) that outperforms a reference i-vector system on a subset of the NIST Language Recognition Evaluation (8 target languages, 3s task) by up to a 26%. This result is in line with previously published research using proprietary LSTM implementations and huge computational resources, which made these former results hardly reproducible. Further, we extend those previous experiments modeling unseen languages (out of set, OOS, modeling), which is crucial in real applications. Results show that a LSTM RNN with OOS modeling is able to detect these languages and generalizes robustly to unseen OOS languages. Finally, we also analyze the effect of even more limited test data (from 2.25s to 0.1s) proving that with as little as 0.5s an accuracy of over 50% can be achieved.This work has been supported by project CMC-V2: Caracterizacion, Modelado y Compensacion de Variabilidad en la Señal de Voz (TEC2012-37585-C02-01), funded by Ministerio de Economia y Competitividad, Spain

    Restoring speech following total removal of the larynx by a learned transformation from sensor data to acoustics

    Get PDF
    Total removal of the larynx may be required to treat laryngeal cancer: speech is lost. This article shows that it may be possible to restore speech by sensing movement of the remaining speech articulators and use machine learning algorithms to derive a transformation to convert this sensor data into an acoustic signal. The resulting “silent speech,” which may be delivered in real time, is intelligible and sounds natural. The identity of the speaker is recognisable. The sensing technique involves attaching small, unobtrusive magnets to the lips and tongue and monitoring changes in the magnetic field induced by their movement

    Star-forming cores embedded in a massive cold clump: Fragmentation, collapse and energetic outflows

    Full text link
    The fate of massive cold clumps, their internal structure and collapse need to be characterised to understand the initial conditions for the formation of high-mass stars, stellar systems, and the origin of associations and clusters. We explore the onset of star formation in the 75 M_sun SMM1 clump in the region ISOSS J18364-0221 using infrared and (sub-)millimetre observations including interferometry. This contracting clump has fragmented into two compact cores SMM1 North and South of 0.05 pc radius, having masses of 15 and 10 M_sun, and luminosities of 20 and 180 L_sun. SMM1 South harbours a source traced at 24 and 70um, drives an energetic molecular outflow, and appears supersonically turbulent at the core centre. SMM1 North has no infrared counterparts and shows lower levels of turbulence, but also drives an outflow. Both outflows appear collimated and parsec-scale near-infrared features probably trace the outflow-powering jets. We derived mass outflow rates of at least 4E-5 M_sun/yr and outflow timescales of less than 1E4 yr. Our HCN(1-0) modelling for SMM1 South yielded an infall velocity of 0.14 km/s and an estimated mass infall rate of 3E-5 M_sun/yr. Both cores may harbour seeds of intermediate- or high-mass stars. We compare the derived core properties with recent simulations of massive core collapse. They are consistent with the very early stages dominated by accretion luminosity.Comment: Accepted for publication in ApJ, 14 pages, 7 figure

    Ecological changes in historically polluted soils: Metal(loid) bioaccumulation in microarthropods and their impact on community structure

    Get PDF
    International audienceSoil pollution by persistent metal(loid)s present environmental and sanitary risks. While the effects of metal(loid)s on vegetation and macrofauna have been widely studied, their impact on microarthropods (millimetre scale) and their bioaccumulation capacity have been less investigated. However, microarthropods provide important ecosystem services, contributing in particular to soil organic matter dynamics. This study focussed on the impact of metal(loid) pollution on the structure and distribution of microarthropod communities and their potential to bioaccumulate lead (Pb). Soil samples were collected from a contaminated historical site with a strong horizontal and vertical gradient of Pb concentrations. Microarthropods were extracted using the Berlese method. The field experiments showed that microarthropods were present even in extremely polluted soils (30,000 mg Pb kg− 1). However, while microarthropod abundance increased with increasing soil C/N content (R2 = 0.79), richness decreased with increasing pollution. A shift in the community structure from an oribatid-to a springtail-dominated community was observed in less polluted soils (R2 = 0.68). In addition, Pb bioamplification occurred in microarthropods, with higher Pb concentrations in predators than in detritivorous microarthropods. Finally, the importance of feeding and reproductive ecological traits as potentially relevant descriptors of springtail community structures was highlighted. This study demonstrates the interest of microarthropod communities with different trophic levels and ecological features for evaluating the global environmental impact of metal(loid) pollution on soil biological quality

    Exploring spatial-frequency-sequential relationships for motor imagery classification with recurrent neural network

    Get PDF
    Abstract Background Conventional methods of motor imagery brain computer interfaces (MI-BCIs) suffer from the limited number of samples and simplified features, so as to produce poor performances with spatial-frequency features and shallow classifiers. Methods Alternatively, this paper applies a deep recurrent neural network (RNN) with a sliding window cropping strategy (SWCS) to signal classification of MI-BCIs. The spatial-frequency features are first extracted by the filter bank common spatial pattern (FB-CSP) algorithm, and such features are cropped by the SWCS into time slices. By extracting spatial-frequency-sequential relationships, the cropped time slices are then fed into RNN for classification. In order to overcome the memory distractions, the commonly used gated recurrent unit (GRU) and long-short term memory (LSTM) unit are applied to the RNN architecture, and experimental results are used to determine which unit is more suitable for processing EEG signals. Results Experimental results on common BCI benchmark datasets show that the spatial-frequency-sequential relationships outperform all other competing spatial-frequency methods. In particular, the proposed GRU-RNN architecture achieves the lowest misclassification rates on all BCI benchmark datasets. Conclusion By introducing spatial-frequency-sequential relationships with cropping time slice samples, the proposed method gives a novel way to construct and model high accuracy and robustness MI-BCIs based on limited trials of EEG signals

    HIV-1 Vpu Neutralizes the Antiviral Factor Tetherin/BST-2 by Binding It and Directing Its Beta-TrCP2-Dependent Degradation

    Get PDF
    Host cells impose a broad range of obstacles to the replication of retroviruses. Tetherin (also known as CD317, BST-2 or HM1.24) impedes viral release by retaining newly budded HIV-1 virions on the surface of cells. HIV-1 Vpu efficiently counteracts this restriction. Here, we show that HIV-1 Vpu induces the depletion of tetherin from cells. We demonstrate that this phenomenon correlates with the ability of Vpu to counteract the antiviral activity of both overexpressed and interferon-induced endogenous tetherin. In addition, we show that Vpu co-immunoprecipitates with tetherin and β-TrCP in a tri-molecular complex. This interaction leads to Vpu-mediated proteasomal degradation of tetherin in a β-TrCP2-dependent manner. Accordingly, in conditions where Vpu-β-TrCP2-tetherin interplay was not operative, including cells stably knocked down for β-TrCP2 expression or cells expressing a dominant negative form of β-TrCP, the ability of Vpu to antagonize the antiviral activity of tetherin was severely impaired. Nevertheless, tetherin degradation did not account for the totality of Vpu-mediated counteraction against the antiviral factor, as binding of Vpu to tetherin was sufficient for a partial relief of the restriction. Finally, we show that the mechanism used by Vpu to induce tetherin depletion implicates the cellular ER-associated degradation (ERAD) pathway, which mediates the dislocation of ER membrane proteins into the cytosol for subsequent proteasomal degradation. In conclusion, we show that Vpu interacts with tetherin to direct its β-TrCP2-dependent proteasomal degradation, thereby alleviating the blockade to the release of infectious virions. Identification of tetherin binding to Vpu provides a potential novel target for the development of drugs aimed at inhibiting HIV-1 replication

    Humanized Mice Recapitulate Key Features of HIV-1 Infection: A Novel Concept Using Long-Acting Anti-Retroviral Drugs for Treating HIV-1

    Get PDF
    BACKGROUND: Humanized mice generate a lymphoid system of human origin subsequent to transplantation of human CD34+ cells and thus are highly susceptible to HIV infection. Here we examined the efficacy of antiretroviral treatment (ART) when added to food pellets, and of long-acting (LA) antiretroviral compounds, either as monotherapy or in combination. These studies shall be inspiring for establishing a gold standard of ART, which is easy to administer and well supported by the mice, and for subsequent studies such as latency. Furthermore, they should disclose whether viral breakthrough and emergence of resistance occurs similar as in HIV-infected patients when ART is insufficient. METHODS/PRINCIPAL FINDINGS: NOD/shi-scid/γ(c)null (NOG) mice were used in all experimentations. We first performed pharmacokinetic studies of the drugs used, either added to food pellets (AZT, TDF, 3TC, RTV) or in a LA formulation that permitted once weekly subcutaneous administration (TMC278: non-nucleoside reverse transcriptase inhibitor, TMC181: protease inhibitor). A combination of 3TC, TDF and TMC278-LA or 3TC, TDF, TMC278-LA and TMC181-LA suppressed the viral load to undetectable levels in 15/19 (79%) and 14/14 (100%) mice, respectively. In successfully treated mice, subsequent monotherapy with TMC278-LA resulted in viral breakthrough; in contrast, the two LA compounds together prevented viral breakthrough. Resistance mutations matched the mutations most commonly observed in HIV patients failing therapy. Importantly, viral rebound after interruption of ART, presence of HIV DNA in successfully treated mice and in vitro reactivation of early HIV transcripts point to an existing latent HIV reservoir. CONCLUSIONS/SIGNIFICANCE: This report is a unique description of multiple aspects of HIV infection in humanized mice that comprised efficacy testing of various treatment regimens, including LA compounds, resistance mutation analysis as well as viral rebound after treatment interruption. Humanized mice will be highly valuable for exploring the antiviral potency of new compounds or compounds targeting the latent HIV reservoir

    Coordinating with the Future: The Anticipatory Nature of Representation

    Full text link

    Learning Context Sensitive Languages with LSTM Trained with Kalman Filters

    Get PDF
    Unlike traditional recurrent neural networks, the long shortterm memory (LSTM) model generalizes well when presented with training sequences derived from regular and also simple nonregular languages
    corecore