152 research outputs found

    Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings

    Full text link
    Models of acoustic word embeddings (AWEs) learn to map variable-length spoken word segments onto fixed-dimensionality vector representations such that different acoustic exemplars of the same word are projected nearby in the embedding space. In addition to their speech technology applications, AWE models have been shown to predict human performance on a variety of auditory lexical processing tasks. Current AWE models are based on neural networks and trained in a bottom-up approach that integrates acoustic cues to build up a word representation given an acoustic or symbolic supervision signal. Therefore, these models do not leverage or capture high-level lexical knowledge during the learning process. In this paper, we propose a multi-task learning model that incorporates top-down lexical knowledge into the training procedure of AWEs. Our model learns a mapping between the acoustic input and a lexical representation that encodes high-level information such as word semantics in addition to bottom-up form-based supervision. We experiment with three languages and demonstrate that incorporating lexical knowledge improves the embedding space discriminability and encourages the model to better separate lexical categories.Comment: Accepted in INTERSPEECH 202

    An Information-Theoretic Analysis of Self-supervised Discrete Representations of Speech

    Full text link
    Self-supervised representation learning for speech often involves a quantization step that transforms the acoustic input into discrete units. However, it remains unclear how to characterize the relationship between these discrete units and abstract phonetic categories such as phonemes. In this paper, we develop an information-theoretic framework whereby we represent each phonetic category as a distribution over discrete units. We then apply our framework to two different self-supervised models (namely wav2vec 2.0 and XLSR) and use American English speech as a case study. Our study demonstrates that the entropy of phonetic distributions reflects the variability of the underlying speech sounds, with phonetically similar sounds exhibiting similar distributions. While our study confirms the lack of direct, one-to-one correspondence, we find an intriguing, indirect relationship between phonetic categories and discrete units.Comment: Accepted in Interspeech 202

    Rutin ameliorates carbon tetrachloride (CCl4)-induced hepatorenal toxicity and hypogonadism in male rats

    Get PDF
    Rutin, a food derived-polyphenolic bioflavonoid, has been acknowledged for several health benefits. This study aims to explore the ameliorative effects of rutin against carbon tetrachloride (CCl4) toxicity in male rats. Adult male rats were given either CCl4 (30% in olive oil, 3 ml/kg b.w. intraperitoneally) alone or in combination with rutin (70 mg/kg intragastrically) twice a week for 4 weeks. Our data showed that rutin mitigated CCl4 hepatorenal damage, as indicated by diagnostic markers (i.e., transaminases, alkaline phosphatase, total bilirubin, total protein, albumin, urea, uric acid and creatinine), and histopathological findings. In addition, CCl4 induced profound elevation of free radical generation and oxidative stress, as evidenced by increasing lipid peroxidation and reducing catalase, superoxide dismutase and glutathione peroxidase activities in liver, kidney and testicular tissues; these effects were suppressed by coexposure with rutin. Moreover, the increase in the levels of serum triglycerides, cholesterol, low-density lipoprotein cholesterol, and very-low-density lipoprotein cholesterol induced by CCl4 was effectively counteracted by rutin. The decrease in the level of high-density lipoprotein cholesterol in the CCl4 group was also counteracted by rutin treatment. Interestingly, the decreased levels of hormonal mediators associated with sperm production, including serum testosterone, luteinizing hormone and follicle-stimulating hormone, and the impaired sperm quality induced by CCl4 were reversed by rutin. Data from the current study clearly demonstrated that rutin supplementation could at least partly overcome CCl4-induced hepatotoxicity, nephrotoxicity and reproductive toxicity by antioxidant and antidyslipidemic effects

    Sequencing, Analysis, and Annotation of Expressed Sequence Tags for \u3ci\u3eCamelus dromedarius\u3c/i\u3e

    Get PDF
    Despite its economical, cultural, and biological importance, there has not been a large scale sequencing project to date for Camelus dromedarius. With the goal of sequencing complete DNA of the organism, we first established and sequenced camel EST libraries, generating 70,272 reads. Following trimming, chimera check, repeat masking, cluster and assembly, we obtained 23,602 putative gene sequences, out of which over 4,500 potentially novel or fast evolving gene sequences do not carry any homology to other available genomes. Functional annotation of sequences with similarities in nucleotide and protein databases has been obtained using Gene Ontology classification. Comparison to available full length cDNA sequences and Open Reading Frame (ORF) analysis of camel sequences that exhibit homology to known genes show more than 80% of the contigs with an ORF\u3e300 bp and ~40% hits extending to the start codons of full length cDNAs suggesting successful characterization of camel genes. Similarity analyses are done separately for different organisms including human, mouse, bovine, and rat. Accompanying web portal, CAGBASE (http://camel.kacst.edu.sa/), hosts a relational database containing annotated EST sequences and analysis tools with possibility to add sequences from public domain. We anticipate our results to provide a home base for genomic studies of camel and other comparative studies enabling a starting point for whole genome sequencing of the organism

    Homo sapiens in Arabia by 85,000 years ago.

    Get PDF
    Understanding the timing and character of the expansion of Homo sapiens out of Africa is critical for inferring the colonization and admixture processes that underpin global population history. It has been argued that dispersal out of Africa had an early phase, particularly ~130-90 thousand years ago (ka), that reached only the East Mediterranean Levant, and a later phase, ~60-50 ka, that extended across the diverse environments of Eurasia to Sahul. However, recent findings from East Asia and Sahul challenge this model. Here we show that H. sapiens was in the Arabian Peninsula before 85 ka. We describe the Al Wusta-1 (AW-1) intermediate phalanx from the site of Al Wusta in the Nefud desert, Saudi Arabia. AW-1 is the oldest directly dated fossil of our species outside Africa and the Levant. The palaeoenvironmental context of Al Wusta demonstrates that H. sapiens using Middle Palaeolithic stone tools dispersed into Arabia during a phase of increased precipitation driven by orbital forcing, in association with a primarily African fauna. A Bayesian model incorporating independent chronometric age estimates indicates a chronology for Al Wusta of ~95-86 ka, which we correlate with a humid episode in the later part of Marine Isotope Stage 5 known from various regional records. Al Wusta shows that early dispersals were more spatially and temporally extensive than previously thought. Early H. sapiens dispersals out of Africa were not limited to winter rainfall-fed Levantine Mediterranean woodlands immediately adjacent to Africa, but extended deep into the semi-arid grasslands of Arabia, facilitated by periods of enhanced monsoonal rainfall
    corecore