72 research outputs found

    Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile Health

    Full text link
    Modern smartphones possess hardware for audio acquisition and to perform speech processing tasks such as speaker recognition and health assessment. However, energy consumption remains a concern, especially for resource-intensive DNNs. Prior work has improved the DNN energy efficiency by utilizing a compact model or reducing the dimensions of speech features. Both approaches reduced energy consumption during DNN inference but not during speech acquisition. This paper proposes using a masking kernel integrated into gradient descent during DNN training to learn the most energy-efficient speech length and sampling rate for windowing, a common step for sample construction. To determine the most energy-optimal parameters, a masking function with non-zero derivatives was combined with a low-pass filter. The proposed approach minimizes the energy consumption of both data collection and inference by 57%, and is competitive with speaker recognition and traumatic brain injury detection baselines

    A Genome-Wide Association Study of Total Bilirubin and Cholelithiasis Risk in Sickle Cell Anemia

    Get PDF
    Serum bilirubin levels have been associated with polymorphisms in the UGT1A1 promoter in normal populations and in patients with hemolytic anemias, including sickle cell anemia. When hemolysis occurs circulating heme increases, leading to elevated bilirubin levels and an increased incidence of cholelithiasis. We performed the first genome-wide association study (GWAS) of bilirubin levels and cholelithiasis risk in a discovery cohort of 1,117 sickle cell anemia patients. We found 15 single nucleotide polymorphisms (SNPs) associated with total bilirubin levels at the genome-wide significance level (p value <5×10−8). SNPs in UGT1A1, UGT1A3, UGT1A6, UGT1A8 and UGT1A10, different isoforms within the UGT1A locus, were identified (most significant rs887829, p = 9.08×10−25). All of these associations were validated in 4 independent sets of sickle cell anemia patients. We tested the association of the 15 SNPs with cholelithiasis in the discovery cohort and found a significant association (most significant p value 1.15×10−4). These results confirm that the UGT1A region is the major regulator of bilirubin metabolism in African Americans with sickle cell anemia, similar to what is observed in other ethnicities

    «La relation de limitation et d’exception dans le français d’aujourd’hui : excepté, sauf et hormis comme pivots d’une relation algébrique »

    Get PDF
    L’analyse des emplois prépositionnels et des emplois conjonctifs d’ “excepté”, de “sauf” et d’ “hormis” permet d’envisager les trois prépositions/conjonctions comme le pivot d’un binôme, comme la plaque tournante d’une structure bipolaire. Placées au milieu du binôme, ces prépositions sont forcées par leur sémantisme originaire dûment métaphorisé de jouer le rôle de marqueurs d’inconséquence systématique entre l’élément se trouvant à leur gauche et celui qui se trouve à leur droite. L’opposition qui surgit entre les deux éléments n’est donc pas une incompatibilité naturelle, intrinsèque, mais extrinsèque, induite. Dans la plupart des cas (emplois limitatifs), cette opposition prend la forme d’un rapport entre une « classe » et le « membre (soustrait) de la classe », ou bien entre un « tout » et une « partie » ; dans d’autres (emplois exceptifs), cette opposition se manifeste au contraire comme une attaque de front portée par un « tout » à un autre « tout ». De plus, l’inconséquence induite mise en place par la préposition/conjonction paraît, en principe, tout à fait insurmontable. Dans l’assertion « les écureuils vivent partout, sauf en Australie » (que l’on peut expliciter par « Les écureuils vivent partout, sauf [qu’ils ne vivent pas] en Australie »), la préposition semble en effet capable d’impliquer le prédicat principal avec signe inverti, et de bâtir sur une telle implication une sorte de sous énoncé qui, à la rigueur, est totalement inconséquent avec celui qui le précède (si « les écureuils ne vivent pas en Australie », le fait qu’ils « vivent partout » est faux). Néanmoins, l’analyse montre qu’alors que certaines de ces oppositions peuvent enfin être dépassées, d’autres ne le peuvent pas. C’est, respectivement, le cas des relations limitatives et des relations exceptives. La relation limitative, impliquant le rapport « tout » - « partie », permet de résoudre le conflit dans les termes d’une somme algébrique entre deux sous énoncés pourvus de différent poids informatif et de signe contraire. Les valeurs numériques des termes de la somme étant déséquilibrées, le résultat est toujours autre que zéro. La relation exceptive, au contraire, qui n’implique pas le rapport « tout » - « partie », n’est pas capable de résoudre le conflit entre deux sous énoncés pourvus du même poids informatif et en même temps de signe contraire : les valeurs numériques des termes de la somme étant symétriques et égales, le résultat sera toujours équivalent à zéro

    Morphological variation in the adult hard palate and posterior pharyngeal wall

    No full text
    Purpose: Adult human vocal tracts display considerable morphological variation across individuals, but the nature and extent of this variation has not been extensively studied for many vocal tract structures. There exists a need to analyze morphological variation and, even more basically, to develop a methodology for morphological analysis of the vocal tract. Such analysis will facilitate fundamental characterization of the speech production system, with broad implications from modeling to explaining interspeaker variability. Method: A data-driven methodology to automatically analyze the extent and variety of morphological variation is proposed and applied to a diverse subject pool of 36 adults. Analysis is focused on two key aspects of vocal tract structure: the midsagittal shape of the hard palate and the posterior pharyngeal wall. Result: Palatal morphology varies widely in its degree of concavity but also in anteriority and sharpness. Pharyngeal wall morphology, by contrast, varies mostly in terms of concavity alone. The distribution of morphological characteristics is complex, and analysis suggests that certain variations may be categorical in nature. Conclusion: Major modes of morphological variation are identified, including their relative magnitude, distribution, and categorical nature. Implications of these findings for speech articulation strategies and speech acoustics are discussed.10 page(s

    Interspeaker variability in hard palate morphology and vowel production

    No full text
    Purpose: Differences in vocal tract morphology have the potential to explain interspeaker variability in speech production. The potential acoustic impact of hard palate shape was examined in simulation, in addition to the interplay among morphology, articulation, and acoustics in real vowel production data. Method: High-front vowel production from 5 speakers of American English was examined using midsagittal real-time magnetic resonance imaging data with synchronized audio. Relationships among hard palate morphology, tongue shaping, and formant frequencies were analyzed. Simulations were performed to determine the acoustical properties of vocal tracts whose area functions are altered according to prominent hard palate variations. Results: Simulations revealed that altering the height and position of the palatal dome alters formant frequencies. Examinations of real speech data showed that palatal morphology is not significantly correlated with any formant frequency but is correlated with major aspects of lingual articulation. Conclusion: Certain differences in hard palate morphology can substantially affect vowel acoustics, but those effects are not noticeable in real speech. Speakers adapt their lingual articulation to accommodate palate shape differences with the potential to substantially affect formant frequencies, while ignoring palate shape differences with relatively little acoustic impact, lending support for acoustic goals of vowel production

    SEARCHING FOR BETTER LOGIC CIRCUITS: USING ARTIFICIAL INTELLIGENCE TECHNIQUES TO AUTOMATE DIGITAL DESIGN

    No full text
    Logic circuits are at the core of modern computing. The process of designing circuits which are efficient is thus of critical importance. Usually, logic circuits are designed by human beings who have a specific repertoire of conventional design techniques. These techniques limit the solutions that may be considered during the design process in both form and quality. The limits guide designers through the immense realm of possible circuits, thus making the problem more manageable. Simultaneously, the limits sometimes eliminate from consideration circuits which are optimal in terms of size, depth, etc. By exploring the full range of possible solutions, circuits could be discovered which are superior to the best known human designs. Automated design techniques borrowed from artificial intelligence have allowed exactly that. Specifically, the application of genetic algorithms has allowed the creation of circuits which are substantially superior to the best known human designs. This paper expands on such previous research with a three-fold approach. This approach is comprised of (1) two distinct optimizations for the application of genetic algorithms to design, (2) the formulation and implementation of a systematic search technique to the problem and (3) a comparison of the relative merits of the optimized genetic algorithm an

    On Short-Time Estimation of Vocal Tract Length from Formant Frequencies.

    No full text
    Vocal tract length is highly variable across speakers and determines many aspects of the acoustic speech signal, making it an essential parameter to consider for explaining behavioral variability. A method for accurate estimation of vocal tract length from formant frequencies would afford normalization of interspeaker variability and facilitate acoustic comparisons across speakers. A framework for considering estimation methods is developed from the basic principles of vocal tract acoustics, and an estimation method is proposed that follows naturally from this framework. The proposed method is evaluated using acoustic characteristics of simulated vocal tracts ranging from 14 to 19 cm in length, as well as real-time magnetic resonance imaging data with synchronous audio from five speakers whose vocal tracts range from 14.5 to 18.0 cm in length. Evaluations show improvements in accuracy over previously proposed methods, with 0.631 and 1.277 cm root mean square error on simulated and human speech data, respectively. Empirical results show that the effectiveness of the proposed method is based on emphasizing higher formant frequencies, which seem less affected by speech articulation. Theoretical predictions of formant sensitivity reinforce this empirical finding. Moreover, theoretical insights are explained regarding the reason for differences in formant sensitivity

    Vocal tract cross-distance estimation from real-time MRI using region-of-interest analysis

    No full text
    Real-Time Magnetic Resonance Imaging affords speech articulation data with good spatial and temporal resolution and complete midsagittal views of the moving vocal tract, but also brings many challenges in the domain of image processing and analysis. Region-of-interest analysis has previously been proposed for simple, efficient and robust extraction of linguistically-meaningful constriction degree information. However, the accuracy of such methods has not been rigorously evaluated, and no method has been proposed to calibrate the pixel intensity values or convert them into absolute measurements of length. This work provides such an evaluation, as well as insights into the placement of regions in the image plane and calibration of the resultant pixel intensity measurements. Measurement errors are shown to be generally at or below the spatial resolution of the imaging protocol with a high degree of consistency across time and overall vocal tract configuration, validating the utility of this method of image analysis.4 page(s
    corecore