138 research outputs found

    Analisi gerarchica degli inviluppi spettrali differenziali di una voce emotiva

    Get PDF
    .In questo articolo viene descritto un nuovo metodo di analisi del timbro vocale tramite lo studio delle variazioni di inviluppo spettrale utilizzato da uno stesso parlatore in situazioni emotiva neutra o espressiva. Il contesto dell\u27analisi riguarda un corpus di un solo parlatore istruito a leggere una serie di frasi utilizzando uno stile di lettura neutro e successivamente utilizzando due modalit? emotive: uno stile allegro e uno stile triste. Gli inviluppi spettrali relativi alle versioni allineate delle realizzazioni vocali neutre e espressive (allegra e triste) sono confrontati utilizzando un metodo differenziale. Le differenze sono state calcolate tra lo stato emotivo e quello neutro, di conseguenza le due categorie messe a confronto sono neutro-allegro e neutro-triste. La statistica degli inviluppi differenziali ? stata calcolata per ogni fono. I dati sono stati esaminati utilizzando un metodo di clustering gerarchico di tipo agglomerativo. I cluster risultanti sono avvalorati con diverse misure di distanza tra le distribuzioni statistiche ed esplorati visivamente per trovare similitudini e differenze tra le due categorie. I risultati mettono in evidenza sistematiche variazioni nel timbro vocale relative ai due insiemi di differenze di inviluppi spettrali. Questi tratti dipendono dalla valenza dell\u27emozione presa in considerazione (positiva, negativa) come dalle propriet? fonetiche del particolare fono come ad esempio sonorit? e luogo di articolazione

    Cluster Analysis of Differential Spectral Envelopes on Emotional Speech

    Get PDF
    This paper reports on the analysis of the spectral variation of emotional speech. Spectral envelopes of time aligned speech frames are compared between emotionally neutral and active utterances. Statistics are computed over the resulting differential spectral envelopes for each phoneme. Finally, these statistics are classified using agglomerative hierarchical clustering and a measure of dissimilarity between statistical distributions and the resulting clusters are analysed. The results show that there are systematic changes in spectral envelopes when going from neutral to sad or happy speech, and those changes depend on the valence of the emotional content (negative, positive) as well as on the phonetic properties of the sounds such as voicing and place of articulation

    Due tecniche di vocoding per la sintesi di parlato emotivo mediante trasformazione del timbro vocale

    Get PDF
    In questo articolo vengono descritte due tecniche di modifica del timbro vocale utilizzate in un esperimento di trasformazione della voce con l\u27obiettivo di riprodurre alcune caratteristiche del parlato emotivo. Il segnale vocale emesso da un parlatore con stile di lettura neutro viene convertito in modo da riprodurre l\u27inviluppo spettrale utilizzato dallo stesso parlatore in una situazione emotiva non neutra. La funzione di conversione tra gli inviluppi spettrali ? calcolata utilizzando un metodo ricavato con un addestramento su dati reali. Per questo motivo ? stato preso in considerazione un database contenente la voce di un parlatore registrato durante la lettura/recitazione di un corpus di testi con diversi stili emozionali: allegro, triste e uno stile neutro di riferimento. Le due tecniche di generazione della forma d\u27onda (vocoding) prese in considerazione sono il Phase Vocoder e il filtro MLSA (Mel Log Spectrum Approximation). I due prototipi implementati sono stati valutati con test di tipo percettivo, mentre valutazioni oggettive hanno convalidato l\u27efficacia della funzione di conversione

    Prognostic Impact of miR-224 and RAS Mutations in Medullary Thyroid Carcinoma

    Get PDF
    Little is known about the function of microRNA-224 (miR-224) in medullary thyroid cancer (MTC). This study investigated the role of miR-224 expression in MTC and correlated it with mutation status in sporadic MTCs. A consecutive series of 134 MTCs were considered. Patients had a sporadic form in 80% of cases (107/134). In this group, REarranged during transfection (RET) and rat sarcoma (RAS) mutation status were assessed by direct sequencing in the tumor tissues. Quantitative real-time polymerase chain reaction was used to quantify mature hsa-miR-224 in tumor tissue. RAS (10/107 cases, 9%) and RET (39/107 cases, 36%) mutations were mutually exclusive in sporadic cases. miR-224 expression was significantly downregulated in patients with the following: high calcitonin levels at diagnosis (p=0.03, r=−0.3); advanced stage (p=0.001); persistent disease (p=0.001); progressive disease (p=0.002); and disease-related death (p=0.0001). We found a significant positive correlation between miR-224 expression and somatic RAS mutations (p=0.007). Patients whose MTCs had a low miR-224 expression tended to have a shorter overall survival (log-rank test p=0.005). On multivariate analysis, miR-224 represented an independent prognostic marker. Our data indicate that miR-224 is upregulated in RAS-mutated MTCs and in patients with a better prognosis and could represent an independent prognostic marker in MTC patients

    An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings

    Full text link
    We performed an experimental review of current diarization systems for the conversational telephone speech (CTS) domain. In detail, we considered a total of eight different algorithms belonging to clustering-based, end-to-end neural diarization (EEND), and speech separation guided diarization (SSGD) paradigms. We studied the inference-time computational requirements and diarization accuracy on four CTS datasets with different characteristics and languages. We found that, among all methods considered, EEND-vector clustering (EEND-VC) offers the best trade-off in terms of computing requirements and performance. More in general, EEND models have been found to be lighter and faster in inference compared to clustering-based methods. However, they also require a large amount of diarization-oriented annotated data. In particular EEND-VC performance in our experiments degraded when the dataset size was reduced, whereas self-attentive EEND (SA-EEND) was less affected. We also found that SA-EEND gives less consistent results among all the datasets compared to EEND-VC, with its performance degrading on long conversations with high speech sparsity. Clustering-based diarization systems, and in particular VBx, instead have more consistent performance compared to SA-EEND but are outperformed by EEND-VC. The gap with respect to this latter is reduced when overlap-aware clustering methods are considered. SSGD is the most computationally demanding method, but it could be convenient if speech recognition has to be performed. Its performance is close to SA-EEND but degrades significantly when the training and inference data characteristics are less matched.Comment: 52 pages, 10 figure

    La sentiment analysis come strumento di studio del parlato emozionale?

    Get PDF
    Vari studi in letteratura hanno dimostrato che il parlato emozionale è caratterizzato da vari indici acustici. Tuttavia, tali studi hanno quasi sempre utilizzato parlato recitato, ignorando il parlato elicitato in maniera ecologica a causa della difficoltà nel reperire adeguate produzioni emozionali. In questo contributo, esploriamo la possibilità di utilizzare la sentiment analysis per selezionare produzioni emozionali da corpora orali. Abbiamo utilizzato il corpus LibriSpeech, da cui abbiamo estratto valori di sentiment analysis a livello di frase e di parola, nonché vari indici acustici e spettrali associati al parlato emozionale. L’analisi della relazione tra i livelli acustico e testuale ha rivelato effetti significativi ma di portata ridotta. Questo ci fa pensare che tali due livelli (acustico e lessicale) tendano a essere relativamente indipendenti, rendendo inappropriato l’utilizzo di metriche testuali per la selezione di materiale acusticamente emozionale.Abundant literature has shown that emotional speech is characterized by various acoustic cues. However, most studies focused on sentences produced by actors, disregarding ecologically elicited speech due to difficulties in finding suitable emotional data. In this contribution we explore the possibility of using sentiment analysis for the selection of emotional chunks from speech corpora. We used the LibriSpeech corpus and extracted sentiment analysis scores at word and sentence levels, as well as several acoustic and spectral parameters of emotional voice. The analysis of the relation between textual and acoustic indices revealed significant but small effects. This suggests that these two levels tend to be fairly independent, making it improper to use sentiment analysis for the selection of acoustically emotional speech

    Leveraging Speech Separation for Conversational Telephone Speaker Diarization

    Full text link
    Speech separation and speaker diarization have strong similarities. In particular with respect to end-to-end neural diarization (EEND) methods. Separation aims at extracting each speaker from overlapped speech, while diarization identifies time boundaries of speech segments produced by the same speaker. In this paper, we carry out an analysis of the use of speech separation guided diarization (SSGD) where diarization is performed simply by separating the speakers signals and applying voice activity detection. In particular we compare two speech separation (SSep) models, both in offline and online settings. In the online setting we consider both the use of continuous source separation (CSS) and causal SSep models architectures. As an additional contribution, we show a simple post-processing algorithm which reduces significantly the false alarm errors of a SSGD pipeline. We perform our experiments on Fisher Corpus Part 1 and CALLHOME datasets evaluating both separation and diarization metrics. Notably, without fine-tuning, our SSGD DPRNN-based online model achieves 12.7% DER on CALLHOME, comparable with state-of-the-art EEND models despite having considerably lower latency, i.e., 50 ms vs 1 s.Comment: Submitted to INTERSPEECH 202

    The role of procalcitonin in the follow-up of medullary thyroid cancer

    Get PDF
    Objective: Calcitonin (Ct) represents the most important biochemical marker of medullary thyroid cancer (MTC), but has certain limits. We analyzed the performance of procalcitonin (ProCt) in follow-up MTC patients. Methods: In this monocentric and retrospective study, we consecutively obtained ProCt and Ct values from all MTC patients that we visited during the period from April 2021 to May 2022. Patients were defined as having structural evidence of disease (29/90, 32.2%) irrespective of Ct values or, in its absence, as not evident disease (NED) if Ct was ≤10 ng/L (47/90, 52.2%), or minimal residual disease if Ct was >10 ng/L (14/90, 15.6%). Results: Ct and ProCt values were highly correlated (r = 0.883, P 0.12 ng/mL (P < 0.01, area under the curve: 0.963), with the following sensitivity, specificity, positive predictive value, and negative predictive value (NPV): 100%, 83.61%, 74.4%, and 100.0%. Conclusions: ProCt and Ct have a high correlation in MTC follow-up. ProCt may be useful as an adjunct to Ct, especially for its NPV concerning the structural disease

    Sunitinib for metastatic progressive phaeochromocytomas and paragangliomas: results from FIRSTMAPPP, an academic, multicentre, international, randomised, placebo-controlled, double-blind, phase 2 trial

    Get PDF
    Background: No randomised controlled trial has ever been done in patients with metastatic phaeochromocytomas and paragangliomas. Preclinical and first clinical evidence suggested beneficial effects of sunitinib. We aimed to evaluate the safety and efficacy of sunitinib in patients with metastatic phaeochromocytomas and paragangliomas. Methods: FIRSTMAPPP is a multicentre, international, randomised, placebo-controlled, double-blind, phase 2 trial done at 14 academic centres across four European countries. Eligible participants were adults (aged ≥18 years) with sporadic or inherited progressive metastatic phaeochromocytomas and paragangliomas. Patients were randomly assigned (1:1) to receive either oral sunitinib (37·5 mg per day) or placebo. Randomisation was stratified according to SDHB status (mutation present vs wild type) and number of previous systemic therapies (0 vs ≥1). Primary endpoint was the rate of progression-free survival at 12 months according to real-time central review (Response Evaluation Criteria in Solid Tumours version 1.1). On the basis of a two-step Simon model, we aimed for the accrual of 78 patients, assuming a 20% improvement of the 12-month progression-free survival rate from 20% to 40%, to conclude that sunitinib is effective. Crossover from the placebo group was allowed. This trial is registered with ClinicalTrials.gov, number NCT01371201, and is closed for enrolment. Findings: From Dec 1, 2011, to Jan 31, 2019, a total of 78 patients with progressive metastatic phaeochromocytomas and paragangliomas were enrolled (39 patients per group). 25 (32%) of 78 patients had germline SDHx variants and 54 (69%) had used previous therapies. The primary endpoint was met, with a 12-month progression-free survival in 14 of 39 patients (36% [90% CI 23-50]) in the sunitinib group. In the placebo group, the 12-month progression-free survival in seven of 39 patients was 19% (90% CI 11-31), validating the hypotheses of our study design. The most frequent grade 3 or 4 adverse events were asthenia (seven [18%] of 39 and one [3%] of 39), hypertension (five [13%] and four [10%]), and back or bone pain (one [3%] and three [8%]) in the sunitinib and placebo groups, respectively. Three deaths occurred in the sunitinib group: these deaths were due to respiratory insufficiency, amyotrophic lateral sclerosis, and rectal bleeding. Only the latter event was considered drug related. Two deaths occurred in the placebo group due to aspiration pneumonia and septic shock. Interpretation: This first randomised trial supports the use of sunitinib as the medical option with the highest level of evidence for anti-tumour efficacy in progressive metastatic phaeochromocytomas and paragangliomas. Funding: French Ministry of Health, through the National Institute for Cancer, German Ministry of Education and Research, and the German Research Foundation within the CRC/Transregio 205/2, EU Seventh Framework Programme, and a private donator grant
    • …
    corecore