846 research outputs found

    Vocoder-free End-to-End Voice Conversion with Transformer Network

    Full text link
    Mel-frequency filter bank (MFB) based approaches have the advantage of learning speech compared to raw spectrum since MFB has less feature size. However, speech generator with MFB approaches require additional vocoder that needs a huge amount of computation expense for training process. The additional pre/post processing such as MFB and vocoder is not essential to convert real human speech to others. It is possible to only use the raw spectrum along with the phase to generate different style of voices with clear pronunciation. In this regard, we propose a fast and effective approach to convert realistic voices using raw spectrum in a parallel manner. Our transformer-based model architecture which does not have any CNN or RNN layers has shown the advantage of learning fast and solved the limitation of sequential computation of conventional RNN. In this paper, we introduce a vocoder-free end-to-end voice conversion method using transformer network. The presented conversion model can also be used in speaker adaptation for speech recognition. Our approach can convert the source voice to a target voice without using MFB and vocoder. We can get an adapted MFB for speech recognition by multiplying the converted magnitude with phase. We perform our voice conversion experiments on TIDIGITS dataset using the metrics such as naturalness, similarity, and clarity with mean opinion score, respectively.Comment: Work in progres

    Nasopharynx as a Microbiologic Reservoir in Chronic Suppurative Otitis Media: Preliminary Study

    Get PDF
    ObjectivesThe present study was designed to identify the correlations of bacterial strains of the middle ear and the nasopharynx in chronic suppurative otitis media (CSOM) patients who were scheduled for operations.MethodsSixty-three patients with CSOM were enrolled in the study. Culture specimens were collected from the middle ear and nasopharynx of patients who were admitted for operation. Samples collections were performed 3 times; from the middle ear and nasophaynx at the admission day, from the middle ear during the operation, and from the external auditory canal post-operatively. Bacteria were identified by gram staining and biochemical tests. The correspondence rate of organisms which simultaneously exist in the middle ear and the nasopharynx was measured.ResultsSixty-eight organisms were isolated from the middle ear and 57 organisms from the nasopharynx among 63 patients. Of 68 bacteria identified in middle ear, 26.52% (18 bacteria) corresponded with those of nasopharynx. MRSA had the high correspondence rate, and of 18 methicillin-resistant Staphylococcus aureus (MRSA) isolated from middle ear, 33.3% (6 bacteria) corresponded with nasophaynx. Meanwhile, 3 organisms of MRSA were detected from the external auditory canal post-operatively, although they were only found in nasopharynx pre-operatively.ConclusionThe current trend of middle ear swab alone for bacterial detection would be insufficient to identify the potent MRSA and impede early antibiotic intervention for the effective middle ear surgery. Therefore, it is necessary to perform nasopharynx cultures together with conventional middle ear culture to control potent risk for infection pre-operatively

    Stethoscope-guided Supervised Contrastive Learning for Cross-domain Adaptation on Respiratory Sound Classification

    Full text link
    Despite the remarkable advances in deep learning technology, achieving satisfactory performance in lung sound classification remains a challenge due to the scarcity of available data. Moreover, the respiratory sound samples are collected from a variety of electronic stethoscopes, which could potentially introduce biases into the trained models. When a significant distribution shift occurs within the test dataset or in a practical scenario, it can substantially decrease the performance. To tackle this issue, we introduce cross-domain adaptation techniques, which transfer the knowledge from a source domain to a distinct target domain. In particular, by considering different stethoscope types as individual domains, we propose a novel stethoscope-guided supervised contrastive learning approach. This method can mitigate any domain-related disparities and thus enables the model to distinguish respiratory sounds of the recording variation of the stethoscope. The experimental results on the ICBHI dataset demonstrate that the proposed methods are effective in reducing the domain dependency and achieving the ICBHI Score of 61.71%, which is a significant improvement of 2.16% over the baseline.Comment: accepted to ICASSP 202

    Estimated dietary isoflavone intake among Korean adults

    Get PDF
    This study estimated the isoflavone intake level in Koreans using Food Frequency Questionnaire and analyzed related variables. The results showed that the average daily intake of isoflavone in adults was shown as 23.1 mg. The isoflavone intake level at 50 percentile was 16.9 mg (0~190 mg), and 10% of adults took almost 50 mg of isoflavone a day and 10% took about 5 mg a day. The major food sources for isoflavone in Koreans were in the order of soybean, soybean paste, soy milk, soybean curd (tofu), and bean sprouts; the intake was different depending on age, educational background, occupation, economic standard, and family type. The result showed higher isoflavone intake levels in the group over 30 years old and the highest isoflavone intake in subjects working in farming/fishery, followed by housemakers. According to the differences by families the families with elderly members showed 50% higher isoflavone intake than young families with friends or siblings. Depending on related ecological variables, therefore, various nutrition education programs should be developed for a variety of intakes of soybean foods, along with easy and simple cooking methods as parts of continuous research

    Drug Resistance via Feedback Activation of Stat3 in Oncogene-Addicted Cancer Cells

    Get PDF
    SummaryPathway-targeted cancer drugs can produce dramatic responses that are invariably limited by the emergence of drug-resistant cells. We found that many drug-treated “oncogene-addicted” cancer cells engage a positive feedback loop leading to Stat3 activation, consequently promoting cell survival and limiting overall drug response. This was observed in cancer cells driven by diverse activated kinases, including EGFR, HER2, ALK, and MET, as well as mutant KRAS. Specifically, MEK inhibition led to autocrine activation of Stat3 via the FGF receptor and JAK kinases, and pharmacological inhibition of MEK together with JAK and FGFR enhanced tumor regression. These findings suggest that inhibition of a Stat3 feedback loop may augment the response to a broad spectrum of drugs that target pathways of oncogene addiction

    Predicting COVID-19 transmission in a student population in Seoul, South Korea, 2020–2021

    Get PDF
    Background As coronavirus disease 2019 (COVID-19) transmission depends on factors such as demography, comorbidity, and patterns of daily activity, a better understanding of the societal factors of the infection among students would be useful in planning prevention strategies. However, no studies to date have focused on societal factors associated with COVID-19 transmission among students. Purpose This study aimed to characterize the factors of a student population associated with COVID-19 transmission in the metropolitan city of Seoul, South Korea. Methods We analyzed the epidemiological data for laboratory-confirmed (reverse transcription polymerase chain reaction) COVID-19 cases collected by the Korea Disease Control and Prevention Agency and Ministry of Education from January 2020 to October 2021. We calculated the global Moran’s index, local Moran’s index, and Getis-Ord’s index. A spatial regression analysis was performed to identify sociodemographic predictors of COVID-19 at the district level. Results The global spatial correlation estimated by Moran’s index was 0.082 for the community population and 0.064 for the student population. The attack rate of adults aged 30– 59 years (P=0.049) was associated with an increased risk of COVID-19 attack rates in students, whereas the number of students per primary- (P=0.003) and middle- (P=0.030) school class was inversely associated with risk of COVID-19 attack among students. Conclusion We found that COVID-19 transmission was more attributable to the community-level burden in students than adults. We recommend that public health initiatives target initiatives that protect students from COVID-19 when the community carries a high burden of infection

    Electron−hole separation in ferroelectric oxides for efficient photovoltaic responses

    Get PDF
    Despite their potential to exceed the theoretical Shockley−Queisser limit, ferroelectric photovoltaics (FPVs) have performed inefficiently due to their extremely low photocurrents. Incorporating Bi₂FeCrO₆(BFCO) as the light absorber in FPVs has recently led to impressively high and record photocurrents [Nechache R, et al. (2015) Nat Photonics 9:61–67], which has revived the FPV field. However, our understanding of this remarkable phenomenon is far from satisfactory. Here, we use first-principles calculations to determine that such excellent performance mainly lies in the efficient separation of electron− hole (e-h) pairs. We show that photoexcited electrons and holes in BFCO are spatially separated on the Fe and Cr sites, respectively. This separation is much more pronounced in disordered BFCO phases, which adequately explains the observed exceptional PV responses. We further establish a design strategy to discover next-generation FPV materials. By exploring 44 additional Bi-based double-perovskite oxides, we suggest five active-layer materials that offer a combination of strong e-h separations and visible-light absorptions for FPV applications. Our work indicates that charge separation is the most important issue to be addressed for FPVs to compete with conventional devices. Keywords: ferroelectrics; double perovskites; photovoltaics; e-h separation; density functional theor
    • 

    corecore