4,295 research outputs found

    Vocoder-free End-to-End Voice Conversion with Transformer Network

    Full text link
    Mel-frequency filter bank (MFB) based approaches have the advantage of learning speech compared to raw spectrum since MFB has less feature size. However, speech generator with MFB approaches require additional vocoder that needs a huge amount of computation expense for training process. The additional pre/post processing such as MFB and vocoder is not essential to convert real human speech to others. It is possible to only use the raw spectrum along with the phase to generate different style of voices with clear pronunciation. In this regard, we propose a fast and effective approach to convert realistic voices using raw spectrum in a parallel manner. Our transformer-based model architecture which does not have any CNN or RNN layers has shown the advantage of learning fast and solved the limitation of sequential computation of conventional RNN. In this paper, we introduce a vocoder-free end-to-end voice conversion method using transformer network. The presented conversion model can also be used in speaker adaptation for speech recognition. Our approach can convert the source voice to a target voice without using MFB and vocoder. We can get an adapted MFB for speech recognition by multiplying the converted magnitude with phase. We perform our voice conversion experiments on TIDIGITS dataset using the metrics such as naturalness, similarity, and clarity with mean opinion score, respectively.Comment: Work in progres

    The Light and Period Variations of the Eclipsing Binary BX Draconis

    Full text link
    New CCD photometric observations of BX Dra were obtained for 26 nights from 2009 April to 2010 June. The long-term photometric behaviors of the system are presented from detailed studies of the period and light variations, based on the historical data and our new observations. All available light curves display total eclipses at secondary minima and inverse O'Connell effects with Max I fainter than Max II, which are satisfactorily modeled by adding the slightly time-varying hot spot on the primary star. A total of 87 times of minimum light spanning over about 74 yrs, including our 22 timing measurements, were used for ephemeris computations. Detailed analysis of the O-C diagram showed that the orbital period has changed in combinations with an upward parabola and a sinusoidal variation. The continuous period increase with a rate of +5.65 \times 10^-7 d yr^-1 is consistent with that calculated from the Wilson-Devinney synthesis code. It can be interpreted as a mass transfer from the secondary to the primary star at a rate of 2.74 \times 10^-7 M\odot yr^-1, which is one of the largest rates for contact systems. The most likely explanation of the sinusoidal variation with a period of 30.2 yrs and a semi-amplitude of 0.0062 d is a light-traveltime effect due to the existence of a circumbinary object. We suggest that BX Dra is probably a triple system, consisting of a primary star with a spectral type of F0, its secondary component of spectral type F1-2, and an unseen circumbinary object with a minimum mass of M3 = 0.23 M\odot.Comment: 24 pages, including 5 figures and 9 tables, accepted for publication in PAS

    Determinants Of Enforcement Action By The Financial Supervisory Service Of Korea From The Perspective Of Audit Firms

    Get PDF
    In this study, we examine the determinants of enforcement action by the Financial Supervisory Service of Korea from the perspective of audit firms. Enforcement action is an indication of audit failure. Both client- and audit firm-specific factors are involved in its occurrence. Most published studies of enforcement after audit failure focus on client characteristics because details about audit firms from financial statements and information about organizational structure are not publicly available. However, examining the issues surrounding enforcement from the perspective of audit firms may also be valuable in elucidating the potential determinants of audit failure resulting in enforcement action. Utilizing publicly available data from audit firms in South Korea, we identify several audit firm characteristics as determinants of enforcement action. The results of our empirical analysis reveal that the likelihood of audit failure is positively associated with the ratio of accounts receivable to total assets, the ratio of audit fees to total revenue, the ratio of partners to the total number of CPAs, CEO ownership, and age of audit firms. In addition, the likelihood of audit failure is negatively associated with ownership concentration and profitability. These associations are more pronounced in non-affiliated audit firms than affiliated audit firms. Several useful implications for regulators are described for improving audit quality by means of enforcement action

    Miuraea migitae, a new record of the order Bangiales (Bangiophyceae, Rhodophyta) from Korea

    Get PDF
    Abstract We found specimens of foliose Bangiales from the subtidal zone of Udo, Jeju Island, Korea. In molecular analyses of rbcL sequences, these Korean specimens were almost identical to Miuraea migitae from Osaka, Japan. In the morphological comparison, Korean specimens were consistent with habitat, color, and vegetative characteristics with the description of M. migitae. This is the first record of M. migitae outside the type locality and Nagasaki in Japan. This study confirms that new or unrecorded species of the order Bangiales may be discovered from subtidal habitats

    Experimental Study on Coordinated Heading Control of Four Vessels Moored Side by Side

    Get PDF
    A floating type liquefied natural gas (LNG) bunkering terminal has been under development in Korea since 2014; the terminal is designed to receive LNG from an LNG carrier (LNGC) and transfer it to two other LNG bunkering shuttles (LNGBS) simultaneously. The operational feasibility of the LNG loading and unloading processes has been confirmed. When four vessels are moored side by side with mooring ropes and fenders, their positions must be maintained within the designed allowable criteria. In addition, the floating bunkering terminal (FLBT) has its own mooring system, an internal turret with catenary mooring lines and stern tunnel thrusters to maintain its own position and control the vessel heading. In this study, we investigated the operational feasibility of the FLBT during the LNG loading and unloading operations with four vessel mooring configurations and heading controls. A series of model tests was done in the ocean engineering basin of the Korea Research Institute of Ships and Ocean engineering. The motion responses of the four vessels were determined using an optical measurement system, and the tensile loads on ship-to-ship mooring ropes and the compressive loads on ship-to-ship fenders were measured using one-axis load cells. A white noise test was done and the results were compared with the numerical results for the purpose of validation. Then, four combined environmental conditions were presented both without heading control and with several heading control cases. Finally, we determined the available safe bunkering heading ranges taking into account the tensile loads on the mooring ropes

    Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Imbalance

    Full text link
    Deep generative models have emerged as a promising approach in the medical image domain to address data scarcity. However, their use for sequential data like respiratory sounds is less explored. In this work, we propose a straightforward approach to augment imbalanced respiratory sound data using an audio diffusion model as a conditional neural vocoder. We also demonstrate a simple yet effective adversarial fine-tuning method to align features between the synthetic and real respiratory sound samples to improve respiratory sound classification performance. Our experimental results on the ICBHI dataset demonstrate that the proposed adversarial fine-tuning is effective, while only using the conventional augmentation method shows performance degradation. Moreover, our method outperforms the baseline by 2.24% on the ICBHI Score and improves the accuracy of the minority classes up to 26.58%. For the supplementary material, we provide the code at https://github.com/kaen2891/adversarial_fine-tuning_using_generated_respiratory_sound.Comment: accepted in NeurIPS 2023 Workshop on Deep Generative Models for Health (DGM4H
    corecore