20 research outputs found

    Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis

    Full text link
    Neural Text-to-speech (TTS) synthesis is a powerful technology that can generate speech using neural networks. One of the most remarkable features of TTS synthesis is its capability to produce speech in the voice of different speakers. This paper introduces voice cloning and speech synthesis https://pypi.org/project/voice-cloning/ an open-source python package for helping speech disorders to communicate more effectively as well as for professionals seeking to integrate voice cloning or speech synthesis capabilities into their projects. This package aims to generate synthetic speech that sounds like the natural voice of an individual, but it does not replace the natural human voice. The architecture of the system comprises a speaker verification system, a synthesizer, a vocoder, and noise reduction. Speaker verification system trained on a varied set of speakers to achieve optimal generalization performance without relying on transcriptions. Synthesizer is trained using both audio and transcriptions that generate Mel spectrogram from a text and vocoder which converts the generated Mel Spectrogram into corresponding audio signal. Then the audio signal is processed by a noise reduction algorithm to eliminate unwanted noise and enhance speech clarity. The performance of synthesized speech from seen and unseen speakers are then evaluated using subjective and objective evaluation such as Mean Opinion Score (MOS), Gross Pitch Error (GPE), and Spectral distortion (SD). The model can create speech in distinct voices by including speaker characteristics that are chosen randomly

    Convex Optimization Algorithm for Product Recommendation Using Microblogging Information

    Get PDF
    The ecommerce and the social media is connected together where the products advertisements can be given by the ecommerce website which can be viewed by the social media users that is facebook users can  view the reviews given by the facebook friends or friends of friends of the users. The user may get an idea about the product features and also can be able to decide whether to buy the product or not. And also the user can give the review when the user knows about the product. In facebook database, all the reviews of the users or friends or friends of friends are stored. Among all the reviews, the overall count of the users who have given the positive comments and the negative comments are displayed .From this, the users can easily decide whether the product is useful or not. In the proposed system, convolutional neural network is used for more advanced learning
    corecore