963 research outputs found

    RawNet: Fast End-to-End Neural Vocoder

    Full text link
    Neural networks based vocoders have recently demonstrated the powerful ability to synthesize high quality speech. These models usually generate samples by conditioning on some spectrum features, such as Mel-spectrum. However, these features are extracted by using speech analysis module including some processing based on the human knowledge. In this work, we proposed RawNet, a truly end-to-end neural vocoder, which use a coder network to learn the higher representation of signal, and an autoregressive voder network to generate speech sample by sample. The coder and voder together act like an auto-encoder network, and could be jointly trained directly on raw waveform without any human-designed features. The experiments on the Copy-Synthesis tasks show that RawNet can achieve the comparative synthesized speech quality with LPCNet, with a smaller model architecture and faster speech generation at the inference step.Comment: Submitted to Interspeech 2019, Graz, Austri

    Information Environment and Gains from Corporate Takeovers

    Get PDF
    Motivated by the inadequate research in understanding the determinants of takeover wealth creation, as well as the theoretical and practical importance of information environment in the takeover market, this thesis examines the wealth effects of information environment on UK takeovers. It regards information dissemination as a process inherent in takeover announcements, along which, factors capturing the characteristics of information sender, information content, information recipient and market condition, are addressed to form three key research issues. First considered are the wealth effects of misvaluation conditional on information signalled by payment and financing methods of takeovers. The results indicate that a price run-up via an upward revaluation follows undervalued bidders releasing good news (non-equity financed cash deals). Secondly, this research is concerned with the wealth effects of investor sentiment, towards the information released, at a whole market and individual firm level. The results show that high investor sentiment drives up target firms’ announcement returns and further causes an increase in takeover premium. The last issue addressed is the relation between information asymmetry and gains to frequent bidders. The results suggest that information asymmetry declines in a merger series while serial non-equity financed cash deals generate decreasing bidders’ announcement returns since the scale of their upward revaluations continually decreases with subsequent announcements. These three groups of results form a mechanism of information environment’s wealth effect as follows. Takeover announcements release new information. With the arrival of new information investors update their assessments of firm value. The scale of revaluation is determined by a firm’s information asymmetry, the direction of it depends on firm misvaluation, information signalled by takeover announcements and the investor sentiment in interpreting this information

    Off-the-shelf ChatGPT is a Good Few-shot Human Motion Predictor

    Full text link
    To facilitate the application of motion prediction in practice, recently, the few-shot motion prediction task has attracted increasing research attention. Yet, in existing few-shot motion prediction works, a specific model that is dedicatedly trained over human motions is generally required. In this work, rather than tackling this task through training a specific human motion prediction model, we instead propose a novel FMP-OC framework. In FMP-OC, in a totally training-free manner, we enable Few-shot Motion Prediction, which is a non-language task, to be performed directly via utilizing the Off-the-shelf language model ChatGPT. Specifically, to lead ChatGPT as a language model to become an accurate motion predictor, in FMP-OC, we first introduce several novel designs to facilitate extracting implicit knowledge from ChatGPT. Moreover, we also incorporate our framework with a motion-in-context learning mechanism. Extensive experiments demonstrate the efficacy of our proposed framework

    FlexScatter: Predictive Scheduling and Adaptive Rateless Coding for Wi-Fi Backscatter Communications in Dynamic Traffic Conditions

    Get PDF
    Abstract The potential of Wi-Fi backscatter communications systems is immense, yet challenges such as signal instability and energy constraints impose performance limits. This paper introduces FlexScatter, a Wi-Fi backscatter system featuring a designed scheduling strategy based on excitation prediction and rateless coding to enhance system performance. Initially, a Wi-Fi traffic prediction model is constructed by analyzing the variability of the excitation source. Then, an adaptive transmission scheduling algorithm is proposed to address the low energy consumption demands of backscatter tags, adjusting the transmission strategy according to predictive analytics and taming channel conditions. Furthermore, leveraging the benefits of low-density parity-check (LDPC) and fountain codes, a novel coding and decoding algorithm is developed, which is tailored for dynamic channel conditions. Experimental validation shows that FlexScatter reduces bit error rates (BER) by up to 30%, enhances energy efficiency by 7%, and overall system utility by 11%, compared to conventional methods. FlexScatter’s ability to balance energy consumption and communication efficiency makes it a robust solution for future IoT applications that rely on unpredictable Wi-Fi traffic.Abstract The potential of Wi-Fi backscatter communications systems is immense, yet challenges such as signal instability and energy constraints impose performance limits. This paper introduces FlexScatter, a Wi-Fi backscatter system featuring a designed scheduling strategy based on excitation prediction and rateless coding to enhance system performance. Initially, a Wi-Fi traffic prediction model is constructed by analyzing the variability of the excitation source. Then, an adaptive transmission scheduling algorithm is proposed to address the low energy consumption demands of backscatter tags, adjusting the transmission strategy according to predictive analytics and taming channel conditions. Furthermore, leveraging the benefits of low-density parity-check (LDPC) and fountain codes, a novel coding and decoding algorithm is developed, which is tailored for dynamic channel conditions. Experimental validation shows that FlexScatter reduces bit error rates (BER) by up to 30%, enhances energy efficiency by 7%, and overall system utility by 11%, compared to conventional methods. FlexScatter’s ability to balance energy consumption and communication efficiency makes it a robust solution for future IoT applications that rely on unpredictable Wi-Fi traffic

    Short-term PV power prediction based on the 24 traditional Chinese solar terms and adaboost-GA-BP model

    Get PDF
    High-precision, short-term power forecasting for photovoltaic systems not only reduces unnecessary energy consumption but also provides power grid security. To this end, in this paper we propose a photovoltaic short-term power forecasting model based on the division of data of the 24 traditional Chinese solar terms and the Adaboost-GA-BP model. The 24 solar terms were condensed from the laws of meteorology, phenology, and seasonal changes to adapt to agricultural times in ancient China and have become intangible cultural heritage. This article first analyzes the numerical characteristics of meteorological factors and demonstrates their close correlation with the turning points of the 24 solar terms. Second, using Standardized Euclidean Distance and Spearman’s Correlation Coefficients to analyze data similarity between the Gregorian half-months and the 24 solar terms divisions for comparative analysis purposes, it is shown that the intragroup data under the division of the 24 solar terms have a higher similarity, leading to an average decrease of 15.68%, 40.57%, 14.68%, and 14.64% in the MAE, MSE, RMSE, and WMAPE of the predicted results, respectively. Finally, based on the data derived from the 24 solar terms, the combined algorithm was compared with the Adaboost-GA-BP model and then was verified. The genetic algorithm and Adaboost were used to optimize the BP neural network algorithm in initial value assignment and neural network structure, resulting in a 23.42%, 18.12%, and 22.28% reduction in the mean values of the MAE, RMSE, and WMAPE of the predicted results, respectively. Analysis of the results show that using the Adaboost-GA-BP model based on the 24 solar terms for short-term photovoltaic power forecasting can improve the accuracy of photovoltaic power forecasting and significantly improve the predictive performance of the model

    4K4D: Real-Time 4D View Synthesis at 4K Resolution

    Full text link
    This paper targets high-fidelity and real-time view synthesis of dynamic 3D scenes at 4K resolution. Recently, some methods on dynamic view synthesis have shown impressive rendering quality. However, their speed is still limited when rendering high-resolution images. To overcome this problem, we propose 4K4D, a 4D point cloud representation that supports hardware rasterization and enables unprecedented rendering speed. Our representation is built on a 4D feature grid so that the points are naturally regularized and can be robustly optimized. In addition, we design a novel hybrid appearance model that significantly boosts the rendering quality while preserving efficiency. Moreover, we develop a differentiable depth peeling algorithm to effectively learn the proposed model from RGB videos. Experiments show that our representation can be rendered at over 400 FPS on the DNA-Rendering dataset at 1080p resolution and 80 FPS on the ENeRF-Outdoor dataset at 4K resolution using an RTX 4090 GPU, which is 30x faster than previous methods and achieves the state-of-the-art rendering quality. Our project page is available at https://zju3dv.github.io/4k4d/.Comment: Project Page: https://zju3dv.github.io/4k4
    corecore