977 research outputs found

    Homophone Reveals the Truth: A Reality Check for Speech2Vec

    Full text link
    Generating spoken word embeddings that possess semantic information is a fascinating topic. Compared with text-based embeddings, they cover both phonetic and semantic characteristics, which can provide richer information and are potentially helpful for improving ASR and speech translation systems. In this paper, we review and examine the authenticity of a seminal work in this field: Speech2Vec. First, a homophone-based inspection method is proposed to check the speech embeddings released by the author of Speech2Vec. There is no indication that these embeddings are generated by the Speech2Vec model. Moreover, through further analysis of the vocabulary composition, we suspect that a text-based model fabricates these embeddings. Finally, we reproduce the Speech2Vec model, referring to the official code and optimal settings in the original paper. Experiments showed that this model failed to learn effective semantic embeddings. In word similarity benchmarks, it gets a correlation score of 0.08 in MEN and 0.15 in WS-353-SIM tests, which is over 0.5 lower than those described in the original paper. Our data and code are available.Comment: Corrected typo

    The enhanced soliton propagation and energy transfer in the coupled drift wave and energetic-particle-induced geodesic acoustic mode system

    Full text link
    The evolution of the coupled drift wave (DW) and energetic-particle-induced geodesic acoustic mode (EGAM) nonlinear system is investigated using the fully nonlinear coupled DW-EGAM two-field equations, with emphasis on the turbulence spreading in the form of soliton and the nonlinear energy transfer between DW and EGAM. Four scenarios with different combinations of EGAM initial amplitudes and linear EGAM growth rates are designed to delineate the effects of linear EGAM drive and finite EGAM amplitude on DW nonlinear dynamic evolution. In presence of the linear EPs drive, the soliton propagation is enhanced, due to the generation of small radial scale structures. Two conservation laws of the nonlinear system are derived, including the energy conservation law. It is found that the energy of DW always decreases and that of EGAM always increases, leading to regulation of DW by EGAM.Comment: 19 figures, 9 page

    Exploring Parameter-Efficient Fine-tuning for Improving Communication Efficiency in Federated Learning

    Full text link
    Federated learning (FL) has emerged as a promising paradigm for enabling the collaborative training of models without centralized access to the raw data on local devices. In the typical FL paradigm (e.g., FedAvg), model weights are sent to and from the server each round to participating clients. However, this can quickly put a massive communication burden on the system, especially if more capable models beyond very small MLPs are employed. Recently, the use of pre-trained models has been shown effective in federated learning optimization and improving convergence. This opens the door for new research questions. Can we adjust the weight-sharing paradigm in federated learning, leveraging strong and readily-available pre-trained models, to significantly reduce the communication burden while simultaneously achieving excellent performance? To this end, we investigate the use of parameter-efficient fine-tuning in federated learning. Specifically, we systemically evaluate the performance of several parameter-efficient fine-tuning methods across a variety of client stability, data distribution, and differential privacy settings. By only locally tuning and globally sharing a small portion of the model weights, significant reductions in the total communication overhead can be achieved while maintaining competitive performance in a wide range of federated learning scenarios, providing insight into a new paradigm for practical and effective federated systems

    Time-Varying Discrete-Time Wavelet Transforms

    Get PDF

    The Aerobic Cyclase Involved in (Bacterio)chlorophyll Biosynthesis

    Get PDF

    FedPerfix: Towards Partial Model Personalization of Vision Transformers in Federated Learning

    Full text link
    Personalized Federated Learning (PFL) represents a promising solution for decentralized learning in heterogeneous data environments. Partial model personalization has been proposed to improve the efficiency of PFL by selectively updating local model parameters instead of aggregating all of them. However, previous work on partial model personalization has mainly focused on Convolutional Neural Networks (CNNs), leaving a gap in understanding how it can be applied to other popular models such as Vision Transformers (ViTs). In this work, we investigate where and how to partially personalize a ViT model. Specifically, we empirically evaluate the sensitivity to data distribution of each type of layer. Based on the insights that the self-attention layer and the classification head are the most sensitive parts of a ViT, we propose a novel approach called FedPerfix, which leverages plugins to transfer information from the aggregated model to the local client as a personalization. Finally, we evaluate the proposed approach on CIFAR-100, OrganAMNIST, and Office-Home datasets and demonstrate its effectiveness in improving the model's performance compared to several advanced PFL methods.Comment: 2023 IEEE/CVF International Conference on Computer Vision (ICCV

    Direct Preference Optimization for Neural Machine Translation with Minimum Bayes Risk Decoding

    Full text link
    Minimum Bayes Risk (MBR) decoding can significantly improve translation performance of Multilingual Large Language Models (MLLMs). However, MBR decoding is computationally expensive and in this paper, we show how recently developed Reinforcement Learning (RL) technique, Direct Preference Optimization (DPO) can be used to fine-tune MLLMs so that we get the gains from MBR without the additional computation in inference. Our fine-tuned models have significantly improved performance on multiple NMT test sets compared to base MLLMs without preference optimization. Our method boosts the translation performance of MLLMs using relatively small monolingual fine-tuning sets

    Measuring compactness of the urban landscape within a city territory for environmental capabilities: the case of 50 cities in eastern China

    Full text link
    [EN] When a compact city is pursued as the strategy for urban sustainability the understanding of compactness is varied from the developed countries to the developing countries. In China the historical cities are characterized with high density and mixed land use. After a short time of motorization they still show compact forms in the central city. A large amount of newly developed areas are distributed in clusters near or far from the center. The crop land and natural habitat are encroached to different degrees. This paper developed an approach to measure the structural compactness of urban landscape patches within a city territory. It included six spatial metrics to measure the shape and density of the central agglomeration, the area configuration and distant relationship between the central agglomeration and the other clusters, and the distribution of all urban patches. By this approach the 50 cities in eastern China were categorized into five classes of forms: one center, multi-centers, centralized groups, cluster groups and scatter. Then the vegetation biomass loss with urban expansion was calculated based on remote sensing data, and used to assess the environmental capability of the five types of urban form. The suggestions of urban form optimization could be put forward for the five categories of cities.Shuang, CS.; Tong, Z.; Guangyu, L.; Yue, Y. (2018). Measuring compactness of the urban landscape within a city territory for environmental capabilities: the case of 50 cities in eastern China. En 24th ISUF International Conference. Book of Papers. Editorial Universitat Politècnica de València. 13-20. https://doi.org/10.4995/ISUF2017.2017.5094OCS132

    An Algorithm for Finding Functional Modules and Protein Complexes in Protein-Protein Interaction Networks

    Get PDF
    Biological processes are often performed by a group of proteins rather than by individual proteins, and proteins in a same biological group form a densely connected subgraph in a protein-protein interaction network. Therefore, finding a densely connected subgraph provides useful information to predict the function or protein complex of uncharacterized proteins in the highly connected subgraph. We have developed an efficient algorithm and program for finding cliques and near-cliques in a protein-protein interaction network. Analysis of the interaction network of yeast proteins using the algorithm demonstrates that 59% of the near-cliques identified by our algorithm have at least one function shared by all the proteins within a near-clique, and that 56% of the near-cliques show a good agreement with the experimentally determined protein complexes catalogued in MIPS
    corecore