18,209 research outputs found

    Improving Multi-Scale Aggregation Using Feature Pyramid Module for Robust Speaker Verification of Variable-Duration Utterances

    Full text link
    Currently, the most widely used approach for speaker verification is the deep speaker embedding learning. In this approach, we obtain a speaker embedding vector by pooling single-scale features that are extracted from the last layer of a speaker feature extractor. Multi-scale aggregation (MSA), which utilizes multi-scale features from different layers of the feature extractor, has recently been introduced and shows superior performance for variable-duration utterances. To increase the robustness dealing with utterances of arbitrary duration, this paper improves the MSA by using a feature pyramid module. The module enhances speaker-discriminative information of features from multiple layers via a top-down pathway and lateral connections. We extract speaker embeddings using the enhanced features that contain rich speaker information with different time scales. Experiments on the VoxCeleb dataset show that the proposed module improves previous MSA methods with a smaller number of parameters. It also achieves better performance than state-of-the-art approaches for both short and long utterances.Comment: Accepted to Interspeech 202

    I\u27ll Be Your Friend

    Get PDF
    In this piece, Min-Jung Kim chronicles her struggles as a young Korean-American girl trying to pursue her American Dream to be the first-generation college student in her family

    Flashlight

    Get PDF
    This poem illustrates the struggle of an undergraduate first-generation college student who knew little about the first-gen identity or the experiences she would encounter until she became a First To Go Scholar at Loyola Marymount University. The poet represents the First To Go Program as a flashlight that has helped her to navigate a once dark and unfamiliar environment

    Asymmetric-valued Spectrum Auction and Competition in Wireless Broadband Services

    Full text link
    We study bidding and pricing competition between two spiteful mobile network operators (MNOs) with considering their existing spectrum holdings. Given asymmetric-valued spectrum blocks are auctioned off to them via a first-price sealed-bid auction, we investigate the interactions between two spiteful MNOs and users as a three-stage dynamic game and characterize the dynamic game's equilibria. We show an asymmetric pricing structure and different market share between two spiteful MNOs. Perhaps counter-intuitively, our results show that the MNO who acquires the less-valued spectrum block always lowers his service price despite providing double-speed LTE service to users. We also show that the MNO who acquires the high-valued spectrum block, despite charing a higher price, still achieves more market share than the other MNO. We further show that the competition between two MNOs leads to some loss of their revenues. By investigating a cross-over point at which the MNOs' profits are switched, it serves as the benchmark of practical auction designs

    Highly efficient source for frequency-entangled photon pairs generated in a 3rd order periodically poled MgO-doped stoichiometric LiTaO3 crystal

    Full text link
    We present a highly efficient source for discrete frequency-entangled photon pairs based on spontaneous parametric down-conversion using 3rd order type-0 quasi-phase matching in a periodically poled MgO-doped stoichiometric LiTaO3 crystal pumped by a 355.66 nm laser. Correlated two-photon states were generated with automatic conservation of energy and momentum in two given spatial modes. These states have a wide spectral range, even under small variations in crystal temperature, which consequently results in higher discreteness. Frequency entanglement was confirmed by measuring two-photon quantum interference fringes without any spectral filtering.Comment: 4 pages, 4 figures, to be published in Optics Letter
    • …
    corecore