19,717 research outputs found
Improving Multi-Scale Aggregation Using Feature Pyramid Module for Robust Speaker Verification of Variable-Duration Utterances
Currently, the most widely used approach for speaker verification is the deep
speaker embedding learning. In this approach, we obtain a speaker embedding
vector by pooling single-scale features that are extracted from the last layer
of a speaker feature extractor. Multi-scale aggregation (MSA), which utilizes
multi-scale features from different layers of the feature extractor, has
recently been introduced and shows superior performance for variable-duration
utterances. To increase the robustness dealing with utterances of arbitrary
duration, this paper improves the MSA by using a feature pyramid module. The
module enhances speaker-discriminative information of features from multiple
layers via a top-down pathway and lateral connections. We extract speaker
embeddings using the enhanced features that contain rich speaker information
with different time scales. Experiments on the VoxCeleb dataset show that the
proposed module improves previous MSA methods with a smaller number of
parameters. It also achieves better performance than state-of-the-art
approaches for both short and long utterances.Comment: Accepted to Interspeech 202
I\u27ll Be Your Friend
In this piece, Min-Jung Kim chronicles her struggles as a young Korean-American girl trying to pursue her American Dream to be the first-generation college student in her family
Flashlight
This poem illustrates the struggle of an undergraduate first-generation college student who knew little about the first-gen identity or the experiences she would encounter until she became a First To Go Scholar at Loyola Marymount University. The poet represents the First To Go Program as a flashlight that has helped her to navigate a once dark and unfamiliar environment
Asymmetric-valued Spectrum Auction and Competition in Wireless Broadband Services
We study bidding and pricing competition between two spiteful mobile network
operators (MNOs) with considering their existing spectrum holdings. Given
asymmetric-valued spectrum blocks are auctioned off to them via a first-price
sealed-bid auction, we investigate the interactions between two spiteful MNOs
and users as a three-stage dynamic game and characterize the dynamic game's
equilibria. We show an asymmetric pricing structure and different market share
between two spiteful MNOs. Perhaps counter-intuitively, our results show that
the MNO who acquires the less-valued spectrum block always lowers his service
price despite providing double-speed LTE service to users. We also show that
the MNO who acquires the high-valued spectrum block, despite charing a higher
price, still achieves more market share than the other MNO. We further show
that the competition between two MNOs leads to some loss of their revenues. By
investigating a cross-over point at which the MNOs' profits are switched, it
serves as the benchmark of practical auction designs
Highly efficient source for frequency-entangled photon pairs generated in a 3rd order periodically poled MgO-doped stoichiometric LiTaO3 crystal
We present a highly efficient source for discrete frequency-entangled photon
pairs based on spontaneous parametric down-conversion using 3rd order type-0
quasi-phase matching in a periodically poled MgO-doped stoichiometric LiTaO3
crystal pumped by a 355.66 nm laser. Correlated two-photon states were
generated with automatic conservation of energy and momentum in two given
spatial modes. These states have a wide spectral range, even under small
variations in crystal temperature, which consequently results in higher
discreteness. Frequency entanglement was confirmed by measuring two-photon
quantum interference fringes without any spectral filtering.Comment: 4 pages, 4 figures, to be published in Optics Letter
- …