5,678 research outputs found
A Deterministic Equivalent for the Analysis of Non-Gaussian Correlated MIMO Multiple Access Channels
Large dimensional random matrix theory (RMT) has provided an efficient
analytical tool to understand multiple-input multiple-output (MIMO) channels
and to aid the design of MIMO wireless communication systems. However, previous
studies based on large dimensional RMT rely on the assumption that the transmit
correlation matrix is diagonal or the propagation channel matrix is Gaussian.
There is an increasing interest in the channels where the transmit correlation
matrices are generally nonnegative definite and the channel entries are
non-Gaussian. This class of channel models appears in several applications in
MIMO multiple access systems, such as small cell networks (SCNs). To address
these problems, we use the generalized Lindeberg principle to show that the
Stieltjes transforms of this class of random matrices with Gaussian or
non-Gaussian independent entries coincide in the large dimensional regime. This
result permits to derive the deterministic equivalents (e.g., the Stieltjes
transform and the ergodic mutual information) for non-Gaussian MIMO channels
from the known results developed for Gaussian MIMO channels, and is of great
importance in characterizing the spectral efficiency of SCNs.Comment: This paper is the revision of the original manuscript titled "A
Deterministic Equivalent for the Analysis of Small Cell Networks". We have
revised the original manuscript and reworked on the organization to improve
the presentation as well as readabilit
IT Diffusion Stages in the Network Era of Chinese Companies
One of our recent studies revisited Nolan’s stages theory on IT growth in the context of Chinese companies and concluded that IT spending and learning with regard to IT management are not as tightly linked as is implied in Nolan\u27s stages theory. The study also revealed that aggregate IT spending can be better explained by a technology diffusion process on the population level. Based on the findings, in this paper, we re-consider Chinese companies’ IT spending growth pattern, and in turn, their IT application history over the past 20 years from a perspective of IT diffusion stages, highlighting the special characteristics of the network/Internet era. We project that the IT diffusion process in China’s Network Era would experience four stages and has currently reached the third. Also, we propose some important issues to be further addressed
Fused Text Segmentation Networks for Multi-oriented Scene Text Detection
In this paper, we introduce a novel end-end framework for multi-oriented
scene text detection from an instance-aware semantic segmentation perspective.
We present Fused Text Segmentation Networks, which combine multi-level features
during the feature extracting as text instance may rely on finer feature
expression compared to general objects. It detects and segments the text
instance jointly and simultaneously, leveraging merits from both semantic
segmentation task and region proposal based object detection task. Not
involving any extra pipelines, our approach surpasses the current state of the
art on multi-oriented scene text detection benchmarks: ICDAR2015 Incidental
Scene Text and MSRA-TD500 reaching Hmean 84.1% and 82.0% respectively. Morever,
we report a baseline on total-text containing curved text which suggests
effectiveness of the proposed approach.Comment: Accepted by ICPR201
EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance
Although current neural text-to-speech (TTS) models are able to generate
high-quality speech, intensity controllable emotional TTS is still a
challenging task. Most existing methods need external optimizations for
intensity calculation, leading to suboptimal results or degraded quality. In
this paper, we propose EmoDiff, a diffusion-based TTS model where emotion
intensity can be manipulated by a proposed soft-label guidance technique
derived from classifier guidance. Specifically, instead of being guided with a
one-hot vector for the specified emotion, EmoDiff is guided with a soft label
where the value of the specified emotion and \textit{Neutral} is set to
and respectively. The here represents the emotion
intensity and can be chosen from 0 to 1. Our experiments show that EmoDiff can
precisely control the emotion intensity while maintaining high voice quality.
Moreover, diverse speech with specified emotion intensity can be generated by
sampling in the reverse denoising process.Comment: Accepted to ICASSP202
Prediction Approach of Critical Node Based on Multiple Attribute Decision Making for Opportunistic Sensor Networks
Predicting critical nodes of Opportunistic Sensor Network (OSN) can help us not only to improve network performance but also to decrease the cost in network maintenance. However, existing ways of predicting critical nodes in static network are not suitable for OSN. In this paper, the conceptions of critical nodes, region contribution, and cut-vertex in multiregion OSN are defined. We propose an approach to predict critical node for OSN, which is based on multiple attribute decision making (MADM). It takes RC to present the dependence of regions on Ferry nodes. TOPSIS algorithm is employed to find out Ferry node with maximum comprehensive contribution, which is a critical node. The experimental results show that, in different scenarios, this approach can predict the critical nodes of OSN better
Concise and Efficient Total Syntheses of Virenamides A and D
Concise total syntheses of linear thiazole-containing peptides virenamides A (1) and D (4), isolated from Australian ascidian Diplosoma virens have been accomplished from Boc-L-valine (6) in 7 steps. A cyclization between thioamide and bromoacetaldehyde was applied to form thiazole ring as a key step
VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature
The mainstream neural text-to-speech(TTS) pipeline is a cascade system,
including an acoustic model(AM) that predicts acoustic feature from the input
transcript and a vocoder that generates waveform according to the given
acoustic feature. However, the acoustic feature in current TTS systems is
typically mel-spectrogram, which is highly correlated along both time and
frequency axes in a complicated way, leading to a great difficulty for the AM
to predict. Although high-fidelity audio can be generated by recent neural
vocoders from ground-truth(GT) mel-spectrogram, the gap between the GT and the
predicted mel-spectrogram from AM degrades the performance of the entire TTS
system. In this work, we propose VQTTS, consisting of an AM txt2vec and a
vocoder vec2wav, which uses self-supervised vector-quantized(VQ) acoustic
feature rather than mel-spectrogram. We redesign both the AM and the vocoder
accordingly. In particular, txt2vec basically becomes a classification model
instead of a traditional regression model while vec2wav uses an additional
feature encoder before HifiGAN generator for smoothing the discontinuous
quantized feature. Our experiments show that vec2wav achieves better
reconstruction performance than HifiGAN when using self-supervised VQ acoustic
feature. Moreover, our entire TTS system VQTTS achieves state-of-the-art
performance in terms of naturalness among all current publicly available TTS
systems.Comment: This version has been removed by arXiv administrators because the
submitter did not have the authority to assign the license at the time of
submissio
- …