14,305 research outputs found
Breaking Sticks and Ambiguities with Adaptive Skip-gram
Recently proposed Skip-gram model is a powerful method for learning
high-dimensional word representations that capture rich semantic relationships
between words. However, Skip-gram as well as most prior work on learning word
representations does not take into account word ambiguity and maintain only
single representation per word. Although a number of Skip-gram modifications
were proposed to overcome this limitation and learn multi-prototype word
representations, they either require a known number of word meanings or learn
them using greedy heuristic approaches. In this paper we propose the Adaptive
Skip-gram model which is a nonparametric Bayesian extension of Skip-gram
capable to automatically learn the required number of representations for all
words at desired semantic resolution. We derive efficient online variational
learning algorithm for the model and empirically demonstrate its efficiency on
word-sense induction task
How long, O Bayesian network, will I sample thee? A program analysis perspective on expected sampling times
Bayesian networks (BNs) are probabilistic graphical models for describing
complex joint probability distributions. The main problem for BNs is inference:
Determine the probability of an event given observed evidence. Since exact
inference is often infeasible for large BNs, popular approximate inference
methods rely on sampling.
We study the problem of determining the expected time to obtain a single
valid sample from a BN. To this end, we translate the BN together with
observations into a probabilistic program. We provide proof rules that yield
the exact expected runtime of this program in a fully automated fashion. We
implemented our approach and successfully analyzed various real-world BNs taken
from the Bayesian network repository
Adaptive imputation of missing values for incomplete pattern classification
In classification of incomplete pattern, the missing values can either play a
crucial role in the class determination, or have only little influence (or
eventually none) on the classification results according to the context. We
propose a credal classification method for incomplete pattern with adaptive
imputation of missing values based on belief function theory. At first, we try
to classify the object (incomplete pattern) based only on the available
attribute values. As underlying principle, we assume that the missing
information is not crucial for the classification if a specific class for the
object can be found using only the available information. In this case, the
object is committed to this particular class. However, if the object cannot be
classified without ambiguity, it means that the missing values play a main role
for achieving an accurate classification. In this case, the missing values will
be imputed based on the K-nearest neighbor (K-NN) and self-organizing map (SOM)
techniques, and the edited pattern with the imputation is then classified. The
(original or edited) pattern is respectively classified according to each
training class, and the classification results represented by basic belief
assignments are fused with proper combination rules for making the credal
classification. The object is allowed to belong with different masses of belief
to the specific classes and meta-classes (which are particular disjunctions of
several single classes). The credal classification captures well the
uncertainty and imprecision of classification, and reduces effectively the rate
of misclassifications thanks to the introduction of meta-classes. The
effectiveness of the proposed method with respect to other classical methods is
demonstrated based on several experiments using artificial and real data sets
Computationally Efficient Optimization of a Five-Phase Flux-Switching PM Machine Under Different Operating Conditions
This paper investigates the comparative design optimizations of a five-phase outer-rotor flux-switching permanent magnet (FSPM) machine for in-wheel traction applications. To improve the comprehensive performance of the motor, two kinds of large-scale design optimizations under different operating conditions are performed and compared, including the traditional optimization performed at the rated operating point and the optimization targeting the whole driving cycles. Three driving cycles are taken into account, namely, the urban dynamometer driving schedule (UDDS), the highway fuel economy driving schedule (HWFET), and the combined UDDS/HWFET, representing the city, highway, and combined city/highway driving, respectively. Meanwhile, the computationally efficient finite-element analysis (CE-FEA) method, the cyclic representative operating points extraction technique, as well as the response surface methodology (in order to minimize the number of experiments when establishing the inverse machine model), are presented to reduce the computational effort and cost. From the results and discussion, it will be found that the optimization results against different operating conditions exhibit distinct characteristics in terms of geometry, efficiency, and energy loss distributions. For the traditional optimization performed at the rated operating point, the optimal design tends to reduce copper losses but suffer from high core losses; for UDDS, the optimal design tends to minimize both copper losses and PM eddy-current losses in the low-speed region; for HWFET, the optimal design tends to minimize core losses in the high-speed region; for the combined UDDS/HWFET, the optimal design tends to balance/compromise the loss components in both the low-speed and high-speed regions. Furthermore, the advantages of the adopted optimization methodologies versus the traditional procedure are highlighted
- …