51,328 research outputs found
Multi-Agent Simulation of Emergence of Schwa Deletion Pattern in Hindi
Recently, there has been a revival of interest in multi-agent simulation techniques for exploring the nature of language change. However, a lack of appropriate validation of simulation experiments against real language data often calls into question the general applicability of these methods in modeling realistic language change. We try to address this issue here by making an attempt to model the phenomenon of schwa deletion in Hindi through a multi-agent simulation framework. The pattern of Hindi schwa deletion and its diachronic nature are well studied, not only out of general linguistic inquiry, but also to facilitate Hindi grapheme-to-phoneme conversion, which is a preprocessing step to text-to-speech synthesis. We show that under certain conditions, the schwa deletion pattern observed in modern Hindi emerges in the system from an initial state of no deletion. The simulation framework described in this work can be extended to model other phonological changes as well.Language Change, Linguistic Agent, Language Game, Multi-Agent Simulation, Schwa Deletion
Dynamically Reconfigurable Online Self-organising Fuzzy Neural Network with Variable Number of Inputs for Smart Home Application
A self-organising fuzzy-neural network (SOFNN) adapts its structure based on variations of the input data. Conventionally in such self-organising networks, the number of inputs providing the data is fixed. In this paper, we consider the situation where the number of inputs to a network changes dynamically during its online operation. We extend our existing work on a SOFNN such that the SOFNN can self-organise its structure based not only on its input data, but also according to the changes in the number of its inputs. We apply the approach to a smart home application, where there are certain situations when some of the existing events may be removed or new events emerge, and illustrate that our approach enhances cognitive reasoning in a dynamic smart home environment. In this case, the network identifies the removed and/or added events from the received information over time, and reconfigures its structure dynamically. We present results for different combinations of training and testing phases of the dynamic reconfigurable SOFNN using a set of realistic synthesized data. The results show the potential of the proposed method
Experience versus Talent Shapes the Structure of the Web
We use sequential large-scale crawl data to empirically investigate and
validate the dynamics that underlie the evolution of the structure of the web.
We find that the overall structure of the web is defined by an intricate
interplay between experience or entitlement of the pages (as measured by the
number of inbound hyperlinks a page already has), inherent talent or fitness of
the pages (as measured by the likelihood that someone visiting the page would
give a hyperlink to it), and the continual high rates of birth and death of
pages on the web. We find that the web is conservative in judging talent and
the overall fitness distribution is exponential, showing low variability. The
small variance in talent, however, is enough to lead to experience
distributions with high variance: The preferential attachment mechanism
amplifies these small biases and leads to heavy-tailed power-law (PL) inbound
degree distributions over all pages, as well as over pages that are of the same
age. The balancing act between experience and talent on the web allows newly
introduced pages with novel and interesting content to grow quickly and surpass
older pages. In this regard, it is much like what we observe in high-mobility
and meritocratic societies: People with entitlement continue to have access to
the best resources, but there is just enough screening for fitness that allows
for talented winners to emerge and join the ranks of the leaders. Finally, we
show that the fitness estimates have potential practical applications in
ranking query results
Measuring the Initial Transient: Reflected Brownian Motion
We analyze the convergence to equilibrium of one-dimensional reflected
Brownian motion (RBM) and compute a number of related initial transient
formulae. These formulae are of interest as approximations to the initial
transient for queueing systems in heavy traffic, and help us to identify
settings in which initialization bias is significant. We conclude with a
discussion of mean square error for RBM. Our analysis supports the view that
initial transient effects for RBM and related models are typically of modest
size relative to the intrinsic stochastic variability, unless one chooses an
especially poor initialization.Comment: 14 pages, 3 figure
Robust Beam Search for Encoder-Decoder Attention Based Speech Recognition without Length Bias
As one popular modeling approach for end-to-end speech recognition,
attention-based encoder-decoder models are known to suffer the length bias and
corresponding beam problem. Different approaches have been applied in simple
beam search to ease the problem, most of which are heuristic-based and require
considerable tuning. We show that heuristics are not proper modeling
refinement, which results in severe performance degradation with largely
increased beam sizes. We propose a novel beam search derived from
reinterpreting the sequence posterior with an explicit length modeling. By
applying the reinterpreted probability together with beam pruning, the obtained
final probability leads to a robust model modification, which allows reliable
comparison among output sequences of different lengths. Experimental
verification on the LibriSpeech corpus shows that the proposed approach solves
the length bias problem without heuristics or additional tuning effort. It
provides robust decision making and consistently good performance under both
small and very large beam sizes. Compared with the best results of the
heuristic baseline, the proposed approach achieves the same WER on the 'clean'
sets and 4% relative improvement on the 'other' sets. We also show that it is
more efficient with the additional derived early stopping criterion.Comment: accepted at INTERSPEECH202
Mutual information in random Boolean models of regulatory networks
The amount of mutual information contained in time series of two elements
gives a measure of how well their activities are coordinated. In a large,
complex network of interacting elements, such as a genetic regulatory network
within a cell, the average of the mutual information over all pairs is a
global measure of how well the system can coordinate its internal dynamics. We
study this average pairwise mutual information in random Boolean networks
(RBNs) as a function of the distribution of Boolean rules implemented at each
element, assuming that the links in the network are randomly placed. Efficient
numerical methods for calculating show that as the number of network nodes
N approaches infinity, the quantity N exhibits a discontinuity at parameter
values corresponding to critical RBNs. For finite systems it peaks near the
critical value, but slightly in the disordered regime for typical parameter
variations. The source of high values of N is the indirect correlations
between pairs of elements from different long chains with a common starting
point. The contribution from pairs that are directly linked approaches zero for
critical networks and peaks deep in the disordered regime.Comment: 11 pages, 6 figures; Minor revisions for clarity and figure format,
one reference adde
A practical illustration of the importance of realistic individualized treatment rules in causal inference
The effect of vigorous physical activity on mortality in the elderly is
difficult to estimate using conventional approaches to causal inference that
define this effect by comparing the mortality risks corresponding to
hypothetical scenarios in which all subjects in the target population engage in
a given level of vigorous physical activity. A causal effect defined on the
basis of such a static treatment intervention can only be identified from
observed data if all subjects in the target population have a positive
probability of selecting each of the candidate treatment options, an assumption
that is highly unrealistic in this case since subjects with serious health
problems will not be able to engage in higher levels of vigorous physical
activity. This problem can be addressed by focusing instead on causal effects
that are defined on the basis of realistic individualized treatment rules and
intention-to-treat rules that explicitly take into account the set of treatment
options that are available to each subject. We present a data analysis to
illustrate that estimators of static causal effects in fact tend to
overestimate the beneficial impact of high levels of vigorous physical activity
while corresponding estimators based on realistic individualized treatment
rules and intention-to-treat rules can yield unbiased estimates. We emphasize
that the problems encountered in estimating static causal effects are not
restricted to the IPTW estimator, but are also observed with the
-computation estimator, the DR-IPTW estimator, and the targeted MLE. Our
analyses based on realistic individualized treatment rules and
intention-to-treat rules suggest that high levels of vigorous physical activity
may confer reductions in mortality risk on the order of 15-30%, although in
most cases the evidence for such an effect does not quite reach the 0.05 level
of significance.Comment: Published in at http://dx.doi.org/10.1214/07-EJS105 the Electronic
Journal of Statistics (http://www.i-journals.org/ejs/) by the Institute of
Mathematical Statistics (http://www.imstat.org
- …