729 research outputs found
MACOC: a medoid-based ACO clustering algorithm
The application of ACO-based algorithms in data mining is growing over the last few years and several supervised and unsupervised learning algorithms have been developed using this bio-inspired approach. Most recent works concerning unsupervised learning have been focused on clustering, showing great potential of ACO-based techniques. This work presents an ACO-based clustering algorithm inspired by the ACO Clustering (ACOC) algorithm. The proposed approach restructures ACOC from a centroid-based technique to a medoid-based technique, where the properties of the search space are not necessarily known. Instead, it only relies on the information about the distances amongst data. The new algorithm, called MACOC, has been compared against well-known algorithms (K-means and Partition Around Medoids) and with ACOC. The experiments measure the accuracy of the algorithm for both synthetic datasets and real-world datasets extracted from the UCI Machine Learning Repository
Atomistic simulations of self-trapped exciton formation in silicon nanostructures: The transition from quantum dots to nanowires
Using an approximate time-dependent density functional theory method, we
calculate the absorption and luminescence spectra for hydrogen passivated
silicon nanoscale structures with large aspect ratio. The effect of electron
confinement in axial and radial directions is systematically investigated.
Excited state relaxation leads to significant Stokes shifts for short nanorods
with lengths less than 2 nm, but has little effect on the luminescence
intensity. The formation of self-trapped excitons is likewise observed for
short nanostructures only; longer wires exhibit fully delocalized excitons with
neglible geometrical distortion at the excited state minimum.Comment: 10 pages, 4 figure
Randomized Reference Classifier with Gaussian Distribution and Soft Confusion Matrix Applied to the Improving Weak Classifiers
In this paper, an issue of building the RRC model using probability
distributions other than beta distribution is addressed. More precisely, in
this paper, we propose to build the RRR model using the truncated normal
distribution. Heuristic procedures for expected value and the variance of the
truncated-normal distribution are also proposed. The proposed approach is
tested using SCM-based model for testing the consequences of applying the
truncated normal distribution in the RRC model. The experimental evaluation is
performed using four different base classifiers and seven quality measures. The
results showed that the proposed approach is comparable to the RRC model built
using beta distribution. What is more, for some base classifiers, the
truncated-normal-based SCM algorithm turned out to be better at discovering
objects coming from minority classes.Comment: arXiv admin note: text overlap with arXiv:1901.0882
Global Networks of Trade and Bits
Considerable efforts have been made in recent years to produce detailed
topologies of the Internet. Although Internet topology data have been brought
to the attention of a wide and somewhat diverse audience of scholars, so far
they have been overlooked by economists. In this paper, we suggest that such
data could be effectively treated as a proxy to characterize the size of the
"digital economy" at country level and outsourcing: thus, we analyse the
topological structure of the network of trade in digital services (trade in
bits) and compare it with that of the more traditional flow of manufactured
goods across countries. To perform meaningful comparisons across networks with
different characteristics, we define a stochastic benchmark for the number of
connections among each country-pair, based on hypergeometric distribution.
Original data are thus filtered by means of different thresholds, so that we
only focus on the strongest links, i.e., statistically significant links. We
find that trade in bits displays a sparser and less hierarchical network
structure, which is more similar to trade in high-skill manufactured goods than
total trade. Lastly, distance plays a more prominent role in shaping the
network of international trade in physical goods than trade in digital
services.Comment: 25 pages, 6 figure
Lattice-gas simulations of Domain Growth, Saturation and Self-Assembly in Immiscible Fluids and Microemulsions
We investigate the dynamical behavior of both binary fluid and ternary
microemulsion systems in two dimensions using a recently introduced
hydrodynamic lattice-gas model of microemulsions. We find that the presence of
amphiphile in our simulations reduces the usual oil-water interfacial tension
in accord with experiment and consequently affects the non-equilibrium growth
of oil and water domains. As the density of surfactant is increased we observe
a crossover from the usual two-dimensional binary fluid scaling laws to a
growth that is {\it slow}, and we find that this slow growth can be
characterized by a logarithmic time scale. With sufficient surfactant in the
system we observe that the domains cease to grow beyond a certain point and we
find that this final characteristic domain size is inversely proportional to
the interfacial surfactant concentration in the system.Comment: 28 pages, latex, embedded .eps figures, one figure is in colour, all
in one uuencoded gzip compressed tar file, submitted to Physical Review
Mechanical and Electronic Properties of MoS Nanoribbons and Their Defects
We present our study on atomic, electronic, magnetic and phonon properties of
one dimensional honeycomb structure of molybdenum disulfide (MoS) using
first-principles plane wave method. Calculated phonon frequencies of bare
armchair nanoribbon reveal the fourth acoustic branch and indicate the
stability. Force constant and in-plane stiffness calculated in the harmonic
elastic deformation range signify that the MoS nanoribbons are stiff quasi
one dimensional structures, but not as strong as graphene and BN nanoribbons.
Bare MoS armchair nanoribbons are nonmagnetic, direct band gap
semiconductors. Bare zigzag MoS nanoribbons become half-metallic as a
result of the (2x1) reconstruction of edge atoms and are semiconductor for
minority spins, but metallic for the majority spins. Their magnetic moments and
spin-polarizations at the Fermi level are reduced as a result of the
passivation of edge atoms by hydrogen. The functionalization of MoS
nanoribbons by adatom adsorption and vacancy defect creation are also studied.
The nonmagnetic armchair nanoribbons attain net magnetic moment depending on
where the foreign atoms are adsorbed and what kind of vacancy defect is
created. The magnetization of zigzag nanoribbons due to the edge states is
suppressed in the presence of vacancy defects.Comment: 11 pages, 5 figures, first submitted at November 23th, 200
Algorithms for learning parsimonious context trees
Parsimonious context trees, PCTs, provide a sparse parameterization of conditional probability distributions. They are particularly powerful for modeling context-specific independencies in sequential discrete data. Learning PCTs from data is computationally hard due to the combinatorial explosion of the space of model structures as the number of predictor variables grows. Under the score-and-search paradigm, the fastest algorithm for finding an optimal PCT, prior to the present work, is based on dynamic programming. While the algorithm can handle small instances fast, it becomes infeasible already when there are half a dozen four-state predictor variables. Here, we show that common scoring functions enable the use of new algorithmic ideas, which can significantly expedite the dynamic programming algorithm on typical data. Specifically, we introduce a memoization technique, which exploits regularities within the predictor variables by equating different contexts associated with the same data subset, and a bound-and-prune technique, which exploits regularities within the response variable by pruning parts of the search space based on score upper bounds. On real-world data from recent applications of PCTs within computational biology the ideas are shown to reduce the traversed search space and the computation time by several orders of magnitude in typical cases.Peer reviewe
What is a Cool-Core Cluster? A Detailed Analysis of the Cores of the X-ray Flux-Limited HIFLUGCS Cluster Sample
We use the largest complete sample of 64 galaxy clusters (HIghest X-ray FLUx
Galaxy Cluster Sample) with available high-quality X-ray data from Chandra, and
apply 16 cool-core diagnostics to them, some of them new. We also correlate
optical properties of brightest cluster galaxies (BCGs) with X-ray properties.
To segregate cool core and non-cool-core clusters, we find that central cooling
time, t_cool, is the best parameter for low redshift clusters with high quality
data, and that cuspiness is the best parameter for high redshift clusters. 72%
of clusters in our sample have a cool core (t_cool < 7.7 h_{71}^{-1/2} Gyr) and
44% have strong cool cores (t_cool <1.0 h_{71}^{-1/2} Gyr). For the first time
we show quantitatively that the discrepancy in classical and spectroscopic mass
deposition rates can not be explained with a recent formation of the cool
cores, demonstrating the need for a heating mechanism to explain the cooling
flow problem. [Abridged]Comment: 45 pages, 19 figures, 7 tables. Accepted for publication in A&A.
Contact Person: Rupal Mittal ([email protected]
- …