729 research outputs found

    MACOC: a medoid-based ACO clustering algorithm

    Get PDF
    The application of ACO-based algorithms in data mining is growing over the last few years and several supervised and unsupervised learning algorithms have been developed using this bio-inspired approach. Most recent works concerning unsupervised learning have been focused on clustering, showing great potential of ACO-based techniques. This work presents an ACO-based clustering algorithm inspired by the ACO Clustering (ACOC) algorithm. The proposed approach restructures ACOC from a centroid-based technique to a medoid-based technique, where the properties of the search space are not necessarily known. Instead, it only relies on the information about the distances amongst data. The new algorithm, called MACOC, has been compared against well-known algorithms (K-means and Partition Around Medoids) and with ACOC. The experiments measure the accuracy of the algorithm for both synthetic datasets and real-world datasets extracted from the UCI Machine Learning Repository

    Atomistic simulations of self-trapped exciton formation in silicon nanostructures: The transition from quantum dots to nanowires

    Full text link
    Using an approximate time-dependent density functional theory method, we calculate the absorption and luminescence spectra for hydrogen passivated silicon nanoscale structures with large aspect ratio. The effect of electron confinement in axial and radial directions is systematically investigated. Excited state relaxation leads to significant Stokes shifts for short nanorods with lengths less than 2 nm, but has little effect on the luminescence intensity. The formation of self-trapped excitons is likewise observed for short nanostructures only; longer wires exhibit fully delocalized excitons with neglible geometrical distortion at the excited state minimum.Comment: 10 pages, 4 figure

    Randomized Reference Classifier with Gaussian Distribution and Soft Confusion Matrix Applied to the Improving Weak Classifiers

    Full text link
    In this paper, an issue of building the RRC model using probability distributions other than beta distribution is addressed. More precisely, in this paper, we propose to build the RRR model using the truncated normal distribution. Heuristic procedures for expected value and the variance of the truncated-normal distribution are also proposed. The proposed approach is tested using SCM-based model for testing the consequences of applying the truncated normal distribution in the RRC model. The experimental evaluation is performed using four different base classifiers and seven quality measures. The results showed that the proposed approach is comparable to the RRC model built using beta distribution. What is more, for some base classifiers, the truncated-normal-based SCM algorithm turned out to be better at discovering objects coming from minority classes.Comment: arXiv admin note: text overlap with arXiv:1901.0882

    Global Networks of Trade and Bits

    Get PDF
    Considerable efforts have been made in recent years to produce detailed topologies of the Internet. Although Internet topology data have been brought to the attention of a wide and somewhat diverse audience of scholars, so far they have been overlooked by economists. In this paper, we suggest that such data could be effectively treated as a proxy to characterize the size of the "digital economy" at country level and outsourcing: thus, we analyse the topological structure of the network of trade in digital services (trade in bits) and compare it with that of the more traditional flow of manufactured goods across countries. To perform meaningful comparisons across networks with different characteristics, we define a stochastic benchmark for the number of connections among each country-pair, based on hypergeometric distribution. Original data are thus filtered by means of different thresholds, so that we only focus on the strongest links, i.e., statistically significant links. We find that trade in bits displays a sparser and less hierarchical network structure, which is more similar to trade in high-skill manufactured goods than total trade. Lastly, distance plays a more prominent role in shaping the network of international trade in physical goods than trade in digital services.Comment: 25 pages, 6 figure

    Lattice-gas simulations of Domain Growth, Saturation and Self-Assembly in Immiscible Fluids and Microemulsions

    Full text link
    We investigate the dynamical behavior of both binary fluid and ternary microemulsion systems in two dimensions using a recently introduced hydrodynamic lattice-gas model of microemulsions. We find that the presence of amphiphile in our simulations reduces the usual oil-water interfacial tension in accord with experiment and consequently affects the non-equilibrium growth of oil and water domains. As the density of surfactant is increased we observe a crossover from the usual two-dimensional binary fluid scaling laws to a growth that is {\it slow}, and we find that this slow growth can be characterized by a logarithmic time scale. With sufficient surfactant in the system we observe that the domains cease to grow beyond a certain point and we find that this final characteristic domain size is inversely proportional to the interfacial surfactant concentration in the system.Comment: 28 pages, latex, embedded .eps figures, one figure is in colour, all in one uuencoded gzip compressed tar file, submitted to Physical Review

    Mechanical and Electronic Properties of MoS2_2 Nanoribbons and Their Defects

    Get PDF
    We present our study on atomic, electronic, magnetic and phonon properties of one dimensional honeycomb structure of molybdenum disulfide (MoS2_2) using first-principles plane wave method. Calculated phonon frequencies of bare armchair nanoribbon reveal the fourth acoustic branch and indicate the stability. Force constant and in-plane stiffness calculated in the harmonic elastic deformation range signify that the MoS2_2 nanoribbons are stiff quasi one dimensional structures, but not as strong as graphene and BN nanoribbons. Bare MoS2_2 armchair nanoribbons are nonmagnetic, direct band gap semiconductors. Bare zigzag MoS2_2 nanoribbons become half-metallic as a result of the (2x1) reconstruction of edge atoms and are semiconductor for minority spins, but metallic for the majority spins. Their magnetic moments and spin-polarizations at the Fermi level are reduced as a result of the passivation of edge atoms by hydrogen. The functionalization of MoS2_2 nanoribbons by adatom adsorption and vacancy defect creation are also studied. The nonmagnetic armchair nanoribbons attain net magnetic moment depending on where the foreign atoms are adsorbed and what kind of vacancy defect is created. The magnetization of zigzag nanoribbons due to the edge states is suppressed in the presence of vacancy defects.Comment: 11 pages, 5 figures, first submitted at November 23th, 200

    Algorithms for learning parsimonious context trees

    Get PDF
    Parsimonious context trees, PCTs, provide a sparse parameterization of conditional probability distributions. They are particularly powerful for modeling context-specific independencies in sequential discrete data. Learning PCTs from data is computationally hard due to the combinatorial explosion of the space of model structures as the number of predictor variables grows. Under the score-and-search paradigm, the fastest algorithm for finding an optimal PCT, prior to the present work, is based on dynamic programming. While the algorithm can handle small instances fast, it becomes infeasible already when there are half a dozen four-state predictor variables. Here, we show that common scoring functions enable the use of new algorithmic ideas, which can significantly expedite the dynamic programming algorithm on typical data. Specifically, we introduce a memoization technique, which exploits regularities within the predictor variables by equating different contexts associated with the same data subset, and a bound-and-prune technique, which exploits regularities within the response variable by pruning parts of the search space based on score upper bounds. On real-world data from recent applications of PCTs within computational biology the ideas are shown to reduce the traversed search space and the computation time by several orders of magnitude in typical cases.Peer reviewe

    What is a Cool-Core Cluster? A Detailed Analysis of the Cores of the X-ray Flux-Limited HIFLUGCS Cluster Sample

    Full text link
    We use the largest complete sample of 64 galaxy clusters (HIghest X-ray FLUx Galaxy Cluster Sample) with available high-quality X-ray data from Chandra, and apply 16 cool-core diagnostics to them, some of them new. We also correlate optical properties of brightest cluster galaxies (BCGs) with X-ray properties. To segregate cool core and non-cool-core clusters, we find that central cooling time, t_cool, is the best parameter for low redshift clusters with high quality data, and that cuspiness is the best parameter for high redshift clusters. 72% of clusters in our sample have a cool core (t_cool < 7.7 h_{71}^{-1/2} Gyr) and 44% have strong cool cores (t_cool <1.0 h_{71}^{-1/2} Gyr). For the first time we show quantitatively that the discrepancy in classical and spectroscopic mass deposition rates can not be explained with a recent formation of the cool cores, demonstrating the need for a heating mechanism to explain the cooling flow problem. [Abridged]Comment: 45 pages, 19 figures, 7 tables. Accepted for publication in A&A. Contact Person: Rupal Mittal ([email protected]
    corecore