1,050 research outputs found

    Faster K-Means Cluster Estimation

    Full text link
    There has been considerable work on improving popular clustering algorithm `K-means' in terms of mean squared error (MSE) and speed, both. However, most of the k-means variants tend to compute distance of each data point to each cluster centroid for every iteration. We propose a fast heuristic to overcome this bottleneck with only marginal increase in MSE. We observe that across all iterations of K-means, a data point changes its membership only among a small subset of clusters. Our heuristic predicts such clusters for each data point by looking at nearby clusters after the first iteration of k-means. We augment well known variants of k-means with our heuristic to demonstrate effectiveness of our heuristic. For various synthetic and real-world datasets, our heuristic achieves speed-up of up-to 3 times when compared to efficient variants of k-means.Comment: 6 pages, Accepted at ECIR 201

    A Multi-Objective Optimization for Supply Chain Network Using the Bees Algorithm

    Get PDF
    A supply chain is a complex network which involves the products, services and information flows between suppliers and customers. A typical supply chain is composed of different levels, hence, there is a need to optimize the supply chain by finding the optimum configuration of the network in order to get a good compromise between the multi-objectives such as cost minimization and lead-time minimization. There are several multi-objective optimization methods which have been applied to find the optimum solutions set based on the Pareto front line. In this study, a swarm-based optimization method, namely, the bees algorithm is proposed in dealing with the multi-objective supply chain model to find the optimum configuration of a given supply chain problem which minimizes the total cost and the total lead-time. The supply chain problem utilized in this study is taken from literature and several experiments have been conducted in order to show the performance of the proposed model; in addition, the results have been compared to those achieved by the ant colony optimization method. The results show that the proposed bees algorithm is able to achieve better Pareto solutions for the supply chain problem

    Initial motion of a rectangular object being pushed or pulled

    Get PDF
    Techniques are described for determining the location of the initial center of rotation (COR) of a rectangular bar being pushed or pulled. The initial COR is the point about which the bar first rotates when the pushing or pulling force is applied. This point characterizes the initial motion of the bar. Also investigated is how the location of the initial COR varies with the magnitude of the exerted force. The minimum effort criterion is proved to be able to predict the quasi-static centre of rotation. It is found that the initial COR always lies between the quasi-static and the impulsive CORs and that it will move towards the impulsive COR as the magnitude of the applied force increases. It is shown that there exists a point on an object such that, when the force is applied at that point, the object will start to rotate about a known point.published_or_final_versio

    Cognitive support for older people from multimedia options

    No full text
    If older users of multimedia displays could select among presentation options, would they choose display combinations that supported their performance? After three short touch-screen tasks which measured the perceptual and cognitive abilities of 50 older adults, they answered questions about a route on an online map that could be accompanied by written and/or spoken text. Half the participants saw animated routes; and they were less accurate answering questions than those who saw static routes but this did not affect people’s multimedia choices which, although diverse, were systematic. Spoken text was more often selected by people who had lower scores on the spatial working memory task, than by the older adults with higher scores. This suggests that older people with cognitive limitations recognise ways in which multimedia information can be supportive

    Highly accurate step counting at variouswalking states using low-cost inertial measurement unit support indoor positioning system

    Full text link
    © 2018 by the authors. Licensee MDPI, Basel, Switzerland. Accurate step counting is essential for indoor positioning, health monitoring systems, and other indoor positioning services. There are several publications and commercial applications in step counting. Nevertheless, over-counting, under-counting, and false walking problems are still encountered in these methods. In this paper, we propose to develop a highly accurate step counting method to solve these limitations by proposing four features: Minimal peak distance, minimal peak prominence, dynamic thresholding, and vibration elimination, and these features are adaptive with the user’s states. Our proposed features are combined with periodicity and similarity features to solve false walking problem. The proposed method shows a significant improvement of 99.42% and 96.47% of the average of accuracy in free walking and false walking problems, respectively, on our datasets. Furthermore, our proposed method also achieves the average accuracy of 97.04% on public datasets and better accuracy in comparison with three commercial step counting applications: Pedometer and Weight Loss Coach installed on Lenovo P780, Health apps in iPhone 5s (iOS 10.3.3), and S-health in Samsung Galaxy S5 (Android 6.01)

    Crowdsourcing Dialect Characterization through Twitter

    Get PDF
    We perform a large-scale analysis of language diatopic variation using geotagged microblogging datasets. By collecting all Twitter messages written in Spanish over more than two years, we build a corpus from which a carefully selected list of concepts allows us to characterize Spanish varieties on a global scale. A cluster analysis proves the existence of well defined macroregions sharing common lexical properties. Remarkably enough, we find that Spanish language is split into two superdialects, namely, an urban speech used across major American and Spanish citites and a diverse form that encompasses rural areas and small towns. The latter can be further clustered into smaller varieties with a stronger regional character.Comment: 10 pages, 5 figure

    Development and validation of a prognostic model for predicting 30-day mortality risk in medical patients in emergency department (ED)

    Full text link
    © 2017 The Author(s). The primary aim of this prospective study is to develop and validate a new prognostic model for predicting the risk of mortality in Emergency Department (ED) patients. The study involved 1765 patients in the development cohort and 1728 in the validation cohort. The main outcome was mortality up to 30 days after admission. Potential risk factors included clinical characteristics, vital signs, and routine haematological and biochemistry tests. The Bayesian Model Averaging method within the Cox's regression model was used to identify independent risk factors for mortality. In the development cohort, the incidence of 30-day mortality was 9.8%, and the following factors were associated with a greater risk of mortality: male gender, increased respiratory rate and serum urea, decreased peripheral oxygen saturation and serum albumin, lower Glasgow Coma Score, and admission to intensive care unit. The area under the receiver operating characteristic curve for the model with the listed factors was 0.871 (95% CI, 0.844-0.898) in the development cohort and 0.783 (95% CI, 0.743-0.823) in the validation cohort. Calibration analysis found a close agreement between predicted and observed mortality risk. We conclude that the risk of mortality among ED patients could be accurately predicted by using common clinical signs and biochemical tests

    Fast Ensemble Smoothing

    Full text link
    Smoothing is essential to many oceanographic, meteorological and hydrological applications. The interval smoothing problem updates all desired states within a time interval using all available observations. The fixed-lag smoothing problem updates only a fixed number of states prior to the observation at current time. The fixed-lag smoothing problem is, in general, thought to be computationally faster than a fixed-interval smoother, and can be an appropriate approximation for long interval-smoothing problems. In this paper, we use an ensemble-based approach to fixed-interval and fixed-lag smoothing, and synthesize two algorithms. The first algorithm produces a linear time solution to the interval smoothing problem with a fixed factor, and the second one produces a fixed-lag solution that is independent of the lag length. Identical-twin experiments conducted with the Lorenz-95 model show that for lag lengths approximately equal to the error doubling time, or for long intervals the proposed methods can provide significant computational savings. These results suggest that ensemble methods yield both fixed-interval and fixed-lag smoothing solutions that cost little additional effort over filtering and model propagation, in the sense that in practical ensemble application the additional increment is a small fraction of either filtering or model propagation costs. We also show that fixed-interval smoothing can perform as fast as fixed-lag smoothing and may be advantageous when memory is not an issue

    A novel ensemble artificial intelligence approach for gully erosion mapping in a semi-arid watershed (Iran)

    Get PDF
    © 2019 by the authors. Licensee MDPI, Basel, Switzerland. In this study, we introduced a novel hybrid artificial intelligence approach of rotation forest (RF) as a Meta/ensemble classifier based on alternating decision tree (ADTree) as a base classifier called RF-ADTree in order to spatially predict gully erosion at Klocheh watershed of Kurdistan province, Iran. A total of 915 gully erosion locations along with 22 gully conditioning factors were used to construct a database. Some soft computing benchmark models (SCBM) including the ADTree, the Support Vector Machine by two kernel functions such as Polynomial and Radial Base Function (SVM-Polynomial and SVM-RBF), the Logistic Regression (LR), and the Naïve Bayes Multinomial Updatable (NBMU) models were used for comparison of the designed model. Results indicated that 19 conditioning factors were effective among which distance to river, geomorphology, land use, hydrological group, lithology and slope angle were the most remarkable factors for gully modeling process. Additionally, results of modeling concluded the RF-ADTree ensemble model could significantly improve (area under the curve (AUC) = 0.906) the prediction accuracy of the ADTree model (AUC = 0.882). The new proposed model had also the highest performance (AUC = 0.913) in comparison to the SVM-Polynomial model (AUC = 0.879), the SVM-RBF model (AUC = 0.867), the LR model (AUC = 0.75), the ADTree model (AUC = 0.861) and the NBMU model (AUC = 0.811)
    • …
    corecore