154 research outputs found

    Identifying network communities with a high resolution

    Full text link
    Community structure is an important property of complex networks. An automatic discovery of such structure is a fundamental task in many disciplines, including sociology, biology, engineering, and computer science. Recently, several community discovery algorithms have been proposed based on the optimization of a quantity called modularity (Q). However, the problem of modularity optimization is NP-hard, and the existing approaches often suffer from prohibitively long running time or poor quality. Furthermore, it has been recently pointed out that algorithms based on optimizing Q will have a resolution limit, i.e., communities below a certain scale may not be detected. In this research, we first propose an efficient heuristic algorithm, Qcut, which combines spectral graph partitioning and local search to optimize Q. Using both synthetic and real networks, we show that Qcut can find higher modularities and is more scalable than the existing algorithms. Furthermore, using Qcut as an essential component, we propose a recursive algorithm, HQcut, to solve the resolution limit problem. We show that HQcut can successfully detect communities at a much finer scale and with a higher accuracy than the existing algorithms. Finally, we apply Qcut and HQcut to study a protein-protein interaction network, and show that the combination of the two algorithms can reveal interesting biological results that may be otherwise undetectable.Comment: 14 pages, 5 figures. 1 supplemental file at http://cic.cs.wustl.edu/qcut/supplemental.pd

    CSNL: A cost-sensitive non-linear decision tree algorithm

    Get PDF
    This article presents a new decision tree learning algorithm called CSNL that induces Cost-Sensitive Non-Linear decision trees. The algorithm is based on the hypothesis that nonlinear decision nodes provide a better basis than axis-parallel decision nodes and utilizes discriminant analysis to construct nonlinear decision trees that take account of costs of misclassification. The performance of the algorithm is evaluated by applying it to seventeen datasets and the results are compared with those obtained by two well known cost-sensitive algorithms, ICET and MetaCost, which generate multiple trees to obtain some of the best results to date. The results show that CSNL performs at least as well, if not better than these algorithms, in more than twelve of the datasets and is considerably faster. The use of bagging with CSNL further enhances its performance showing the significant benefits of using nonlinear decision nodes. The performance of the algorithm is evaluated by applying it to seventeen data sets and the results are compared with those obtained by two well known cost-sensitive algorithms, ICET and MetaCost, which generate multiple trees to obtain some of the best results to date. The results show that CSNL performs at least as well, if not better than these algorithms, in more than twelve of the data sets and is considerably faster. The use of bagging with CSNL further enhances its performance showing the significant benefits of using non-linear decision nodes

    Inhibition in multiclass classification

    Get PDF
    The role of inhibition is investigated in a multiclass support vector machine formalism inspired by the brain structure of insects. The so-called mushroom bodies have a set of output neurons, or classification functions, that compete with each other to encode a particular input. Strongly active output neurons depress or inhibit the remaining outputs without knowing which is correct or incorrect. Accordingly, we propose to use a classification function that embodies unselective inhibition and train it in the large margin classifier framework. Inhibition leads to more robust classifiers in the sense that they perform better on larger areas of appropriate hyperparameters when assessed with leave-one-out strategies. We also show that the classifier with inhibition is a tight bound to probabilistic exponential models and is Bayes consistent for 3-class problems. These properties make this approach useful for data sets with a limited number of labeled examples. For larger data sets, there is no significant comparative advantage to other multiclass SVM approaches

    Inhibition in multiclass classification

    Get PDF
    The role of inhibition is investigated in a multiclass support vector machine formalism inspired by the brain structure of insects. The so-called mushroom bodies have a set of output neurons, or classification functions, that compete with each other to encode a particular input. Strongly active output neurons depress or inhibit the remaining outputs without knowing which is correct or incorrect. Accordingly, we propose to use a classification function that embodies unselective inhibition and train it in the large margin classifier framework. Inhibition leads to more robust classifiers in the sense that they perform better on larger areas of appropriate hyperparameters when assessed with leave-one-out strategies. We also show that the classifier with inhibition is a tight bound to probabilistic exponential models and is Bayes consistent for 3-class problems. These properties make this approach useful for data sets with a limited number of labeled examples. For larger data sets, there is no significant comparative advantage to other multiclass SVM approaches

    Inducing safer oblique trees without costs

    Get PDF
    Decision tree induction has been widely studied and applied. In safety applications, such as determining whether a chemical process is safe or whether a person has a medical condition, the cost of misclassification in one of the classes is significantly higher than in the other class. Several authors have tackled this problem by developing cost-sensitive decision tree learning algorithms or have suggested ways of changing the distribution of training examples to bias the decision tree learning process so as to take account of costs. A prerequisite for applying such algorithms is the availability of costs of misclassification. Although this may be possible for some applications, obtaining reasonable estimates of costs of misclassification is not easy in the area of safety. This paper presents a new algorithm for applications where the cost of misclassifications cannot be quantified, although the cost of misclassification in one class is known to be significantly higher than in another class. The algorithm utilizes linear discriminant analysis to identify oblique relationships between continuous attributes and then carries out an appropriate modification to ensure that the resulting tree errs on the side of safety. The algorithm is evaluated with respect to one of the best known cost-sensitive algorithms (ICET), a well-known oblique decision tree algorithm (OC1) and an algorithm that utilizes robust linear programming

    A survey of cost-sensitive decision tree induction algorithms

    Get PDF
    The past decade has seen a significant interest on the problem of inducing decision trees that take account of costs of misclassification and costs of acquiring the features used for decision making. This survey identifies over 50 algorithms including approaches that are direct adaptations of accuracy based methods, use genetic algorithms, use anytime methods and utilize boosting and bagging. The survey brings together these different studies and novel approaches to cost-sensitive decision tree learning, provides a useful taxonomy, a historical timeline of how the field has developed and should provide a useful reference point for future research in this field

    In vivo proton magnetic resonance spectroscopy reveals region specific metabolic responses to SIV infection in the macaque brain

    Get PDF
    <p>Abstract</p> <p>Background</p> <p><it>In vivo </it>proton magnetic resonance spectroscopy (<sup>1</sup>H-MRS) studies of HIV-infected humans have demonstrated significant metabolic abnormalities that vary by brain region, but the causes are poorly understood. Metabolic changes in the frontal cortex, basal ganglia and white matter in 18 SIV-infected macaques were investigated using MRS during the first month of infection.</p> <p>Results</p> <p>Changes in the N-acetylaspartate (NAA), choline (Cho), <it>myo</it>-inositol (MI), creatine (Cr) and glutamine/glutamate (Glx) resonances were quantified both in absolute terms and relative to the creatine resonance. Most abnormalities were observed at the time of peak viremia, 2 weeks post infection (wpi). At that time point, significant decreases in NAA and NAA/Cr, reflecting neuronal injury, were observed only in the frontal cortex. Cr was significantly elevated only in the white matter. Changes in Cho and Cho/Cr were similar across the brain regions, increasing at 2 wpi, and falling below baseline levels at 4 wpi. MI and MI/Cr levels were increased across all brain regions.</p> <p>Conclusion</p> <p>These data best support the hypothesis that different brain regions have variable intrinsic vulnerabilities to neuronal injury caused by the AIDS virus.</p

    Feasibility and effects of preventive home visits for at-risk older people: Design of a randomized controlled trial

    Get PDF
    Abstract Background The search for preventive methods to mitigate functional decline and unwanted relocation by older adults living in the community is important. Preventive home visit (PHV) models use infrequent but regular visits to older adults by trained practitioners with the goal of maintaining function and quality of life. Evidence about PHV efficacy is mixed but generally supportive. Yet interventions have rarely combined a comprehensive (biopsychosocial) occupational therapy intervention protocol with a home visit to older adults. There is a particular need in the USA to create and examine such a protocol. Methods/Design The study is a single-blind randomized controlled pilot trial designed to assess the feasibility, and to obtain preliminary efficacy estimates, of an intervention consisting of preventive home visits to community-dwelling older adults. An occupational therapy-based preventive home visit (PHV) intervention was developed and is being implemented and evaluated using a repeated measures design. We recruited a sample of 110 from a population of older adults (75+) who were screened and found to be at-risk for functional decline. Participants are currently living in the community (not in assisted living or a skilled nursing facility) in one of three central North Carolina counties. After consent, participants were randomly assigned into experimental and comparison groups. The experimental group receives the intervention 4 times over a 12 month follow-up period while the comparison group receives a minimal intervention of mailed printed materials. Pre- and post-intervention measures are being gathered by questionnaires administered face-to-face by a treatment-blinded research associate. Key outcome measures include functional ability, participation, life satisfaction, self-rated health, and depression. Additional information is collected from participants in the experimental group during the intervention to assess the feasibility of the intervention and potential modifiers. Fidelity is being addressed and measured across several domains. Discussion Feasibility indications to date are positive. Although the protocol has some limitations, we expect to learn enough about the intervention, delivery and effects to support a larger trial with a more stringent design and enhanced statistical power. Trial Registration ClinicalTrials.gov ID NCT0098528

    Low level constraints on dynamic contour path integration

    Get PDF
    Contour integration is a fundamental visual process. The constraints on integrating discrete contour elements and the associated neural mechanisms have typically been investigated using static contour paths. However, in our dynamic natural environment objects and scenes vary over space and time. With the aim of investigating the parameters affecting spatiotemporal contour path integration, we measured human contrast detection performance of a briefly presented foveal target embedded in dynamic collinear stimulus sequences (comprising five short 'predictor' bars appearing consecutively towards the fovea, followed by the 'target' bar) in four experiments. The data showed that participants' target detection performance was relatively unchanged when individual contour elements were separated by up to 2° spatial gap or 200ms temporal gap. Randomising the luminance contrast or colour of the predictors, on the other hand, had similar detrimental effect on grouping dynamic contour path and subsequent target detection performance. Randomising the orientation of the predictors reduced target detection performance greater than introducing misalignment relative to the contour path. The results suggest that the visual system integrates dynamic path elements to bias target detection even when the continuity of path is disrupted in terms of spatial (2°), temporal (200ms), colour (over 10 colours) and luminance (-25% to 25%) information. We discuss how the findings can be largely reconciled within the functioning of V1 horizontal connections
    corecore