2,092 research outputs found

    Probabilistic Clustering of Time-Evolving Distance Data

    Full text link
    We present a novel probabilistic clustering model for objects that are represented via pairwise distances and observed at different time points. The proposed method utilizes the information given by adjacent time points to find the underlying cluster structure and obtain a smooth cluster evolution. This approach allows the number of objects and clusters to differ at every time point, and no identification on the identities of the objects is needed. Further, the model does not require the number of clusters being specified in advance -- they are instead determined automatically using a Dirichlet process prior. We validate our model on synthetic data showing that the proposed method is more accurate than state-of-the-art clustering methods. Finally, we use our dynamic clustering model to analyze and illustrate the evolution of brain cancer patients over time

    A procedure for the change point problem in parametric models based on phi-divergence test-statistics

    Full text link
    This paper studies the change point problem for a general parametric, univariate or multivariate family of distributions. An information theoretic procedure is developed which is based on general divergence measures for testing the hypothesis of the existence of a change. For comparing the accuracy of the new test-statistic a simulation study is performed for the special case of a univariate discrete model. Finally, the procedure proposed in this paper is illustrated through a classical change-point example

    Document Image Binarization and Segmentation

    Get PDF
    Conceptually the Binarization of the chronicled archives is NP-difficult issue since the picture contains commotion, source debasements, and enlightenment. The point of binarization is to locate the best possible picture pixels' limit to enhance the general execution of the framework. This paper presents another half and half meta-heuristic calculation to decide the best edge an incentive for picture archives binarization. The point of Binarization is to locate the correct picture pixels' limit to enhance the general execution of the framework. Record division is a strategy for ripping the archive into unmistakable areas. In this proposed framework at first we displaying Wavelet deterioration and to binarize the record picture, and furthermore utilizes the projection profile to section lines and associated part investigation to fragment the characters. The normal result will be the binarized and fragmented characters, these character can be bolster to OCR for acknowledgement

    An Analytical Study of Rumoured Tweets by Using Twitter Data

    Get PDF
    Earlier when the internet was not there, rumours were spread by word of mouth technique but in this era of technology where we have social networking sites like twitter, rumours can be spread easily and quickly and a situation of panic can arise. Twitter is an American online news and social networking service on which users finds the latest news and world events faster. It is used for communication, interaction withpeople, announcement of event etc. from breaking news to sports, politics and everyday interests, one can find this service very addictive and an easy way to gather information about a certain event. Businesses can also use it to build their own brands and for marketing. But the founders of twitter like jack Dorsey forgot one thing that every coin has two sides. While twitter is a great way to interact with the masses, it is also a home of spammers. Spamming is a very common thing on twitter. Spammers create twitter accounts to perform a variety of tasks like posting links with unrelated tweets and the speed at which these fake and malicious misinformation spread on twitter in a real-time emergencies always causing a huge flood of tweets on twitter. In this paper, we demonstrated an analytical study of those rumoured tweets by twitter data. Using some of the rumoured tweets posted during the Chennai flood in 2015 and some non-rumoured tweets, we trained a classifier. The ability to track rumours and predict their outcomes have many applications for journalists, emergency services, and thereforehelp in minimizing the impact of false and fake information on this twitter platform

    Gallbladder Cancer Predisposition: A Multigenic Approach to DNA-Repair, Apoptotic and Inflammatory Pathway Genes

    Get PDF
    Gallbladder cancer (GBC) is a multifactorial disease with complex interplay between multiple genetic variants. We performed Classification and Regression Tree Analysis (CART) and Grade of Membership (GoM) analysis to identify combinations of alleles among the DNA repair, inflammatory and apoptotic pathway genetic variants in modifying the risk for GBC. We analyzed 16 polymorphisms in 8 genes involved in DNA repair, apoptotic and inflammatory pathways to find out combinations of genetic variants contributing to GBC risk. The genes included in the study were XRCC1, OGG1, ERCC2, MSH2, CASP8, TLR2, TLR4 and PTGS2. Single locus analysis by logistic regression showed association of MSH2 IVS1+9G>C (rs2303426), ERCC2 Asp312Asn (rs1799793), OGG1 Ser326Cys (rs1052133), OGG1 IVS4-15C>G (rs2072668), CASP8 -652 6N ins/del (rs3834129), PTGS2 -1195G>A (rs689466), PTGS2 -765G>C (rs20417), TLR4 Ex4+936C>T (rs4986791) and TLR2 –196 to –174del polymorphisms with GBC risk. The CART analysis revealed OGG1 Ser326Cys, and OGG1 IVS4-15C>G polymorphisms as the best polymorphic signature for discriminating between cases and controls. In the GoM analysis, the data was categorized into six sets representing risk for GBC with respect to the investigated polymorphisms. Sets I, II and III described low intrinsic risk (controls) characterized by multiple protective alleles while sets IV, V and VI represented high intrinsic risk groups (GBC cases) characterized by the presence of multiple risk alleles. The CART and GoM analyses also showed the importance of PTGS2 -1195G>A polymorphism in susceptibility to GBC risk. In conclusion, the present multigenic approach can be used to define individual risk profiles for gallbladder cancer in North Indian population

    Forecasting Player Behavioral Data and Simulating in-Game Events

    Full text link
    Understanding player behavior is fundamental in game data science. Video games evolve as players interact with the game, so being able to foresee player experience would help to ensure a successful game development. In particular, game developers need to evaluate beforehand the impact of in-game events. Simulation optimization of these events is crucial to increase player engagement and maximize monetization. We present an experimental analysis of several methods to forecast game-related variables, with two main aims: to obtain accurate predictions of in-app purchases and playtime in an operational production environment, and to perform simulations of in-game events in order to maximize sales and playtime. Our ultimate purpose is to take a step towards the data-driven development of games. The results suggest that, even though the performance of traditional approaches such as ARIMA is still better, the outcomes of state-of-the-art techniques like deep learning are promising. Deep learning comes up as a well-suited general model that could be used to forecast a variety of time series with different dynamic behaviors

    The conserved C-terminus of the PcrA/UvrD helicase interacts directly with RNA polymerase

    Get PDF
    Copyright: © 2013 Gwynn et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Funding: This work was supported by a Wellcome Trust project grant to MD (Reference: 077368), an ERC starting grant to MD (Acronym: SM-DNA-REPAIR) and a BBSRC project grant to PM, NS and MD (Reference: BB/I003142/1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.Peer reviewedPublisher PD

    The use of medicinal plants in health care practices by Rohingya refugees in a degraded forest and conservation area of Bangladesh

    Get PDF
    People in developing countries traditionally rely on plants for their primary healthcare. This dependence is relatively higher in forests in remote areas due to the lack of access to modern health facilities and easy availability of the plant products.We carried out an ethno-medicinal survey in Teknaf Game Reserve (TGR), a heavily degraded forest and conservation area in southern Bangladesh, to explore the diversity of plants used by Rohingya refugees for treating various ailments. The study also documented the traditional utilization, collection and perceptions of medicinal plants by the Rohingyas residing on the edges of this conservation area. We collected primary information through direct observation and by interviewing older respondents using a semi-structured questionnaire. A total of 34 plant species in 28 families were frequently used by the Rohingyas to treat 45 ailments, ranging from simple headaches to highly complex eye and heart diseases. For medicinal preparations and treating various ailments, aboveground plant parts were used more than belowground parts. The collection of medicinal plants was mostly from the TGR. © 2009 Taylor & Francis
    corecore