20 research outputs found

    Using Twitter to learn about the autism community

    Full text link
    Considering the raising socio-economic burden of autism spectrum disorder (ASD), timely and evidence-driven public policy decision making and communication of the latest guidelines pertaining to the treatment and management of the disorder is crucial. Yet evidence suggests that policy makers and medical practitioners do not always have a good understanding of the practices and relevant beliefs of ASD-afflicted individuals' carers who often follow questionable recommendations and adopt advice poorly supported by scientific data. The key goal of the present work is to explore the idea that Twitter, as a highly popular platform for information exchange, could be used as a data-mining source to learn about the population affected by ASD -- their behaviour, concerns, needs etc. To this end, using a large data set of over 11 million harvested tweets as the basis for our investigation, we describe a series of experiments which examine a range of linguistic and semantic aspects of messages posted by individuals interested in ASD. Our findings, the first of their nature in the published scientific literature, strongly motivate additional research on this topic and present a methodological basis for further work.Comment: Social Network Analysis and Mining, 201

    Discovering topic structures of a temporally evolving document corpus

    Get PDF
    In this paper we describe a novel framework for the discovery of the topical content of a data corpus, and the tracking of its complex structural changes across the temporal dimension. In contrast to previous work our model does not impose a prior on the rate at which documents are added to the corpus nor does it adopt the Markovian assumption which overly restricts the type of changes that the model can capture. Our key technical contribution is a framework based on (i) discretization of time into epochs, (ii) epoch-wise topic discovery using a hierarchical Dirichlet process-based model, and (iii) a temporal similarity graph which allows for the modelling of complex topic changes: emergence and disappearance, evolution, splitting, and merging. The power of the proposed framework is demonstrated on two medical literature corpora concerned with the autism spectrum disorder (ASD) and the metabolic syndrome (MetS)—both increasingly important research subjects with significant social and healthcare consequences. In addition to the collected ASD and metabolic syndrome literature corpora which we made freely available, our contribution also includes an extensive empirical analysis of the proposed framework. We describe a detailed and careful examination of the effects that our algorithms’s free parameters have on its output and discuss the significance of the findings both in the context of the practical application of our algorithm as well as in the context of the existing body of work on temporal topic analysis. Our quantitative analysis is followed by several qualitative case studies highly relevant to the current research on ASD and MetS, on which our algorithm is shown to capture well the actual developments in these fields.Publisher PDFPeer reviewe

    Analysing the history of autism spectrum disorder using topic models

    Get PDF
    We describe a novel framework for the discovery of underlying topics of a longitudinal collection of scholarly data,and the tracking of their lifetime and popularity over time. Unlike the social media or news data, as the topic nuances in science result in new scientific directions to emerge, a new approach to model the longitudinal literature data is using topics which remain identifiable over the course of time. Current studies either disregard the time dimension or treat it as an exchangeable covariate when they fix the topics over time or do not share the topics over epochs when they model the time naturally. We address these issues by adopting a non-parametric Bayesian approach. We assume the data is partially exchangeable and divided it into consecutive epochs. Then, by fixing the topics in a recurrent Chinese restaurant franchise, we impose a static topical structure on the corpus such that the they are shared across epochs and the documents within epochs. We demonstrate the effectiveness of the proposed framework on a collection of medical literature related to autism spectrum disorder. We collect a large corpus of publications and carefully examining two important research issues of the domain as case studies. Moreover, we make the results of our experiment and the source code of the model, freely available to aid other researchers by analysing the results or applying the model to their data collections.Postprin

    Monitoring protected areas using remote sensing technology

    Full text link
    Due to irrational use of natural resources, human society is facing unprecedented threats. Remote sensing is one of the essential tools to determine changes in various forms of biological diversity over time. There are many methods to determine changes in protected areas, using satellite images. In this paper after introducing different change detection methods and their advantages and disadvantages, a hybrid method is used to analyse changes in forests and protected areas in a national park. Two Landsat images of Golestan National Park in Iran (taken in 1998 and 2010) were used. This hybrid approach combines Change Vector Analysis (CVA) for flagging the occurrence of changes, followed by signature extension to assign labels to changedpixels. The main objective of this paper is to propose a method for discovering and assessing environmental threats to natural treasures

    Overcoming data scarcity of Twitter:Using tweets as bootstrap with application to autism-related topic content analysis

    No full text
    Notwithstanding recent work which has demonstrated the potential of using Twitter messages for content-specific data mining and analysis, the depth of such analysis is inherently limited by the scarcity of data imposed by the 140 character tweet limit. In this paper we describe a novel approach for targeted knowledge exploration which uses tweet content analysis as a preliminary step. This step is used to bootstrap more sophisticated data collection from directly related but much richer content sources. In particular we demonstrate that valuable information can be collected by following URLs included in tweets. We automatically extract content from the corresponding web pages and treating each web page as a document linked to the original tweet show how a temporal topic model based on a hierarchical Dirichlet process can be used to track the evolution of a complex topic structure of a Twitter community. Using autism-related tweets we demonstrate that our method is capable of capturing a much more meaningful picture of information exchange than user-chosen hashtags.</p

    Evaluating coverage changes in national parks using a hybrid change detection algorithm and remote sensing

    Full text link
    Remote sensing is a useful tool for detecting change over time.We introduce a hybrid change-detection method for forest and protected-area vegetation and demonstrate its use with two satellite images of Golestan National Park in northern Iran (1998 and 2010). We report on the advantages and disadvantages of the hybrid method relative to the standard change-detection method. In the proposed hybrid algorithm, the change vector analysis technique was used to determine changes in vegetation. Following this, we used postclassification comparison to determine the nature of the changes observed and their accuracy and to evaluate the effects of different parameters on the performance of the proposed method. We determined 85% accuracy for the proposed hybrid change-detection method, thus demonstrating a method for discovering and assessing environmental threats to natural treasures. &copy; 2014 Society of Photo-Optical Instrumentation Engineers
    corecore