2 research outputs found

    Topic segmentation model based on ATNLDA and co-occurrence theory and its application in stem cell field

    No full text
    Natural Science Foundation of Fujian Province of China [2011J01360]; National Key Technology RD Program [2011BAH10B06-02]; Humanity and Social Science Youth foundation of Ministry of Education [11YJC870001, 11YJC870027]This paper describes the application of co-occurrence and latent Dirichlet allocation (LDA)-based topic analyses in stem cell-related literature research. On account of the deficiency of parameter estimation in LDA, this study integrated co-occurrence theory and clustering judgement indicators and constructed an ATNLDA (Auto Topic Number LDA) model for topic segmentation. Next, ATNLDA was used to determine the optimal topic number of stem cell research literatures from 2006 to 2011 in PubMed, which was then used for topic segmentation of research content in stem cell data set. After stem cell research topics were obtained, they were analysed in terms of topic label, topic research content and interrelation between topics. The results verified that application of ATNLDA in topic segmentation in stem cell literature research is effective and feasible. Current deficiencies of ATNLDA and future study plan were also discussed
    corecore