Search CORE

17,507 research outputs found

Memory-Efficient Topic Modeling

Author: Cao Xiao-Qin
Liu Zhi-Qiang
Zeng Jia
Publication venue
Publication date: 08/06/2012
Field of study

As one of the simplest probabilistic topic modeling techniques, latent Dirichlet allocation (LDA) has found many important applications in text mining, computer vision and computational biology. Recent training algorithms for LDA can be interpreted within a unified message passing framework. However, message passing requires storing previous messages with a large amount of memory space, increasing linearly with the number of documents or the number of topics. Therefore, the high memory usage is often a major problem for topic modeling of massive corpora containing a large number of topics. To reduce the space complexity, we propose a novel algorithm without storing previous messages for training LDA: tiny belief propagation (TBP). The basic idea of TBP relates the message passing algorithms with the non-negative matrix factorization (NMF) algorithms, which absorb the message updating into the message passing process, and thus avoid storing previous messages. Experimental results on four large data sets confirm that TBP performs comparably well or even better than current state-of-the-art training algorithms for LDA but with a much less memory consumption. TBP can do topic modeling when massive corpora cannot fit in the computer memory, for example, extracting thematic topics from 7 GB PUBMED corpora on a common desktop computer with 2GB memory.Comment: 20 pages, 7 figure

arXiv.org e-Print Archive

CiteSeerX

The relation between the Toda hierarchy and the KdV hierarchy

Author: Case
Case
Gieseker
Kupershmidt
Morosi
Morosi
Morosi
Newell
Runliang Lin
Toda
Tu
Xin Cao
Yunbo Zeng
Zeng
Zeng
Publication venue: 'Elsevier BV'
Publication date: 28/12/1998
Field of study

Under three relations connecting the field variables of Toda flows and that of KdV flows, we present three new sequences of combination of the equations in the Toda hierarchy which have the KdV hierarchy as a continuous limit. The relation between the Poisson structures of the KdV hierarchy and the Toda hierarchy in continuous limit is also studied.Comment: 11 pages, Tex, no figures, to be published in Physics Letters

arXiv.org e-Print Archive

CiteSeerX

Crossref