Search CORE

38 research outputs found

Steering time-dependent estimation of posteriors with hyperparameter indexing in Bayesian topic models

Author: Masada Tomonari
Oguri Kiyoshi
Shibata Yuichiro
Takasu Atsuhiro
Publication venue: Springer Verlag
Publication date: 01/01/2011
Field of study

This paper provides a new approach to topical trend analysis. Our aim is to improve the generalization power of latent Dirichlet allocation (LDA) by using document timestamps. Many previous works model topical trends by making latent topic distributions time-dependent. We propose a straightforward approach by preparing a different word multinomial distribution for each time point. Since this approach increases the number of parameters, overfitting becomes a critical issue. Our contribution to this issue is two-fold. First, we propose an effective way of defining Dirichlet priors over the word multinomials. Second, we propose a special scheduling of variational Bayesian (VB) inference. Comprehensive experiments with six datasets prove that our approach can improve LDA and also Topics over Time, a well-known variant of LDA, in terms of test data perplexity in the framework of VB inference

Nagasaki University's Academic Output SITE: NAOSITE

Bag of Timestamps: A Simple and Efficient Bayesian Chronological Mining

Author: Hamada Tsuyoshi
Masada Tomonari
Oguri Kiyoshi
Shibata Yuichiro
Takasu Atsuhiro
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/05/2009
Field of study

In this paper, we propose a new probabilistic model, Bag of Timestamps (BoT), for chronological text mining. BoT is an extension of latent Dirichlet allocation (LDA), and has two remarkable features when compared with a previously proposed Topics over Time (ToT), which is also an extension of LDA. First, we can avoid overfitting to temporal data, because temporal data are modeled in a Bayesian manner similar to word frequencies. Second, BoT has a conditional probability where no functions requiring time-consuming computations appear. The experiments using newswire documents show that BoT achieves more moderate fitting to temporal data in shorter execution time than ToT.Advances in Data and Web Management. Joint International Conferences, APWeb/WAIM 2009 Suzhou, China, April 2-4, 2009 Proceeding

Nagasaki University's Academic Output SITE: NAOSITE

Institutional Repositories DataBase (IRDB)

Modeling topical trends over continuous time with priors

Author: Fukagawa Daiji
Masada Tomonari
Oguri Kiyoshi
Shibata Yuichiro
Takasu Atsuhiro
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

In this paper, we propose a new method for topical trend analysis. We model topical trends by per-topic Beta distributions as in Topics over Time (TOT), proposed as an extension of latent Dirichlet allocation (LDA). However, TOT is likely to overfit to timestamp data in extracting latent topics. Therefore, we apply prior distributions to Beta distributions in TOT. Since Beta distribution has no conjugate prior, we devise a trick, where we set one among the two parameters of each per-topic Beta distribution to one based on a Bernoulli trial and apply Gamma distribution as a conjugate prior. Consequently, we can marginalize out the parameters of Beta distributions and thus treat timestamp data in a Bayesian fashion. In the evaluation experiment, we compare our method with LDA and TOT in link detection task on TDT4 dataset. We use word predictive probabilities as term weights and estimate document similarities by using those weights in a TFIDF-like scheme. The results show that our method achieves a moderate fitting to timestamp data.Advances in Neural Networks - ISNN 2010 : 7th International Symposium on Neural Networks, ISNN 2010, Shanghai, China, June 6-9, 2010, Proceedings, Part IIThe original publication is available at www.springerlink.co

Nagasaki University's Academic Output SITE: NAOSITE

Institutional Repositories DataBase (IRDB)

Nagasaki university's Academic Output SITE

Dynamic hyperparameter optimization for bayesian topical trend analysis

Author: Fukagawa Daiji
Hamada Tsuyoshi
Masada Tomonari
Oguri Kiyoshi
Shibata Yuichiro
Takasu Atsuhiro
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2009
Field of study

This paper presents a new Bayesian topical trend analysis. We regard the parameters of topic Dirichlet priors in latent Dirichlet allocation as a function of document timestamps and optimize the parameters by a gradient-based algorithm. Since our method gives similar hyperparameters to the documents having similar timestamps, topic assignment in collapsed Gibbs sampling is affected by timestamp similarities. We compute TFIDF-based document similarities by using a result of collapsed Gibbs sampling and evaluate our proposal by link detection task of Topic Detection and Tracking.Proceeding of the 18th ACM conference : Hong Kong, China, 2009.11.02-2009.11.0

Nagasaki University's Academic Output SITE: NAOSITE

Hyperfibrinolysis in severe isolated traumatic brain injury may occur without tissue hypoperfusion: a retrospective observational multicentre study

Author: A Hagiwara
A Karakus
A Levrat
A Sawamura
Akiyoshi Hagiwara
Arino Yaguchi
Atsushi Shiraishi
B Risberg
CJ Lowenstein
CJ Rao
CJ Vecht
D Huber
D Kudo
DA Story
Daisuke Kudo
Daizoh Saitoh
E Suehiro
F Kanemitsu
Gou Mayama
H Schochl
H Wada
H Yasui
HH Kaufman
HH Kaufman
Hiroshi Kato
Hiroshi Ogura
Hiroyasu Ishikura
I Bilgic
I Raza
J Feng
JA Kraut
JL Kashuk
Junichi Sasaki
Kazutaka Nishiyama
Kiyoshi Murata
Kunihiko Maekawa
L Medved
M Hayakawa
M Hayakawa
M Maegele
M Maekawa
M Nekludov
ME Kutcher
ME Kutcher
Mineji Hayakawa
MJ Cohen
Munekazu Takeda
Naoto Morimura
Naoyuki Kaneko
OM Theusinger
OM Theusinger
Osamu Takasu
Q Zhang
S Gando
S Gando
S Gando
S Gando
S Gando
S Kushimoto
S Naganathar
Shiei Kim
Shigeki Kushimoto
Shintaro Furugori
SR Ostrowski
ST Abrams
T Kooistra
T Lustenberger
Takashi Kanemura
Takayuki Shibusawa
Tetsuya Matsuoka
Toshifumi Uejima
Y Tian
Yoshihiko Nakamura
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Critical profiles of chiral diether-mediated asymmetric conjugate aminolithiation of enoate with lithium amide as a key to the total synthesis of (−)-kopsinine

Author: Harada Shingo
Sakai Takeo
Takasu Kiyosei
Tomioka Kiyoshi
Yamada Ken-ichi
Yamamoto Yasutomo
Publication venue: 'Elsevier BV'
Publication date: 01/04/2013
Field of study

Chiral diether-mediated asymmetric aminolithiation of indolylpropenoate with lithium amide in toluene at -78 °C for 15 min gave, after aqueous ammonium chloride quench, the corresponding conjugate addition product with 97% ee in 89% yield. If hydrogen chloride in methanol was selected as a quencher, however, aminolithiation at -78 °C for 3 h gave the corresponding adduct with 97% ee in 54% yield, along with recovery of the starting enoate in 39% yield. Based on this finding of an incomplete and slow reaction at -78 °C, the aminolithiation conditions were optimized to be at -60 °C for 15 h and subsequent enolate trap with alkyl halide upon an addition of DMPU afforded the desired aminoalkylation product with 98% ee in 89% yield. Further approach toward total synthesis of (−)-kopsinine was carried out by examining asymmetric aminolithiation with N-hydroxyethylamine equivalent, one-pot piperidine formation, and Claisen condensation

Kyoto University Research Information Repository

Semi-supervised bibliographic element segmentation with latent permutations

Author: Masada Tomonari
Oguri Kiyoshi
Shibata Yuichiro
Takasu Atsuhiro
Publication venue: Springer Verlag
Publication date: 01/01/2011
Field of study

This paper proposes a semi-supervised bibliographic element segmentation. Our input data is a large scale set of bibliographic references each given as an unsegmented sequence of word tokens. Our problem is to segment each reference into bibliographic elements, e.g. authors, title, journal, pages, etc. We solve this problem with an LDA-like topic model by assigning each word token to a topic so that the word tokens assigned to the same topic refer to the same bibliographic element. Topic assignments should satisfy contiguity constraint, i.e., the constraint that the word tokens assigned to the same topic should be contiguous. Therefore, we proposed a topic model in our preceding work [8] based on the topic model devised by Chen et al. [3]. Our model extends LDA and realizes unsupervised topic assignments satisfying contiguity constraint. The main contribution of this paper is the proposal of a semi-supervised learning for our proposed model. We assume that at most one third of word tokens are already labeled. In addition, we assume that a few percent of the labels may be incorrect. The experiment showed that our semi-supervised learning improved the unsupervised learning by a large margin and achieved an over 90% segmentation accuracy.13th International Conference on Asia-Pacific Digital Libraries, ICADL 2011; Beijing; 24 October 2011 through 27 October 201

Nagasaki University's Academic Output SITE: NAOSITE