Search CORE

6 research outputs found

Cyclic Directed Probabilistic Graphical Model: A Proposal Based on Structured Outcomes

Author: Sirotkin Oleksii
Publication venue
Publication date: 25/10/2023
Field of study

In the process of building (structural learning) a probabilistic graphical model from a set of observed data, the directional, cyclic dependencies between the random variables of the model are often found. Existing graphical models such as Bayesian and Markov networks can reflect such dependencies. However, this requires complicating those models, such as adding additional variables or dividing the model graph into separate subgraphs. Herein, we describe a probabilistic graphical model - probabilistic relation network - that allows the direct capture of directional cyclic dependencies during structural learning. This model is based on the simple idea that each sample of the observed data can be represented by an arbitrary graph (structured outcome), which reflects the structure of the dependencies of the variables included in the sample. Each of the outcomes contains only a part of the graphical model structure; however, a complete graph of the probabilistic model is obtained by combining different outcomes. Such a graph, unlike Bayesian and Markov networks, can be directed and can have cycles. We explored the full joint distribution and conditional distribution and conditional independence properties of variables in the proposed model. We defined the algorithms for constructing of the model from the dataset and for calculating the conditional and full joint distributions. We also performed a numerical comparison with Bayesian and Markov networks. This model does not violate the probability axioms, and it supports learning from observed data. Notably, it supports probabilistic inference, making it a prospective tool in data analysis and in expert and design-making applications.Comment: 41 pages, 11 figures, arXiv:2206.06089v

arXiv.org e-Print Archive

Exploiting Syntactic, Semantic and Lexical Regularities in Language Modeling via Directed Markov Random Fields

Author: Cheng Li
Greiner Russell
Schuurmans Dale
Wang Shaojun
Wang Shaomin
Publication venue: CORE Scholar
Publication date: 01/01/2004
Field of study

We present a directed Markov random field (MRF) model that combines n-gram models, probabilistic context free grammars (PCFGs) and probabilistic latent semantic analysis (PLSA) for the purpose of statistical language modeling. Even though the composite directed MRF model potentially has an exponential number of loops and becomes a context sensitive grammar, we are nevertheless able to estimate its parameters in cubic time using an efficient modified EM method, the generalized inside-outside algorithm, which extends the inside-outside algorithm to incorporate the effects of the n-gram and PLSA language models. We generalize various smoothing techniques to alleviate the sparseness of n-gram counts in cases where there are hidden variables. We also derive an analogous algorithm to calculate the probability of initial subsequence of a sentence, generated by the composite language model. Our experimental results on the Wall Street Journal corpus show that we obtain significant reductions in perplexity compared to the state-of-the-art baseline trigram model with Good-Turing and Kneser-Ney smoothings. 1

CiteSeerX

Crossref

CORE

EXPLOITING SYNTACTIC, SEMANTIC, AND LEXICAL REGULARITIES IN LANGUAGE MODELING VIA DIRECTED MARKOV RANDOM FIELDS

Author: Abney
Baker
Bellegarda
Bellegarda
Bengio
Berger
Bilmes
Blei
Booth
Brown
Charniak
Chelba
Chen
Chi
Chi
Coin
Croft
Dempster
Durbin
Gildea
Good
Griffiths
Hofmann
Hopcroft
Jelinek
Jelinek
Jelinek
Johnson
Katz
Khudanpur
Lari
Lauritzen
Mark
McAllester
McAllester
Miller
Ney
Ney
Pearl
Pereira
Petrov
Pritchard
Roark
Rosenfeld
Rosenfeld
Rosenfeld
Roukos
Searls
Shannon
Teh
Wainwright
Wallach
Wang
Wang
Wang
Yedidia
Younger
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Getting Past the Language Gap: Innovations in Machine Translation

Author: Hush NS
McKemmish LK
McKenzie RH
Reimers JR
Publication venue: Attuale: SPRINGER, NEW YOIRK
Publication date: 01/01/2013
Field of study

In this chapter, we will be reviewing state of the art machine translation systems, and will discuss innovative methods for machine translation, highlighting the most promising techniques and applications. Machine translation (MT) has benefited from a revitalization in the last 10 years or so, after a period of relatively slow activity. In 2005 the field received a jumpstart when a powerful complete experimental package for building MT systems from scratch became freely available as a result of the unified efforts of the MOSES international consortium. Around the same time, hierarchical methods had been introduced by Chinese researchers, which allowed the introduction and use of syntactic information in translation modeling. Furthermore, the advances in the related field of computational linguistics, making off-the-shelf taggers and parsers readily available, helped give MT an additional boost. Yet there is still more progress to be made. For example, MT will be enhanced greatly when both syntax and semantics are on board: this still presents a major challenge though many advanced research groups are currently pursuing ways to meet this challenge head-on. The next generation of MT will consist of a collection of hybrid systems. It also augurs well for the mobile environment, as we look forward to more advanced and improved technologies that enable the working of Speech-To-Speech machine translation on hand-held devices, i.e. speech recognition and speech synthesis. We review all of these developments and point out in the final section some of the most promising research avenues for the future of MT

Archivio Ricerca Ca'Foscari

Crossref

OPUS - University of Technology Sydney

UCL Discovery

Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari

University of Queensland eSpace