Search CORE

2 research outputs found

Dependency Parsing for Weibo: An Efficient Probabilistic Logic Programming Approach

Author: Kathryn Mazaitis (3888793)
Lingpeng Kong (5361833)
William W. Cohen (5359088)
William Yang Wang (5358836)
Publication venue
Publication date: 29/06/2018
Field of study

<p>Dependency parsing is a core task in NLP, and it is widely used by many applications such as information extraction, question answering, and machine translation. In the era of social media, a big challenge is that parsers trained on traditional newswire corpora typically suffer from the domain mismatch issue, and thus perform poorly on social media data. We present a new GFL/FUDG-annotated Chinese treebank with more than 18K tokens from Sina Weibo (the Chinese equivalent of Twitter). We formulate the dependency parsing problem as many small and parallelizable arc prediction tasks: for each task, we use a programmable probabilistic firstorder logic to infer the dependency arc of a token in the sentence. In experiments, we show that the proposed model outperforms an off-the-shelf Stanford Chinese parser, as well as a strong MaltParser baseline that is trained on the same in-domain data.</p

FigShare

A Tale of Two Entity Linking and Discovery Systems

Author: Bhavana Dalvi (5362277)
Frank Lin (5363810)
Jakob Bauer (5363807)
Kathryn Mazaitis (3888793)
Richard C. Wang (252512)
William W. Cohen (5359088)
Publication venue
Publication date
Field of study

<p>The long-term research agenda of our group is to evaluate the potential of probabilistic logics for complex, large-scale problems which require data resources naturally encoded as relations. In pursuit of this goal, we compared two systems for performing automated entity discovery and linking in English-language text, as submitted to the 2014 TAC Knowledge Base Population Entity Discovery and Linking (EDL) track. Both systems are based on random-walk strategies for measuring similarity within graphs. The first system is PageReactor, a hand-engineering system originally designed for task of wikification. The second is based on ProPPR, a probabilistic logic programming language</p

FigShare