Taxonomy Induction using Hypernym Subsequences

Biemann Chris; Cram Damien; Grefenstette Gregory; Gupta Amit; Kozareva Zornitsa; Nastase Vivi; Oakes Michael P; Ponzetto S.; Ponzetto Simone Paolo; Snow Rion

research

Taxonomy Induction using Hypernym Subsequences

Authors: Biemann Chris
Cram Damien
Grefenstette Gregory
Gupta Amit
Kozareva Zornitsa
Nastase Vivi
Oakes Michael P
Ponzetto S.
Ponzetto Simone Paolo
Snow Rion
Publication date: 5 May 2017
Publisher
Doi

Abstract

We propose a novel, semi-supervised approach towards domain taxonomy induction from an input vocabulary of seed terms. Unlike all previous approaches, which typically extract direct hypernym edges for terms, our approach utilizes a novel probabilistic framework to extract hypernym subsequences. Taxonomy induction from extracted subsequences is cast as an instance of the minimumcost flow problem on a carefully designed directed graph. Through experiments, we demonstrate that our approach outperforms stateof- the-art taxonomy induction approaches across four languages. Importantly, we also show that our approach is robust to the presence of noise in the input vocabulary. To the best of our knowledge, no previous approaches have been empirically proven to manifest noise-robustness in the input vocabulary