29,769 research outputs found
Relation Extraction : A Survey
With the advent of the Internet, large amount of digital text is generated
everyday in the form of news articles, research publications, blogs, question
answering forums and social media. It is important to develop techniques for
extracting information automatically from these documents, as lot of important
information is hidden within them. This extracted information can be used to
improve access and management of knowledge hidden in large text corpora.
Several applications such as Question Answering, Information Retrieval would
benefit from this information. Entities like persons and organizations, form
the most basic unit of the information. Occurrences of entities in a sentence
are often linked through well-defined relations; e.g., occurrences of person
and organization in a sentence may be linked through relations such as employed
at. The task of Relation Extraction (RE) is to identify such relations
automatically. In this paper, we survey several important supervised,
semi-supervised and unsupervised RE techniques. We also cover the paradigms of
Open Information Extraction (OIE) and Distant Supervision. Finally, we describe
some of the recent trends in the RE techniques and possible future research
directions. This survey would be useful for three kinds of readers - i)
Newcomers in the field who want to quickly learn about RE; ii) Researchers who
want to know how the various RE techniques evolved over time and what are
possible future research directions and iii) Practitioners who just need to
know which RE technique works best in various settings
X-View: Graph-Based Semantic Multi-View Localization
Global registration of multi-view robot data is a challenging task.
Appearance-based global localization approaches often fail under drastic
view-point changes, as representations have limited view-point invariance. This
work is based on the idea that human-made environments contain rich semantics
which can be used to disambiguate global localization. Here, we present X-View,
a Multi-View Semantic Global Localization system. X-View leverages semantic
graph descriptor matching for global localization, enabling localization under
drastically different view-points. While the approach is general in terms of
the semantic input data, we present and evaluate an implementation on visual
data. We demonstrate the system in experiments on the publicly available
SYNTHIA dataset, on a realistic urban dataset recorded with a simulator, and on
real-world StreetView data. Our findings show that X-View is able to globally
localize aerial-to-ground, and ground-to-ground robot data of drastically
different view-points. Our approach achieves an accuracy of up to 85 % on
global localizations in the multi-view case, while the benchmarked baseline
appearance-based methods reach up to 75 %
Morphology-based Entity and Relational Entity Extraction Framework for Arabic
Rule-based techniques to extract relational entities from documents allow
users to specify desired entities with natural language questions, finite state
automata, regular expressions and structured query language. They require
linguistic and programming expertise and lack support for Arabic morphological
analysis. We present a morphology-based entity and relational entity extraction
framework for Arabic (MERF). MERF requires basic knowledge of linguistic
features and regular expressions, and provides the ability to interactively
specify Arabic morphological and synonymity features, tag types associated with
regular expressions, and relations and code actions defined over matches of
subexpressions. MERF constructs entities and relational entities from matches
of the specifications. We evaluated MERF with several case studies. The results
show that MERF requires shorter development time and effort compared to
existing application specific techniques and produces reasonably accurate
results within a reasonable overhead in run time
UHop: An Unrestricted-Hop Relation Extraction Framework for Knowledge-Based Question Answering
In relation extraction for knowledge-based question answering, searching from
one entity to another entity via a single relation is called "one hop". In
related work, an exhaustive search from all one-hop relations, two-hop
relations, and so on to the max-hop relations in the knowledge graph is
necessary but expensive. Therefore, the number of hops is generally restricted
to two or three. In this paper, we propose UHop, an unrestricted-hop framework
which relaxes this restriction by use of a transition-based search framework to
replace the relation-chain-based search one. We conduct experiments on
conventional 1- and 2-hop questions as well as lengthy questions, including
datasets such as WebQSP, PathQuestion, and Grid World. Results show that the
proposed framework enables the ability to halt, works well with
state-of-the-art models, achieves competitive performance without exhaustive
searches, and opens the performance gap for long relation paths.Comment: To appear in NAACL-HLT 201
The Actias system: supervised multi-strategy learning paradigm using categorical logic
One of the most difficult problems in the development of intelligent systems
is the construction of the underlying knowledge base. As a consequence, the
rate of progress in the development of this type of system is directly related
to the speed with which knowledge bases can be assembled, and on its quality.
We attempt to solve the knowledge acquisition problem, for a Business
Information System, developing a supervised multistrategy learning paradigm.
This paradigm is centred on a collaborative data mining strategy, where groups
of experts collaborate using data-mining process on the supervised acquisition
of new knowledge extracted from heterogeneous machine learning data models.
The Actias system is our approach to this paradigm. It is the result of
applying the graphic logic based language of sketches to knowledge integration.
The system is a data mining collaborative workplace, where the Information
System knowledge base is an algebraic structure. It results from the
integration of background knowledge with new insights extracted from data
models, generated for specific data modelling tasks, and represented as rules
using the sketches language.Comment: 9 pages, 6 figures, conference ICKEDS'0
More Data, More Relations, More Context and More Openness: A Review and Outlook for Relation Extraction
Relational facts are an important component of human knowledge, which are
hidden in vast amounts of text. In order to extract these facts from text,
people have been working on relation extraction (RE) for years. From early
pattern matching to current neural networks, existing RE methods have achieved
significant progress. Yet with explosion of Web text and emergence of new
relations, human knowledge is increasing drastically, and we thus require
"more" from RE: a more powerful RE system that can robustly utilize more data,
efficiently learn more relations, easily handle more complicated context, and
flexibly generalize to more open domains. In this paper, we look back at
existing RE methods, analyze key challenges we are facing nowadays, and show
promising directions towards more powerful RE. We hope our view can advance
this field and inspire more efforts in the community
Knowledge Extraction and Knowledge Integration governed by {\L}ukasiewicz Logics
The development of machine learning in particular and artificial intelligent
in general has been strongly conditioned by the lack of an appropriate
interface layer between deduction, abduction and induction. In this work we
extend traditional algebraic specification methods in this direction. Here we
assume that such interface for AI emerges from an adequate Neural-Symbolic
integration. This integration is made for universe of discourse described on a
Topos governed by a many-valued {\L}ukasiewicz logic. Sentences are integrated
in a symbolic knowledge base describing the problem domain, codified using a
graphic-based language, wherein every logic connective is defined by a neuron
in an artificial network. This allows the integration of first-order formulas
into a network architecture as background knowledge, and simplifies symbolic
rule extraction from trained networks. For the train of such neural networks we
changed the Levenderg-Marquardt algorithm, restricting the knowledge
dissemination in the network structure using soft crystallization. This
procedure reduces neural network plasticity without drastically damaging the
learning performance, allowing the emergence of symbolic patterns. This makes
the descriptive power of produced neural networks similar to the descriptive
power of {\L}ukasiewicz logic language, reducing the information lost on
translation between symbolic and connectionist structures. We tested this
method on the extraction of knowledge from specified structures. For it, we
present the notion of fuzzy state automata, and we use automata behaviour to
infer its structure. We use this type of automata on the generation of models
for relations specified as symbolic background knowledge.Comment: 38 page
Thematic Analysis and Visualization of Textual Corpus
The semantic analysis of documents is a domain of intense research at
present. The works in this domain can take several directions and touch several
levels of granularity. In the present work we are exactly interested in the
thematic analysis of the textual documents. In our approach, we suggest
studying the variation of the theme relevance within a text to identify the
major theme and all the minor themes evoked in the text. This allows us at the
second level of analysis to identify the relations of thematic associations in
a textual corpus. Through the identification and the analysis of these
association relations we suggest generating thematic paths allowing users,
within the frame work of information search system, to explore the corpus
according to their themes of interest and to discover new knowledge by
navigating in the thematic association relations.Comment: 16 pages,9 figure
Global Relation Embedding for Relation Extraction
We study the problem of textual relation embedding with distant supervision.
To combat the wrong labeling problem of distant supervision, we propose to
embed textual relations with global statistics of relations, i.e., the
co-occurrence statistics of textual and knowledge base relations collected from
the entire corpus. This approach turns out to be more robust to the training
noise introduced by distant supervision. On a popular relation extraction
dataset, we show that the learned textual relation embedding can be used to
augment existing relation extraction models and significantly improve their
performance. Most remarkably, for the top 1,000 relational facts discovered by
the best existing model, the precision can be improved from 83.9% to 89.3%.Comment: Accepted to NAACL HLT 201
Deep Learning for LiDAR Point Clouds in Autonomous Driving: A Review
Recently, the advancement of deep learning in discriminative feature learning
from 3D LiDAR data has led to rapid development in the field of autonomous
driving. However, automated processing uneven, unstructured, noisy, and massive
3D point clouds is a challenging and tedious task. In this paper, we provide a
systematic review of existing compelling deep learning architectures applied in
LiDAR point clouds, detailing for specific tasks in autonomous driving such as
segmentation, detection, and classification. Although several published
research papers focus on specific topics in computer vision for autonomous
vehicles, to date, no general survey on deep learning applied in LiDAR point
clouds for autonomous vehicles exists. Thus, the goal of this paper is to
narrow the gap in this topic. More than 140 key contributions in the recent
five years are summarized in this survey, including the milestone 3D deep
architectures, the remarkable deep learning applications in 3D semantic
segmentation, object detection, and classification; specific datasets,
evaluation metrics, and the state of the art performance. Finally, we conclude
the remaining challenges and future researches.Comment: 21 pages, submitted to IEEE Transactions on Neural Networks and
Learning System
- …