Search CORE

1,648 research outputs found

NLP and the Humanities: The Revival of an Old Liaison

Author: Jong F.M.G. de
Publication venue: Association for Computational Linguistics
Publication date: 01/01/2009
Field of study

This paper presents an overview of some\ud emerging trends in the application of NLP\ud in the domain of the so-called Digital Humanities\ud and discusses the role and nature\ud of metadata, the annotation layer that is so\ud characteristic of documents that play a role\ud in the scholarly practises of the humanities.\ud It is explained how metadata are the\ud key to the added value of techniques such\ud as text and link mining, and an outline is\ud given of what measures could be taken to\ud increase the chances for a bright future for\ud the old ties between NLP and the humanities.\ud There is no data like metadata

University of Twente Research Information

Ontologies and Information Extraction

Author: Nazarenko Adeline
Nédellec Claire
Publication venue
Publication date: 01/01/2005
Field of study

This report argues that, even in the simplest cases, IE is an ontology-driven process. It is not a mere text filtering method based on simple pattern matching and keywords, because the extracted pieces of texts are interpreted with respect to a predefined partial domain model. This report shows that depending on the nature and the depth of the interpretation to be done for extracting the information, more or less knowledge must be involved. This report is mainly illustrated in biology, a domain in which there are critical needs for content-based exploration of the scientific literature and which becomes a major application domain for IE

arXiv.org e-Print Archive

HAL Descartes

HAL-Paris 13

Leveraging Indexical Pragmatics (OFIP) for Search Engine: An Ontology- based Approach

Author: Liu Dapeng
Yoon Victoria Y
Publication venue: AIS Electronic Library (AISeL)
Publication date: 01/01/2019
Field of study

The relevance of search results is an important indicator of information retrieval performance. A domain-specific Search Engine (SE), distinct from a general web SE, focuses on a specific segment of online content and may increase search results relevance. Traditional methods to improve domain-specific SE precision heavily depend on query expansion, lexical analysis of texts, and large amounts of training data. These methods suffer from limited effectiveness and efficiency because expanded query terms and coarse language features bring in uncontrollable complexity and increase dimensionality. Our design, leveraging the integrated power of computational syntax, semantics, and indexical pragmatics, proposes an ontology-driven framework that is tailored to work in a dynamic Internet environment without large amounts of manually annotated training data. This article presents our design, that is essential for building a domain-specific SE, and its instantiation in the terrorism domain

Crossref

ScholarSpace at University of Hawai'i at Manoa

AIS Electronic Library (AISeL)

Improving Ontology Recommendation and Reuse in WebCORE by Collaborative Assessments

Author: Cantador I.
Castells P.
Fernandez M.
Publication venue
Publication date: 01/01/2007
Field of study

In this work, we present an extension of CORE [8], a tool for Collaborative Ontology Reuse and Evaluation. The system receives an informal description of a specific semantic domain and determines which ontologies from a repository are the most appropriate to describe the given domain. For this task, the environment is divided into three modules. The first component receives the problem description as a set of terms, and allows the user to refine and enlarge it using WordNet. The second module applies multiple automatic criteria to evaluate the ontologies of the repository, and determines which ones fit best the problem description. A ranked list of ontologies is returned for each criterion, and the lists are combined by means of rank fusion techniques. Finally, the third component uses manual user evaluations in order to incorporate a human, collaborative assessment of the ontologies. The new version of the system incorporates several novelties, such as its implementation as a web application; the incorporation of a NLP module to manage the problem definitions; modifications on the automatic ontology retrieval strategies; and a collaborative framework to find potential relevant terms according to previous user queries. Finally, we present some early experiments on ontology retrieval and evaluation, showing the benefits of our system

CiteSeerX

Open Research Online (The Open University)

Biblos-e Archivo

Text–to–Video: Image Semantics and NLP

Author: Schwarz Katharina
Publication venue: Universität Tübingen
Publication date: 01/01/2018
Field of study

When aiming at automatically translating an arbitrary text into a visual story, the main challenge consists in finding a semantically close visual representation whereby the displayed meaning should remain the same as in the given text. Besides, the appearance of an image itself largely influences how its meaningful information is transported towards an observer. This thesis now demonstrates that investigating in both, image semantics as well as the semantic relatedness between visual and textual sources enables us to tackle the challenging semantic gap and to find a semantically close translation from natural language to a corresponding visual representation. Within the last years, social networking became of high interest leading to an enormous and still increasing amount of online available data. Photo sharing sites like Flickr allow users to associate textual information with their uploaded imagery. Thus, this thesis exploits this huge knowledge source of user generated data providing initial links between images and words, and other meaningful data. In order to approach visual semantics, this work presents various methods to analyze the visual structure as well as the appearance of images in terms of meaningful similarities, aesthetic appeal, and emotional effect towards an observer. In detail, our GPU-based approach efficiently finds visual similarities between images in large datasets across visual domains and identifies various meanings for ambiguous words exploring similarity in online search results. Further, we investigate in the highly subjective aesthetic appeal of images and make use of deep learning to directly learn aesthetic rankings from a broad diversity of user reactions in social online behavior. To gain even deeper insights into the influence of visual appearance towards an observer, we explore how simple image processing is capable of actually changing the emotional perception and derive a simple but effective image filter. To identify meaningful connections between written text and visual representations, we employ methods from Natural Language Processing (NLP). Extensive textual processing allows us to create semantically relevant illustrations for simple text elements as well as complete storylines. More precisely, we present an approach that resolves dependencies in textual descriptions to arrange 3D models correctly. Further, we develop a method that finds semantically relevant illustrations to texts of different types based on a novel hierarchical querying algorithm. Finally, we present an optimization based framework that is capable of not only generating semantically relevant but also visually coherent picture stories in different styles.Bei der automatischen Umwandlung eines beliebigen Textes in eine visuelle Geschichte, besteht die größte Herausforderung darin eine semantisch passende visuelle Darstellung zu finden. Dabei sollte die Bedeutung der Darstellung dem vorgegebenen Text entsprechen. Darüber hinaus hat die Erscheinung eines Bildes einen großen Einfluß darauf, wie seine bedeutungsvollen Inhalte auf einen Betrachter übertragen werden. Diese Dissertation zeigt, dass die Erforschung sowohl der Bildsemantik als auch der semantischen Verbindung zwischen visuellen und textuellen Quellen es ermöglicht, die anspruchsvolle semantische Lücke zu schließen und eine semantisch nahe Übersetzung von natürlicher Sprache in eine entsprechend sinngemäße visuelle Darstellung zu finden. Des Weiteren gewann die soziale Vernetzung in den letzten Jahren zunehmend an Bedeutung, was zu einer enormen und immer noch wachsenden Menge an online verfügbaren Daten geführt hat. Foto-Sharing-Websites wie Flickr ermöglichen es Benutzern, Textinformationen mit ihren hochgeladenen Bildern zu verknüpfen. Die vorliegende Arbeit nutzt die enorme Wissensquelle von benutzergenerierten Daten welche erste Verbindungen zwischen Bildern und Wörtern sowie anderen aussagekräftigen Daten zur Verfügung stellt. Zur Erforschung der visuellen Semantik stellt diese Arbeit unterschiedliche Methoden vor, um die visuelle Struktur sowie die Wirkung von Bildern in Bezug auf bedeutungsvolle Ähnlichkeiten, ästhetische Erscheinung und emotionalem Einfluss auf einen Beobachter zu analysieren. Genauer gesagt, findet unser GPU-basierter Ansatz effizient visuelle Ähnlichkeiten zwischen Bildern in großen Datenmengen quer über visuelle Domänen hinweg und identifiziert verschiedene Bedeutungen für mehrdeutige Wörter durch die Erforschung von Ähnlichkeiten in Online-Suchergebnissen. Des Weiteren wird die höchst subjektive ästhetische Anziehungskraft von Bildern untersucht und "deep learning" genutzt, um direkt ästhetische Einordnungen aus einer breiten Vielfalt von Benutzerreaktionen im sozialen Online-Verhalten zu lernen. Um noch tiefere Erkenntnisse über den Einfluss des visuellen Erscheinungsbildes auf einen Betrachter zu gewinnen, wird erforscht, wie alleinig einfache Bildverarbeitung in der Lage ist, tatsächlich die emotionale Wahrnehmung zu verändern und ein einfacher aber wirkungsvoller Bildfilter davon abgeleitet werden kann. Um bedeutungserhaltende Verbindungen zwischen geschriebenem Text und visueller Darstellung zu ermitteln, werden Methoden des "Natural Language Processing (NLP)" verwendet, die der Verarbeitung natürlicher Sprache dienen. Der Einsatz umfangreicher Textverarbeitung ermöglicht es, semantisch relevante Illustrationen für einfache Textteile sowie für komplette Handlungsstränge zu erzeugen. Im Detail wird ein Ansatz vorgestellt, der Abhängigkeiten in Textbeschreibungen auflöst, um 3D-Modelle korrekt anzuordnen. Des Weiteren wird eine Methode entwickelt die, basierend auf einem neuen hierarchischen Such-Anfrage Algorithmus, semantisch relevante Illustrationen zu Texten verschiedener Art findet. Schließlich wird ein optimierungsbasiertes Framework vorgestellt, das nicht nur semantisch relevante, sondern auch visuell kohärente Bildgeschichten in verschiedenen Bildstilen erzeugen kann

Publikationsserver der Universität Tübingen

Econometrics meets sentiment : an overview of methodology and applications

Author: Algaba Andres
Ardia David
Bluteau Keven
Borms Samuel
Boudt Kris
Publication venue: 'Wiley'
Publication date: 01/01/2020
Field of study

The advent of massive amounts of textual, audio, and visual data has spurred the development of econometric methodology to transform qualitative sentiment data into quantitative sentiment variables, and to use those variables in an econometric analysis of the relationships between sentiment and other variables. We survey this emerging research field and refer to it as sentometrics, which is a portmanteau of sentiment and econometrics. We provide a synthesis of the relevant methodological approaches, illustrate with empirical results, and discuss useful software

VU Research Portal

Crossref

Ghent University Academic Bibliography

Biomedical informatics and translational medicine

Author: A Berlin
A Brazma
A Burgun
A Ebidia
A Ikekawa
A Kundaje
A Mangalampalli
A Ruttenberg
A Rzhetsky
A Wright
AH Peden
AJ Butte
AJ Butte
AJ Butte
AJ Cawsey
AK Smith
AM McDaniel
AS N
AX Garg
B Chaudhry
B Honigman
B Kaplan
B Louie
B Mollon
B Williams-Jones
BC Choi
BC Choi
BC Choi
BG Blobel
BJ Liu
BJ Liu
BL Humphreys
BM Costa
C Fomous
C Ohmann
CD Manning
CE Kahn Jr
CP Friedman
CS Ledbetter
D Detmer
D Johnston
D Jurafsky
D Lorence
D Rebholz-Schuhman
D Revere
D Short
DA Jordan
DA Lindberg
DB Keator
DC Balfour
DF Sittig
DJ Persell
DJ Severtson
DK Manley
DL Buckeridge
DL Heymann
DL Hunt
DL Rubin
DL Rubin
DM Bravata
DR Masys
DR Swanson
E Barclay
E Cadag
E Reiter
EA Zerhouni
EA Zerhouni
EG Poon
EH Shortliffe
EJ Hovenga
ER Weitzman
EV Bernstam
EV Bernstam
FT de Dombal
FT De Dombal
G Eysenbach
G Hripcsak
G Wade
GA Thorisson
GJ Downing
GO Barnett
GO Klein
GO Klein
GS Butler
GS Omenn
H Eriksson
H Muller
HJ Lee
HP Lehmann
HU Prokosch
Indra Neil Sarkar
IS Vizirianakis
J Allen
J Blake
J Cimino
J Lahteenmaki
J Lombardo
J Lyon
J Mantas
J Mykkanen
J Pathak
J Pearl
J Quackenbush
JA Osheroff
JD Halamka
JE Allen
JH van Bemmel
JJ Cimino
JJ Cimino
JK Iglehart
JM Marchibroda
JM Westfall
JS Brownstein
K Hayrinen
K Kawamoto
K Kawamoto
K Wasson
KA Kuhn
KB Cohen
KD Mandl
L Ohno-Machado
L Poissant
L Stein
LM Prevedello
M Dalal
M Fieschi
M Gerstein
M Musen
M Scherf
M Weeber
MA Harris
MA Hoffman
MA Musen
MD Kane
MF Collen
MF Collen
MJ Ball
MJ Ball
MJ Khoury
MS Siadaty
MS Watson
MY Galperin
MY Law
O Bodenreider
O Ratib
O Ratib
P Baxter
P De Clercq
P Durieux
P Jacquemart
P Mirhaji
PA Dang
PA Dang
PC Tang
PF Brennan
PG Shekelle
PH Gesteland
PJ Embi
PJ Embi
PL Reichertz
PM Kuzmak
PR Payne
PR Payne
PW O'Carroll
QT Zeng
R Feldman
R Haux
R Khorasani
R Kukafka
R Kukafka
R Mattheus
RA Greenes
RA Greenes
RA Greenes
RA Pagon
RB Altman
RL Arenson
RL Richesson
RO Duda
RS Dick
S Oster
S Xu
SB Johnson
SB King
SC Kirkwood
SF Altschul
SH Woolf
SM Huff
SM Maviglia
SM Meystre
SS Furuie
ST Rosenbloom
TH Payne
TK Houston
TR Frieden
U Rajcevic
U Sax
V Kashyap
V Maojo
V Maojo
VL Patel
W Clancey
W Hersh
W Hersh
W Hsu
WA Yasnoff
WD Bidgood Jr
WE Evans
WE Hammond
WE Hammond
WE Schreiber
WJ Bug
WR Hersh
WR Hersh
WR Hersh
WW Chapman
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Biomedical informatics involves a core set of methodologies that can provide a foundation for crossing the "translational barriers" associated with translational medicine. To this end, the fundamental aspects of biomedical informatics (e.g., bioinformatics, imaging informatics, clinical informatics, and public health informatics) may be essential in helping improve the ability to bring basic research findings to the bedside, evaluate the efficacy of interventions across communities, and enable the assessment of the eventual impact of translational medicine innovations on health policies. Here, a brief description is provided for a selection of key biomedical informatics topics (Decision Support, Natural Language Processing, Standards, Information Retrieval, and Electronic Health Records) and their relevance to translational medicine. Based on contributions and advancements in each of these topic areas, the article proposes that biomedical informatics practitioners ("biomedical informaticians") can be essential members of translational medicine teams

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Arabic Information Retrieval: A Relevancy Assessment Survey

Author: Ababneh Ahmad
Lu Joan
Xu Qiang
Publication venue: 'Omer Halisdemir Universitesi Iktisadi ve Idari Bilimler Fakultesi Dergisi'
Publication date: 01/01/2016
Field of study

The paper presents a research in Arabic Information Retrieval (IR). It surveys the impact of statistical and morphological analysis of Arabic text in improving Arabic IR relevancy. We investigated the contributions of Stemming, Indexing, Query Expansion, Text Summarization (TS), Text Translation, and Named Entity Recognition (NER) in enhancing the relevancy of Arabic IR. Our survey emphasizing on the quantitative relevancy measurements provided in the surveyed publications. The paper shows that the researchers achieved significant enhancements especially in building accurate stemmers, with accuracy reaches 97%, and in measuring the impact of different indexing strategies. Query expansion and Text Translation showed positive relevancy effect. However, other tasks such as NER and TS still need more research to realize their impact on Arabic IR

University of Huddersfield Repository

AIS Electronic Library (AISeL)

Huddersfield Research Portal

Performance Analysis of Machine Learning Approaches in Automatic Classification of Arabic Language

Author: S. Alharithi Fahd
Publication venue: Arab Journals Platform
Publication date: 29/04/2023
Field of study

Text classification (TC) is a crucial subject. The number of digital files available on the internet is enormous. The goal of TC is to categorize texts into a series of predetermined groups. The number of studies conducted on the English database is significantly higher than the number of studies conducted on the Arabic database. Therefore, this research analyzes the performance of automatic TC of the Arabic language using Machine Learning (ML) approaches. Further, Single-label Arabic News Articles Datasets (SANAD) are introduced, which contain three different datasets, namely Akhbarona, Khaleej, and Arabiya. Initially, the collected texts are pre-processed in which tokenization and stemming occur. In this research, three kinds of stemming are employed, namely light stemming, Khoja stemming, and no- stemming, to evaluate the effect of the pre-processing technique on Arabic TC performance. Moreover, feature extraction and feature weighting are performed; in feature weighting, the term weighting process is completed by the term frequency- inverse document frequency (tf-idf) method. In addition, this research selects C4.5, Support Vector Machine (SVM), and Naïve Bayes (NB) as a classification algorithm. The results indicated that the SVM and NB methods had attained higher accuracy than the C4.5 method. NB achieved the maximum accuracy with a performance of 99.9%

Arab Journals Platform