Search CORE

245 research outputs found

Detecting Large Concept Extensions for Conceptual Analysis

Author: C Dutilh Novaes
DJ Chalmers
DM Blei
F Jackson
KL Gwet
S Deerwester
S Haslanger
S Laurence
TL Griffiths
U Fayyad
Publication venue
Publication date: 18/06/2017
Field of study

When performing a conceptual analysis of a concept, philosophers are interested in all forms of expression of a concept in a text---be it direct or indirect, explicit or implicit. In this paper, we experiment with topic-based methods of automating the detection of concept expressions in order to facilitate philosophical conceptual analysis. We propose six methods based on LDA, and evaluate them on a new corpus of court decision that we had annotated by experts and non-experts. Our results indicate that these methods can yield important improvements over the keyword heuristic, which is often used as a concept detection heuristic in many contexts. While more work remains to be done, this indicates that detecting concepts through topics can serve as a general-purpose method for at least some forms of concept expression that are not captured using naive keyword approaches

arXiv.org e-Print Archive

Crossref

Effect of Tuned Parameters on a LSA MCQ Answering Model

Author: A. C. Graesser
Alain Lifchitz
C. H. Q. Ding
D. I. Martin
G. Denhière
G. Salton
G. Salton
Guy Denhière
J. Diaz
J. Diaz
J. Quesada
M. Efron
M. F. Porter
M. W. Berry
S. Deerwester
S. T. Dumais
S. T. Dumais
Sandra Jhean-Larose
W. Kintsch
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

This paper presents the current state of a work in progress, whose objective is to better understand the effects of factors that significantly influence the performance of Latent Semantic Analysis (LSA). A difficult task, which consists in answering (French) biology Multiple Choice Questions, is used to test the semantic properties of the truncated singular space and to study the relative influence of main parameters. A dedicated software has been designed to fine tune the LSA semantic space for the Multiple Choice Questions task. With optimal parameters, the performances of our simple model are quite surprisingly equal or superior to those of 7th and 8th grades students. This indicates that semantic spaces were quite good despite their low dimensions and the small sizes of training data sets. Besides, we present an original entropy global weighting of answers' terms of each question of the Multiple Choice Questions which was necessary to achieve the model's success.Comment: 9 page

arXiv.org e-Print Archive

Decentralized learning with budgeted network load using Gaussian copulas and classifier ensembles

Author: AP Dawid
C Genest
DH Wolpert
ED Sontag
F Pedregosa
GB Giannakis
I Zezula
J Kittler
J Kittler
L Breiman
L Xu
LK Hansen
M Wozniak
OP Faugeras
S Deerwester
TK Ho
V Tresp
Y Freund
Y Koren
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/03/2019
Field of study

We examine a network of learners which address the same classification task but must learn from different data sets. The learners cannot share data but instead share their models. Models are shared only one time so as to preserve the network load. We introduce DELCO (standing for Decentralized Ensemble Learning with COpulas), a new approach allowing to aggregate the predictions of the classifiers trained by each learner. The proposed method aggregates the base classifiers using a probabilistic model relying on Gaussian copulas. Experiments on logistic regressor ensembles demonstrate competing accuracy and increased robustness in case of dependent classifiers. A companion python implementation can be downloaded at https://github.com/john-klein/DELC

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

HAL Descartes

UCL Discovery

Hal-Diderot

Looking at Vector Space and Language Models for IR using Density Matrices

Author: A Gleason
AI Lvovsky
B Piwowarski
C Carpineto
ChX Zhai
G Birkhoff
G Salton
G Zuccon
G Zuccon
J Rocchio
J Zobel
K Rijsbergen van
K Tsuda
M Melucci
M Melucci
M Melucci
M Melucci
MA Nielsen
MK Warmuth
S Deerwester
SKM Wong
T Hofmann
X Zhao
Publication venue
Publication date: 08/01/2014
Field of study

In this work, we conduct a joint analysis of both Vector Space and Language Models for IR using the mathematical framework of Quantum Theory. We shed light on how both models allocate the space of density matrices. A density matrix is shown to be a general representational tool capable of leveraging capabilities of both VSM and LM representations thus paving the way for a new generation of retrieval models. We analyze the possible implications suggested by our findings.Comment: In Proceedings of Quantum Interaction 201

arXiv.org e-Print Archive

Crossref

Error threshold in optimal coding, numerical criteria and classes of universalities for complexity

Author: A. E. Allahverdyan
A. E. Allakhverdyan
C. H. Bennet
C. Tsallis
D. B. Saakian
D. B. Saakian
D. B. Saakian
D. Dhar
D. MacKay
E. J. Chaisson
E. T. Jayenes
H. Nishimori
I. Chisar
J. L. Cardy
J. P. Crutchfield
L. D. Landau
M. Eigen
M. Eigen
M. Eigen
M. Gell-Mann
M. Gell-Mann
M. Mitchell
N. Skantzos
R. Benzi
R. S. Ingarden
S. A. Kauffmann
S. Deerwester
T. K. Landauer
T. Koski
T. M. Cover
V. S. Pande
W. Hilberg
Publication venue: 'American Physical Society (APS)'
Publication date: 05/09/2004
Field of study

The free energy of the Random Energy Model at the transition point between ferromagnetic and spin glass phases is calculated. At this point, equivalent to the decoding error threshold in optimal codes, free energy has finite size corrections proportional to the square root of the number of degrees. The response of the magnetization to the ferromagnetic couplings is maximal at the values of magnetization equal to half. We give several criteria of complexity and define different universality classes. According to our classification, at the lowest class of complexity are random graph, Markov Models and Hidden Markov Models. At the next level is Sherrington-Kirkpatrick spin glass, connected with neuron-network models. On a higher level are critical theories, spin glass phase of Random Energy Model, percolation, self organized criticality (SOC). The top level class involves HOT design, error threshold in optimal coding, language, and, maybe, financial market. Alive systems are also related with the last class. A concept of anti-resonance is suggested for the complex systems.Comment: 17 page

arXiv.org e-Print Archive

Crossref

Quantum Aspects of Semantic Analysis and Symbolic Artificial Intelligence

Author: Aerts D
Aerts D
Aerts D Czachor M
Bell J S
Bennett C H Brassard G
Bettelli S
Blei D M
Bush P
Deerwester S
Diederik Aerts
Griffiths T L
Hampton J
Hofmann T
Landauer T K
Landauer T K
Landauer T K
Lund K
Marek Czachor
Oemer B
Penrose R
Penrose R
Penrose R
Plate T A
Selinger P
Steyvers M Shiffrin R M Nelson D L
Widdows D Peters S R T Oehrle J Rogers
Publication venue: 'IOP Publishing'
Publication date: 19/02/2004
Field of study

Modern approaches to semanic analysis if reformulated as Hilbert-space problems reveal formal structures known from quantum mechanics. Similar situation is found in distributed representations of cognitive structures developed for the purposes of neural networks. We take a closer look at similarites and differences between the above two fields and quantum information theory.Comment: version accepted in J. Phys. A (Letter to the Editor

arXiv.org e-Print Archive

Crossref

The Emerging Scholarly Brain

Author: A Josang
A Lancichinetti
A Szalay
A-L Barabasi
A-L Barabasi
AJ Connolly
B Höldobler
C Alexander
CG Jung
CL Borgman
DO Hebb
EA Henneken
F Murtagh
J Bollen
J Bollen
J West
JA Baldwin
JD West
K Frisch von
KS Fu
L Leydesdorff
LL Thurstone
M Girvan
M Golay
M Rosvall
MEJ Newman
MJ Kurtz
MJ Kurtz
MJ Kurtz
MJ Kurtz
MJ Kurtz
O Stapledon
P Bonacich
P Ginsparg
PG Ossorio
PG Ossorio
PG Ossorio
PM Davis
PM Fitts
RJ Hanisch
S Brin
S Deerwester
S Fortunato
S Pinker
S Pinker
Y Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 04/08/2010
Field of study

It is now a commonplace observation that human society is becoming a coherent super-organism, and that the information infrastructure forms its emerging brain. Perhaps, as the underlying technologies are likely to become billions of times more powerful than those we have today, we could say that we are now building the lizard brain for the future organism.Comment: to appear in Future Professional Communication in Astronomy-II (FPCA-II) editors A. Heck and A. Accomazz

arXiv.org e-Print Archive

Crossref

Semantic Structuring and Visual Querying of Document Abstracts in Digital Libraries

Author: C. Tresp
C.J. Rijsbergen van
K. Lagus
L.A. Zadeh
M. Schröder
S. Deerwester
T. Kohonen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Kernel Spectral Clustering and applications

In this chapter we review the main literature related to kernel spectral clustering (KSC), an approach to clustering cast within a kernel-based optimization setting. KSC represents a least-squares support vector machine based formulation of spectral clustering described by a weighted kernel PCA objective. Just as in the classifier case, the binary clustering model is expressed by a hyperplane in a high dimensional space induced by a kernel. In addition, the multi-way clustering can be obtained by combining a set of binary decision functions via an Error Correcting Output Codes (ECOC) encoding scheme. Because of its model-based nature, the KSC method encompasses three main steps: training, validation, testing. In the validation stage model selection is performed to obtain tuning parameters, like the number of clusters present in the data. This is a major advantage compared to classical spectral clustering where the determination of the clustering parameters is unclear and relies on heuristics. Once a KSC model is trained on a small subset of the entire data, it is able to generalize well to unseen test points. Beyond the basic formulation, sparse KSC algorithms based on the Incomplete Cholesky Decomposition (ICD) and

L_0

L_1, L_0 + L_1

, Group Lasso regularization are reviewed. In that respect, we show how it is possible to handle large scale data. Also, two possible ways to perform hierarchical clustering and a soft clustering method are presented. Finally, real-world applications such as image segmentation, power load time-series clustering, document clustering and big data learning are considered.Comment: chapter contribution to the book "Unsupervised Learning Algorithms

arXiv.org e-Print Archive

Crossref

Tag-Aware Recommender Systems: A State-of-the-art Survey

Author: A Capocci
A Clauset
A Gunawardana
A Hotho
AE Gelfand
AP Dempster
B Pittel
C Cattuto
C Cattuto
C Cattuto
C Liu
DM Blei
G Adomavicius
G Cimini
G Ghoshal
G Koutrika
G Linden
G Salton
GQ Zhang
J Scott
JA Hanley
JB Schafer
JL Herlocker
JM Kleinberg
JW Wang
K Tso
L Lathauwer De
L Lü
L Spiteri
LdaF Costa
M Dubinko
M Girvan
M Medo
MEJ Newman
MJ Pazzani
MS Shang
MS Shang
MS Shang
O Nov
P Kazienko
P Mika
P Resnick
P Resnick
P Wu
R Albert
R Lambiotte
S Boccaletti
S Brin
S Deerwester
SN Dorogovtsev
T Zhou
T Zhou
T Zhou
Tao Zhou
TG Kolda
V Zlatić
X Si
Y Ding
YC Zhang
Yi-Cheng Zhang
Z Huang
Zi-Ke Zhang
ZK Zhang
ZK Zhang
ZK Zhang
ZK Zhang
ZK Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/02/2012
Field of study

In the past decade, Social Tagging Systems have attracted increasing attention from both physical and computer science communities. Besides the underlying structure and dynamics of tagging systems, many efforts have been addressed to unify tagging information to reveal user behaviors and preferences, extract the latent semantic relations among items, make recommendations, and so on. Specifically, this article summarizes recent progress about tag-aware recommender systems, emphasizing on the contributions from three mainstream perspectives and approaches: network-based methods, tensor-based methods, and the topic-based methods. Finally, we outline some other tag-related works and future challenges of tag-aware recommendation algorithms.Comment: 19 pages, 3 figure

arXiv.org e-Print Archive

Crossref

RERO DOC Digital Library