110,301 research outputs found
Using Ontologies for the Design of Data Warehouses
Obtaining an implementation of a data warehouse is a complex task that forces
designers to acquire wide knowledge of the domain, thus requiring a high level
of expertise and becoming it a prone-to-fail task. Based on our experience, we
have detected a set of situations we have faced up with in real-world projects
in which we believe that the use of ontologies will improve several aspects of
the design of data warehouses. The aim of this article is to describe several
shortcomings of current data warehouse design approaches and discuss the
benefit of using ontologies to overcome them. This work is a starting point for
discussing the convenience of using ontologies in data warehouse design.Comment: 15 pages, 2 figure
Image Semantics in the Description and Categorization of Journalistic Photographs
This paper reports a study on the description and categorization of images. The aim of the study was to evaluate existing indexing frameworks in the context of reportage photographs and to find out how the use of this particular image genre influences the results. The effect of different tasks on image description and categorization was also studied. Subjects performed keywording and free description tasks and the elicited terms were classified using the most extensive one of the reviewed frameworks. Differences were found in the terms used in constrained and unconstrained descriptions. Summarizing terms such as abstract concepts, themes, settings and emotions were
used more frequently in keywording than in free description. Free descriptions included more terms referring to locations within the images, people and descriptive terms due to the narrative form the subjects used without prompting. The evaluated framework was found to lack some syntactic and semantic classes present in the data and modifications were suggested. According to the results of this study image categorization is based on high-level interpretive concepts,
including affective and abstract themes. The results indicate that image genre influences categorization and keywording modifies and truncates natural image description
On multidimensional item response theory -- a coordinate free approach
A coordinate system free definition of complex structure multidimensional
item response theory (MIRT) for dichotomously scored items is presented. The
point of view taken emphasizes the possibilities and subtleties of
understanding MIRT as a multidimensional extension of the ``classical''
unidimensional item response theory models. The main theorem of the paper is
that every monotonic MIRT model looks the same; they are all trivial extensions
of univariate item response theory.Comment: Published at http://dx.doi.org/10.1214/07-EJS016 in the Electronic
Journal of Statistics (http://www.i-journals.org/ejs/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Multidimensional Urban Segregation - Toward A Neural Network Measure
We introduce a multidimensional, neural-network approach to reveal and
measure urban segregation phenomena, based on the Self-Organizing Map algorithm
(SOM). The multidimensionality of SOM allows one to apprehend a large number of
variables simultaneously, defined on census or other types of statistical
blocks, and to perform clustering along them. Levels of segregation are then
measured through correlations between distances on the neural network and
distances on the actual geographical map. Further, the stochasticity of SOM
enables one to quantify levels of heterogeneity across census blocks. We
illustrate this new method on data available for the city of Paris.Comment: NCAA S.I. WSOM+ 201
An Unsupervised Approach for Automatic Activity Recognition based on Hidden Markov Model Regression
Using supervised machine learning approaches to recognize human activities
from on-body wearable accelerometers generally requires a large amount of
labelled data. When ground truth information is not available, too expensive,
time consuming or difficult to collect, one has to rely on unsupervised
approaches. This paper presents a new unsupervised approach for human activity
recognition from raw acceleration data measured using inertial wearable
sensors. The proposed method is based upon joint segmentation of
multidimensional time series using a Hidden Markov Model (HMM) in a multiple
regression context. The model is learned in an unsupervised framework using the
Expectation-Maximization (EM) algorithm where no activity labels are needed.
The proposed method takes into account the sequential appearance of the data.
It is therefore adapted for the temporal acceleration data to accurately detect
the activities. It allows both segmentation and classification of the human
activities. Experimental results are provided to demonstrate the efficiency of
the proposed approach with respect to standard supervised and unsupervised
classification approache
- …