2,856 research outputs found
Role of Matrix Factorization Model in Collaborative Filtering Algorithm: A Survey
Recommendation Systems apply Information Retrieval techniques to select the
online information relevant to a given user. Collaborative Filtering is
currently most widely used approach to build Recommendation System. CF
techniques uses the user behavior in form of user item ratings as their
information source for prediction. There are major challenges like sparsity of
rating matrix and growing nature of data which is faced by CF algorithms. These
challenges are been well taken care by Matrix Factorization. In this paper we
attempt to present an overview on the role of different MF model to address the
challenges of CF algorithms, which can be served as a roadmap for research in
this area.Comment: 8 pages, 1 figure in IJAFRC, Vol.1, Issue 12, December 201
Iterative Residual Rescaling: An Analysis and Generalization of LSI
We consider the problem of creating document representations in which
inter-document similarity measurements correspond to semantic similarity. We
first present a novel subspace-based framework for formalizing this task. Using
this framework, we derive a new analysis of Latent Semantic Indexing (LSI),
showing a precise relationship between its performance and the uniformity of
the underlying distribution of documents over topics. This analysis helps
explain the improvements gained by Ando's (2000) Iterative Residual Rescaling
(IRR) algorithm: IRR can compensate for distributional non-uniformity. A
further benefit of our framework is that it provides a well-motivated,
effective method for automatically determining the rescaling factor IRR depends
on, leading to further improvements. A series of experiments over various
settings and with several evaluation metrics validates our claims.Comment: To appear in the proceedings of SIGIR 2001. 11 page
Cross-Lingual Adaptation using Structural Correspondence Learning
Cross-lingual adaptation, a special case of domain adaptation, refers to the
transfer of classification knowledge between two languages. In this article we
describe an extension of Structural Correspondence Learning (SCL), a recently
proposed algorithm for domain adaptation, for cross-lingual adaptation. The
proposed method uses unlabeled documents from both languages, along with a word
translation oracle, to induce cross-lingual feature correspondences. From these
correspondences a cross-lingual representation is created that enables the
transfer of classification knowledge from the source to the target language.
The main advantages of this approach over other approaches are its resource
efficiency and task specificity.
We conduct experiments in the area of cross-language topic and sentiment
classification involving English as source language and German, French, and
Japanese as target languages. The results show a significant improvement of the
proposed method over a machine translation baseline, reducing the relative
error due to cross-lingual adaptation by an average of 30% (topic
classification) and 59% (sentiment classification). We further report on
empirical analyses that reveal insights into the use of unlabeled data, the
sensitivity with respect to important hyperparameters, and the nature of the
induced cross-lingual correspondences
- …