Search CORE

3,283 research outputs found

Accurate Online Video Tagging via Probabilistic Hybrid Modeling

Author: CHUA Tat-Seng
SHEN Jialie
WANG Meng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2016
Field of study

Ministry of Education, Singapore under its Academic Research Funding Tier

Institutional Knowledge at Singapore Management University

Tag-Aware Recommender Systems: A State-of-the-art Survey

Author: A Capocci
A Clauset
A Gunawardana
A Hotho
AE Gelfand
AP Dempster
B Pittel
C Cattuto
C Cattuto
C Cattuto
C Liu
DM Blei
G Adomavicius
G Cimini
G Ghoshal
G Koutrika
G Linden
G Salton
GQ Zhang
J Scott
JA Hanley
JB Schafer
JL Herlocker
JM Kleinberg
JW Wang
K Tso
L Lathauwer De
L Lü
L Spiteri
LdaF Costa
M Dubinko
M Girvan
M Medo
MEJ Newman
MJ Pazzani
MS Shang
MS Shang
MS Shang
O Nov
P Kazienko
P Mika
P Resnick
P Resnick
P Wu
R Albert
R Lambiotte
S Boccaletti
S Brin
S Deerwester
SN Dorogovtsev
T Zhou
T Zhou
T Zhou
Tao Zhou
TG Kolda
V Zlatić
X Si
Y Ding
YC Zhang
Yi-Cheng Zhang
Z Huang
Zi-Ke Zhang
ZK Zhang
ZK Zhang
ZK Zhang
ZK Zhang
ZK Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/02/2012
Field of study

In the past decade, Social Tagging Systems have attracted increasing attention from both physical and computer science communities. Besides the underlying structure and dynamics of tagging systems, many efforts have been addressed to unify tagging information to reveal user behaviors and preferences, extract the latent semantic relations among items, make recommendations, and so on. Specifically, this article summarizes recent progress about tag-aware recommender systems, emphasizing on the contributions from three mainstream perspectives and approaches: network-based methods, tensor-based methods, and the topic-based methods. Finally, we outline some other tag-related works and future challenges of tag-aware recommendation algorithms.Comment: 19 pages, 3 figure

arXiv.org e-Print Archive

Crossref

RERO DOC Digital Library

A Survey of Location Prediction on Twitter

Author: Han Jialong
Sun Aixin
Zheng Xin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

Locations, e.g., countries, states, cities, and point-of-interests, are central to news, emergency events, and people's daily lives. Automatic identification of locations associated with or mentioned in documents has been explored for decades. As one of the most popular online social network platforms, Twitter has attracted a large number of users who send millions of tweets on daily basis. Due to the world-wide coverage of its users and real-time freshness of tweets, location prediction on Twitter has gained significant attention in recent years. Research efforts are spent on dealing with new challenges and opportunities brought by the noisy, short, and context-rich nature of tweets. In this survey, we aim at offering an overall picture of location prediction on Twitter. Specifically, we concentrate on the prediction of user home locations, tweet locations, and mentioned locations. We first define the three tasks and review the evaluation metrics. By summarizing Twitter network, tweet content, and tweet context as potential inputs, we then structurally highlight how the problems depend on these inputs. Each dependency is illustrated by a comprehensive review of the corresponding strategies adopted in state-of-the-art approaches. In addition, we also briefly review two related problems, i.e., semantic location prediction and point-of-interest recommendation. Finally, we list future research directions.Comment: Accepted to TKDE. 30 pages, 1 figur

arXiv.org e-Print Archive

DR-NTU (Digital Repository of NTU)

Integration of Computer Vision and Natural Language Processing in Multimedia Robotics Application

Author: El-Komy Amir
I. Taloba Ahmed
M. Abd El-Aziz Rasha
R. Shahin Osama
Publication venue: Arab Journals Platform
Publication date: 08/10/2022
Field of study

Computer vision and natural language processing (NLP) are two active machine learning research areas. However, the integration of these two areas gives rise to a new interdisciplinary field, which is currently attracting more attention of researchers. Research has been carried out to extract the text associated with an image or a video that can assist in making computer vision effective. Moreover, researchers focus on utilizing NLP to extract the meaning of words through the use of computer vision. This concept is widely used in robotics. Although robots should observe the surroundings from different ways of interactions, natural gestures and spoken languages are the most convenient way for humans to interact with the robots. This would be possible only if the robots can understand such types of interactions. In the present paper, the proposed integrated application is utilized for guiding vision-impaired people. As vision is the most essential in the life of a human being, an alternative source that helps in guiding the blind in their movements is highly important. For this purpose, the current paper uses a smartphone with the capabilities of vision, language, and intelligence which has been attached to the blind person to capture the images of their surroundings, and it is associated with a Faster Region Convolutional Neural Network (F-RCNN) based central server to detect the objects in the image to inform the person about them and avoid obstacles in their way. These results are passed to the smartphone which produces a speech output for the guidance of the blinds

Arab Journals Platform

Current Challenges and Visions in Music Recommender Systems Research

Author: Chen Ching-Wei
Deldjoo Yashar
Elahi Mehdi
Schedl Markus
Zamani Hamed
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/03/2018
Field of study

Music recommender systems (MRS) have experienced a boom in recent years, thanks to the emergence and success of online streaming services, which nowadays make available almost all music in the world at the user's fingertip. While today's MRS considerably help users to find interesting music in these huge catalogs, MRS research is still facing substantial challenges. In particular when it comes to build, incorporate, and evaluate recommendation strategies that integrate information beyond simple user--item interactions or content-based descriptors, but dig deep into the very essence of listener needs, preferences, and intentions, MRS research becomes a big endeavor and related publications quite sparse. The purpose of this trends and survey article is twofold. We first identify and shed light on what we believe are the most pressing challenges MRS research is facing, from both academic and industry perspectives. We review the state of the art towards solving these challenges and discuss its limitations. Second, we detail possible future directions and visions we contemplate for the further evolution of the field. The article should therefore serve two purposes: giving the interested reader an overview of current challenges in MRS research and providing guidance for young researchers by identifying interesting, yet under-researched, directions in the field

arXiv.org e-Print Archive

JKU | ePub

What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision

Author: Huang Jonathan
Johnston Nick
Malmaud Jonathan
Murphy Kevin
Rabinovich Andrew
Rathod Vivek
Publication venue
Publication date: 01/01/2015
Field of study

We present a novel method for aligning a sequence of instructions to a video of someone carrying out a task. In particular, we focus on the cooking domain, where the instructions correspond to the recipe. Our technique relies on an HMM to align the recipe steps to the (automatically generated) speech transcript. We then refine this alignment using a state-of-the-art visual food detector, based on a deep convolutional neural network. We show that our technique outperforms simpler techniques based on keyword spotting. It also enables interesting applications, such as automatically illustrating recipes with keyframes, and searching within a video for events of interest.Comment: To appear in NAACL 201

arXiv.org e-Print Archive

CiteSeerX

Crossref