701 research outputs found
Literature Survey on Interplay of Topics, Information Diffusion and Connections on Social Networks
Researchers have attempted to model information diffusion and topic trends
and lifecycle on online social networks. They have investigated the role of
content, social connections and communities, familiarity and behavioral
similarity in this context. The current article presents a survey of
representative models that perform topic analysis, capture information
diffusion, and explore the properties of social connections in the context of
online social networks. The article concludes with a set of outlines of open
problems and possible directions of future research interest. This article is
intended for researchers to identify the current literature, and explore
possibilities to improve the art
A Hitchhiker's Guide On Distributed Training of Deep Neural Networks
Deep learning has led to tremendous advancements in the field of Artificial
Intelligence. One caveat however is the substantial amount of compute needed to
train these deep learning models. Training a benchmark dataset like ImageNet on
a single machine with a modern GPU can take upto a week, distributing training
on multiple machines has been observed to drastically bring this time down.
Recent work has brought down ImageNet training time to a time as low as 4
minutes by using a cluster of 2048 GPUs. This paper surveys the various
algorithms and techniques used to distribute training and presents the current
state of the art for a modern distributed training framework. More
specifically, we explore the synchronous and asynchronous variants of
distributed Stochastic Gradient Descent, various All Reduce gradient
aggregation strategies and best practices for obtaining higher throughout and
lower latency over a cluster such as mixed precision training, large batch
training and gradient compression.Comment: 14 page
Topic Lifecycle on Social Networks: Analyzing the Effects of Semantic Continuity and Social Communities
Topic lifecycle analysis on Twitter, a branch of study that investigates
Twitter topics from their birth through lifecycle to death, has gained immense
mainstream research popularity. In the literature, topics are often treated as
one of (a) hashtags (independent from other hashtags), (b) a burst of keywords
in a short time span or (c) a latent concept space captured by advanced text
analysis methodologies, such as Latent Dirichlet Allocation (LDA). The first
two approaches are not capable of recognizing topics where different users use
different hashtags to express the same concept (semantically related), while
the third approach misses out the user's explicit intent expressed via
hashtags. In our work, we use a word embedding based approach to cluster
different hashtags together, and the temporal concurrency of the hashtag
usages, thus forming topics (a semantically and temporally related group of
hashtags).We present a novel analysis of topic lifecycles with respect to
communities. We characterize the participation of social communities in the
topic clusters, and analyze the lifecycle of topic clusters with respect to
such participation. We derive first-of-its-kind novel insights with respect to
the complex evolution of topics over communities and time: temporal morphing of
topics over hashtags within communities, how the hashtags die in some
communities but morph into some other hashtags in some other communities (that,
it is a community-level phenomenon), and how specific communities adopt to
specific hashtags. Our work is fundamental in the space of topic lifecycle
modeling and understanding in communities: it redefines our understanding of
topic lifecycles and shows that the social boundaries of topic lifecycles are
deeply ingrained with community behavior.Comment: 12 pages, 5 figures (13 figures if sub-figures are counted
separately), To Appear in ECIR 201
Assessment of Effectiveness of Content Models for Approximating Twitter Social Connection Structures
This paper explores the social quality (goodness) of community structures
formed across Twitter users, where social links within the structures are
estimated based upon semantic properties of user-generated content (corpus). We
examined the overlap of the community structures of the constructed graphs, and
followership-based social communities, to find the social goodness of the links
constructed. Unigram, bigram and LDA content models were empirically
investigated for evaluation of effectiveness, as approximators of underlying
social graphs, such that they maintain the {\it community} social property.
Impact of content at varying granularities, for the purpose of predicting links
while retaining the social community structures, was investigated. 100
discussion topics, spanning over 10 Twitter events, were used for experiments.
The unigram language model performed the best, indicating strong similarity of
word usage within deeply connected social communities. This observation agrees
with the phenomenon of evolution of word usage behavior, that transform
individuals belonging to the same community tending to choose the same words,
made by Danescu et al. (2013), and raises a question on the literature that
use, without validation, LDA for content-based social link prediction over
other content models. Also, semantically finer-grained content was observed to
be more effective compared to coarser-grained content
EmTaggeR: A Word Embedding Based Novel Method for Hashtag Recommendation on Twitter
The hashtag recommendation problem addresses recommending (suggesting) one or
more hashtags to explicitly tag a post made on a given social network platform,
based upon the content and context of the post. In this work, we propose a
novel methodology for hashtag recommendation for microblog posts, specifically
Twitter. The methodology, EmTaggeR, is built upon a training-testing framework
that builds on the top of the concept of word embedding. The training phase
comprises of learning word vectors associated with each hashtag, and deriving a
word embedding for each hashtag. We provide two training procedures, one in
which each hashtag is trained with a separate word embedding model applicable
in the context of that hashtag, and another in which each hashtag obtains its
embedding from a global context. The testing phase constitutes computing the
average word embedding of the test post, and finding the similarity of this
embedding with the known embeddings of the hashtags. The tweets that contain
the most-similar hashtag are extracted, and all the hashtags that appear in
these tweets are ranked in terms of embedding similarity scores. The top-K
hashtags that appear in this ranked list, are recommended for the given test
post. Our system produces F1 score of 50.83%, improving over the LDA baseline
by around 6.53 times, outperforming the best-performing system known in the
literature that provides a lift of 6.42 times. EmTaggeR is a fast, scalable and
lightweight system, which makes it practical to deploy in real-life
applications.Comment: Accepted at the IEEE International Conference on Data Mining (ICDM)
2017 ACUMEN Worksho
A Survey of Modern Object Detection Literature using Deep Learning
Object detection is the identification of an object in the image along with
its localisation and classification. It has wide spread applications and is a
critical component for vision based software systems. This paper seeks to
perform a rigorous survey of modern object detection algorithms that use deep
learning. As part of the survey, the topics explored include various
algorithms, quality metrics, speed/size trade offs and training methodologies.
This paper focuses on the two types of object detection algorithms- the SSD
class of single step detectors and the Faster R-CNN class of two step
detectors. Techniques to construct detectors that are portable and fast on low
powered devices are also addressed by exploring new lightweight convolutional
base architectures. Ultimately, a rigorous review of the strengths and
weaknesses of each detector leads us to the present state of the art
Automated Test Generation to Detect Individual Discrimination in AI Models
Dependability on AI models is of utmost importance to ensure full acceptance
of the AI systems. One of the key aspects of the dependable AI system is to
ensure that all its decisions are fair and not biased towards any individual.
In this paper, we address the problem of detecting whether a model has an
individual discrimination. Such a discrimination exists when two individuals
who differ only in the values of their protected attributes (such as,
gender/race) while the values of their non-protected ones are exactly the same,
get different decisions. Measuring individual discrimination requires an
exhaustive testing, which is infeasible for a non-trivial system. In this
paper, we present an automated technique to generate test inputs, which is
geared towards finding individual discrimination. Our technique combines the
well-known technique called symbolic execution along with the local
explainability for generation of effective test cases. Our experimental results
clearly demonstrate that our technique produces 3.72 times more successful test
cases than the existing state-of-the-art across all our chosen benchmarks
Topical Stance Detection for Twitter: A Two-Phase LSTM Model Using Attention
The topical stance detection problem addresses detecting the stance of the
text content with respect to a given topic: whether the sentiment of the given
text content is in FAVOR of (positive), is AGAINST (negative), or is NONE
(neutral) towards the given topic. Using the concept of attention, we develop a
two-phase solution. In the first phase, we classify subjectivity - whether a
given tweet is neutral or subjective with respect to the given topic. In the
second phase, we classify sentiment of the subjective tweets (ignoring the
neutral tweets) - whether a given subjective tweet has a FAVOR or AGAINST
stance towards the topic. We propose a Long Short-Term memory (LSTM) based deep
neural network for each phase, and embed attention at each of the phases. On
the SemEval 2016 stance detection Twitter task dataset, we obtain a best-case
macro F-score of 68.84% and a best-case accuracy of 60.2%, outperforming the
existing deep learning based solutions. Our framework, T-PAN, is the first in
the topical stance detection literature, that uses deep learning within a
two-phase architecture.Comment: Accepted at the 40th European Conference on Information Retrieval
(ECIR), 201
Gaze-based Autism Detection for Adolescents and Young Adults using Prosaic Videos
Autism often remains undiagnosed in adolescents and adults. Prior research
has indicated that an autistic individual often shows atypical fixation and
gaze patterns. In this short paper, we demonstrate that by monitoring a user's
gaze as they watch commonplace (i.e., not specialized, structured or coded)
video, we can identify individuals with autism spectrum disorder. We recruited
35 autistic and 25 non-autistic individuals, and captured their gaze using an
off-the-shelf eye tracker connected to a laptop. Within 15 seconds, our
approach was 92.5% accurate at identifying individuals with an autism
diagnosis. We envision such automatic detection being applied during e.g., the
consumption of web media, which could allow for passive screening and
adaptation of user interfaces
AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and Mitigating Unwanted Algorithmic Bias
Fairness is an increasingly important concern as machine learning models are
used to support decision making in high-stakes applications such as mortgage
lending, hiring, and prison sentencing. This paper introduces a new open source
Python toolkit for algorithmic fairness, AI Fairness 360 (AIF360), released
under an Apache v2.0 license {https://github.com/ibm/aif360). The main
objectives of this toolkit are to help facilitate the transition of fairness
research algorithms to use in an industrial setting and to provide a common
framework for fairness researchers to share and evaluate algorithms.
The package includes a comprehensive set of fairness metrics for datasets and
models, explanations for these metrics, and algorithms to mitigate bias in
datasets and models. It also includes an interactive Web experience
(https://aif360.mybluemix.net) that provides a gentle introduction to the
concepts and capabilities for line-of-business users, as well as extensive
documentation, usage guidance, and industry-specific tutorials to enable data
scientists and practitioners to incorporate the most appropriate tool for their
problem into their work products. The architecture of the package has been
engineered to conform to a standard paradigm used in data science, thereby
further improving usability for practitioners. Such architectural design and
abstractions enable researchers and developers to extend the toolkit with their
new algorithms and improvements, and to use it for performance benchmarking. A
built-in testing infrastructure maintains code quality.Comment: 20 page
- …