12,368 research outputs found
Sentiment Lexicon Adaptation with Context and Semantics for the Social Web
Sentiment analysis over social streams offers governments and organisations a fast and effective way to monitor the publics' feelings towards policies, brands, business, etc. General purpose sentiment lexicons have been used to compute sentiment from social streams, since they are simple and effective. They calculate the overall sentiment of texts by using a general collection of words, with predetermined sentiment orientation and strength. However, words' sentiment often vary with the contexts in which they appear, and new words might be encountered that are not covered by the lexicon, particularly in social media environments where content emerges and changes rapidly and constantly. In this paper, we propose a lexicon adaptation approach that uses contextual as well as semantic information extracted from DBPedia to update the words' weighted sentiment orientations and to add new words to the lexicon. We evaluate our approach on three different Twitter datasets, and show that enriching the lexicon with contextual and semantic information improves sentiment computation by 3.4% in average accuracy, and by 2.8% in average F1 measure
An Approach to Web-Scale Named-Entity Disambiguation
We present a multi-pass clustering approach to large scale. wide-scope named-entity disambiguation (NED) oil collections of web pages. Our approach Uses name co-occurrence information to cluster and hence disambiguate entities. and is designed to handle NED on the entire web. We show that on web collections, NED becomes increasing), difficult as the corpus size increases, not only because of the challenge of scaling the NED algorithm, but also because new and surprising facets of entities become visible in the data. This effect limits the potential benefits for data-driven approaches of processing larger data-sets, and suggests that efficient clustering-based disambiguation methods for the web will require extracting more specialized information front documents
A Conversation with Shoutir Kishore Chatterjee
Shoutir Kishore Chatterjee was born in Ranchi, a small hill station in India,
on November 6, 1934. He received his B.Sc. in statistics from the Presidency
College, Calcutta, in 1954, and M.Sc. and Ph.D. degrees in statistics from the
University of Calcutta in 1956 and 1962, respectively. He was appointed a
lecturer in the Department of Statistics, University of Calcutta, in 1960 and
was a member of its faculty until his retirement as a professor in 1997.
Indeed, from the 1970s he steered the teaching and research activities of the
department for the next three decades. Professor Chatterjee was the National
Lecturer in Statistics (1985--1986) of the University Grants Commission, India,
the President of the Section of Statistics of the Indian Science Congress
(1989) and an Emeritus Scientist (1997--2000) of the Council of Scientific and
Industrial Research, India. Professor Chatterjee, affectionately known as SKC
to his students and admirers, is a truly exceptional person who embodies the
spirit of eternal India. He firmly believes that ``fulfillment in man's life
does not come from amassing a lot of money, after the threshold of what is
required for achieving a decent living is crossed. It does not come even from
peer recognition for intellectual achievements. Of course, one has to work and
toil a lot before one realizes these facts.''Comment: Published in at http://dx.doi.org/10.1214/088342306000000565 the
Statistical Science (http://www.imstat.org/sts/) by the Institute of
Mathematical Statistics (http://www.imstat.org
- …