Search CORE

9,572 research outputs found

More blogging features for author identification

Author: Ahmed Amr
Mohtasseb Haytham
Publication venue
Publication date: 01/01/2009
Field of study

In this paper we present a novel improvement in the field of authorship identification in personal blogs. The improvement in authorship identification, in our work, is by utilizing a hybrid collection of linguistic features that best capture the style of users in diaries blogs. The features sets contain LIWC with its psychology background, a collection of syntactic features & part-of-speech (POS), and the misspelling errors features. Furthermore, we analyze the contribution of each feature set on the final result and compare the outcome of using different combination from the selected feature sets. Our new categorization of misspelling words which are mapped into numerical features, are noticeably enhancing the classification results. The paper also confirms the best ranges of several parameters that affect the final result of authorship identification such as the author numbers, words number in each post, and the number of documents/posts for each author/user. The results and evaluation show that the utilized features are compact, while their performance is highly comparable with other much larger feature sets

University of Lincoln Institutional Repository

CiteSeerX

Edge Hill University Research Information Repository

Hybrid Approach for Emotion Classification of Audio Conversation Based on Text and Speech Mining

Author: Bhaskar Jasmine
Nedungadi Prema
Sruthi K.
Publication venue: Published by Elsevier B.V.
Publication date: 31/12/2015
Field of study

AbstractOne of the greatest challenges in speech technology is estimating the speaker's emotion. Most of the existing approaches concentrate either on audio or text features. In this work, we propose a novel approach for emotion classification of audio conversation based on both speech and text. The novelty in this approach is in the choice of features and the generation of a single feature vector for classification. Our main intention is to increase the accuracy of emotion classification of speech by considering both audio and text features. In this work we use standard methods such as Natural Language Processing, Support Vector Machines, WordNet Affect and SentiWordNet. The dataset for this work have been taken from Semval -2007 and eNTERFACE’05 EMOTION Database

Elsevier - Publisher Connector

Role of sentiment classification in sentiment analysis: a survey

Author: J Prabhu
M R Pavan Kumar
Publication venue: Annals of Library and Information Studies (ALIS)
Publication date: 06/11/2018
Field of study

Through a survey of literature, the role of sentiment classification in sentiment analysis has been reviewed. The review identifies the research challenges involved in tackling sentiment classification. A total of 68 articles during 2015 – 2017 have been reviewed on six dimensions viz., sentiment classification, feature extraction, cross-lingual sentiment classification, cross-domain sentiment classification, lexica and corpora creation and multi-label sentiment classification. This study discusses the prominence and effects of sentiment classification in sentiment evaluation and a lot of further research needs to be done for productive results

Online Publishing @ NISCAIR

Line graphs as social networks

Author: A. Mańka-Krasoń
Apolloni
Balakrishnan
Barabasi
Bearman
Cohen
Dorogovtsev
Evans
Fortunato
Freeman
Girvan
Granovetter
Guimerá
Herring
K. Kułakowski
Kendall
Kitsak
L. Muchnik
Liben-Nowell
Liljeros
M.J. Krawczyk
Mańka-Krasoń
Mańka-Krasoń
Morris
Nacher
Nacher
Newman
Newman
Newman
Newman
Pastor-Satorras
Scott
Tyler
Wasserman
Watts
Whitney
Zhang
Publication venue: 'Elsevier BV'
Publication date: 12/10/2010
Field of study

The line graphs are clustered and assortative. They share these topological features with some social networks. We argue that this similarity reveals the cliquey character of the social networks. In the model proposed here, a social network is the line graph of an initial network of families, communities, interest groups, school classes and small companies. These groups play the role of nodes, and individuals are represented by links between these nodes. The picture is supported by the data on the LiveJournal network of about 8 x 10^6 people. In particular, sharp maxima of the observed data of the degree dependence of the clustering coefficient C(k) are associated with cliques in the social network.Comment: 11 pages, 4 figure

arXiv.org e-Print Archive

Crossref

Are Emotions Enumerable or Decomposable? And its Implications for Emotion Processing

Author: Chen Ying
Huang Chu-Ren
Lee Sophia Y. M.
Publication venue: City University of Hong Kong
Publication date: 01/01/2009
Field of study

PACLIC 23 / City University of Hong Kong / 3-5 December 200

Waseda University Repository

A survey of data mining techniques for social media analysis

Author: Adedoyin-Olowe Mariam
Gaber Mohamed Medhat
Stahl Frederic
Publication venue: Episciences
Publication date: 16/04/2014
Field of study

Social network has gained remarkable attention in the last decade. Accessing social network sites such as Twitter, Facebook LinkedIn and Google+ through the internet and the web 2.0 technologies has become more affordable. People are becoming more interested in and relying on social network for information, news and opinion of other users on diverse subject matters. The heavy reliance on social network sites causes them to generate massive data characterised by three computational issues namely; size, noise and dynamism. These issues often make social network data very complex to analyse manually, resulting in the pertinent use of computational means of analysing them. Data mining provides a wide range of techniques for detecting useful knowledge from massive datasets like trends, patterns and rules [44]. Data mining techniques are used for information retrieval, statistical modelling and machine learning. These techniques employ data pre-processing, data analysis, and data interpretation processes in the course of data analysis. This survey discusses different data mining techniques used in mining diverse aspects of the social network over decades going from the historical techniques to the up-to-date models, including our novel technique named TRCM. All the techniques covered in this survey are listed in the Table.1 including the tools employed as well as names of their authors

arXiv.org e-Print Archive

Central Archive at the University of Reading

Crossref

Social software for music

Author: Costa Cláudio Miguel Teixeira da
Publication venue
Publication date: 01/01/2009
Field of study

Tese de mestrado integrado. Engenharia Informática e Computação. Faculdade de Engenharia. Universidade do Porto. 200

Repositório Aberto da Universidade do Porto

Early Warning Analysis for Social Diffusion Events

Author: Colbaugh Richard
Glass Kristin
Publication venue
Publication date: 30/12/2012
Field of study

There is considerable interest in developing predictive capabilities for social diffusion processes, for instance to permit early identification of emerging contentious situations, rapid detection of disease outbreaks, or accurate forecasting of the ultimate reach of potentially viral ideas or behaviors. This paper proposes a new approach to this predictive analytics problem, in which analysis of meso-scale network dynamics is leveraged to generate useful predictions for complex social phenomena. We begin by deriving a stochastic hybrid dynamical systems (S-HDS) model for diffusion processes taking place over social networks with realistic topologies; this modeling approach is inspired by recent work in biology demonstrating that S-HDS offer a useful mathematical formalism with which to represent complex, multi-scale biological network dynamics. We then perform formal stochastic reachability analysis with this S-HDS model and conclude that the outcomes of social diffusion processes may depend crucially upon the way the early dynamics of the process interacts with the underlying network's community structure and core-periphery structure. This theoretical finding provides the foundations for developing a machine learning algorithm that enables accurate early warning analysis for social diffusion events. The utility of the warning algorithm, and the power of network-based predictive metrics, are demonstrated through an empirical investigation of the propagation of political memes over social media networks. Additionally, we illustrate the potential of the approach for security informatics applications through case studies involving early warning analysis of large-scale protests events and politically-motivated cyber attacks

arXiv.org e-Print Archive

Springer - Publisher Connector