6 research outputs found

    Sensor de reputação online: técnicas de aprendizagem automática para a deteção e classificação de opiniões na Web

    Get PDF
    As redes sociais são plataformas em larga escala onde pessoas de todo o mundo se podem conhecer, partilhar imagens e vídeos ou trocar opiniões. Saber as opiniões dos utilizadores que podem afetar a reputação de um produto ou serviço é uma das vantagens que as empresas podem retirar deste tipo de plataformas. O objetivo deste trabalho é apresentar um sistema com a capacidade de determinar, através de técnicas de aprendizagem automática, o sentimento de uma frase e respetivo impacto na afetação da reputação da entidade mencionada, classificando-o como positivo, negativo ou neutro. Este sistema foi desenvolvido na linguagem Python e utiliza recursos da ferramenta NLTK, como o reconhecimento de entidades (NE Chunk), o classificador gramatical (pos-tag) e os algoritmos para o classificador da polaridade de sentimentos (Naive Bayes, Decision Trees e SVM); Online Reputation Sensor: machine learning techniques for detection and classification of opinions in Web textual sources #### abstract: The social networks are large scale platforms where people around the world meet, share photos and videos and share opinions. Knowing people's opinions about a product or service is one of the advantages that companies can benefit from these type of plataforms. The purpose of this work is to present a system with the ability to predict, through machine learning techniques, the sense of a sentence and the respective reputation impact on the target entity, classifying it as negative, positive or neutral. This system was developed in Python and uses resources from NLTK framework, such as entity recognition (NE Chunk), the grammar classifier (pos-tag) and the algorithms used in system development (Naive Bayes, Decision Trees, and SVM)

    Learning domain-specific sentiment lexicons with applications to recommender systems

    Get PDF
    Search is now going beyond looking for factual information, and people wish to search for the opinions of others to help them in their own decision-making. Sentiment expressions or opinion expressions are used by users to express their opinion and embody important pieces of information, particularly in online commerce. The main problem that the present dissertation addresses is how to model text to find meaningful words that express a sentiment. In this context, I investigate the viability of automatically generating a sentiment lexicon for opinion retrieval and sentiment classification applications. For this research objective we propose to capture sentiment words that are derived from online users’ reviews. In this approach, we tackle a major challenge in sentiment analysis which is the detection of words that express subjective preference and domain-specific sentiment words such as jargon. To this aim we present a fully generative method that automatically learns a domain-specific lexicon and is fully independent of external sources. Sentiment lexicons can be applied in a broad set of applications, however popular recommendation algorithms have somehow been disconnected from sentiment analysis. Therefore, we present a study that explores the viability of applying sentiment analysis techniques to infer ratings in a recommendation algorithm. Furthermore, entities’ reputation is intrinsically associated with sentiment words that have a positive or negative relation with those entities. Hence, is provided a study that observes the viability of using a domain-specific lexicon to compute entities reputation. Finally, a recommendation system algorithm is improved with the use of sentiment-based ratings and entities reputation

    Understanding the topics and opinions from social media content

    Get PDF
    Social media has become one indispensable part of people’s daily life, as it records and reflects people’s opinions and events of interest, as well as influences people’s perceptions. As the most commonly employed and easily accessed data format on social media, a great deal of the social media textual content is not only factual and objective, but also rich in opinionated information. Thus, besides the topics Internet users are talking about in social media textual content, it is also of great importance to understand the opinions they are expressing. In this thesis, I present my broadly applicable text mining approaches, in order to understand the topics and opinions of user-generated texts on social media, to provide insights about the thoughts of Internet users on entities, events, etc. Specifically, I develop approaches to understand the semantic differences between language-specific editions of Wikipedia, when discussing certain entities from the related topical aspects perspective and the aggregated sentiment bias perspective. Moreover, I employ effective features to detect the reputation-influential sentences for person and company entities in Wikipedia articles, which lead to the detected sentiment bias. Furthermore, I propose neural network models with different levels of attention mechanism, to detect the stances of tweets towards any given target. I also introduce an online timeline generation approach, to detect and summarise the relevant sub-topics in the tweet stream, in order to provide Internet users with some insights about the evolution of major events they are interested in
    corecore