648 research outputs found
Precursors and Laggards: An Analysis of Semantic Temporal Relationships on a Blog Network
We explore the hypothesis that it is possible to obtain information about the
dynamics of a blog network by analysing the temporal relationships between
blogs at a semantic level, and that this type of analysis adds to the knowledge
that can be extracted by studying the network only at the structural level of
URL links. We present an algorithm to automatically detect fine-grained
discussion topics, characterized by n-grams and time intervals. We then propose
a probabilistic model to estimate the temporal relationships that blogs have
with one another. We define the precursor score of blog A in relation to blog B
as the probability that A enters a new topic before B, discounting the effect
created by asymmetric posting rates. Network-level metrics of precursor and
laggard behavior are derived from these dyadic precursor score estimations.
This model is used to analyze a network of French political blogs. The scores
are compared to traditional link degree metrics. We obtain insights into the
dynamics of topic participation on this network, as well as the relationship
between precursor/laggard and linking behaviors. We validate and analyze
results with the help of an expert on the French blogosphere. Finally, we
propose possible applications to the improvement of search engine ranking
algorithms
Precursors and Laggards: An Analysis of Semantic Temporal Relationships on a Blog Network
We explore the hypothesis that it is possible to obtain information about the
dynamics of a blog network by analysing the temporal relationships between
blogs at a semantic level, and that this type of analysis adds to the knowledge
that can be extracted by studying the network only at the structural level of
URL links. We present an algorithm to automatically detect fine-grained
discussion topics, characterized by n-grams and time intervals. We then propose
a probabilistic model to estimate the temporal relationships that blogs have
with one another. We define the precursor score of blog A in relation to blog B
as the probability that A enters a new topic before B, discounting the effect
created by asymmetric posting rates. Network-level metrics of precursor and
laggard behavior are derived from these dyadic precursor score estimations.
This model is used to analyze a network of French political blogs. The scores
are compared to traditional link degree metrics. We obtain insights into the
dynamics of topic participation on this network, as well as the relationship
between precursor/laggard and linking behaviors. We validate and analyze
results with the help of an expert on the French blogosphere. Finally, we
propose possible applications to the improvement of search engine ranking
algorithms
Survey on Link Prediction and Page Ranking In Blogs S.Geetha
This paper presents a study of the various aspects of link prediction and page ranking in blogs. Social networks have taken on a new eminence from the prospect of the analysis of social networks, which is a recent area of research which grew out of the social sciences as well as the exact sciences, especially with the computing capacity for mathematical calculations and even modelling which was previously impossible. An essential element of social media, particularly blogs, is the hyperlink graph that connects various pieces of content. Link prediction has many applications, including recommending new items in online networks (e.g., products in eBay and Amazon, and friends in Face book), monitoring and preventing criminal activities in a criminal network, predicting the next web page users will visit, and complementing missing links in automatic web data crawlers. Page Rank is the technique used by Google to determine importance of page on the web. It considers all incoming links to a page as votes for Page Rank. Our findings provide an overview of social relations and we address the problem of page ranking and link prediction in networked data, which appears in many applications such as network analysis or recommended systems. Keywords- web log, social networks analysis, readership, link prediction, Page ranking. I
Semi-Supervised Learning For Identifying Opinions In Web Content
Thesis (Ph.D.) - Indiana University, Information Science, 2011Opinions published on the World Wide Web (Web) offer opportunities for detecting personal attitudes regarding topics, products, and services. The opinion detection literature indicates that both a large body of opinions and a wide variety of opinion features are essential for capturing subtle opinion information. Although a large amount of opinion-labeled data is preferable for opinion detection systems, opinion-labeled data is often limited, especially at sub-document levels, and manual annotation is tedious, expensive and error-prone. This shortage of opinion-labeled data is less challenging in some domains (e.g., movie reviews) than in others (e.g., blog posts). While a simple method for improving accuracy in challenging domains is to borrow opinion-labeled data from a non-target data domain, this approach often fails because of the domain transfer problem: Opinion detection strategies designed for one data domain generally do not perform well in another domain. However, while it is difficult to obtain opinion-labeled data, unlabeled user-generated opinion data are readily available. Semi-supervised learning (SSL) requires only limited labeled data to automatically label unlabeled data and has achieved promising results in various natural language processing (NLP) tasks, including traditional topic classification; but SSL has been applied in only a few opinion detection studies. This study investigates application of four different SSL algorithms in three types of Web content: edited news articles, semi-structured movie reviews, and the informal and unstructured content of the blogosphere. SSL algorithms are also evaluated for their effectiveness in sparse data situations and domain adaptation. Research findings suggest that, when there is limited labeled data, SSL is a promising approach for opinion detection in Web content. Although the contributions of SSL varied across data domains, significant improvement was demonstrated for the most challenging data domain--the blogosphere--when a domain transfer-based SSL strategy was implemented
BlogForever D2.6: Data Extraction Methodology
This report outlines an inquiry into the area of web data extraction, conducted within the context of blog preservation. The report reviews theoretical advances and practical developments for implementing data extraction. The inquiry is extended through an experiment that demonstrates the effectiveness and feasibility of implementing some of the suggested approaches. More specifically, the report discusses an approach based on unsupervised machine learning that employs the RSS feeds and HTML representations of blogs. It outlines the possibilities of extracting semantics available in blogs and demonstrates the benefits of exploiting available standards such as microformats and microdata. The report proceeds to propose a methodology for extracting and processing blog data to further inform the design and development of the BlogForever platform
Opinion mining and sentiment analysis in marketing communications: a science mapping analysis in Web of Science (1998â2018)
Opinion mining and sentiment analysis has become ubiquitous in our society, with
applications in online searching, computer vision, image understanding, artificial intelligence and
marketing communications (MarCom). Within this context, opinion mining and sentiment analysis
in marketing communications (OMSAMC) has a strong role in the development of the field by
allowing us to understand whether people are satisfied or dissatisfied with our service or product
in order to subsequently analyze the strengths and weaknesses of those consumer experiences. To
the best of our knowledge, there is no science mapping analysis covering the research about opinion
mining and sentiment analysis in the MarCom ecosystem. In this study, we perform a science
mapping analysis on the OMSAMC research, in order to provide an overview of the scientific work
during the last two decades in this interdisciplinary area and to show trends that could be the basis
for future developments in the field. This study was carried out using VOSviewer, CitNetExplorer
and InCites based on results from Web of Science (WoS). The results of this analysis show the
evolution of the field, by highlighting the most notable authors, institutions, keywords,
publications, countries, categories and journals.The research was funded by Programa Operativo FEDER AndalucĂa 2014â2020, grant number âLa
reputaciĂłn de las organizaciones en una sociedad digital. ElaboraciĂłn de una Plataforma Inteligente para la
LocalizaciĂłn, IdentificaciĂłn y ClasificaciĂłn de Influenciadores en los Medios Sociales Digitales (UMA18â
FEDERJAâ148)â and The APC was funded by the same research gran
A META-ANALYTIC REVIEW OF SOCIAL MEDIA STUDIES
Social media such as social networking sites, blogs, micro-blogs, Wikis, are increasingly and widely used in our daily lives. In the information system (IS) discipline, social media have become a hot research area and draw the attention of many scholars. The paper systematically reviewed social media studies published in Association for Information Systems (AIS) listed top 20 journals from 2009 to 2013. The publication time, journal preferences, research objects and research topics are discussed. Generally, the current social media studies including four areas, namely user, management, technology and information. Each area has distinct focuses and topics. By thoroughly analyzing the research topics, the authors formulate our projections and recommendations for future social media studies
Early Prediction of Movie Box Office Success based on Wikipedia Activity Big Data
Use of socially generated "big data" to access information about collective
states of the minds in human societies has become a new paradigm in the
emerging field of computational social science. A natural application of this
would be the prediction of the society's reaction to a new product in the sense
of popularity and adoption rate. However, bridging the gap between "real time
monitoring" and "early predicting" remains a big challenge. Here we report on
an endeavor to build a minimalistic predictive model for the financial success
of movies based on collective activity data of online users. We show that the
popularity of a movie can be predicted much before its release by measuring and
analyzing the activity level of editors and viewers of the corresponding entry
to the movie in Wikipedia, the well-known online encyclopedia.Comment: 13 pages, Including Supporting Information, 7 Figures, Download the
dataset from: http://wwm.phy.bme.hu/SupplementaryDataS1.zi
- âŠ