345 research outputs found
Trustworthiness in Social Big Data Incorporating Semantic Analysis, Machine Learning and Distributed Data Processing
This thesis presents several state-of-the-art approaches constructed for the purpose of (i) studying the trustworthiness of users in Online Social Network platforms, (ii) deriving concealed knowledge from their textual content, and (iii) classifying and predicting the domain knowledge of users and their content. The developed approaches are refined through proof-of-concept experiments, several benchmark comparisons, and appropriate and rigorous evaluation metrics to verify and validate their effectiveness and efficiency, and hence, those of the applied frameworks
Giving meaning to tweets in emergency situations: a semantic approach for filtering and visualizing social data
In this paper, we propose a semantic approach for monitoring information publishedon social networks about a specific event. In the era of Big Data, when an emergencyoccurs information posted on social networks becomes more and more helpful foremergency operators. As direct witnesses of the situation, people share photos, videosor text messages about events that call their attention. In the emergency operationcenter, these data can be collected and integrated within the management processto improve the overall understanding of the situation and in particular of the citizenreactions. To support the tracking and analyzing of social network activities, there arealready monitoring tools that combine visualization techniques with geographicalmaps. However, tweets are written from the perspective of citizens and the informationthey provide might be inaccurate, irrelevant or false. Our approach tries to dealwith data relevance proposing an innovative ontology-based method for filteringtweets and extracting meaningful topics depending on their semantic content. In thisway data become relevant for the operators to make decisions. Two real cases used totest its applicability showed that different visualization techniques might be neededto support situation awareness. This ontology-based approach can be generalizedfor analyzing the information flow about other domains of application changing theunderlying knowledge base.This work is supported by the project emerCien grant funded by the Spanish Ministry of Economy and Competitivity (TIN2012-09687)
Novel platform for topic group mining, crowd opinion analysis and opinion leader identification in on-line social network platforms
In recent years, topic group mining and massive crowd opinion analysis from on-line social network platforms have become some of the most important tasks not only in research area but also in industry. Systems of this sort can identify similar topics from a very large dataset, group them together based on the topic, and analyse the inclination of the content's owner. To solve this problem, which involves research from a number of different areas, an integrated platform needs to be proposed.
Most community mining techniques treat the network as a graph where nodes represent users and edges reflect user relationship between two users. One obvious drawback of these approaches is that it can only utilise the explicit user relationships provided by on-line social network platforms. All other possible relationships will be ignored. Some on-line social network platforms restrict the length of content a user can publish. This causes traditional document clustering methods to perform poorly. Meanwhile, the restriction of content length also affects opinion mining performance since most content lacks contextual features. Hence, other context features that are not immediately or obviously related need to be investigated to improve performance in user inclination classification.
This research proposes a novel three layered platform. Two core technologies of the platform are topic group mining and user inclination analysis. The integrated approach was evaluated by a series of experiments to examine each core technology. The results indicate that the proposed integrated platform is able to produce the following results. 1) Scores up to 0.82 by V-measure evaluation function in topic group mining. 2) High accuracy rate in inclination mining. 3) A flexible and adaptable platform design which can accommodate different on-line social networks easily
Recommended from our members
Crowdsourced Data Mining for Urban Activity: A Review of Data Sources, Applications and Methods
The penetration of devices integrated with location-based services and internet services has generated massive data about the everyday life of citizens and tracked their activities happening in cities. Crowdsourced data, such as social media data, POIs data and collaborative websites, generated by the crowd, has become fine-grained proxy data of urban activity and widely used in research in urban studies. However, due to the heterogeneity of data types of crowdsourced data and the limitation of previous studies mainly focusing on a specific application, a systematic review of crowdsourced data mining for urban activity is still lacking. In order to fill the gap, this paper conducts a literature search in the Web of Science database, selecting 226 highly related papers published between 2013 and 2019. Based on those papers, the review firstly conducts a bibliometric analysis identifying underpinning domains, pivot scholars and papers around this topic. The review also synthesises previous research into three parts: main applications of different data sources and data fusion; application of spatial analysis in mobility patterns, functional areas and event detection; application of socio-demographic and perception analysis in city attractiveness, demographic characteristics and sentiment analysis. The challenges of this type of data are also discussed in the end. This study provides a systematic and current review for both researchers and practitioners interested in the applications of crowdsourced data mining for urban activity.This research is funded by a scholarship from the China Scholarship Counci
- …