427 research outputs found

    What Can The Study Of Representation Tell Us About Learning?

    Get PDF

    Representation And Learning

    Get PDF

    Using Linguistic Features to Estimate Suicide Probability of Chinese Microblog Users

    Full text link
    If people with high risk of suicide can be identified through social media like microblog, it is possible to implement an active intervention system to save their lives. Based on this motivation, the current study administered the Suicide Probability Scale(SPS) to 1041 weibo users at Sina Weibo, which is a leading microblog service provider in China. Two NLP (Natural Language Processing) methods, the Chinese edition of Linguistic Inquiry and Word Count (LIWC) lexicon and Latent Dirichlet Allocation (LDA), are used to extract linguistic features from the Sina Weibo data. We trained predicting models by machine learning algorithm based on these two types of features, to estimate suicide probability based on linguistic features. The experiment results indicate that LDA can find topics that relate to suicide probability, and improve the performance of prediction. Our study adds value in prediction of suicidal probability of social network users with their behaviors

    Building data warehouses in the era of big data: an approach for scalable and flexible big data warehouses

    Get PDF
    During the last few years, the concept of Big Data Warehousing gained significant attention from the scientific community, highlighting the need to make design changes to the traditional Data Warehouse (DW) due to its limitations, in order to achieve new characteristics relevant in Big Data contexts (e.g., scalability on commodity hardware, real-time performance, and flexible storage). The state-of-the-art in Big Data Warehousing reflects the young age of the concept, as well as ambiguity and the lack of common approaches to build Big Data Warehouses (BDWs). Consequently, an approach to design and implement these complex systems is of major relevance to business analytics researchers and practitioners. In this tutorial, the design and implementation of BDWs is targeted, in order to present a general approach that researchers and practitioners can follow in their Big Data Warehousing projects, exploring several demonstration cases focusing on system design and data modelling examples in areas like smart cities, retail, finance, manufacturing, among others

    Towards a Social Trust-Aware Recommender for Teachers

    Get PDF
    Fazeli, S., Drachsler, H., Brouns, F., & Sloep, P. B. (2014). Towards a Social Trust-aware Recommender for Teachers. In N. Manouselis, H. Drachsler, K. Verbert & O. C. Santos (Eds.), Recommender Systems for Technology Enhanced Learning (pp. 177-194): Springer New York.Online communities and networked learning provide teachers with social learning opportunities, allowing them to interact and collaborate with others in order to develop their personal and professional skills. However, with the large number of learning resources produced everyday, teachers need to find out what are the most suitable ones for them. In this paper, we introduce recommender systems as a potential solution to this . The setting is the Open Discovery Space (ODS) project. Unfortunately, due to the sparsity of the educational datasets most educational recommender systems cannot make accurate recommendations. To overcome this problem, we propose to enhance a trust-based recommender algorithm with social data obtained from monitoring the activities of teachers within the ODS platform. In this article, we outline the re-quirements of the ODS recommender system based on experiences reported in related TEL recommender system studies. In addition, we provide empirical ev-idence from a survey study with stakeholders of the ODS project to support the requirements identified from a literature study. Finally, we present an agenda for further research intended to find out which recommender system should ul-timately be deployed in the ODS platform.NELLL, EU 7th framework Open Discovery Spac

    Network Archaeology: Uncovering Ancient Networks from Present-day Interactions

    Get PDF
    Often questions arise about old or extinct networks. What proteins interacted in a long-extinct ancestor species of yeast? Who were the central players in the Last.fm social network 3 years ago? Our ability to answer such questions has been limited by the unavailability of past versions of networks. To overcome these limitations, we propose several algorithms for reconstructing a network's history of growth given only the network as it exists today and a generative model by which the network is believed to have evolved. Our likelihood-based method finds a probable previous state of the network by reversing the forward growth model. This approach retains node identities so that the history of individual nodes can be tracked. We apply these algorithms to uncover older, non-extant biological and social networks believed to have grown via several models, including duplication-mutation with complementarity, forest fire, and preferential attachment. Through experiments on both synthetic and real-world data, we find that our algorithms can estimate node arrival times, identify anchor nodes from which new nodes copy links, and can reveal significant features of networks that have long since disappeared.Comment: 16 pages, 10 figure

    Beyond User-to-User Access Control for Online Social Networks

    Full text link

    A meta-analysis of state-of-the-art electoral prediction from Twitter data

    Full text link
    Electoral prediction from Twitter data is an appealing research topic. It seems relatively straightforward and the prevailing view is overly optimistic. This is problematic because while simple approaches are assumed to be good enough, core problems are not addressed. Thus, this paper aims to (1) provide a balanced and critical review of the state of the art; (2) cast light on the presume predictive power of Twitter data; and (3) depict a roadmap to push forward the field. Hence, a scheme to characterize Twitter prediction methods is proposed. It covers every aspect from data collection to performance evaluation, through data processing and vote inference. Using that scheme, prior research is analyzed and organized to explain the main approaches taken up to date but also their weaknesses. This is the first meta-analysis of the whole body of research regarding electoral prediction from Twitter data. It reveals that its presumed predictive power regarding electoral prediction has been rather exaggerated: although social media may provide a glimpse on electoral outcomes current research does not provide strong evidence to support it can replace traditional polls. Finally, future lines of research along with a set of requirements they must fulfill are provided.Comment: 19 pages, 3 table

    Uncertainty estimation for operational ocean forecast products-a multi-model ensemble for the North Sea and the Baltic Sea

    Get PDF
    Multi-model ensembles for sea surface temperature (SST), sea surface salinity (SSS), sea surface currents (SSC), and water transports have been developed for the North Sea and the Baltic Sea using outputs from several operational ocean forecasting models provided by different institutes. The individual models differ in model code, resolution, boundary conditions, atmospheric forcing, and data assimilation. The ensembles are produced on a daily basis. Daily statistics are calculated for each parameter giving information about the spread of the forecasts with standard deviation, ensemble mean and median, and coefficient of variation. High forecast uncertainty, i.e., for SSS and SSC, was found in the Skagerrak, Kattegat (Transition Area between North Sea and Baltic Sea), and the Norwegian Channel. Based on the data collected, longer-term statistical analyses have been done, such as a comparison with satellite data for SST and evaluation of the deviation between forecasts in temporal and spatial scale. Regions of high forecast uncertainty for SSS and SSC have been detected in the Transition Area and the Norwegian Channel where a large spread between the models might evolve due to differences in simulating the frontal structures and their movements. A distinct seasonal pattern could be distinguished for SST with high uncertainty between the forecasts during summer. Forecasts with relatively high deviation from the multi-model ensemble (MME) products or the other individual forecasts were detected for each region and each parameter. The comparison with satellite data showed that the error of the MME products is lowest compared to those of the ensemble members

    The First Provenance Challenge

    No full text
    The first Provenance Challenge was set up in order to provide a forum for the community to help understand the capabilities of different provenance systems and the expressiveness of their provenance representations. To this end, a Functional Magnetic Resonance Imaging workflow was defined, which participants had to either simulate or run in order to produce some provenance representation, from which a set of identified queries had to be implemented and executed. Sixteen teams responded to the challenge, and submitted their inputs. In this paper, we present the challenge workflow and queries, and summarise the participants contributions
    corecore