128 research outputs found

    Identifying Retweetable Tweets with a Personalized Global Classifier

    Full text link
    In this paper we present a method to identify tweets that a user may find interesting enough to retweet. The method is based on a global, but personalized classifier, which is trained on data from several users, represented in terms of user-specific features. Thus, the method is trained on a sufficient volume of data, while also being able to make personalized decisions, i.e., the same post received by two different users may lead to different classification decisions. Experimenting with a collection of approx.\ 130K tweets received by 122 journalists, we train a logistic regression classifier, using a wide variety of features: the content of each tweet, its novelty, its text similarity to tweets previously posted or retweeted by the recipient or sender of the tweet, the network influence of the author and sender, and their past interactions. Our system obtains F1 approx. 0.9 using only 10 features and 5K training instances.Comment: This is a long paper version of the extended abstract titled "A Personalized Global Filter To Predict Retweets", of the same authors, which was published in the 25th ACM UMAP conference in Bratislava, Slovakia, in July 201

    A HMM POS Tagger for Micro-blogging Type Texts

    Get PDF
    The high volume of communication via micro-blogging type messages has created an increased demand for text processing tools customised the unstructured text genre. The available text processing tools developed on structured texts has been shown to deteriorate significantly when used on unstructured, micro-blogging type texts. In this paper, we present the results of testing a HMM based POS (Part-Of-Speech) tagging model customized for unstructured texts. We also evaluated the tagger against published CRF based state-of-the-art POS tagging models customized for Tweet messages using three publicly available Tweet corpora. Finally, we did cross-validation tests with both the taggers by training them on one Tweet corpus and testing them on another one

    Deep Memory Networks for Attitude Identification

    Full text link
    We consider the task of identifying attitudes towards a given set of entities from text. Conventionally, this task is decomposed into two separate subtasks: target detection that identifies whether each entity is mentioned in the text, either explicitly or implicitly, and polarity classification that classifies the exact sentiment towards an identified entity (the target) into positive, negative, or neutral. Instead, we show that attitude identification can be solved with an end-to-end machine learning architecture, in which the two subtasks are interleaved by a deep memory network. In this way, signals produced in target detection provide clues for polarity classification, and reversely, the predicted polarity provides feedback to the identification of targets. Moreover, the treatments for the set of targets also influence each other -- the learned representations may share the same semantics for some targets but vary for others. The proposed deep memory network, the AttNet, outperforms methods that do not consider the interactions between the subtasks or those among the targets, including conventional machine learning methods and the state-of-the-art deep learning models.Comment: Accepted to WSDM'1

    Feasibility of detecting single atoms using photonic bandgap cavities

    Get PDF
    We propose an atom-cavity chip that combines laser cooling and trapping of neutral atoms with magnetic microtraps and waveguides to deliver a cold atom to the mode of a fiber taper coupled photonic bandgap (PBG) cavity. The feasibility of this device for detecting single atoms is analyzed using both a semi-classical treatment and an unconditional master equation approach. Single-atom detection seems achievable in an initial experiment involving the non-deterministic delivery of weakly trapped atoms into the mode of the PBG cavity.Comment: 11 pages, 5 figure

    Statistical Inference for Valued-Edge Networks: Generalized Exponential Random Graph Models

    Get PDF
    Across the sciences, the statistical analysis of networks is central to the production of knowledge on relational phenomena. Because of their ability to model the structural generation of networks, exponential random graph models are a ubiquitous means of analysis. However, they are limited by an inability to model networks with valued edges. We solve this problem by introducing a class of generalized exponential random graph models capable of modeling networks whose edges are valued, thus greatly expanding the scope of networks applied researchers can subject to statistical analysis

    Refining Kidney Survival in 383 Genetically Characterized Patients With Nephronophthisis

    Get PDF
    Introduction: Nephronophthisis (NPH) comprises a group of rare disorders accounting for up to 10% of end-stage kidney disease (ESKD) in children. Prediction of kidney prognosis poses a major challenge. We assessed differences in kidney survival, impact of variant type, and the association of clinical characteristics with declining kidney function. Methods: Data was obtained from 3 independent sources, namely the network for early onset cystic kidney diseases clinical registry (n = 105), an online survey sent out to the European Reference Network for Rare Kidney Diseases (n = 60), and a literature search (n = 218). Results: A total of 383 individuals were available for analysis: 116 NPHP1, 101 NPHP3, 81 NPHP4 and 85 NPHP11/TMEM67 patients. Kidney survival differed between the 4 cohorts with a highly variable median age at onset of ESKD as follows: NPHP3, 4.0 years (interquartile range 0.3–12.0); NPHP1, 13.5 years (interquartile range 10.5–16.5); NPHP4, 16.0 years (interquartile range 11.0–25.0); and NPHP11/TMEM67, 19.0 years (interquartile range 8.7–28.0). Kidney survival was significantly associated with the underlying variant type for NPHP1, NPHP3, and NPHP4. Multivariate analysis for the NPHP1 cohort revealed growth retardation (hazard ratio 3.5) and angiotensin-converting enzyme inhibitor (ACEI) treatment (hazard ratio 2.8) as 2 independent factors associated with an earlier onset of ESKD, whereas arterial hypertension was linked to an accelerated glomerular filtration rate (GFR) decline. Conclusion: The presented data will enable clinicians to better estimate kidney prognosis of distinct patients with NPH and thereby allow personalized counseling

    Guidance on a better integration of aquaculture, fisheries, and other activities in the coastal zone: from tools to practical examples

    Get PDF
    This guidance document provides a comprehensive assessment of the conflicts and synergies between fisheries, aquaculture and other activities in the coastal zone in six COEXIST case study areas. It forms deliverable D5.2 of the COEXIST project and synthesises deliverable D5.1, which provides a more detailed description of the methods used and results. This document also accounts for the views and expectations of stakeholders that were raised at the COEXIST stakeholder workshop held in Bergen, Norway, parallel to the ICES (International Council for the Exploration of the Sea) Annual Science Conference 2012. Over 30 stakeholders representing a variety of sectors, including aquaculture, fisheries, coastal zone management, tourism and energy, as well as 20 members from the COEXIST project and ICES representatives, attended this event. The stakeholders and COEXIST members were from Denmark, Finland, France, Germany, Ireland, Italy, Norway, Portugal, Spain, the Netherlands and the United Kingdom. The workshop aims were firstly to communicate the COEXIST project results and progress to stakeholders and the second major aim was to receive stakeholder feedback on the development of best practice guidance for spatial planning to integrate fisheries, aquaculture and further demands in the coastal zone

    Author Correction: Federated learning enables big data for rare cancer boundary detection.

    Get PDF
    • …
    corecore