259 research outputs found

    Followers Are Not Enough: A Question-Oriented Approach to Community Detection in Online Social Networks

    Full text link
    Community detection in online social networks is typically based on the analysis of the explicit connections between users, such as "friends" on Facebook and "followers" on Twitter. But online users often have hundreds or even thousands of such connections, and many of these connections do not correspond to real friendships or more generally to accounts that users interact with. We claim that community detection in online social networks should be question-oriented and rely on additional information beyond the simple structure of the network. The concept of 'community' is very general, and different questions such as "whom do we interact with?" and "with whom do we share similar interests?" can lead to the discovery of different social groups. In this paper we focus on three types of communities beyond structural communities: activity-based, topic-based, and interaction-based. We analyze a Twitter dataset using three different weightings of the structural network meant to highlight these three community types, and then infer the communities associated with these weightings. We show that the communities obtained in the three weighted cases are highly different from each other, and from the communities obtained by considering only the unweighted structural network. Our results confirm that asking a precise question is an unavoidable first step in community detection in online social networks, and that different questions can lead to different insights about the network under study.Comment: 22 pages, 4 figures, 1 table

    Statistical Methods for Analyzing Time Series Data Drawn from Complex Social Systems

    Get PDF
    The rise of human interaction in digital environments has lead to an abundance of behavioral traces. These traces allow for model-based investigation of human-human and human-machine interaction `in the wild.' Stochastic models allow us to both predict and understand human behavior. In this thesis, we present statistical procedures for learning such models from the behavioral traces left in digital environments. First, we develop a non-parametric method for smoothing time series data corrupted by serially correlated noise. The method determines the simplest smoothing of the data that simultaneously gives the simplest residuals, where simplicity of the residuals is measured by their statistical complexity. We find that complexity regularized regression outperforms generalized cross validation in the presence of serially correlated noise. Next, we cast the task of modeling individual-level user behavior on social media into a predictive framework. We demonstrate the performance of two contrasting approaches, computational mechanics and echo state networks, on a heterogeneous data set drawn from user behavior on Twitter. We demonstrate that the behavior of users can be well-modeled as processes with self-feedback. We find that the two modeling approaches perform very similarly for most users, but that users where the two methods differ in performance highlight the challenges faced in applying predictive models to dynamic social data. We then expand the predictive problem of the previous work to modeling the aggregate behavior of large collections of users. We use three models, corresponding to seasonal, aggregate autoregressive, and aggregation-of-individual approaches, and find that the performance of the methods at predicting times of high activity depends strongly on the tradeoff between true and false positives, with no method dominating. Our results highlight the challenges and opportunities involved in modeling complex social systems, and demonstrate how influencers interested in forecasting potential user engagement can use complexity modeling to make better decisions. Finally, we turn from a predictive to a descriptive framework, and investigate how well user behavior can be attributed to time of day, self-memory, and social inputs. The models allow us to describe how a user processes their past behavior and their social inputs. We find that despite the diversity of observed user behavior, most models inferred fall into a small subclass of all possible finitary processes. Thus, our work demonstrates that user behavior, while quite complex, belies simple underlying computational structures

    SbcCD regulation and localization in Escherichia coli

    Get PDF
    The SbcCD complex and its homologues play important roles in DNA repair and in the maintenance of genome stability. In Escherichia coli, the in vitro functions of SbcCD have been well characterized, but its exact cellular role remains elusive. This work investigates the regulation of the sbcDC operon and the cellular localization of the SbcC and SbcD proteins. Transcription of the sbcDC operon is shown to be dependent on starvation and RpoS protein. Overexpressed SbcC protein forms foci that colocalize with the replication factory, while overexpressed SbcD protein is distributed through the cytoplasm

    Injection of Automatically Selected DBpedia Subjects in Electronic Medical Records to boost Hospitalization Prediction

    Get PDF
    International audienceAlthough there are many medical standard vocabularies available, it remains challenging to properly identify domain concepts in electronic medical records. Variations in the annotations of these texts in terms of coverage and abstraction may be due to the chosen annotation methods and the knowledge graphs, and may lead to very different performances in the automated processing of these annotations. We propose a semi-supervised approach based on DBpedia to extract medical subjects from EMRs and evaluate the impact of augmenting the features used to represent EMRs with these subjects in the task of predicting hospitalization. We compare the impact of subjects selected by experts vs. by machine learning methods through feature selection. Our approach was experimented on data from the database PRIMEGE PACA that contains more than 600,000 consultations carried out by 17 general practitioners (GPs)

    Projet BoxOffice : une initiative de modernisation des cabinets médicaux

    Get PDF
    National audienceRésumé non disponibl

    Prescription of antibiotics and anxiolytics/hypnotics to asthmatic patients in general practice: a cross-sectional study based on French and Italian prescribing data

    No full text
    BACKGROUND Asthma is often poorly controlled and guidelines are often inadequately followed in medical practice. In particular, the prescription of non-asthma-specific drugs can affect the quality of care. The goal of this study was to measure the frequency of the prescription of antibiotics and anxiolytics/hypnotics to asthmatic patients and to look for associations between sex or age and the prescription of these drugs. METHODS A cross-sectional study was conducted using computerised medical records from French and Italian general practitioners' networks. Patients were selected according to criteria adapted from the HEDIS (Healthcare Effectiveness Data and Information Set) criteria. The outcome measure was the number of antibiotics or anxiolytics/hypnotics prescriptions per patient in 1 year. Parallel multivariate models were developed. RESULTS The final sample included 3,093 French patients (mean age 27.6 years, 49.7% women) and 3,872 Italian patients (mean age 29.1 years, 48.7% women). In the univariate analysis, the French patients were prescribed fewer antibiotics than the Italian patients (37.1% vs. 42.2%, p < 0.00001) but more anxiolytics/hypnotics (17.8% vs. 6.9%, p < 0.0001). In the multivariate models, the female patients were more likely to receive antibiotics (odds ratio: 1.5 [1.3-1.7]) and anxiolytics/hypnotics (odds ratio: 1.8 [1.5-2.1]). CONCLUSIONS The prescription of antibiotics and anxiolytics/hypnotics to asthmatic patients is frequent, especially in women. Asthma guidelines should address this issue by referring to other guidelines covering the prescription of non-asthma-specific drugs, and alternative non-pharmacological interventions should be considered
    • 

    corecore