318 research outputs found
Followers Are Not Enough: A Question-Oriented Approach to Community Detection in Online Social Networks
Community detection in online social networks is typically based on the
analysis of the explicit connections between users, such as "friends" on
Facebook and "followers" on Twitter. But online users often have hundreds or
even thousands of such connections, and many of these connections do not
correspond to real friendships or more generally to accounts that users
interact with. We claim that community detection in online social networks
should be question-oriented and rely on additional information beyond the
simple structure of the network. The concept of 'community' is very general,
and different questions such as "whom do we interact with?" and "with whom do
we share similar interests?" can lead to the discovery of different social
groups. In this paper we focus on three types of communities beyond structural
communities: activity-based, topic-based, and interaction-based. We analyze a
Twitter dataset using three different weightings of the structural network
meant to highlight these three community types, and then infer the communities
associated with these weightings. We show that the communities obtained in the
three weighted cases are highly different from each other, and from the
communities obtained by considering only the unweighted structural network. Our
results confirm that asking a precise question is an unavoidable first step in
community detection in online social networks, and that different questions can
lead to different insights about the network under study.Comment: 22 pages, 4 figures, 1 table
Statistical Methods for Analyzing Time Series Data Drawn from Complex Social Systems
The rise of human interaction in digital environments has lead to an abundance of behavioral traces. These traces allow for model-based investigation of human-human and human-machine interaction `in the wild.' Stochastic models allow us to both predict and understand human behavior. In this thesis, we present statistical procedures for learning such models from the behavioral traces left in digital environments.
First, we develop a non-parametric method for smoothing time series data corrupted by serially correlated noise. The method determines the simplest smoothing of the data that simultaneously gives the simplest residuals, where simplicity of the residuals is measured by their statistical complexity. We find that complexity regularized regression outperforms generalized cross validation in the presence of serially correlated noise.
Next, we cast the task of modeling individual-level user behavior on social media into a predictive framework. We demonstrate the performance of two contrasting approaches, computational mechanics and echo state networks, on a heterogeneous data set drawn from user behavior on Twitter. We demonstrate that the behavior of users can be well-modeled as processes with self-feedback. We find that the two modeling approaches perform very similarly for most users, but that users where the two methods differ in performance highlight the challenges faced in applying predictive models to dynamic social data.
We then expand the predictive problem of the previous work to modeling the aggregate behavior of large collections of users. We use three models, corresponding to seasonal, aggregate autoregressive, and aggregation-of-individual approaches, and find that the performance of the methods at predicting times of high activity depends strongly on the tradeoff between true and false positives, with no method dominating. Our results highlight the challenges and opportunities involved in modeling complex social systems, and demonstrate how influencers interested in forecasting potential user engagement can use complexity modeling to make better decisions.
Finally, we turn from a predictive to a descriptive framework, and investigate how well user behavior can be attributed to time of day, self-memory, and social inputs. The models allow us to describe how a user processes their past behavior and their social inputs. We find that despite the diversity of observed user behavior, most models inferred fall into a small subclass of all possible finitary processes. Thus, our work demonstrates that user behavior, while quite complex, belies simple underlying computational structures
SbcCD regulation and localization in Escherichia coli
The SbcCD complex and its homologues play important roles in DNA repair and in the maintenance of genome stability. In Escherichia coli, the in vitro functions of SbcCD have been well characterized, but its exact cellular role remains elusive. This work investigates the regulation of the sbcDC operon and the cellular localization of the SbcC and SbcD proteins. Transcription of the sbcDC operon is shown to be dependent on starvation and RpoS protein. Overexpressed SbcC protein forms foci that colocalize with the replication factory, while overexpressed SbcD protein is distributed through the cytoplasm
Injection of Automatically Selected DBpedia Subjects in Electronic Medical Records to boost Hospitalization Prediction
International audienceAlthough there are many medical standard vocabularies available, it remains challenging to properly identify domain concepts in electronic medical records. Variations in the annotations of these texts in terms of coverage and abstraction may be due to the chosen annotation methods and the knowledge graphs, and may lead to very different performances in the automated processing of these annotations. We propose a semi-supervised approach based on DBpedia to extract medical subjects from EMRs and evaluate the impact of augmenting the features used to represent EMRs with these subjects in the task of predicting hospitalization. We compare the impact of subjects selected by experts vs. by machine learning methods through feature selection. Our approach was experimented on data from the database PRIMEGE PACA that contains more than 600,000 consultations carried out by 17 general practitioners (GPs)
Projet BoxOffice : une initiative de modernisation des cabinets médicaux
National audienceRésumé non disponibl
DNA double strand break repair in Escherichia coli perturbs cell division and chromosome dynamics
To prevent the transmission of damaged genomic material between generations, cells require a system for accommodating DNA repair within their cell cycles. We have previously shown that Escherichia coli cells subject to a single, repairable site-specific DNA double-strand break (DSB) per DNA replication cycle reach a new average cell length, with a negligible effect on population growth rate. We show here that this new cell size distribution is caused by a DSB repair-dependent delay in completion of cell division. This delay occurs despite unperturbed cell size regulated initiation of both chromosomal DNA replication and cell division. Furthermore, despite DSB repair altering the profile of DNA replication across the genome, the time required to complete chromosomal duplication is invariant. The delay in completion of cell division is accompanied by a DSB repair-dependent delay in individualization of sister nucleoids. We suggest that DSB repair events create inter-sister connections that persist until those chromosomes are separated by a closing septum
« La socialisation secondaire ne s’exerce pas sur une page blanche mais sur une page déjà écrite et déjà froissée par les expériences antérieures »
International audienc
Prescription of antibiotics and anxiolytics/hypnotics to asthmatic patients in general practice: a cross-sectional study based on French and Italian prescribing data
BACKGROUND Asthma is often poorly controlled and guidelines are often inadequately followed in medical practice. In particular, the prescription of non-asthma-specific drugs can affect the quality of care. The goal of this study was to measure the frequency of the prescription of antibiotics and anxiolytics/hypnotics to asthmatic patients and to look for associations between sex or age and the prescription of these drugs. METHODS A cross-sectional study was conducted using computerised medical records from French and Italian general practitioners' networks. Patients were selected according to criteria adapted from the HEDIS (Healthcare Effectiveness Data and Information Set) criteria. The outcome measure was the number of antibiotics or anxiolytics/hypnotics prescriptions per patient in 1 year. Parallel multivariate models were developed. RESULTS The final sample included 3,093 French patients (mean age 27.6 years, 49.7% women) and 3,872 Italian patients (mean age 29.1 years, 48.7% women). In the univariate analysis, the French patients were prescribed fewer antibiotics than the Italian patients (37.1% vs. 42.2%, p < 0.00001) but more anxiolytics/hypnotics (17.8% vs. 6.9%, p < 0.0001). In the multivariate models, the female patients were more likely to receive antibiotics (odds ratio: 1.5 [1.3-1.7]) and anxiolytics/hypnotics (odds ratio: 1.8 [1.5-2.1]). CONCLUSIONS The prescription of antibiotics and anxiolytics/hypnotics to asthmatic patients is frequent, especially in women. Asthma guidelines should address this issue by referring to other guidelines covering the prescription of non-asthma-specific drugs, and alternative non-pharmacological interventions should be considered
- …