Search CORE

20,090 research outputs found

Start Time and Duration Distribution Estimation in Semi-Structured Processes

Author: Iacob Maria-Eugenia
Wombacher Andreas
Publication venue: Centre for Telematics and Information Technology, University of Twente
Publication date: 01/01/2012
Field of study

Semi-structured processes are business workflows, where the execution of the workflow is not completely controlled by a workflow engine, i.e., an implementation of a formal workflow model. Examples are workflows where actors potentially have interaction with customers reporting the result of the interaction in a process aware information system. Building a performance model for resource management in these processes is difficult since the required information is only partially recorded. In this paper we propose a systematic approach for the creation of an event log that is suitable for available process mining tools. This event log is created by an incrementally cleansing of data. The proposed approach is evaluated in an experiment

University of Twente Research Information

The Role of Interactivity in Local Differential Privacy

Author: Joseph Matthew
Mao Jieming
Neel Seth
Roth Aaron
Publication venue
Publication date: 08/11/2019
Field of study

We study the power of interactivity in local differential privacy. First, we focus on the difference between fully interactive and sequentially interactive protocols. Sequentially interactive protocols may query users adaptively in sequence, but they cannot return to previously queried users. The vast majority of existing lower bounds for local differential privacy apply only to sequentially interactive protocols, and before this paper it was not known whether fully interactive protocols were more powerful. We resolve this question. First, we classify locally private protocols by their compositionality, the multiplicative factor

k \geq 1

by which the sum of a protocol's single-round privacy parameters exceeds its overall privacy guarantee. We then show how to efficiently transform any fully interactive

k

-compositional protocol into an equivalent sequentially interactive protocol with an

O(k)

blowup in sample complexity. Next, we show that our reduction is tight by exhibiting a family of problems such that for any

k

, there is a fully interactive

k

-compositional protocol which solves the problem, while no sequentially interactive protocol can solve the problem without at least an

\tilde \Omega(k)

factor more examples. We then turn our attention to hypothesis testing problems. We show that for a large class of compound hypothesis testing problems --- which include all simple hypothesis testing problems as a special case --- a simple noninteractive test is optimal among the class of all (possibly fully interactive) tests

arXiv.org e-Print Archive

Crossref

Diffusion of Lexical Change in Social Media

Author: Eisenstein Jacob
O'Connor Brendan
Smith Noah A.
Xing Eric P.
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 19/11/2014
Field of study

Computer-mediated communication is driving fundamental changes in the nature of written language. We investigate these changes by statistical analysis of a dataset comprising 107 million Twitter messages (authored by 2.7 million unique user accounts). Using a latent vector autoregressive model to aggregate across thousands of words, we identify high-level patterns in diffusion of linguistic change over the United States. Our model is robust to unpredictable changes in Twitter's sampling rate, and provides a probabilistic characterization of the relationship of macro-scale linguistic influence to a set of demographic and geographic predictors. The results of this analysis offer support for prior arguments that focus on geographical proximity and population size. However, demographic similarity -- especially with regard to race -- plays an even more central role, as cities with similar racial demographics are far more likely to share linguistic influence. Rather than moving towards a single unified "netspeak" dialect, language evolution in computer-mediated communication reproduces existing fault lines in spoken American English.Comment: preprint of PLOS-ONE paper from November 2014; PLoS ONE 9(11) e11311

arXiv.org e-Print Archive

Directory of Open Access Journals

PubMed Central

FigShare

Log file analysis for disengagement detection in e-Learning environments

Author: Cocea Mihaela
Weibelzahl S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2009
Field of study

Portsmouth University Research Portal (Pure)