Search CORE

64 research outputs found

Representatively Memorable: Sampling the Right Phrase Set to Get the Text Entry Experiment Right

Author: Danescu-Niculescu-Mizil C.
Keller F.
Nelder J. A.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/05/2014
Field of study

[EN] In text entry experiments, memorability is a desired property of the phrases used as stimuli. Unfortunately, to date there is no automated method to achieve this effect. As a result, researchers have to use either manually curated Englishonly phrase sets or sampling procedures that do not guarantee phrases being memorable. In response to this need, we present a novel sampling method based on two core ideas: a multiple regression model over language-independent features, and the statistical analysis of the corpus from which phrases will be drawn. Our results show that researchers can finally use a method to successfully curate their own stimuli targeting potentially any language or domain. The source code as well as our phrase sets are publicly available.This work is supported by the 7th Framework Program of the European Commision (FP7/2007-13) under grant agreements 287576 (CASMACAT) and 600707 (tranScriptorium)Leiva, LA.; Sanchis-Trilles, G. (2014). Representatively Memorable: Sampling the Right Phrase Set to Get the Text Entry Experiment Right. ACM. 1709-1712. https://doi.org/10.1145/2556288.2557024S1709171

Crossref

RiuNet

Cascades: A view from Audience

Author: Bakshy E.
Berger J.
Danescu-Niculescu-Mizil C.
Dow P. A.
Goel S.
Goel S.
Lin Y.-R. R.
Romero D. M.
Ross S. M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 26/03/2017
Field of study

Cascades on online networks have been a popular subject of study in the past decade, and there is a considerable literature on phenomena such as diffusion mechanisms, virality, cascade prediction, and peer network effects. However, a basic question has received comparatively little attention: how desirable are cascades on a social media platform from the point of view of users? While versions of this question have been considered from the perspective of the producers of cascades, any answer to this question must also take into account the effect of cascades on their audience. In this work, we seek to fill this gap by providing a consumer perspective of cascade. Users on online networks play the dual role of producers and consumers. First, we perform an empirical study of the interaction of Twitter users with retweet cascades. We measure how often users observe retweets in their home timeline, and observe a phenomenon that we term the "Impressions Paradox": the share of impressions for cascades of size k decays much slower than frequency of cascades of size k. Thus, the audience for cascades can be quite large even for rare large cascades. We also measure audience engagement with retweet cascades in comparison to non-retweeted content. Our results show that cascades often rival or exceed organic content in engagement received per impression. This result is perhaps surprising in that consumers didn't opt in to see tweets from these authors. Furthermore, although cascading content is widely popular, one would expect it to eventually reach parts of the audience that may not be interested in the content. Motivated by our findings, we posit a theoretical model that focuses on the effect of cascades on the audience. Our results on this model highlight the balance between retweeting as a high-quality content selection mechanism and the role of network users in filtering irrelevant content

arXiv.org e-Print Archive

Crossref

PAV and the ROC convex hull

Author: A. Niculescu-Mizil
Alexandru Niculescu-Mizil
B. Zadrozny
F. Provost
G. W. Brier
J. A. Swets
J. Platt
J. Swets
M. Ayer
P. A. Flach
R. Caruana
Tom Fawcett
W. J. Wilbur
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Competition and Selection Among Conventions

Author: Adamic L. A.
Bakshy E.
Bendersky M.
Berger J.
Danescu-Niculescu-Mizil C.
Deutschmann P.
Eisenstein J.
Eisenstein J.
Goel S.
Kooti F.
Krackhardt D.
Labov W.
Labov W. L.
Livne A.
Rogers E.
Romero D. M.
Rotabi R.
Tahmasebi N.
Tsur O.
Valente T.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 26/03/2017
Field of study

In many domains, a latent competition among different conventions determines which one will come to dominate. One sees such effects in the success of community jargon, of competing frames in political rhetoric, or of terminology in technical contexts. These effects have become widespread in the online domain, where the data offers the potential to study competition among conventions at a fine-grained level. In analyzing the dynamics of conventions over time, however, even with detailed on-line data, one encounters two significant challenges. First, as conventions evolve, the underlying substance of their meaning tends to change as well; and such substantive changes confound investigations of social effects. Second, the selection of a convention takes place through the complex interactions of individuals within a community, and contention between the users of competing conventions plays a key role in the convention's evolution. Any analysis must take place in the presence of these two issues. In this work we study a setting in which we can cleanly track the competition among conventions. Our analysis is based on the spread of low-level authoring conventions in the eprint arXiv over 24 years: by tracking the spread of macros and other author-defined conventions, we are able to study conventions that vary even as the underlying meaning remains constant. We find that the interaction among co-authors over time plays a crucial role in the selection of them; the distinction between more and less experienced members of the community, and the distinction between conventions with visible versus invisible effects, are both central to the underlying processes. Through our analysis we make predictions at the population level about the ultimate success of different synonymous conventions over time--and at the individual level about the outcome of "fights" between people over convention choices.Comment: To appear in Proceedings of WWW 2017, data at https://github.com/CornellNLP/Macro

arXiv.org e-Print Archive

Crossref

Climate Informatics

Author: Alexander Francis J.
Banerjee Arindam
Blumenthal M. Benno
Ganguly Auroop R.
Monteleoni Claire
Niculescu-Mizil Alexandru
Schmidt Gavin A.
Smerdon Jason E.
Steinhaeuser Karsten
Tedesco Marco
Tippett Michael
Publication venue
Publication date
Field of study

The impacts of present and potential future climate change will be one of the most important scientific and societal challenges in the 21st century. Given observed changes in temperature, sea ice, and sea level, improving our understanding of the climate system is an international priority. This system is characterized by complex phenomena that are imperfectly observed and even more imperfectly simulated. But with an ever-growing supply of climate data from satellites and environmental sensors, the magnitude of data and climate model output is beginning to overwhelm the relatively simple tools currently used to analyze them. A computational approach will therefore be indispensable for these analysis challenges. This chapter introduces the fledgling research discipline climate informatics: collaborations between climate scientists and machine learning researchers in order to bridge this gap between data and understanding. We hope that the study of climate informatics will accelerate discovery in answering pressing questions in climate science

NASA Technical Reports Server

Inference algorithms for gene networks: a statistical mechanics analysis

Author: A Braunstein
A Pagnani
Alberts B
Baillet-Bechet M Braunstein A Pagnani A Weigt M Zecchina R
Banerjee O El Ghaoui L d’Aspremont A Natsoulis G
Braunstein A
Butte A J
Engel A
Gardner E
Gardner E
Hertz J
Kabashima Y
Kabashima Y
Lee S I
M Weigt
Murphy K Mian S
R Zecchina
Ravikumar P Wainwright M J Lafferty J D
Schmidt M Niculescu-Mizil A Murphy K
Tibshirany R
Tria F Pagnani A Weigt M
Publication venue: 'IOP Publishing'
Publication date: 01/01/2008
Field of study

The inference of gene regulatory networks from high throughput gene expression data is one of the major challenges in systems biology. This paper aims at analysing and comparing two different algorithmic approaches. The first approach uses pairwise correlations between regulated and regulating genes; the second one uses message-passing techniques for inferring activating and inhibiting regulatory interactions. The performance of these two algorithms can be analysed theoretically on well-defined test sets, using tools from the statistical physics of disordered systems like the replica method. We find that the second algorithm outperforms the first one since it takes into account collective effects of multiple regulators

arXiv.org e-Print Archive

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

Towards a consolidation of worldwide journal rankings - A classification using random forests and aggregate rating via data envelopment analysis

Author: A Charnes
A Diskin
A Farhangfar
A Farhangfar
A Hapfelmeier
A Hapfelmeier
A Hashimoto
A Liaw
A Liaw
A Niculescu-Mizil
A P Dempster
A-W Harzing
Acb Tse
B D Ripley
B Gonz�lez-Pereira
B Llamazares
B S Frey
Beth Twala
C Conversano
C Kao
C Paul
C Strobl
C Strobl
C Strobl
C Strobl
C T Bergstrom
D R Cutler
D S Felsenthal
D Steinberg
D Zhou
Dre Bancroft
E Hazelkorn
F L Dubois
G B Durrant
G Biau
G D Bruton
G E Halkos
G G Schulze
G W Brier
Grigory Pishchulov
Gtm Hult
H Bostr�m
H F Moed
H Kim
H Morris
H Noguchi
H Peters
H Willmott
Heinz Tuselmann
I A Gheyas
J Bollen
J H Friedman
J L Schafer
J Mingers
J R Meredith
J S Liu
J S Liu
J Wu
J Wu
J Wu
Jac Baum
K J Archer
K S Park
K-S Fam
L Breiman
L Breiman
L Breiman
L Breiman
L Leydesdorff
L Reyzin
L Zhou
M Bordons
M D Steward
M J Jones
M J Osborne
M Oral
M Rowlinson
N Franke
N J Adler
P A Crookes
R Core Team
R E Schapire
R H Green
R J Bauerly
R K Rainer
R Piccarreta
R Whitley
Rudolf R Sinkovics
S Albers
S Benati
S F Nielsen
S Lim
S Mahdi
S Mehrabian
S Theu�l
S Van Buuren
T Clark
T Hastie
T Hothorn
T Hothorn
T Hothorn
T M Therneau
T M Therneau
T-S Lim
Thomson Reuters
W D Cook
W D Cook
W D Cook
W D Cook
W D Cook
W D Cook
W E Stein
W Gl�nzel
W W Cooper
W-Y Loh
W-Y Loh
Y Yohannes
Y-M Wang
Y-M Wang
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

AbstractThe question of how to assess research outputs published in journals is now a global concern for academics. Numerous journal ratings and rankings exist, some featuring perceptual and peer-review-based journal ranks, some focusing on objective information related to citations, some using a combination of the two. This research consolidates existing journal rankings into an up-to-date and comprehensive list. Existing approaches to determining journal rankings are significantly advanced with the application of a new classification approach, ‘random forests’, and data envelopment analysis. As a result, a fresh look at a publication׳s place in the global research community is offered. While our approach is applicable to all management and business journals, we specifically exemplify the relative position of ‘operations research, management science, production and operations management’ journals within the broader management field, as well as within their own subject domain

Durham Research Online

Elsevier - Publisher Connector

Crossref

E-space: Manchester Metropolitan University's Research Repository

The University of Manchester - Institutional Repository

Enlighten

Boosted Bayesian network classifiers

Author: A. Nadas
A. Niculescu-Mizil
B. Taskar
C. K. Chow
D. Cutting
D. Grossman
D. Heckerman
D. M. Chickering
E. Bauer
E. Segal
G. F. Cooper
G. Webb
J. Demsar
J. Friedman
J. H. Friedman
J. Pearl
James M. Rehg
L. Rabiner
N. Friedman
P. Langley
R. E. Schapire
R. E. Schapire
R. Kohavi
R. Kohavi
R. O. Duda
T. G. Dietterich
V. Jojic
V. Pavlović
Vladimir Pavlović
W. Lam
Yushi Jing
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Ranked retrieval of segmented nuclei for objective assessment of cancer gene repositioning

Author: A Akhtar
A Hill
A Niculescu-mizil
AJH Mehnert
B Everitt
B Zadrozny
C Lanctot
CS Bjornsson
CS Osborne
D Gerlich
David J Foran
E Soutoglou
F Raimondo
FP Kuhl
G Lin
G Lin
H Zhang
J Schwarz-Finsterle
JJ Roix
JW Shirley
K Kupper
K Nandy
Karen J Meaburn
Kaustav Nandy
KJ Meaburn
KJ Meaburn
L Breiman
LA Parada
M Cremer
M Simonis
M Valente
NL Mahy
O Ronneberger
P Andrey
P Danielsson
P Fraser
PR Gudla
Prabhakar Gudla
S Kozubek
ST Kosak
ST Kosak
Stephen J Lockett
T Cremer
T Hastie
T Misteli
T Sexton
Tom Misteli
V Roukos
William J Cukierski
Y Al-Kofahi
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref