Search CORE

52 research outputs found

News Comments: Exploring, Modeling, and Online Prediction

Author: Maarten De Rijke
Manos Tsagkias
Wouter Weerkamp
Publication venue
Publication date: 01/01/2010
Field of study

Abstract. Online news agents provide commenting facilities for their readers to express their opinions or sentiments with regards to news stories. The number of user supplied comments on a news article may be indicative of its importance, interestingness, or impact. We explore the news comments space, and compare the log-normal and the negative binomial distributions for modeling comments from various news agents. These estimated models can be used to normalize raw comment counts and enable comparison across different news sites. We also examine the feasibility of online prediction of the number of comments, based on the volume observed shortly after publication. We report on solid performance for predicting news comment volume in the long run, after short observation. This prediction can be useful for identifying news stories with the potential to “take off, ” and can be used to support front page optimization for news sites.

CiteSeerX

International Migration, Integration and Social Cohesion online publications

Recipient Recommendation in Enterprises using Communication Graphs and Email Content

Author: David Graus
David Van Dijk
Maarten De Rijke
Manos Tsagkias
Wouter Weerkamp
Publication venue
Publication date: 03/04/2020
Field of study

ABSTRACT We address the task of recipient recommendation for emailing in enterprises. We propose an intuitive and elegant way of modeling the task of recipient recommendation, which uses both the communication graph (i.e., who are most closely connected to the sender) and the content of the email. Additionally, the model can incorporate evidence as prior probabilities. Experiments on two enterprise email collections show that our model achieves very high scores, and that it outperforms two variants that use either the communication graph or the content in isolation

CiteSeerX

Blog feed search with a post index

Author: C. Manning
C. Zhai
D. J. C. Mackay
J. He
K. Balog
Krisztian Balog
Maarten de Rijke
Wouter Weerkamp
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

User generated content forms an important domain for mining knowledge. In this paper, we address the task of blog feed search: to find blogs that are principally devoted to a given topic, as opposed to blogs that merely happen to mention the topic in passing. The large number of blogs makes the blogosphere a challenging domain, both in terms of effectiveness and of storage and retrieval efficiency. We examine the effectiveness of an approach to blog feed search that is based on individual posts as indexing units (instead of full blogs). Working in the setting of a probabilistic language modeling approach to information retrieval, we model the blog feed search task by aggregating over a blogger’s posts to collect evidence of relevance to the topic and persistence of interest in the topic. This approach achieves state-of-the-art performance in terms of effectiveness. We then introduce a two-stage model where a pre-selection of candidate blogs is followed by a ranking step. The model integrates aggressive pruning techniques as well as very lean representations of the contents of blog posts, resulting in substantial gains in efficiency while maintaining effectiveness at a very competitive level

Crossref

UvA-DARE

International Migration, Integration and Social Cohesion online publications

Credibility-inspired ranking for blog post retrieval

Author: B. Liu
C. Manning
D. E. Losada
J. He
J. Klewes
M. Chen
M. Metzger
M. Tsagkias
Maarten de Rijke
R. Baeza-Yates
W. Chafe
W. Weerkamp
Wouter Weerkamp
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Exploiting External Collections for Query Expansion

Author: Amati G.
Arguello J.
Balog K.
Cartright M.-A.
Elsas J.
Ernsting B. J.
Fautsch C.
Hawking D.
Java A.
Jijkoun V.
Krisztian Balog
Kwok K. L.
Maarten de Rijke
Macdonald C.
Ounis I.
Ounis I.
Rocchio J.
Weerkamp W.
Westerveld T.
Wouter Weerkamp
Zhang W.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Clinical characteristics of women captured by extending the definition of severe postpartum haemorrhage with 'refractoriness to treatment': a cohort study

Author: Adriaanse H.J.
Baas M.I.
Bank C.M.C.
Beek E. van
Bijvank S.
Bloemenkamp KW
Boer K. (Karin)
Bom J.G.
Bremer H.A. (Henk)
Brons J.T.J.
Burggraaff J.M. (Jan)
Ceelie H.
Chon H.
Cikot J.L.M.
Cramer R.A.
de Boer B.A.G.
de Keijzer M.H.
de Mare A.
de Visser H.
De Vooght KMK
de Vries M.J.
de Wet H.
de Wit A.C.
Delemarre F.M.C.
Diris J.H.C.
Doesburg-van Kleffens M.
Duvekot J.J. (Hans)
Engbers P.
Feitsma H.
Fouraux M.A.
Franssen MT
Frasa M.A.M.
Gemund N. (Nicolette) van
Gillissen A.
Groot C.J.M.
Hackeng C.M. (Christian)
Ham D.P. (David) van der
Hanssen M.
Hasaart T.H.M. (Tom)
Hendriks H.A.
Henriquez D.
Henskens Y.M.C.
Hermsen B.B.J.
Hogenboom S.
Hooker A.
Hudig F
Huijssoon A.G. (Annemarie)
Huisjes A.J.M. (Anjoke)
Jonker N.
Kabel P.J.
Keuren JFW
Kleiverda G.
Klinkspoor J.H.
Koehorst S.G.A.
Kok J.B. (Jacques) de
Kok M.O. (Maarten)
Kok R.D.
Koops A.
Kortlandt W. (Wouter)
Langenveld J. (J.)
Leers MPG
Leyte A. (Anja)
Martens G.D.M.
Meekers J.H.
Meir C.A. (Claudia) van
Metz G.C.H. (Godfried)
Michielse E.
Mirani-Oostdijk C.P.
Mostert L.J.
Oostenveld E.
Osmanovic N.
Oudijk M.A. (Martijn)
Papatsonis D.N.M. (Dimitri)
Peters R.H.M.
Ponjee G.A.E. (Gabriëlle)
Pontesilli M.
Porath M. (Martina)
Post M.S.
Pouwels J.G.J.
Prinzen L.
Roelofsen J.M.T.
Rondeel J.J.M.
Salm P.C.M. (Paulien) van der
Scheepers H.C.J. (Hubertina)
Schippers D.H. (Daniela)
Schuitemaker N.W.E. (Nico)
Sikkema J.M. (J. Marko)
Slomp J. (Jennita)
Smit J.W.A. (Jan)
Smith S.M.
Snuif-de Lange Y.S.
Steures P. (Pieternel)
Tax G.H.M.
TeMp O.H.S.G.
Treskes M.
Ulenkate H.
van de Kerkhof D.H.
van den Akker E.S.A.
van den Akker T.
van der Borden D.M.R.
van der Graaf F.
van der Stappen J.W.J.
van der Veen B.S.
van Dooren I.M.A.
van Duijnhoven J.L.P.
van Dunn F.M.
van Gammeren A.J.
van Hulst M.J.W.
van Kampen C.
van Pampus E. C. M.
van Roosmalen J.J.
van Unnik G.A.
Verhagen T.E.M.
Versendaal J. (Johan)
Visschers B.
Visser O. (Oane)
Waard H. (Harm) de
Weerkamp F. (Floor)
Weinans M.J.N. (Martin)
Wijnen M. (Marit)
Wijngaarden W.J. (Wim) van
Woiski M.D. (Mallory)
Zwart J.J. (Joost)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Background: The absence of a uniform and clinically relevant definition of severe postpartum haemorrhage hampers comparative studies and optimization of clinical management. The concept of persistent postpartum haemorrhage, based on refractoriness to initial first-line treatment, was proposed as an alternative to common definitions that are either based on estimations of blood loss or transfused units of packed red blood cells (RBC). We compared characteristics and outcomes of women with severe postpartum haemorrhage captured by these three types of definitions. Methods: In this large retrospective cohort study in 61 hospitals in the Netherlands we included 1391 consecutive women with postpartum haemorrhage who received either ≥4 units of RBC or a multicomponent transfusion. Clinical characteristics and outcomes of women with severe postpartum haemorrhage defined as persistent postpartum haemorrhage were compared to definitions based on estimated blood loss or transfused units of RBC within 24 h following birth. Adverse maternal outcome was a composite of maternal mortality, hysterectomy, arterial embolisation and intensive care unit admission. Results: One thousand two hundred sixty out of 1391 women (90.6%) with postpartum haemorrhage fulfilled the definition of persistent postpartum haemorrhage. The majority, 820/1260 (65.1%), fulfilled this definition within 1 h following birth, compared to 819/1391 (58.7%) applying the definition of ≥1 L blood loss and 37/845 (4.4%) applying the definition of ≥4 units of RBC. The definition persistent postpartum haemorrhage captured 430/471 adverse maternal outcomes (91.3%), compared to 471/471 (100%) for ≥1 L blood loss and 383/471 (81.3%) for ≥4 units of RBC. Persistent postpartum haemorrhage did not capture all adverse outcomes because of missing data on timing of initial, first-line treatment. Conclusion: The definition persistent postpartum haemo

EUR Research Repository

Leiden University Scholary Publications

Erasmus University Digital Repository

Rijke, “A two-stage model for blog feed search

Author: Krisztian Balog
Wouter Weerkamp
Publication venue
Publication date: 01/01/2010
Field of study

ABSTRACT We consider blog feed search: identifying relevant blogs for a given topic. An individual's search behavior often involves a combination of exploratory behavior triggered by salient features of the information objects being examined plus goal-directed in-depth information seeking behavior. We present a two-stage blog feed search model that directly builds on this insight. We first rank blog posts for a given topic, and use their parent blogs as selection of blogs that we rank using a blog-based model

CiteSeerX

A Generative Blog Post Retrieval Model that Uses Query Expansion based on External Collections

Author: Krisztian Balog
Wouter Weerkamp
Publication venue
Publication date: 01/01/2009
Field of study

User generated content is characterized by short, noisy documents, with many spelling errors and unexpected language usage. To bridge the vocabulary gap between the user’s information need and documents in a specific user generated content environment, the blogosphere, we apply a form of query expansion, i.e., adding and reweighing query terms. Since the blogosphere is noisy, query expansion on the collection itself is rarely effective but external, edited collections are more suitable. We propose a generative model for expanding queries using external collections in which dependencies between queries, documents, and expansion documents are explicitly modeled. Different instantiations of our model are discussed and make different (in)dependence assumptions. Results using two external collections (news and Wikipedia) show that external expansion for retrieval of user generated content is effective; besides, conditioning the external collection on the query is very beneficial, and making candidate expansion terms dependent on just the document seems sufficient.

CiteSeerX