Search CORE

61 research outputs found

Feature extraction and selection for Arabic tweets authorship authentication

Author: A Abbasi
A Abbasi
A Pasha
Abdullateef Rabab’ah
AS Altheneyan
CC Aggarwal
E Stamatatos
E Stamatatos
F Mosteller
G Hirst
G Kanaan
GH Dunteman
H Sayoud
HC Chen
I Kononenko
JT Kent
M Hall
M Koppel
Mahmoud Al-Ayyoub
ML Brocardo
Monther Aldwairi
MS Khorsheed
N Cheng
O Vel De
P Juola
P Juola
P Kosmides
RS Baraka
T Helmy
W Deitrick
Yaser Jararweh
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2017
Field of study

© 2017, Springer-Verlag Berlin Heidelberg. In tweet authentication, we are concerned with correctly attributing a tweet to its true author based on its textual content. The more general problem of authenticating long documents has been studied before and the most common approach relies on the intuitive idea that each author has a unique style that can be captured using stylometric features (SF). Inspired by the success of modern automatic document classification problem, some researchers followed the Bag-Of-Words (BOW) approach for authenticating long documents. In this work, we consider both approaches and their application on authenticating tweets, which represent additional challenges due to the limitation in their sizes. We focus on the Arabic language due to its importance and the scarcity of works related on it. We create different sets of features from both approaches and compare the performance of different classifiers using them. We experiment with various feature selection techniques in order to extract the most discriminating features. To the best of our knowledge, this is the first study of its kind to combine these different sets of features for authorship analysis of Arabic tweets. The results show that combining all the feature sets we compute yields the best results

ZU Scholars (Zayed University)

Crossref

Recommended from our members

Validation of a social cohesion theoretical framework: a multiple group SEM strategy

Author: AJ Morin
B Muthén
CT Whelan
D Lockwood
D Rindskopf
DA Kenny
European Union
FF Chen
G Bon Le
G Duhaime
GH Dunteman
Gianmaria Bottoni
HH Noll
HW Marsh
HW Marsh
HW Marsh
HW Marsh
J Chan
JJ Hox
KA Bollen
KA Bollen
KF Widaman
KG Jöreskog
MS Granovetter
MS Jeannotte
MW Browne
MW Browne
MW Browne
NE Friedkin
P Dickes
P Dickes
PM Bentler
PM Bentler
R Berger-Schmitt
R Cudeck
RC MacCallum
RC MacCallum
RD Putnam
RK Merton
Social Health and Family Affairs Committee
T Asparouhov
T Parsons
TA Whittaker
W Meredith
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/04/2017
Field of study

Social cohesion dates back to the end of the nineteenth century. Back then, society experienced epochal transformations, as are also happening nowadays. Whenever there are epochal changes, a social order (cohesion) matter arises. The paper provides a conceptual scheme of social cohesion identifying its constituent dimensions subdivided by three spheres (macro, meso, micro) and two perspectives (objective and subjective). The overarching aim is to test the validity of the operationalization of the social cohesion model provided. Firstly, we conducted an exploratory factor analysis introducing an approach implemented in Mplus named exploratory structural equation modeling that shows several useful characteristics. Afterward, through a structural equation modeling approach, we performed several confirmatory factor analyses adopting a multiple group SEM strategy in order to cross-validate the social cohesion model

City Research Online

Crossref

Hardship financing of healthcare among rural poor in Orissa, India

Author: A Asfaw
A Kochar
A Krishna
A Leive
C Moser
Census of India
Census of India
CN Mock
D Sillers
David M Dror
DH Peters
DM Dror
E Van Doorslaer
Erika Binnendijk
G Flores
GH Dunteman
J Morduch
K Wyss
K Xu
LC Steinhardt
ME Kruk
Ministry of Health and Family Welfare India
National Sample Survey Organization
P Basu
PA Berman
R Duggal
R Sauerborn
R Sauerborn
RK Som
Ruth Koren
S Bonu
S Khun
S Nahar
S Russell
S Vyas
SR Adhikari
W Van Damme
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background This study examines health-related "hardship financing" in order to get better insights on how poor households finance their out-of-pocket healthcare costs. We define hardship financing as having to borrow money with interest or to sell assets to pay out-of-pocket healthcare costs. Methods Using survey data of 5,383 low-income households in Orissa, one of the poorest states of India, we investigate factors influencing the risk of hardship financing with the use of a logistic regression. Results Overall, about 25% of the households (that had any healthcare cost) reported hardship financing during the year preceding the survey. Among households that experienced a hospitalization, this percentage was nearly 40%, but even among households with outpatient or maternity-related care around 25% experienced hardship financing. Hardship financing is explained not merely by the wealth of the household (measured by assets) or how much is spent out-of-pocket on healthcare costs, but also by when the payment occurs, its frequency and its duration (e.g. more severe in cases of chronic illnesses). The location where a household resides remains a major predictor of the likelihood to have hardship financing despite all other household features included in the model. Conclusions Rural poor households are subjected to considerable and protracted financial hardship due to the indirect and longer-term deleterious effects of how they cope with out-of-pocket healthcare costs. The social network that households can access influences exposure to hardship financing. Our findings point to the need to develop a policy solution that would limit that exposure both in quantum and in time. We therefore conclude that policy interventions aiming to ensure health-related financial protection would have to demonstrate that they have reduced the frequency and the volume of hardship financing.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Erasmus University Digital Repository

Lifestyle and diet in relation to risk of type 2 diabetes in Vietnam: a hospital-based case-control study.

Author: A Alhazmi
AA Shitandi
AO Odegaard
C Willi
CT Nguyen
CY Jeon
DV Tran
DV Tran
E Elm Von
EpiData Association
F Naja
GA Colditz
GH Dunteman
H Armenian
H Iso
International Diabetes Federation
JJ Schlesselman
JJ Schlesselman
JJ Wang
JV Higdon
JV Higdon
JW Osborne
K Kusama
L Gordis
L Guariguata
L Radzeviciene
L Radzevičienė
L Radzevičienė
L Wang
M Ding
NC Khan
NDS Le
NM Pham
PS Quoc
R Huxley
RM Dam van
S Greenland
S Greenland
UG Kyle
World Health Organisation
World Health Organization
World Health Organization
WS Yang
Y Kim
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

BACKGROUND: Lifestyle and diet are important determinants of type 2 diabetes (T2D). Their impact on T2D can be evaluated using clinical and epidemiological approaches. Randomised controlled trials are the most rigorous design but expensive to conduct, whereas prospective cohort studies are time-consuming and less powerful for populations with a low incidence of the disease. Case-control studies are considered appropriate in resource-limited settings. A hospital-based case-control study protocol has been developed to investigate the role of lifestyle and dietary factors in T2D aetiology for adults in Vietnam. METHODS: A total of 1100 patients aged 40-65 years (550 T2D cases and 550 controls) will be recruited from a tertiary hospital in Hanoi, the capital city of Vietnam. Cases and controls will be frequency-matched on age (±3 years), gender, and residential location. T2D will be diagnosed according to the 2006 World Health Organisation criteria. Habitual physical activity will be assessed by the Vietnamese version of the International Physical Activity Questionnaire-Short Form. Food and beverage consumption will be ascertained using a Validated Food Frequency Questionnaire, specifically developed for the Vietnamese population. Information on demographic and other personal characteristics will be collected, together with anthropometric and blood pressure measurements. Descriptive statistics and unconditional logistic regression analyses will be performed to examine factors associated with the T2D prevalence. DISCUSSION: The proposed study will elucidate the role of lifestyle and diet in T2D prevalence among Vietnamese adults. Findings concerning pertinent factors will provide epidemiological evidence for the development of focused interventions, and contribute to the formulation of national policies to prevent and control T2D in Vietnam

Crossref

Springer - Publisher Connector

PubMed Central

espace@Curtin

Discrimination in lexical decision.

Author: A Stefanowitsch
AJ Parker
BC Love
C Burgess
C Leys
C Shaoul
CJ Marsolek
CR Oehrn
D Danks
D Norris
D Norris
D Norris
D Norris
DA Balota
DE Knuth
E Beyersmann
F Moscoso del Prado Martín
F Moscoso del Prado Martín
F Rosenblatt
G Recchia
GA Miller
GE Bodner
GE Booij
GH Dunteman
H Kučera
Hedderik van Rijn
J Friedman
J Friedman
J Friedman
J Heister
JA Dunabeitia
JL McClelland
JP Blevins
JS Bowers
JS Burt
K Lund
K Mulder
K Rastle
K Rastle
KY Chan
L Bauer
Laurie Beth Feldman
LB Feldman
LB Feldman
LB Feldman
LG Allan
LH Wurm
M Brysbeart
M Coltheart
M Marelli
M Minsky
M Ramscar
M Ramscar
M Ramscar
M Ramscar
M Ramscar
M Ramscar
M Taft
M Taft
M Taft
M Taft
M Taft
M Taft
MEJ Masson
Michael Ramscar
MM Botvinick
MN Shadlen
MS Vitevitch
MW Harm
NC Ellis
P Milin
P Milin
PC Trimmer
Petar Milin
Peter Hendrix
PH Matthews
R Romo
R Schreuder
R Schreuder
R Schreuder
R. Harald Baayen
RA Rescorla
RA Rescorla
RH Baayen
RH Baayen
RH Baayen
RH Baayen
RH Baayen
RQ Quiroga
RR Miller
S Andrews
S Andrews
S Andrews
S Waydo
SN Wood
T Yarkoni
TK Landauer
TL Griffiths
WJM Levelt
Z Chen
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2017
Field of study

In this study we present a novel set of discrimination-based indicators of language processing derived from Naive Discriminative Learning (ndl) theory. We compare the effectiveness of these new measures with classical lexical-distributional measures-in particular, frequency counts and form similarity measures-to predict lexical decision latencies when a complete morphological segmentation of masked primes is or is not possible. Data derive from a re-analysis of a large subset of decision latencies from the English Lexicon Project, as well as from the results of two new masked priming studies. Results demonstrate the superiority of discrimination-based predictors over lexical-distributional predictors alone, across both the simple and primed lexical decision tasks. Comparable priming after masked corner and cornea type primes, across two experiments, fails to support early obligatory segmentation into morphemes as predicted by the morpho-orthographic account of reading. Results fit well with ndl theory, which, in conformity with Word and Paradigm theory, rejects the morpheme as a relevant unit of analysis. Furthermore, results indicate that readers with greater spelling proficiency and larger vocabularies make better use of orthographic priors and handle lexical competition more efficiently

Crossref

University of Birmingham Research Portal

Directory of Open Access Journals

PubMed Central

White Rose Research Online

Community mobilization, empowerment and HIV prevention among female sex workers in south India

Author: A Amin
A Rahman
A Sen
Andrea K Blanchard
Banadakoppa Manjappa Ramesh
C Campbell
C Campbell
C Evans
D Kerrigan
D Kerrigan
D Swendeman
D Wight
E Bourcier
E Reed
F Cornish
F Cornish
F Cornish
GH Dunteman
Haranahalli Lakkappa Mohan
I Basu
IT Jolliffe
J Busza
J O'Neil
James F Blanchard
JC Kim
JD Tucker
JF Blanchard
JF Blanchard
JM Wojcicki
KM Blankenship
Maryam Shahmanesh
N Kabeer
N Kabeer
N Kabeer
P Pillai
Parinita Bhattacharjee
R Parker
Ravi Prakash
S Asthana
S Moses
S Panchanadeswaran
S Reza-Paul
Shajy Isac
SS Halli
Stephen Moses
T Ghose
V Gurnani
Vandana Gurnani
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Association between methadone dose and concomitant cocaine use in methadone maintenance treatment: a register-based study

Author: (NIH) NIoH
A Fareed
A Fareed
AP Kennedy
AT McLellan
B Meandzija
BKH Gsellhofer
CE Grella
DM Hartel
E Peles
EC Donny
EC Strain
EMCDDA
F Kamal
Gerhard A Wiesbeck
GH Dunteman
GKH Gmel
J Ward
JC Ball
Johannes Strasser
JP Vader
JR Caplehorn
Kenneth M Dürsteler-MacFarland
KM Dursteler-MacFarland
L Borg
L Greenfield
M Farre
M Farrell
M Gossop
M Kidorf
Marc Vogel
Marc Walter
Marcus Baumeister
MJ Bravo
PA DeMaria Jr
R Stohler
RE Chaisson
RS Schottenfeld
S Magura
S Maxwell
S Petitjean
S Petitjean
SB Leavitt
SE Decker
Sylvie A Petitjean
TM Brady
TR Kosten
Urs Gerhard
VP Dole
WHO
X Castells
YI Hser
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Evidence for contribution of common genetic variants within chromosome 8p21.2-8p21.1 to restricted and repetitive behaviors in autism spectrum disorders

Author: A Smallwood
AP Boyle
B Goldman
B Howie
Benjamin Ackerman
BJ Crespi
BN Howie
BP Coe
BS Abrahams
C Bertocchi
C Betancur
C Cheng
C Gillberg
C Lord
C Lord
D Ma
D Spiker
David Saffen
DH Geschwind
DH Geschwind
DH Geschwind
DS Cannon
E Fombonne
ER Martin
G Bellot
G Tang
GD Fischbach
GH Dunteman
H Hagberg
HG Kim
HM Ozgen
Hui Gao
I Bieche
JC Lambert
JI Tracy
JK Lowe
JL Stone
JM Berg
JM Silverman
K Papanikolaou
K Wang
K Whalley
K Xia
KS Lam
LA Weiss
M Elsabbagh
M Lewis
M Zhu
MD Fallin
ML Cuccaro
N Lacy de
N Ludemann
N Ohkawa
O Delaneau
OJ Veatch
P Szatmari
P Szatmari
PA Holmans
Q Yang
R Anney
R Anney
R Tabares-Seisdedos
S Numata
S Ribich
SL Bishop
SL Bishop
SR Leekam
SS Jeste
T Koide
TE Galesloot
Team RDC
V Hus
VW Hu
VW Hu
W Huang da
W Huang da
Wei Guo
WP Mandy
X Wang
X Zhou
XQ Liu
Y Benjamini
Y Shao
Y Taniyama
Yin Yao Shugart
YS Kim
Yu Tao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Selecting DEA specifications and ranking units via PCA

Author: Arabie P
C Mar Molinero
C Serrano Cinca
Dunteman GH
Norman M
Schiffman JF
Serrano Cinca C
Vargas SC
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A DEA Analysis of Risk, Cost, and Revenues in Insurance

Author: Boj E
C Mar-Molinero
Dunteman GH
Guillén M
I Contreras
Lemaire J
M M Segovia-Gonzalez
Neter J
Pujol M
Schiffman SS
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2009
Field of study

Insurance companies have to take risk and cost into account when pricing car insurance policies that cover the risk of private use of cars. In this paper we use data from 80 000 car insurance policies in order to assess, once risk and cost have been taken into account, the combinations of risk that generate the highest returns for the company under existing pricing practices. We use data envelopment analysis (DEA) and frame the study within an analysis of experiments context. The results of DEA are interpreted in a multivariate statistical analysis context using factor analysis, and property fitting techniques. The impact of risk factors in the efficiency is explored by means of regression analysis with dummy variables. There are consequences for the pricing policy of the company

Crossref

Kent Academic Repository