Search CORE

159 research outputs found

Predicting self‐declared movie watching behavior using Facebook data and information‐fusion sensitivity analysis

Author: Arias M.
Berk R. A.
Biau G.
Bogaert M.
Culp M.
Demšar J.
Dietterich T. G.
El Assady M.
Freund Y.
Freund Y.
Gupta A.
Hernandez‐Orallo J.
Jain V.
Langley P.
Liaw A.
Liu Y.
Mishne G.
Pham X. H.
Prinzie A.
Rish I.
Said A.
Van den Poel D.
Publication venue: 'Wiley'
Publication date: 23/07/2019
Field of study

The main purpose of this paper is to evaluate the feasibility of predicting whether yes or no a Facebook user has self-reported to have watched a given movie genre. Therefore, we apply a data analytical framework that (1) builds and evaluates several predictive models explaining self-declared movie watching behavior, and (2) provides insight into the importance of the predictors and their relationship with self-reported movie watching behavior. For the first outcome, we benchmark several algorithms (logistic regression, random forest, adaptive boosting, rotation forest, and naive Bayes) and evaluate their performance using the area under the receiver operating characteristic curve. For the second outcome, we evaluate variable importance and build partial dependence plots using information-fusion sensitivity analysis for different movie genres. To gather the data, we developed a custom native Facebook app. We resampled our dataset to make it representative of the general Facebook population with respect to age and gender. The results indicate that adaptive boosting outperforms all other algorithms. Time- and frequency-based variables related to media (movies, videos, and music) consumption constitute the list of top variables. To the best of our knowledge, this study is the first to fit predictive models of self-reported movie watching behavior and provide insights into the relationships that govern these models. Our models can be used as a decision tool for movie producers to target potential movie-watchers and market their movies more efficiently

Crossref

Ghent University Academic Bibliography

Edinburgh Research Explorer

Active learning and search on low-rank matrices

Author: Boutilier C.
Eriksson A.
Garnett R.
Garnett R.
Hofmann T.
Jin R.
Murphy R. F.
Neal R. M.
Rish I.
Salakhutdinov R.
Srebro N.
Yu K.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2013
Field of study

Collaborative prediction is a powerful technique, useful in domains from recommender systems to guiding the scien-tific discovery process. Low-rank matrix factorization is one of the most powerful tools for collaborative prediction. This work presents a general approach for active collabora-tive prediction with the Probabilistic Matrix Factorization model. Using variational approximations or Markov chain Monte Carlo sampling to estimate the posterior distribution over models, we can choose query points to maximize our un-derstanding of the model, to best predict unknown elements of the data matrix, or to find as many “positive ” data points as possible. We evaluate our methods on simulated data, and also show their applicability to movie ratings prediction and the discovery of drug-target interactions

CiteSeerX

Crossref

Using naive bayes to detect spammy names in social networks

Author: Androutsopoulos I.
Benevenuto F.
Bennett P. N.
Cao Q.
Domingos P.
Duda R.
Hovold J.
Kohavi R.
McCallum A.
Metsis V.
Rish I.
Sahami M.
Wang G.
Zhang H.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Observational and genetic associations between cardiorespiratory fitness and cancer:A UK Biobank and international consortia study

Background: The association of fitness with cancer risk is not clear. Methods: We used Cox proportional hazards models to estimate hazard ratios (HRs) and 95% confidence intervals (CIs) for risk of lung, colorectal, endometrial, breast, and prostate cancer in a subset of UK Biobank participants who completed a submaximal fitness test in 2009-12 (N = 72,572). We also investigated relationships using two-sample Mendelian randomisation (MR), odds ratios (ORs) were estimated using the inverse-variance weighted method. Results: After a median of 11 years of follow-up, 4290 cancers of interest were diagnosed. A 3.5 ml O 2⋅min −1⋅kg −1 total-body mass increase in fitness (equivalent to 1 metabolic equivalent of task (MET), approximately 0.5 standard deviation (SD)) was associated with lower risks of endometrial (HR = 0.81, 95% CI: 0.73–0.89), colorectal (0.94, 0.90–0.99), and breast cancer (0.96, 0.92–0.99). In MR analyses, a 0.5 SD increase in genetically predicted O 2⋅min −1⋅kg −1 fat-free mass was associated with a lower risk of breast cancer (OR = 0.92, 95% CI: 0.86–0.98). After adjusting for adiposity, both the observational and genetic associations were attenuated. Discussion: Higher fitness levels may reduce risks of endometrial, colorectal, and breast cancer, though relationships with adiposity are complex and may mediate these relationships. Increasing fitness, including via changes in body composition, may be an effective strategy for cancer prevention.</p

Edinburgh Research Explorer

ОЦЕНКА ЦИТОТОКСИЧНОСТИ ТРИХОТЕЦЕНА FUSARIUM SP. НА ЛИНИЮ РАКА МОЛОЧНОЙ ЖЕЛЕЗЫ IN VITRO

Author: A. Glinushkin P.
A. Nabatov A.
I. Idiyatov I.
L. Valiullin R.
Rin. Mukhammadiev S.
Rish. Mukhammadiev S.
V. Biryulya V.
А. Глинушкин П.
А. Набатов А.
В. Бирюля В.
И. Идиятов И.
Л. Валиуллин Р.
Рин. Мухаммадиев С.
Риш. Мухаммадиев С.
Publication venue: 'Tomsk Cancer Research Institute'
Publication date: 05/01/2020
Field of study

trichothecenes and their derivatives have recently attracted much attention of researchers with respect of their potential role in medicine, including for the treatment of different types of cancer. The purpose of the study was to investigate the cytotoxic effect of Fusarium trichothecene on human breast cancer cells, human skin fibroblasts and embryonic kidney cells in vitro. Material and methods. Based on the Mtt assay, the cytotoxic effect of trichothecene on cell cultures was determined. Evaluation of morphological changes in cell cultures under the influence of trichothecene was performed by light microscopy. Results. Fusarium trichothecene was found to exhibit a dose-dependent toxic effect on cell lines in the range 1 nM to 1000 nM. the most pronounced cytotoxic effect of trichothecene was observed in human breast cancer cell line (IС50 94.72 ± 4.1 нМ). Lower doses of trichothecene led to a change in the size, shape of human breast cancer cells, human skin fibroblasts and embryonic kidney cells, and loss of contact between them and their isolation. the degradation of cell membranes, formation of unformed cell aggregates and fragments were observed in higher doses of trichothecene. Conclusion. Fusarium trichothecen is a biologically active compound and is less toxic to the normal than to the cancer cell lines, therefore, further studies of this agent are needed.В последнее время трихотеценовые соединения и их производные привлекают внимание исследователей в связи с их потенциальной возможностью применения в медицине, в том числе для лечения различных видов рака. Цель исследования – изучение цитотоксического действия трихотецена Fusarium sp. в отношении линий опухолевых клеток рака молочной железы, нормальных клеток фибробластов кожи и почек эмбриона человека in vitro. Материал и методы. С использованием общепринятого метода МТТ-теста проводилось определение цитотоксического действия трихотецена в отношении исследуемых культур клеток. Оценку изменения в морфологии клеток под воздействием трихотецена проводили методом световой микроскопии. Результаты. Было обнаружено, что трихотецен Fusarium sp. в диапазоне концентрации 1–1000 нM проявлял дозозависимое токсическое действие в отношении исследуемых линий клеток. Наиболее выраженное цитотоксическое действие трихотецена наблюдали при его действии на линию опухолевых клеток молочной железы (IС50 94,72 ± 4,1 нМ). Совместная инкубация трихотецена с линиями клеток рака молочной железы, клеток фибробластов кожи и почек эмбриона человека в более низких дозах приводила к изменению размеров, формы клеток, потере контактов между ними и их обособлению. При более высоких дозах трихотецена наблюдалась деградация мембран, образование неоформленных клеточных агрегатов и фрагментов (апоптозных тел). Заключение. Трихотецен Fusarium sp. обладает биологически активным потенциалом и является менее токсичным по отношению к нормальным клеткам человека по сравнению с опухолевыми, поэтому его целесообразно в дальнейшем исследовать как возможного противоопухолевого агента

Siberian journal of oncology / Сибирский онкологический журнал

Multi-Task Learning for Interpretation of Brain Decoding Models

Author: B Afshin-Pour
BR Conroy
DD Cox
DM Groppe
E Maris
E Maris
FJ Valverde-Albacete
H Zou
I Rish
L Grosenick
M Brecht de
M Gerven van
M Yuan
MK Carroll
PM Rasmussen
R Tibshirani
RN Henson
SC Strother
T Zhang
Publication venue: Springer
Publication date
Field of study

Improving the interpretability of multivariate models is of primary interest for many neuroimaging studies. In this study, we present an application of multi-task learning (MTL) to enhance the interpretability of linear classifiers once applied to neuroimaging data. To attain our goal, we propose to divide the data into spatial fractions and define the temporal data of each spatial unit as a task in MTL paradigm. Our result on magnetoencephalography (MEG) data reveals preliminary evidence that, (1) dividing the brain recordings into spatial fractions based on spatial units of data and (2) considering each spatial fraction as a task, are two factors that provide more stability and consequently more interpretability for brain decoding models

Crossref

Archivio della ricerca - Fondazione Bruno Kessler

Naive Bayes ant colony optimization for designing high dimensional experiments

Author: Baragona
Berni
Bickel
Blum
Blum
Blum
Blum
Borrotti
Branden
Caschera
D. De Lucrezia
Damborsky
De Jong
Dorigo
Dorigo
Dorigo
Dorigo
Ferrari
Forlin
G. Minervini
Gambardella
Garlapati
Goldberg
Holland
I. Poli
Ji
Jones
Lindsay
Longhi
M. Borrotti
Minervini
Mitchell
Mohsen
Montemanni
Nestl
Pellegrini
Rish
Rosen
Rubinstein
Sahami
Sambo
Shyu
Slanzi
Stützle
Tang
Ullah
Ullah
Yang
Yousef
Zanghellini
Zhang
Publication venue: 'Elsevier BV'
Publication date: 01/01/2016
Field of study

In a large number of experimental problems, high dimensionality of the search area and economical constraints can severely limit the number of experimental points that can be tested. Within these constraints, classical optimization techniques perform poorly, in particular, when little a priori knowledge is available. In this work we investigate the possibility of combining approaches from statistical modeling and bio-inspired algorithms to effectively explore a huge search space, sampling only a limited number of experimental points. To this purpose, we introduce a novel approach, combining ant colony optimization (ACO) and naive Bayes classifier (NBC) that is, the naive Bayes ant colony optimization (NACO) procedure. We compare NACO with other similar approaches developing a simulation study. We then derive the NACO procedure with the goal to design artificial enzymes with no sequence homology to the extant one. Our final aim is to mimic the natural fold of 200 amino acids 1AGY serine esterase from Fusarium solani

Archivio Ricerca Ca'Foscari

Crossref

Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari

Combining Asian and European genome-wide association studies of colorectal cancer improves risk prediction across racial and ethnic populations

Author: Ahn Yoon-ok
Arndt Volker
Berndt Sonja I.
Bishop D. Timothy
Brenner Hermann
Brezina Stefanie
Buchanan Daniel D.
Bujanda Luis
Burnett-hartman Andrea
Cai Qiuyin
Campbell Peter T.
Carreras-torres Robert
Casey Graham
Castellví-bel Sergi
Chan Andrew T.
Chang-claude Jenny
Chanock Stephen J.
Chen Zhishan
Conti David V.
Corley Douglas A.
Crosslin David R.
Diez-obrero Virginia
Du Mulong
Dunlop Malcolm G.
D’amato Mauro
Fernandez-rozadilla Ceres
Figueiredo Jane C.
Gignoux Chris
Giles Graham G.
Greenson Joel
Gruber Stephen B.
Gsur Andrea
Gunter Marc J.
Haile Robert W.
Haiman Christopher A.
Hampe Jochen
Hampel Heather
Harrison Tabitha A.
Hauser Elizabeth
Hayes Richard B.
Hoffmeister Michael
Hopper John L.
Houlston Richard S.
Hsu Li
Huyghe Jeroen R.
Iwasaki Motoki
Jarvik Gail P.
Jee Sun Ha
Jenkins Mark A.
Jia Wei-hua
Jiang Shangqing
Jung Keum Ji
Keku Temitope O.
Kim Dong-hyun
Kim Jeongseon
Kim Michelle
Kweon Sun-seog
Küry Sébastien
Lansdorp Vogelaar Iris
Law Philip J.
Le Marchand Loic
Lee Jeffrey K.
Lee Soo Chin
Li Christopher I.
Li Li
Lin Yi
Lindblom Annika
Matejcic Marco
Matsuda Koichi
Matsuo Keitaro
Meester Reinier
Moreno Victor
Murphy Neil
Newcomb Polly A.
Newton Christina
Oh Jae Hwan
Oze Isao
Pai Rish K.
Palmer Julie R.
Pearlman Rachel
Peters Ulrike
Pharoah Paul D. P.
Phipps Amanda I.
Platz Elizabeth A.
Potter John D.
Prentice Ross L.
Qu Conghui
Rennert Gad
Rosenthal Elisabeth A.
Sakoda Lori C.
Sawada Norie
Schmit Stephanie L.
Schneider Jennifer L.
Schoen Robert E.
Schumacher Fredrick R.
Shin Aesun
Shu Xiao-ou
Siegel Erin
Slattery Martha L.
Song Mingyang
Stadler Zsofia K.
Steinfelder Robert S.
Su Yu-ru
Tangen Catherine M.
Thomas Minta
Timofeeva Maria N.
Tomlinson Ian P.
Tsugane Shoichiro
Udaltsova Natalia
Ulrich Cornelia M.
Um Caroline Y.
Vadaparampil Susan
Van Duijnhoven Franzel J. B.
Van Guelpen Bethany
Vandenputtelaar Rosita
Visvanathan Kala
Vodicka Pavel
Vodickova Ludmila
Vymetalkova Veronika
Wang Meilin
Weinstein Stephanie J.
White Emily
Wolk Alicja
Woods Michael O.
Wu Anna H.
Yamaji Taiki
Zauber Ann G.
Zheng Jiayin
Zheng Wei
Publication venue: Springer Science and Business Media LLC
Publication date: 28/11/2023
Field of study

Polygenic risk scores (PRS) have great potential to guide precision colorectal cancer (CRC) prevention by identifying those at higher risk to undertake targeted screening. However, current PRS using European ancestry data have sub-optimal performance in non-European ancestry populations, limiting their utility among these populations. Towards addressing this deficiency, we expand PRS development for CRC by incorporating Asian ancestry data (21,731 cases; 47,444 controls) into European ancestry training datasets (78,473 cases; 107,143 controls). The AUC estimates (95% CI) of PRS are 0.63(0.62-0.64), 0.59(0.57-0.61), 0.62(0.60-0.63), and 0.65(0.63-0.66) in independent datasets including 1681-3651 cases and 8696-115,105 controls of Asian, Black/African American, Latinx/Hispanic, and non-Hispanic White, respectively. They are significantly better than the European-centric PRS in all four major US racial and ethnic groups (p-values < 0.05). Further inclusion of non-European ancestry populations, especially Black/African American and Latinx/Hispanic, is needed to improve the risk prediction and enhance equity in applying PRS in clinical practice

Diposit Digital de la Universitat de Barcelona

A Genetic Locus within the FMN1/GREM1 Gene Region Interacts with Body Mass Index in Colorectal Cancer Risk

Author: Aglago Elom K.
Albanes Demetrius
Arndt Volker
Barry Elizabeth L.
Baurley James W.
Berndt Sonja I.
Bien Stephanie A.
Bishop D. Timothy
Bouras Emmanouil
Brenner Hermann
Buchanan Daniel D.
Budiarto Arif
Campbell Peter T.
Carreras-torres Robert
Casey Graham
Cenggoro Tjeng Wawan
Chan Andrew T.
Chang-claude Jenny
Chen Xuechen
Conti David V.
Devall Matthew
Diez-obrero Virginia
Dimou Niki
Drew David
Evangelou Marina
Figueiredo Jane C.
Gallinger Steven
Gauderman W. James
Giles Graham G.
Gruber Stephen B.
Gsur Andrea
Gunter Marc J.
Hampel Heather
Harlid Sophia
Harrison Tabitha A.
Hidaka Akihisa
Hoffmeister Michael
Hsu Li
Huyghe Jeroen R.
Jenkins Mark A.
Jordahl Kristina
Joshi Amit D.
Kawaguchi Eric S.
Keku Temitope O.
Kim Andre
Kundaje Anshul
Larsson Susanna C.
Lewinger Juan Pablo
Li Li
Lin Yi
Lynch Brigid M.
Mahesworo Bharuno
Mandic Marko
Marchand Loic Le
Moreno Victor
Morrison John
Murphy Neil
Nan Hongmei
Nassir Rami
Newcomb Polly A.
Obón-santacana Mireia
Ogino Shuji
Ose Jennifer
Pai Rish K.
Palmer Julie R.
Papadimitriou Nikos
Pardamean Bens
Peoples Anita R.
Peters Ulrike
Platz Elizabeth A.
Potter John D.
Prentice Ross L.
Qu Conghui
Ren Yu
Rennert Gad
Ruiz-narvaez Edward
Sakoda Lori C.
Scacheri Peter C.
Schmit Stephanie L.
Schoen Robert E.
Shcherbina Anna
Slattery Martha L.
Stern Mariana C.
Su Yu-ru
Tangen Catherine M.
Thibodeau Stephen N.
Thomas Duncan C.
Tian Yu
Tsilidis Konstantinos K.
Ulrich Cornelia M.
Van Duijnhoven Franzel Jb
Van Guelpen Bethany
Visvanathan Kala
Vodicka Pavel
Wang Jun
White Emily
Wolk Alicja
Woods Michael O.
Wu Anna H.
Zemlianskaia Natalia
Publication venue: American Association for Cancer Research (AACR)
Publication date: 18/09/2023
Field of study

Colorectal cancer risk can be impacted by genetic, environmental, and lifestyle factors, including diet and obesity. Geneenvironment interactions (G x E) can provide biological insights into the effects of obesity on colorectal cancer risk. Here, we assessed potential genome-wide G x E interactions between body mass index (BMI) and common SNPs for colorectal cancer risk using data from 36,415 colorectal cancer cases and 48,451 controls from three international colorectal cancer consortia (CCFR, CORECT, and GECCO). The G x E tests included the conventional logistic regression using multiplicative terms (one degree of freedom, 1DF test), the two-step EDGE method, and the joint 3DF test, each of which is powerful for detecting G x E interactions under specific conditions. BMI was associated with higher colorectal cancer risk. The two-step approach revealed a statistically significant GxBMI interaction located within the Formin 1/Gremlin 1 (FMN1/GREM1) gene region (rs58349661). This SNP was also identified by the 3DF test, with a suggestive statistical significance in the 1DF test. Among participants with the CC genotype of rs58349661, overweight and obesity categories were associated with higher colorectal cancer risk, whereas null associations were observed across BMI categories in those with the TT genotype. Using data from three large international consortia, this study discovered a locus in the FMN1/GREM1 gene region that interacts with BMI on the association with colorectal cancer risk. Further studies should examine the potential mechanisms through which this locus modifies the etiologic link between obesity and colorectal cancer

Diposit Digital de la Universitat de Barcelona

Application of a hierarchical enzyme classification method reveals the role of gut microbiome in human metabolism

Author: A Bairoch
A Bairoch
A Bairoch
A Bateman
a McCallum
A Mohammed
AK Arakaki
Akram Mohammed
BL Cantarel
BYM Cheng
C Cortes
Chittibabu Guda
D Desai
D Kelly
D Wilson
DE Almonacid
E de Castro
E Nasibov
EC Webb
F Bäckhed
HB Shen
I Rish
I Shah
IR Sanderson
J Espadaler
J Qin
JH Cummings
JK Nicholson
KC Chou
L Breiman
L Lu
LC Borro
LS Johnson
M Blaut
M Hall
M Kanehisa
M Magrane
M Röttig
MB Jeremy
ME Dumas
P Lepage
PD Cani
PJ Turnbaugh
R Development Core Team
R Duda
R Shi
S Devaraj
S Greenblum
S Schmidt
SS Hung
T Cover
T Sousa
TA Clayton
TD Otto
TM Mitchell
U Syed
U Weingart
V Hooper L
VR Abratt
W Iba
W Li
W Tian
YC Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref