Search CORE

41 research outputs found

When Do Curricula Work in Federated Learning?

Author: Baek Woonjoon
Chen Chen
Kadaveru Sreevatsank
Kungurtsev Vyacheslav
Lin Bill
Shah Mubarak
Vahidian Saeed
Wang Weijia
Publication venue
Publication date: 24/12/2022
Field of study

An oft-cited open problem of federated learning is the existence of data heterogeneity at the clients. One pathway to understanding the drastic accuracy drop in federated learning is by scrutinizing the behavior of the clients' deep models on data with different levels of "difficulty", which has been left unaddressed. In this paper, we investigate a different and rarely studied dimension of FL: ordered learning. Specifically, we aim to investigate how ordered learning principles can contribute to alleviating the heterogeneity effects in FL. We present theoretical analysis and conduct extensive empirical studies on the efficacy of orderings spanning three kinds of learning: curriculum, anti-curriculum, and random curriculum. We find that curriculum learning largely alleviates non-IIDness. Interestingly, the more disparate the data distributions across clients the more they benefit from ordered learning. We provide analysis explaining this phenomenon, specifically indicating how curriculum training appears to make the objective landscape progressively less convex, suggesting fast converging iterations at the beginning of the training procedure. We derive quantitative results of convergence for both convex and nonconvex objectives by modeling the curriculum training on federated devices as local SGD with locally biased stochastic gradients. Also, inspired by ordered learning, we propose a novel client selection technique that benefits from the real-world disparity in the clients. Our proposed approach to client selection has a synergic effect when applied together with ordered learning in FL

arXiv.org e-Print Archive

MimoSA: a system for minimotif annotation

Author: Gryk Michael R
Kadaveru Krishna
Kundeti Vamsi
Meusburger Thomas
Nowling Ronald J
Rajasekaran Sanguthevar
Sargeant David
Schiller Martin R
Vyas Jay
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Minimotifs are short peptide sequences within one protein, which are recognized by other proteins or molecules. While there are now several minimotif databases, they are incomplete. There are reports of many minimotifs in the primary literature, which have yet to be annotated, while entirely novel minimotifs continue to be published on a weekly basis. Our recently proposed function and sequence syntax for minimotifs enables us to build a general tool that will facilitate structured annotation and management of minimotif data from the biomedical literature. Results We have built the MimoSA application for minimotif annotation. The application supports management of the Minimotif Miner database, literature tracking, and annotation of new minimotifs. MimoSA enables the visualization, organization, selection and editing functions of minimotifs and their attributes in the MnM database. For the literature components, Mimosa provides paper status tracking and scoring of papers for annotation through a freely available machine learning approach, which is based on word correlation. The paper scoring algorithm is also available as a separate program, TextMine. Form-driven annotation of minimotif attributes enables entry of new minimotifs into the MnM database. Several supporting features increase the efficiency of annotation. The layered architecture of MimoSA allows for extensibility by separating the functions of paper scoring, minimotif visualization, and database management. MimoSA is readily adaptable to other annotation efforts that manually curate literature into a MySQL database. Conclusions MimoSA is an extensible application that facilitates minimotif annotation and integrates with the Minimotif Miner database. We have built MimoSA as an application that integrates dynamic abstract scoring with a high performance relational model of minimotif syntax. MimoSA's TextMine, an efficient paper-scoring algorithm, can be used to dynamically rank papers with respect to context.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Nevada, Las Vegas Repository

Minimotif miner 2nd release: a database and web system for motif search

Author: Apweiler
Attwood
Balla
de Castro
Falquet
Geer
Goldfarb
Gu
Henikoff
J. Vyas
K. Kadaveru
Kadaveru
Kreegipuu
Letunic
M. R. Gryk
M. R. Schiller
M. W. Maciejewski
Marchler-Bauer
Monnet
N. Rubino
Negro
Obenauer
P. Gradie
Puntervoll
Rawlings
Rigoutsos
S. Balla
S. Rajasekaran
Su
T. Mi
V. Kundeti
Walmsley
Wheeler
Yedidia
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

Minimotif Miner (MnM) consists of a minimotif database and a web-based application that enables prediction of motif-based functions in user-supplied protein queries. We have revised MnM by expanding the database more than 10-fold to approximately 5000 motifs and standardized the motif function definitions. The web-application user interface has been redeveloped with new features including improved navigation, screencast-driven help, support for alias names and expanded SNP analysis. A sample analysis of prion shows how MnM 2 can be used. Weblink: http://mnm.engr.uconn.edu, weblink for version 1 is http://sms.engr.uconn.edu

CiteSeerX

Crossref

PubMed Central

University of Nevada, Las Vegas Repository

Minimotif Miner 3.0: database expansion and significantly improved reduction of false-positive predictions from consensus sequences

Author: A. W. Brooks
Aasland
Arold
Balla
Bodeus
C. A. Ross
C. L. Strong
Coates
Coleman
Coleman
Costa
D. P. Sargeant
Davey
Diella
Evans
Greenberg
Hornbeck
J. C. Merlin
Kadaveru
L. Y. Lee
Lam
Li
Lieber
London
M. R. Gryk
M. R. Schiller
Montecchi-Palazzi
Obenauer
P. Watts
Parthasarathi
Puntervoll
Rajasekaran
S. Deverasetty
S. Rajasekaran
Sargeant
T. J. Bill
T. Mi
V. Rathnayake
Vyas
Xenarios
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

Minimotif Miner (MnM available at http://minimotifminer.org or http://mnm.engr.uconn.edu) is an online database for identifying new minimotifs in protein queries. Minimotifs are short contiguous peptide sequences that have a known function in at least one protein. Here we report the third release of the MnM database which has now grown 60-fold to approximately 300 000 minimotifs. Since short minimotifs are by their nature not very complex we also summarize a new set of false-positive filters and linear regression scoring that vastly enhance minimotif prediction accuracy on a test data set. This online database can be used to predict new functions in proteins and causes of disease

Crossref

PubMed Central

University of Nevada, Las Vegas Repository

Prediction of HIV-1 virus-host protein interactions using virus and host sequence motifs

Author: A Mehle
A Sodhi
AL Brass
Aydin Tozeren
BS Ramakrishna
E de Castro
EC Holmes
F Diella
FP Davis
G Dennis Jr
GZ Panos
H Dinkel
H Shelton
H Zhou
J Hemelaar
JF Roeth
JN Brown
JN Tournier
JR Morgan
K Kadaveru
Lyle Ungar
MD Dyer
N Evrard-Todeschi
N Hulo
O Tastan
P Patel
P Puntervoll
Perry Evans
R Byland
R König
R Tonikian
R Truant
RG Ptak
RJ Edwards
S Peri
SH Coleman
SH Tan
T Kuntzen
V Neduva
V Neduva
V Neduva
V Neduva
VR Panz
W Dampier
W Fu
W Lv
William Dampier
Publication venue: BioMed Central
Publication date: 01/05/2009
Field of study

Abstract Background Host protein-protein interaction networks are altered by invading virus proteins, which create new interactions, and modify or destroy others. The resulting network topology favors excessive amounts of virus production in a stressed host cell network. Short linear peptide motifs common to both virus and host provide the basis for host network modification. Methods We focused our host-pathogen study on the binding and competing interactions of HIV-1 and human proteins. We showed that peptide motifs conserved across 70% of HIV-1 subtype B and C samples occurred in similar positions on HIV-1 proteins, and we documented protein domains that interact with these conserved motifs. We predicted which human proteins may be targeted by HIV-1 by taking pairs of human proteins that may interact via a motif conserved in HIV-1 and the corresponding interacting protein domain. Results Our predictions were enriched with host proteins known to interact with HIV-1 proteins ENV, NEF, and TAT (p-value < 4.26E-21). Cellular pathways statistically enriched for our predictions include the T cell receptor signaling, natural killer cell mediated cytotoxicity, cell cycle, and apoptosis pathways. Gene Ontology molecular function level 5 categories enriched with both predicted and confirmed HIV-1 targeted proteins included categories associated with phosphorylation events and adenyl ribonucleotide binding. Conclusion A list of host proteins highly enriched with those targeted by HIV-1 proteins can be obtained by searching for host protein motifs along virus protein sequences. The resulting set of host proteins predicted to be targeted by virus proteins will become more accurate with better annotations of motifs and domains. Nevertheless, our study validates the role of linear binding motifs shared by virus and host proteins as an important part of the crosstalk between virus and host.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ScholarlyCommons@Penn

Host sequence motifs shared by HIV predict response to antiretroviral therapy

Author: A Ertel
A Matsukawa
A Mocroft
A Rambaut
AD Frankel
AE Kel
AL Brass
AL Brass
Aydin Tozeren
B Larder
C Van Lint
D Jacobs
D Li
DM Moore
E De Clercq
F Longo
GM Lucas
H Vermeiren
J Castilla
J Fellay
J Huang
J Mulder
JA Levy
JW Pinney
K Kadaveru
L Nanni
LM Mansky
Lyle Ungar
M Hariharan
M Kanehisa
M Rehmsmeier
M Rosen-Zvi
ME Garber
MH Katz
MK Kuhner
MV Rockman
N Beerenwinkel
N Beerenwinkel
NR Draper
P Puntervoll
Perry Evans
R König
RG Ptak
RH Stauber
RI Connor
RM Biondi
RM Grant
RW Shafer
S Scheer
SF Altschul
SG Deeks
SG Deeks
SY Rhee
V Matys
V Nair
VA Johnson
VA Johnson
VA Johnson
William Dampier
WM Kati
Y He
Y Pommier
Y Pommier
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background The HIV viral genome mutates at a high rate and poses a significant long term health risk even in the presence of combination antiretroviral therapy. Current methods for predicting a patient's response to therapy rely on site-directed mutagenesis experiments and <it>in vitro </it>resistance assays. In this bioinformatics study we treat response to antiretroviral therapy as a two-body problem: response to therapy is considered to be a function of both the host and pathogen proteomes. We set out to identify potential responders based on the presence or absence of host protein and DNA motifs on the HIV proteome. Results An alignment of thousands of HIV-1 sequences attested to extensive variation in nucleotide sequence but also showed conservation of eukaryotic short linear motifs on the protein coding regions. The reduction in viral load of patients in the Stanford HIV Drug Resistance Database exhibited a bimodal distribution after 24 weeks of antiretroviral therapy, with 2,000 copies/ml cutoff. Similarly, patients allocated into responder/non-responder categories based on consistent viral load reduction during a 24 week period showed clear separation. In both cases of phenotype identification, a set of features composed of short linear motifs in the reverse transcriptase region of HIV sequence accurately predicted a patient's response to therapy. Motifs that overlap resistance sites were highly predictive of responder identification in single drug regimens but these features lost importance in defining responders in multi-drug therapies. Conclusion HIV sequence mutates in a way that preferentially preserves peptide sequence motifs that are also found in the human proteome. The presence and absence of such motifs at specific regions of the HIV sequence is highly predictive of response to therapy. Some of these predictive motifs overlap with known HIV-1 resistance sites. These motifs are well established in bioinformatics databases and hence do not require identification via <it>in vitro </it>mutation experiments.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Sequence- and Interactome-Based Prediction of Viral Protein Hotspots Targeting Host Proteins: A Case Study for HIV Nef

Virus proteins alter protein pathways of the host toward the synthesis of viral particles by breaking and making edges via binding to host proteins. In this study, we developed a computational approach to predict viral sequence hotspots for binding to host proteins based on sequences of viral and host proteins and literature-curated virus-host protein interactome data. We use a motif discovery algorithm repeatedly on collections of sequences of viral proteins and immediate binding partners of their host targets and choose only those motifs that are conserved on viral sequences and highly statistically enriched among binding partners of virus protein targeted host proteins. Our results match experimental data on binding sites of Nef to host proteins such as MAPK1, VAV1, LCK, HCK, HLA-A, CD4, FYN, and GNB2L1 with high statistical significance but is a poor predictor of Nef binding sites on highly flexible, hoop-like regions. Predicted hotspots recapture CD8 cell epitopes of HIV Nef highlighting their importance in modulating virus-host interactions. Host proteins potentially targeted or outcompeted by Nef appear crowding the T cell receptor, natural killer cell mediated cytotoxicity, and neurotrophin signaling pathways. Scanning of HIV Nef motifs on multiple alignments of hepatitis C protein NS5A produces results consistent with literature, indicating the potential value of the hotspot discovery in advancing our understanding of virus-host crosstalk

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Methyl-donor depletion of head and neck cancer cells in vitro establishes a less aggressive tumour cell phenotype

Author: A Berndt
A Ly
Abeir Elmogassabi
AJ Lyons
AK Chaturvedi
AR Kreimer
B Hultberg
BC Blount
BJ Altman
C Pelucchi
CJ Piyathilake
CM Ulrich
Craig Murdoch
D Garcia-Crespo
DW Lin
DZ Cao
E Balada
E Nakano
F Friedl
G Bistulfi
G Gatta
G Nikolaidis
GA Kune
HC Hsu
Hilary J. Powers
I Hayashi
J Sun
JE Flatley
JE Flatley
JM Stempak
JS White
K Ghoshal
K Kadaveru
K Nakano
L Graziosi
LF Petersen
LP Deiss
M Okano
MD Niculescu
MM Doucette
MS Jhaveri
N Poomipark
N Titenko-Holland
ND Freedman
PK Ha
PM Kempen van
Q Gao
R Towle
RC Leung
Rosanna Lowe
RW Friesen
S Deghan Manshadi
S Ito
S Mizuno
S Xiao
SE Chang
SJ Duthie
SJ Duthie
SJ Moat
SR Rangan
SS Napier
T Bestor
TK Hoffmann
TP Wang
Vanessa Hearnden
VP Kovacheva
W Shin
WF Scherer
X Xie
Y Lin
YI Kim
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

PURPOSE: DNA methylation plays a fundamental role in the epigenetic control of carcinogenesis and is, in part, influenced by the availability of methyl donors obtained from the diet. In this study, we developed an in-vitro model to investigate whether methyl donor depletion affects the phenotype and gene expression in head and neck squamous cell carcinoma (HNSCC) cells. METHODS: HNSCC cell lines (UD-SCC2 and UPCI-SCC72) were cultured in medium deficient in methionine, folate, and choline or methyl donor complete medium. Cell doubling-time, proliferation, migration, and apoptosis were analysed. The effects of methyl donor depletion on enzymes controlling DNA methylation and the pro-apoptotic factors death-associated protein kinase-1 (DAPK1) and p53 upregulated modulator of apoptosis (PUMA) were examined by quantitative-PCR or immunoblotting. RESULTS: HNSCC cells cultured in methyl donor deplete conditions showed significantly increased cell doubling times, reduced cell proliferation, impaired cell migration, and a dose-dependent increase in apoptosis when compared to cells cultured in complete medium. Methyl donor depletion significantly increased the gene expression of DNMT3a and TET-1, an effect that was reversed upon methyl donor repletion in UD-SCC2 cells. In addition, expression of DAPK1 and PUMA was increased in UD-SCC2 cells cultured in methyl donor deplete compared to complete medium, possibly explaining the observed increase in apoptosis in these cells. CONCLUSION: Taken together, these data show that depleting HNSCC cells of methyl donors reduces the growth and mobility of HNSCC cells, while increasing rates of apoptosis, suggesting that a methyl donor depleted diet may significantly affect the growth of established HNSCC

Crossref

Springer - Publisher Connector

White Rose Research Online

HIV Protein Sequence Hotspots for Crosstalk with Host Hub Proteins

Author: A Greenway
A Henschel
AC Vendel
AK Dunker
Aydin Tozeren
B Ackerson
CL Ruegg
CM Gould
D Ekman
Denis Dupuy
DR Borger
EE Hill
EF Pettersen
F Cardarelli
F Diella
F Meggio
F Meggio
G Baier-Bitterlich
H Jian
H Li
H Wang
HM Berman
HM Craig
J Friborg
JE Dickerson
K Harada
K Kadaveru
K Saksela
KV Prasad
L Deng
M Hiipakka
M Matsubara
M Schindler
MA Dimattia
MA Perez
Mahdi Sarmady
MD Dyer
MR Schaefer
N Arhel
NE Davey
NE Davey
O Haffar
O Tastan
O Tastan
P Abada
P Bayer
P Beauparlant
P Evans
P Evans
RJ Edwards
S Balakrishnan
S Betzi
S Grzesiek
S Sei
SH Tan
SK Srinivas
SS Chen
T Ammosova
T Kino
TH Tahirov
TS Keshava Prasad
V Neduva
V Neduva
W Fu
W Radding
William Dampier
X Yang
X Yang
Y He
Y Liu
Z Nie
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

HIV proteins target host hub proteins for transient binding interactions. The presence of viral proteins in the infected cell results in out-competition of host proteins in their interaction with hub proteins, drastically affecting cell physiology. Functional genomics and interactome datasets can be used to quantify the sequence hotspots on the HIV proteome mediating interactions with host hub proteins. In this study, we used the HIV and human interactome databases to identify HIV targeted host hub proteins and their host binding partners (H2). We developed a high throughput computational procedure utilizing motif discovery algorithms on sets of protein sequences, including sequences of HIV and H2 proteins. We identified as HIV sequence hotspots those linear motifs that are highly conserved on HIV sequences and at the same time have a statistically enriched presence on the sequences of H2 proteins. The HIV protein motifs discovered in this study are expressed by subsets of H2 host proteins potentially outcompeted by HIV proteins. A large subset of these motifs is involved in cleavage, nuclear localization, phosphorylation, and transcription factor binding events. Many such motifs are clustered on an HIV sequence in the form of hotspots. The sequential positions of these hotspots are consistent with the curated literature on phenotype altering residue mutations, as well as with existing binding site data. The hotspot map produced in this study is the first global portrayal of HIV motifs involved in altering the host protein network at highly connected hub nodes

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central