Search CORE

227 research outputs found

Propagation of charged particle waves in a uniform magnetic field

Author: Arnulfo Gonzalez
Christian Bracher
D. J. Griffiths
E. N. Economou
F. I. Dalidchik
F. I. Dalidchik
F. W. J. Olver
I. I. Fabrikant
I. I. Fabrikant
J. F. Nye
L. S. Schulman
M. C. Gutzwiller
M. V. Berry
P. A. Golovinskii
R. Thom
T. Kramer
T. M. Apostol
T. Poston
V. P. Maslov
V. Z. Slonim
V. Z. Slonim
Y. N. Demkov
Y. N. Demkov
Publication venue: 'American Physical Society (APS)'
Publication date: 28/06/2012
Field of study

This paper considers the probability density and current distributions generated by a point-like, isotropic source of monoenergetic charges embedded into a uniform magnetic field environment. Electron sources of this kind have been realized in recent photodetachment microscopy experiments. Unlike the total photocurrent cross section, which is largely understood, the spatial profiles of charge and current emitted by the source display an unexpected hierarchy of complex patterns, even though the distributions, apart from scaling, depend only on a single physical parameter. We examine the electron dynamics both by solving the quantum problem, i. e., finding the energy Green function, and from a semiclassical perspective based on the simple cyclotron orbits followed by the electron. Simulations suggest that the semiclassical method, which involves here interference between an infinite set of paths, faithfully reproduces the features observed in the quantum solution, even in extreme circumstances, and lends itself to an interpretation of some (though not all) of the rich structure exhibited in this simple problem.Comment: 39 pages, 16 figure

arXiv.org e-Print Archive

Crossref

Motif Discovery through Predictive Modeling of Gene Regulation

Author: A. Battle
A.P. Gasch
C.E. Lawrence
E. Segal
E. Segal
E. Wingender
E.M. Conlon
G.Z. Hertz
H.J. Bussemaker
J.D. Hughes
N. Slonim
R.E. Schapire
T. Cover
T.I. Lee
T.L. Bailey
Y. Pilpel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

We present MEDUSA, an integrative method for learning motif models of transcription factor binding sites by incorporating promoter sequence and gene expression data. We use a modern large-margin machine learning approach, based on boosting, to enable feature selection from the high-dimensional search space of candidate binding sequences while avoiding overfitting. At each iteration of the algorithm, MEDUSA builds a motif model whose presence in the promoter region of a gene, coupled with activity of a regulator in an experiment, is predictive of differential expression. In this way, we learn motifs that are functional and predictive of regulatory response rather than motifs that are simply overrepresented in promoter sequences. Moreover, MEDUSA produces a model of the transcriptional control logic that can predict the expression of any gene in the organism, given the sequence of the promoter region of the target gene and the expression state of a set of known or putative transcription factors and signaling molecules. Each motif model is either a

k

-length sequence, a dimer, or a PSSM that is built by agglomerative probabilistic clustering of sequences with similar boosting loss. By applying MEDUSA to a set of environmental stress response expression data in yeast, we learn motifs whose ability to predict differential expression of target genes outperforms motifs from the TRANSFAC dataset and from a previously published candidate set of PSSMs. We also show that MEDUSA retrieves many experimentally confirmed binding sites associated with environmental stress response from the literature.Comment: RECOMB 200

arXiv.org e-Print Archive

CiteSeerX

Crossref

A genetic algorithm for interpretable model extraction from decision tree ensembles

Author: A Assche Van
DK Slonim
H Kargupta
JH Holland
JR Quinlan
L Breiman
L Breiman
RC Barros
TG Dietterich
W-Y Loh
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Models obtained by decision tree induction techniques excel in being interpretable. However, they can be prone to overfitting, which results in a low predictive performance. Ensemble techniques provide a solution to this problem, and are hence able to achieve higher accuracies. However, this comes at a cost of losing the excellent interpretability of the resulting model, making ensemble techniques impractical in applications where decision support, instead of decision making, is crucial. To bridge this gap, we present the genesim algorithm that transforms an ensemble of decision trees into a single decision tree with an enhanced predictive performance while maintaining interpretability by using a genetic algorithm. We compared genesim to prevalent decision tree induction algorithms, ensemble techniques and a similar technique, called ism, using twelve publicly available data sets. The results show that genesim achieves better predictive performance on most of these data sets compared to decision tree induction techniques & ism. The results also show that genesim's predictive performance is in the same order of magnitude as the ensemble techniques. However, the resulting model of genesim outperforms the ensemble techniques regarding interpretability as it has a very low complexity

Crossref

Ghent University Academic Bibliography

Ballistic matter waves with angular momentum: Exact solutions and applications

Author: A. Chikkatur
A. Fetter
A. Lohr
B. Bayman
B. Gottlieb
C. Blondel
C. Blondel
C. Bracher
C. Bracher
C. Nicole
Christian Bracher
D. Butts
E. Luc-Koenig
E. Rowe
E. Weniger
E. Wigner
F. Dalidchik
G. Gountaroulis
G. Möllenstedt
H. Bryant
H. Wong
I. Bloch
I. Fabrikant
I. Fabrikant
I. Fabrikant
I. Fabrikant
J. Abo-Shaeer
J. Tersoff
K. Madison
L. Hostler
L. Hostler
M. Caola
M. Du
M. Du
M. Du
M. Matthews
M.-O. Mewes
Manfred Kleber
N. Gibson
N. Gibson
N. Gibson
N. Manakov
P. Engels
P. Golovinskii
P. Haljan
S. Chakrabarti
S. Horch
T. Kramer
T. Kramer
T.-L. Ho
Tobias Kramer
V. Bakhrakh
V. Dodonov
V. Kondratovich
V. Kondratovich
V. Slonim
Y. Demkov
Y. Japha
Y. Li
Publication venue: 'American Physical Society (APS)'
Publication date: 28/08/2002
Field of study

An alternative description of quantum scattering processes rests on inhomogeneous terms amended to the Schroedinger equation. We detail the structure of sources that give rise to multipole scattering waves of definite angular momentum, and introduce pointlike multipole sources as their limiting case. Partial wave theory is recovered for freely propagating particles. We obtain novel results for ballistic scattering in an external uniform force field, where we provide analytical solutions for both the scattering waves and the integrated particle flux. Our theory directly applies to p-wave photodetachment in an electric field. Furthermore, illustrating the effects of extended sources, we predict some properties of vortex-bearing atom laser beams outcoupled from a rotating Bose-Einstein condensate under the influence of gravity.Comment: 42 pages, 8 figures, extended version including photodetachment and semiclassical theor

arXiv.org e-Print Archive

Crossref

Machine Learning in Automated Text Categorization

Author: ANDROUTSOPOULOS I.
ATTARDI G.
BAKER L.D.
BIEBRICHER P.
CAROPRESO M.F.
CAVNAR W.B.
CHAKRABARTI S.
CLACK C.
CLEVERDON C.
COHEN W. W.
COHEN W. W.
COHEN W.W.
DAGAN I.
DEERWESTER S.
DENOYER L.
DIAZ ESTEBAN A.
DRUCKER H.
DUMAIS S.T.
DUMAIS S.T.
ESCUDERO G.
Fabrizio Sebastiani
FIELD B.
FORSYTH R. S.
FUHR N.
FUHR N.
FUHR N.
FURNKRANZ J.
GALAVOTTI L.
GALE W. A.
GOVERT N.
GRAY W.A.
GUTHRIE L.
HAYES P.J.
HEAPS H.
HERSH W.
HULL D. A.
HULL D. A.
ITTNER D.J.
IWAYAMA M.
IYER R.D.
JOACHIMS T.
JOACHIMS T.
JOACHIMS T.
JOHN G. H.
JUNKER M.
JUNKER M.
KESSLER B.
KIM Y.-H.
KLINKENBERG R.
KNORZ G.
KOLLER D.
LAM S.L.
LAM W.
LAM W.
LANG K.
LARKEY L. S.
LARKEY L. S.
LARKEY L.S.
LEWIS D. D.
LEWIS D. D.
LEWIS D. D.
LEWIS D. D.
LEWIS D.D.
LEWIS D.D.
LEWIS D.D.
LEWIS D.D.
LEWIS D.D.
LI H.
LI Y.H.
LIERE R.
LIM J. H.
MASAND B.
MASAND B.
MCCALLUM A. K.
MCCALLUM A.K.
MLADENIC D.
MLADENIC D.
MOULINIER I.
MOULINIER I.
MYERS K.
NG H.T.
OH H.-J.
PAZIENZA M. T.
RILOFF E.
ROBERTSON S.E.
ROBERTSON S.E.
ROTH D.
RUIZ M.E.
SABLE C.L.
SARACEVIC T.
SCHAPIRE R. E.
SCHUTZE H.
SCHUTZE H.
SCOTT S.
SEBASTIANI F.
SINGHAL A.
SLONIM N.
TAIRA H.
TUMER K.
TZERAS K.
VAN RIJSBERGEN C. J.
WIENER E.D.
YANG Y.
YANG Y.
YANG Y.
YANG Y.
YU K.L.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2001
Field of study

The automated categorization (or classification) of texts into predefined categories has witnessed a booming interest in the last ten years, due to the increased availability of documents in digital form and the ensuing need to organize them. In the research community the dominant approach to this problem is based on machine learning techniques: a general inductive process automatically builds a classifier by learning, from a set of preclassified documents, the characteristics of the categories. The advantages of this approach over the knowledge engineering approach (consisting in the manual definition of a classifier by domain experts) are a very good effectiveness, considerable savings in terms of expert manpower, and straightforward portability to different domains. This survey discusses the main approaches to text categorization that fall within the machine learning paradigm. We will discuss in detail issues pertaining to three different problems, namely document representation, classifier construction, and classifier evaluation.Comment: Accepted for publication on ACM Computing Survey

arXiv.org e-Print Archive

CiteSeerX

Crossref

Evolution of Resistance to Targeted Anti-Cancer Therapies during Continuous and Pulsed Administration Strategies

Author: A Coldman
A Coldman
A Gupta
B Dibrov
C Chiang
C Sawyers
D Lake
D Milton
D Soulieres
D Townsend
Donna K. Slonim
Franziska Michor
G Swan
H Haeno
J Panetta
Jasmine Foo
K Athreya
K Ross
L Norton
L Norton
M Bentires-Alj
M Burgess
M Citron
M Clynes
M Costa
M Dowsett
M Gorre
M Hidalgo
N Komarova
N Komarova
P Hahnfeldt
R Day
R Martin
S Gardner
W Hryniuk
W Pao
Y Iwasa
Y Iwasa
Z Agur
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

The discovery of small molecules targeted to specific oncogenic pathways has revolutionized anti-cancer therapy. However, such therapy often fails due to the evolution of acquired resistance. One long-standing question in clinical cancer research is the identification of optimum therapeutic administration strategies so that the risk of resistance is minimized. In this paper, we investigate optimal drug dosing schedules to prevent, or at least delay, the emergence of resistance. We design and analyze a stochastic mathematical model describing the evolutionary dynamics of a tumor cell population during therapy. We consider drug resistance emerging due to a single (epi)genetic alteration and calculate the probability of resistance arising during specific dosing strategies. We then optimize treatment protocols such that the risk of resistance is minimal while considering drug toxicity and side effects as constraints. Our methodology can be used to identify optimum drug administration schedules to avoid resistance conferred by one (epi)genetic alteration for any cancer and treatment type

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Global Considerations in Hierarchical Clustering Reveal Meaningful Patterns in Data

Author: A Torrente
AK Jain
CF Zorumski
D Boley
D Horn
D Horn
David Horn
G Getz
G Owsianik
H Chipman
J Handl
J Orlowski
JB Kruskal
Ji Zhu
LK Kaczmarek
M Berridge
M Rune
M Steinbach
MB Eisen
Michal Linial
MS Savaresi
N Kaplan
N Slonim
O Alter
O Sasson
P Cimiano
P D'Haeseleer
P Hansen
PJ Planet
Q Ren
R Apweiler
R Cangelosi
R Sharan
R Varshavsky
R Varshavsky
RO Duda
Roy Varshavsky
S Altschul
TK Landauer
TR Golub
Y Benjamini
Y Zhao
Publication venue: Public Library of Science
Publication date: 21/05/2008
Field of study

BACKGROUND: A hierarchy, characterized by tree-like relationships, is a natural method of organizing data in various domains. When considering an unsupervised machine learning routine, such as clustering, a bottom-up hierarchical (BU, agglomerative) algorithm is used as a default and is often the only method applied. METHODOLOGY/PRINCIPAL FINDINGS: We show that hierarchical clustering that involve global considerations, such as top-down (TD, divisive), or glocal (global-local) algorithms are better suited to reveal meaningful patterns in the data. This is demonstrated, by testing the correspondence between the results of several algorithms (TD, glocal and BU) and the correct annotations provided by experts. The correspondence was tested in multiple domains including gene expression experiments, stock trade records and functional protein families. The performance of each of the algorithms is evaluated by statistical criteria that are assigned to clusters (nodes of the hierarchy tree) based on expert-labeled data. Whereas TD algorithms perform better on global patterns, BU algorithms perform well and are advantageous when finer granularity of the data is sought. In addition, a novel TD algorithm that is based on genuine density of the data points is presented and is shown to outperform other divisive and agglomerative methods. Application of the algorithm to more than 500 protein sequences belonging to ion-channels illustrates the potential of the method for inferring overlooked functional annotations. ClustTree, a graphical Matlab toolbox for applying various hierarchical clustering algorithms and testing their quality is made available. CONCLUSIONS: Although currently rarely used, global approaches, in particular, TD or glocal algorithms, should be considered in the exploratory process of clustering. In general, applying unsupervised clustering methods can leverage the quality of manually-created mapping of proteins families. As demonstrated, it can also provide insights in erroneous and missed annotations

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

A novel approach to the clustering of microarray data via nonparametric density estimation

Author: A Azzalini
A Banerjee
B Bolstad
C Fraley
C Fraley
C Kendziorski
CB Barber
D Slonim
D Tritchler
Davide Risso
ES Garrett
G Getz
G Kerr
G Menardi
GJ McLachlan
IM Johnstone
J Friedman
J Li
J Li
JA Hartigan
JD Banfield
M Chiogna
M de Berg
ML Chow
R Bourgon
R Development Core Team
RC Gentleman
Riccardo De Bin
S Dudoit
S Madeira
T Hastie
TR Golub
U Alon
Y Cheng
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Cluster analysis is a crucial tool in several biological and medical studies dealing with microarray data. Such studies pose challenging statistical problems due to dimensionality issues, since the number of variables can be much higher than the number of observations. Results Here, we present a general framework to deal with the clustering of microarray data, based on a three-step procedure: (i) gene filtering; (ii) dimensionality reduction; (iii) clustering of observations in the reduced space. Via a nonparametric model-based clustering approach we obtain promising results both in simulated and real data. Conclusions The proposed algorithm is a simple and effective tool for the clustering of microarray data, in an unsupervised setting.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Archivio istituzionale della ricerca - Università di Padova

Pompe disease diagnosis and management guideline

Author: Alfred Slonim
Amalfitano A
Amalfitano A
An Y
An Y
Angelini C
Anna Maria Martins
Anneser JM
Ansong A
Arad M
Archibald KC
Ausems MG
Ausems MG
Backman E
Barry J Byrne
Bembi B
Berger KI
Bergner M
Besancon AM
Bieri D
Biggar WD
Bodamer OA
Bodamer OA
Bodamer OA
Bohannon RW
Braunsdorf WE
Brouillette RT
Bulkley BH
Carolyn T Spencer
Case LE
Chamoles NA
Chatwin M
Cresawn KO
Cynthia J Tifft
David M Rapoport
David S Millington
Deborah Marsden
Deeksha Bali
DiFiore MT
Ding E
Dreyfus J
Eagle M
Ellis FR
Engel AG
Finder JD
Fowler WM
Fowler WM
Fraites TJ
Franco LM
Gillette PC
Griffin JL
Guyatt GH
Gwen O’grady
Hagemans ML
Haley SM
Hermans MM
Hers HG
Hicks CL
Hirschhorn R
Horner J
Howell RR
Hug G
Hyde SA
Ing RJ
Joanne Mackey
John F Crowley
Jones MA
Kamphoven JH
Keith RA
Kenneth Berger
Keunen RW
Kim DG
Kirk VG
Kishnani PS
Kleijer WJ
Klinge L
Kravitz RM
Kravitz RM
Krechel SW
Krishnamurthy VV
Kushida CA
Laforet P
Laura E Case
Li Y
Lindeke LL
Makos MM
Marc C Patterson
Marc Nicolino
Margolis ML
Marsden D
Martin-Touaux E
Martiniuk F
Mathiowetz V
Mathiowetz V
Matsuoka Y
McCaffery M
McDonald CM
McFarlane HJ
Meikle P
Mellies U
Metzl JD
Michael S Watson
Moufarrej NA
Niizawa G
Oktenli C
Ottenbacher KJ
Park HK
Pauly DF
Personius KE
Pompe JC
Priya S Kishnani
Quinlivan R
R Rodney Howell
Raben N
Raben N
Redline S
Richard M Kravitz
Riou B
Robert D Steiner
Rosenbek JC
Roy L
Slonim AE
Slonim AE
Slonim AE
Steven Downs
Sun B
Sun B
Tardieu C
Umapathysivam K
Van den Hout H
van den Hout HM
Van den Hout JM
Van den Hout JM
Van der Kraan M
Van Hove JL
Varni JW
Vignos PJ
Vignos PJ
Ward K
Ware JE
Watson JG
Wiegand V
Winkel LP
Young SP
Young T
Zaretsky JZ
Zhang H
Publication venue: Lippincott, Williams & Wilkins
Publication date: 01/05/2006
Field of study

ACMG standards and guidelines are designed primarily as an educational resource for physicians and other health care providers to help them provide quality medical genetic services. Adherence to these standards and guidelines does not necessarily ensure a successful medical outcome. These standards and guidelines should not be considered inclusive of all proper procedures and tests or exclusive of other procedures and tests that are reasonably directed to obtaining the same results. in determining the propriety of any specific procedure or test, the geneticist should apply his or her own professional judgment to the specific clinical circumstances presented by the individual patient or specimen. It may be prudent, however, to document in the patient's record the rationale for any significant deviation from these standards and guidelines.Duke Univ, Med Ctr, Durham, NC 27706 USAOregon Hlth Sci Univ, Portland, OR 97201 USANYU, Sch Med, New York, NY USAUniv Florida, Coll Med, Powell Gene Therapy Ctr, Gainesville, FL 32611 USAIndiana Univ, Bloomington, in 47405 USAUniv Miami, Miller Sch Med, Coral Gables, FL 33124 USAHarvard Univ, Childrens Hosp, Sch Med, Cambridge, MA 02138 USAUniversidade Federal de São Paulo, São Paulo, BrazilColumbia Univ, New York, NY 10027 USANYU, Bellevue Hosp, Sch Med, New York, NY USAColumbia Univ, Med Ctr, New York, NY 10027 USAUniversidade Federal de São Paulo, São Paulo, BrazilWeb of Scienc

Crossref

Repositório Institucional UNIFESP

PubMed Central

University of Miami: Scholarship Miami

Pairwise maximum entropy models for studying large biological systems: when they can and when they can't work

Author: A Tang
C Shannon
D Johnson
D Mastronarde
D Ts'o
E Schneidman
E Vargas-Madrazo
F Rieke
H Lancaster
H Lancaster
J Eisenberg
J Nelson
J Oates
J Shlens
J Shlens
K Dill
M Bethge
M Socolich
N Friedman
N Slonim
O Sarmanov
O Sarmanov
Olaf Sporns
Peter E. Latham
R Bahadur
R Wrangham
S Amari
S DeVries
S Kullback
S Lockless
S Nirenberg
S Yu
Sheila Nirenberg
T Cover
V Sessak
W Russ
Y Dan
Yasser Roudi
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 06/11/2008
Field of study

One of the most critical problems we face in the study of biological systems is building accurate statistical descriptions of them. This problem has been particularly challenging because biological systems typically contain large numbers of interacting elements, which precludes the use of standard brute force approaches. Recently, though, several groups have reported that there may be an alternate strategy. The reports show that reliable statistical models can be built without knowledge of all the interactions in a system; instead, pairwise interactions can suffice. These findings, however, are based on the analysis of small subsystems. Here we ask whether the observations will generalize to systems of realistic size, that is, whether pairwise models will provide reliable descriptions of true biological systems. Our results show that, in most cases, they will not. The reason is that there is a crossover in the predictive power of pairwise models: If the size of the subsystem is below the crossover point, then the results have no predictive power for large systems. If the size is above the crossover point, the results do have predictive power. This work thus provides a general framework for determining the extent to which pairwise models can be used to predict the behavior of whole biological systems. Applied to neural data, the size of most systems studied so far is below the crossover point

arXiv.org e-Print Archive

Public Library of Science (PLOS)

CiteSeerX

Crossref

Directory of Open Access Journals

UCL Discovery

PubMed Central