Search CORE

23 research outputs found

Boosting Black Box Variational Inference

Author: Dresdner Gideon
Khanna Rajiv
Locatello Francesco
Rätsch Gunnar
Valera Isabel
Publication venue
Publication date: 28/11/2018
Field of study

Approximating a probability density in a tractable manner is a central task in Bayesian statistics. Variational Inference (VI) is a popular technique that achieves tractability by choosing a relatively simple variational family. Borrowing ideas from the classic boosting framework, recent approaches attempt to \emph{boost} VI by replacing the selection of a single density with a greedily constructed mixture of densities. In order to guarantee convergence, previous works impose stringent assumptions that require significant effort for practitioners. Specifically, they require a custom implementation of the greedy step (called the LMO) for every probabilistic model with respect to an unnatural variational family of truncated distributions. Our work fixes these issues with novel theoretical and algorithmic insights. On the theoretical side, we show that boosting VI satisfies a relaxed smoothness assumption which is sufficient for the convergence of the functional Frank-Wolfe (FW) algorithm. Furthermore, we rephrase the LMO problem and propose to maximize the Residual ELBO (RELBO) which replaces the standard ELBO optimization in VI. These theoretical enhancements allow for black box implementation of the boosting subroutine. Finally, we present a stopping criterion drawn from the duality gap in the classic FW analyses and exhaustive experiments to illustrate the usefulness of our theoretical and algorithmic contributions

arXiv.org e-Print Archive

MPG.PuRe

Stochastic Frank-Wolfe for Constrained Finite-Sum Minimization

Author: Dresdner Gideon
Freund Robert M.
Ghaoui Laurent El
Locatello Francesco
Négiar Geoffrey
Pedregosa Fabian
Tsai Alicia
Publication venue
Publication date: 26/06/2020
Field of study

We propose a novel Stochastic Frank-Wolfe (a.k.a. conditional gradient) algorithm for constrained smooth finite-sum minimization with a generalized linear prediction/structure. This class of problems includes empirical risk minimization with sparse, low-rank, or other structured constraints. The proposed method is simple to implement, does not require step-size tuning, and has a constant per-iteration cost that is independent of the dataset size. Furthermore, as a byproduct of the method we obtain a stochastic estimator of the Frank-Wolfe gap that can be used as a stopping criterion. Depending on the setting, the proposed method matches or improves on the best computational guarantees for Stochastic Frank-Wolfe algorithms. Benchmarks on several datasets highlight different regimes in which the proposed method exhibits a faster empirical convergence than related methods. Finally, we provide an implementation of all considered methods in an open-source package.Comment: To appear in the Proceedings of the 37th International Conference on Machine Learning, 2020. Main text: 9 pages, 1 figure. Fixes previously found erro

arXiv.org e-Print Archive

ACE: A fast, skillful learned global atmospheric model for climate prediction

Author: Bonev Boris
Brenowitz Noah D.
Bretherton Christopher S.
Clark Spencer K.
Dresdner Gideon
Duncan James
Henn Brian
Kashinath Karthik
McGibbon Jeremy
Peters Matthew E.
Pritchard Michael S.
Watt-Meyer Oliver
Publication venue
Publication date: 06/12/2023
Field of study

Existing ML-based atmospheric models are not suitable for climate prediction, which requires long-term stability and physical consistency. We present ACE (AI2 Climate Emulator), a 200M-parameter, autoregressive machine learning emulator of an existing comprehensive 100-km resolution global atmospheric model. The formulation of ACE allows evaluation of physical laws such as the conservation of mass and moisture. The emulator is stable for 100 years, nearly conserves column moisture without explicit constraints and faithfully reproduces the reference model's climate, outperforming a challenging baseline on over 90% of tracked variables. ACE requires nearly 100x less wall clock time and is 100x more energy efficient than the reference model using typically available resources. Without fine-tuning, ACE can stably generalize to a previously unseen historical sea surface temperature dataset.Comment: Accepted at Tackling Climate Change with Machine Learning: workshop at NeurIPS 202

arXiv.org e-Print Archive

Recommended from our members

Comprehensive molecular characterization of gastric adenocarcinoma

Author: Abdel-Misih Raafat
Ajani Jaffer
Akbani Rehan
Albert Monique
Alexopoulou Iakovina
Ally Adrian
Alonso Shelley
Askoy B. Arman
Ayala Brenda
Balasundaram Miruna
Bartlett John
Bass Adam J.
Baylin Stephen B.
Beer David G.
Belyaev Smitry
Bennett Joseph
Benz Christopher
Bernard Brady
Beroukhim Rameen
Birol Inanc
Black Aaron D.
Bootwalla Moiz S.
Boussioutas Alex
Bowen Jay
Bowlby Reanne
Bristow Christopher A.
Brooks Denise
Brown Jennifer
Brzezinski Jakub
Burton Robert
Butterfield Yaron S. N.
Camargo M. Constanza
Carlsen Rebecca
Carney Julie Ann
Carter Scott L.
Cheong Jae-Ho
Cherniack Andrew
Cherniack Andrew D.
Chin Lynda
Cho Eunjung
Cho Juok
Chu Andy
Chu Justin
Chuah Eric
Chudamani Sudha
Chun Hye-Jung E.
Cibulskis Kristian
Ciriello Giovanni
Clarke Amanda
Crain Daniel
Curely Erin
Curley Erin
Curtis Christina
Davidsen Tanja
Demchok John A.
Dhalla Noreen
Dhir Rajiv
DiCara Daniel
Ding Li
Dolzhansky Oleg
Dresdner Gideon
Eley Greg
Engel Jay
Fedosenko Konstantin
Fisher Sheila
Frazer Scott
Gabriel Stacey B.
Gao Jianjiong
Gardner Johanna
Garman Katherine
Gastier-Foster Julie M.
Gehlenborg Nils
Getz Gad
Gross Benjamin
Guin Ranabir
Gulley Margaret
Hadjipanayis Angela
Haussler David
Heiman David I.
Helsel Carmen
Herman James G.
Hinoue Toshinori
Holt Robert A.
Hutter Carolyn M.
Iacocca Mary
Ibbs Matthew
Iype Lisa
Jacobsen Anders
Janjigian Yelena Y.
Jensen Mark A.
Jones Steven J.M.
Jung Joonil
Kasaian Katayoon
Kelsen David P.
Kemkes Ariane
Kim Hark K.
Kim Jaegil
Kim Jihun
Kim Sang-Bae
Korski Konstanty
Kramer Roger W.
Kreisberg Richard
Kucherlapati Raju
Kwon Sun-Young
Kycler Witold
Ladanyi Marc
Lai Phillip H.
Laird Peter W.
Lander Eric S.
Landreneau Rodney
Lau Kevin
Lawrence Michael S.
Lee Darlene
Lee Jae-Hyuk
Lee Ju-Seog
Lee Semin
Lee William
Leiserson Mark D. M.
Leporowska Ewa
Leraas Kristen M.
Li Haiyan A.
Lichtenberg Tara M.
Lichtenstein Lee
Lim Emilia
Lin Pei
Ling Shiyun
Liu Jia
Liu Wenbin
Liu Yingchun
Lu Yiling
Luketich James
Ma Yussanne
Mackiewicz Andrzej
Mahadeshwar Harshad S.
Mallery David
Manikhas Georgy
Marra Marco A.
Mayo Michael
McAllister Cynthia
McCall Shannon J.
McLellan Michael
Meyerson Matthew
Miller Michael
Mills Shaw Kenna R.
Mills Gordon
Mills Gordon B.
Moore Richard A.
Morris Scott
Mungall Andrew J.
Mungall Karen L.
Murawa Dawid
Murawa Pawel
Murray Bradley A.
Ng Sam
Ng Santa Cruz Sam
Nip Ka Ming
Niu Beifang
Noble Michael S.
Odze Robert
Ojesina Akinyemi I.
Pantazi Angeliki
Parfenov Michael
Park Do-Youn
Park Peter J.
Park Young S.
Paulauskis Joseph
Pedamallu Chandra
Pedamallu Chandra Sekhar
Pennathur Arjun
Penny Robert
Piazuelo M. Blanca
Pihl Todd
Potapova Olga
Protopopov Alexei
Rabeno Brenda
Rabkin Charles S.
Raman Rohini
Ramirez Nilsa C.
Ramirez Ricardo
Rao Arvind
Raphael Benjamin J.
Rathmell W. Kimryn
Ren Xiaojia
Reynolds Sheila M.
Robertson A. Gordon
Rosenberg Mara
Rovira Hector
Sakai Ryo
Saksena Gordon
Sander Chris
Santoso Netty
Schein Jacqueline E.
Schneider Barbara G.
Schultz Nikolaus
Schumacher Steven E.
Seidman Jonathan
Senbabaoglu Yasin
Seth Sahil
Shelton Candace
Shelton Troy
Shen Hui
Shen Ronglai
Sherman Mark
Sheth Margi
Shmulevich Ilya
Sinha Rileen
Sipahimalani Payal
Sofia Heidi J.
Song Xingzhi
Sougnez Carrie
Spychała Arkadiusz
Stojanov Petar
Stuart Josh M.
Suchorska Wiktoria M.
Sumer S. Onur
Sun Yichao
Tabak Barbara
Tabler Teresa R.
Tam Angela
Tang Jiabin
Tang Laura
Tarnuzzer Roy
Tasman Natalie
Tatka Honorata
Taylor Barry S.
Taylor-Weiner Amaro
Teresiak Marek
Thiessen Nina
Thorsson Vesteinn
Thorsson Vésteinn
Triche Timothy
Van Den Berg David J.
Verhaak Roeland G.W.
Voet Doug
Voronina Olga
Walton Jessica
Wan Yunhu
Wang Zhining
Weaver Stephanie
Weinhold Nils
Weinstein John N.
Weisenberger Daniel J.
Willis Joseph E.
Wise Lisa
Wiznerowicz Maciej
Wu Hsin-Ta
Xi Ruibin
Xu Andrew W.
Yang Da
Yang Liming
Yang Lixing
Zack Travis I.
Zenklusen Jean Claude
Zhang Hailei
Zhang Jianhua
Zhang Wei
Zmuda Erik
Zou Lihua
ŁaŸniak Radoslaw
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2014
Field of study

Gastric cancer is a leading cause of cancer deaths, but analysis of its molecular and clinical characteristics has been complicated by histological and aetiological heterogeneity. Here we describe a comprehensive molecular evaluation of 295 primary gastric adenocarcinomas as part of The Cancer Genome Atlas (TCGA) project. We propose a molecular classification dividing gastric cancer into four subtypes: tumours positive for Epstein–Barr virus, which display recurrent PIK3CA mutations, extreme DNA hypermethylation, and amplification of JAK2, CD274 (also known as PD-L1) and PDCD1LG2 (also knownasPD-L2); microsatellite unstable tumours, which show elevated mutation rates, including mutations of genes encoding targetable oncogenic signalling proteins; genomically stable tumours, which are enriched for the diffuse histological variant and mutations of RHOA or fusions involving RHO-family GTPase-activating proteins; and tumours with chromosomal instability, which show marked aneuploidy and focal amplification of receptor tyrosine kinases. Identification of these subtypes provides a roadmap for patient stratification and trials of targeted therapies

Harvard University - DASH

Multiplatform Analysis of 12 Cancer Types Reveals Molecular Classification within and across Tissues of Origin

Author: Abbott Rachel
Abbott Scott
Akbani Rehan
Aksoy B. Arman
Aldape Kenneth
Ally Adrian
Amin Samirkumar
Anastassiou Dimitris
Auman J. Todd
Baggerly Keith A.
Balasundaram Miruna
Balu Saianand
Baylin Stephen B.
Benz Christopher C.
Benz Stephen C.
Berman Benjamin P.
Bernard Brady
Bhatt Ami S.
Birol Inanc
Black Aaron D.
Bodenheimer Tom
Bootwalla Moiz S.
Bowen Jay
Bressler Ryan
Bristow Christopher A.
Brooks Angela N.
Broom Bradley
Buda Elizabeth
Burton Robert
Butterfield Yaron S.N.
Byers Lauren A.
Carlin Daniel
Carter Scott L.
Casasent Tod D.
Chang Kyle
Chanock Stephen
Chen Zhong
Cherniack Andrew D.
Chin Lynda
Cho Dong Yeon
Cho Juok
Chu Andy
Chuah Eric
Chun Hye Jung E.
Cibulskis Kristian
Ciriello Giovanni
Cleland James
Cline Melisssa
Collisson Eric A.
Craft Brian
Creighton Chad J.
Danilova Ludmila
Davidsen Tanja
Davis Caleb
Dees Nathan D.
Delehaunty Kim
Demchok John A.
Dhalla Noreen
DiCara Daniel
Ding Li
Dinh Huyen
Dobson Jason R.
Dodda Deepti
Doddapaneni Harsha Vardhan
Donehower Lawrence
Dooling David J.
Dresdner Gideon
Drummond Jennifer
Eakin Andrea
Edgerton Mary
Eldred Jim M.
Eley Greg
Ellrott Kyle
Fan Cheng
Fei Suzanne
Felau Ina
Frazer Scott
Freeman Samuel S.
Frick Jessica
Fronick Catrina C.
Fulton Lucinda L.
Fulton Robert
Gabriel Stacey B.
Gao Jianjiong
Gastier-Foster Julie M.
Gehlenborg Nils
George Myra
Getz Gad
Gibbs Richard
Goldman Mary
Gonzalez-Perez Abel
Gross Benjamin
Guin Ranabir
Gunaratne Preethi
Hadjipanayis Angela
Hamilton Mark P.
Hamilton Stanley R.
Han Leng
Han Yi
Harper Hollie A.
Haseley Psalm
Haussler David
Hayes D. Neil
Heiman David I.
Helman Elena
Helsel Carmen
Herbrich Shelley M.
Herman James G.
Hinoue Toshinori
Hirst Carrie
Hirst Martin
Hoadley Katherine A.
Holt Robert A.
Hoyle Alan P.
Iype Lisa
Jacobsen Anders
Jeffreys Stuart R.
Jensen Mark A.
Jones Corbin D.
Jones Steven J.M.
Ju Zhenlin
Jung Joonil
Kahles Andre
Kahn Ari
Kalicki-Veizer Joelle
Kalra Divya
Kanchi Krishna Latha
Kandoth Cyriac
Kane David W.
Kim Hoon
Kim Jaegil
Knijnenburg Theo
Koboldt Daniel C.
Kovar Christie
Kramer Roger
Kreisberg Richard
Kucherlapati Raju
Ladanyi Marc
Laird Peter W.
Lander Eric S.
Larson David E.
Lawrence Michael S.
Lee Darlene
Lee Eunjung
Lee Semin
Lee William
Lehmann Kjong Van
Leinonen Kalle
Leiserson Max D.M.
Leraas Kristen M.
Lerner Seth
Levine Douglas A.
Lewis Lora
Ley Timothy J.
Li Haiyan I.
Li Jun
Li Wei
Liang Han
Lichtenberg Tara M.
Lin Jake
Lin Ling
Lin Pei
Liu Wenbin
Liu Yingchun
Liu Yuexin
Lopez-Bigas Nuria
Lorenzi Philip L.
Lu Charles
Lu Yiling
Luquette Lovelace J.
Ma Singer
Magrini Vincent J.
Mahadeshwar Harshad S.
Mardis Elaine R.
Margolin Adam A.
Marra Marco A.
Mayo Michael
McAllister Cynthia
McGuire Sean E.
McLellan Michael D.
McMichael Joshua F.
Melott James
Meng Shaowu
Meyerson Matthew
Mieczkowski Piotr A.
Miller Christopher A.
Miller Martin L.
Miller Michael
Mills Gordon B.
Moore Richard A.
Morgan Margaret
Morton Donna
Mose Lisle E.
Mungall Andrew J.
Muzny Donna
Ng Sam
Nguyen Lam
Niu Beifang
Noble Michael S.
Noushmehr Houtan
O'Laughlin Michelle
Ojesina Akinyemi I.
Omberg Larsson
Ozenberger Brad
Pantazi Angeliki
Parfenov Michael
Park Peter J.
Parker Joel S.
Paull Evan
Pedamallu Chandra Sekhar
Perou Charles M.
Pihl Todd
Pohl Craig
Pot David
Protopopov Alexei
Przytycka Teresa
Radenbaugh Amie
Ramirez Nilsa C.
Ramirez Ricardo
Raphael Benjamin J.
Reid Jeffrey
Ren Xiaojia
Reva Boris
Reynolds Sheila M.
Rhie Suhn K.
Roach Jeffrey
Robertson A. Gordon
Rovira Hector
Ryan Michael
Rätsch Gunnar
Saksena Gordon
Salama Sofie
Sander Chris
Santoso Netty
Schein Jacqueline E.
Schmidt Heather
Schultz Nikolaus
Schumacher Steven E.
Seidman Jonathan
Senbabaoglu Yasin
Seth Sahil
Sharpe Samantha
Shen Hui
Shen Ronglai
Sheth Margi
Shi Yan
Shmulevich Ilya
Silva Grace O.
Simons Janae V.
Sinha Rileen
Sipahimalani Payal
Smith Scott M.
Sofia Heidi J.
Sokolov Artem
Soloway Mathew G.
Song Xingzhi
Sougnez Carrie
Spellman Paul
Staudt Louis
Stewart Chip
Stojanov Petar
Stuart Joshua M.
Su Xiaoping
Sumer S. Onur
Sun Yichao
Swatloski Teresa
Tabak Barbara
Tam Angela
Tamborero David
Tan Donghui
Tang Jiabin
Tarnuzzer Roy
Taylor Barry S.
Thiessen Nina
Thorsson Vesteinn
Triche Timothy
Uzunangelov Vladislav
Van Den Berg David J.
Van Waes Carter
Van't Veer Laura J.
Vandin Fabio
Varhol Richard J.
Vaske Charles J.
Veluvolu Umadevi
Verhaak Roeland
Voet Doug
Walker Jason
Wallis John W.
Waltman Peter
Wan Yunhu
Wang Min
Wang Wenyi
Wang Zhining
Waring Scot
Weinhold Nils
Weinstein John N.
Weisenberger Daniel J.
Wendl Michael C.
Wheeler David
Wilkerson Matthew D.
Wilson Richard K.
Wise Lisa
Wolf Denise M.
Wong Andrew
Wu Chang Jiun
Wu Chia Chin
Wu Hsin Ta
Wu Junyuan
Wylie Todd
Xi Liu
Xi Ruibin
Xia Zheng
Xu Andrew W.
Yang Da
Yang Liming
Yang Lixing
Yang Tai Hsien Ou
Yang Yang
Yao Jun
Yao Rong
Yau Christina
Ye Kai
Yoshihara Kosuke
Yuan Yuan
Yung Alfred K.
Zack Travis
Zeng Dong
Zenklusen Jean Claude
Zhang Hailei
Zhang Jianhua
Zhang Jiashan
Zhang Nianxiang
Zhang Qunyuan
Zhang Wei
Zhao Wei
Zheng Siyuan
Zhu Jing
Zmuda Erik
Zou Lihua
Publication venue
Publication date: 01/01/2014
Field of study

Recent genomic analyses of pathologically-defined tumor types identify “within-a-tissue” disease subtypes. However, the extent to which genomic signatures are shared across tissues is still unclear. We performed an integrative analysis using five genome-wide platforms and one proteomic platform on 3,527 specimens from 12 cancer types, revealing a unified classification into 11 major subtypes. Five subtypes were nearly identical to their tissue-of-origin counterparts, but several distinct cancer types were found to converge into common subtypes. Lung squamous, head & neck, and a subset of bladder cancers coalesced into one subtype typified by TP53 alterations, TP63 amplifications, and high expression of immune and proliferation pathway genes. Of note, bladder cancers split into three pan-cancer subtypes. The multi-platform classification, while correlated with tissue-of-origin, provides independent information for predicting clinical outcomes. All datasets are available for data-mining from a unified resource to support further biological discoveries and insights into novel therapeutic strategies

Carolina Digital Repository

Black-Box Machine Learning Algorithms for Scientific Tasks With Complex Prior Knowledge

Author: Dresdner Gideon
Publication venue: ETH Zurich
Publication date: 01/01/2022
Field of study

Repository for Publications and Research Data

ai2cm/ace: 2023.12.0

Author: Gideon Dresdner
Oliver Watt-Meyer
Publication venue: Zenodo
Publication date: 05/01/2024
Field of study

<p>Inference code for model described in https://arxiv.org/abs/2310.02074</p&gt

ZENODO

Sparse Gaussian Processes on Discrete Domains

Author: Dresdner Gideon
Fortuin Vincent
Rätsch Gunnar
Strathmann Heiko
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2021
Field of study

Kernel methods on discrete domains have shown great promise for many challenging data types, for instance, biological sequence data and molecular structure data. Scalable kernel methods like Support Vector Machines may offer good predictive performances but do not intrinsically provide uncertainty estimates. In contrast, probabilistic kernel methods like Gaussian Processes offer uncertainty estimates in addition to good predictive performance but fall short in terms of scalability. While the scalability of Gaussian processes can be improved using sparse inducing point approximations, the selection of these inducing points remains challenging. We explore different techniques for selecting inducing points on discrete domains, including greedy selection, determinantal point processes, and simulated annealing. We find that simulated annealing, which can select inducing points that are not in the training set, can perform competitively with support vector machines and full Gaussian processes on synthetic data, as well as on challenging real-world DNA sequence data.ISSN:2169-353

Repository for Publications and Research Data

Boosting Variational Inference With Locally Adaptive Step-Sizes

Author: Dresdner Gideon
Locatello Francesco
Pedregosa Fabian
Rätsch Gunnar
Shekhar Saurav
Publication venue: Cornell University
Publication date: 19/05/2021
Field of study

Variational Inference makes a trade-off between the capacity of the variational family and the tractability of finding an approximate posterior distribution. Instead, Boosting Variational Inference allows practitioners to obtain increasingly good posterior approximations by spending more compute. The main obstacle to widespread adoption of Boosting Variational Inference is the amount of resources necessary to improve over a strong Variational Inference baseline. In our work, we trace this limitation back to the global curvature of the KL-divergence. We characterize how the global curvature impacts time and memory consumption, address the problem with the notion of local curvature, and provide a novel approximate backtracking algorithm for estimating local curvature. We give new theoretical convergence rates for our algorithms and provide experimental validation on synthetic and real-world datasets

Repository for Publications and Research Data

Neighborhood Contrastive Learning Applied to Online Patient Monitoring

Author: Dresdner Gideon
Hüser Matthias
Locatello Francesco
Rätsch Gunnar
Yèche Hugo
Publication venue: PMLR
Publication date: 01/01/2021
Field of study

Intensive care units (ICU) are increasingly looking towards machine learning for methods to provide online monitoring of critically ill patients. In machine learning, online monitoring is often formulated as a supervised learning problem. Recently, contrastive learning approaches have demonstrated promising improvements over competitive supervised benchmarks. These methods rely on well-understood data augmentation techniques developed for image data which do not apply to online monitoring. In this work, we overcome this limitation by supplementing time-series data augmentation techniques with a novel contrastive learning objective which we call neighborhood contrastive learning (NCL). Our objective explicitly groups together contiguous time segments from each patient while maintaining state-specific information. Our experiments demonstrate a marked improvement over existing work applying contrastive methods to medical time-series.ISSN:2640-349

Repository for Publications and Research Data