Search CORE

6 research outputs found

Decentralized Training of Foundation Models in Heterogeneous Environments

Author: Chen Beidi
Dao Tri
Davis Jared Quincy
He Yongjun
Liang Percy
Re Christopher
Yuan Binhang
Zhang Ce
Zhang Tianyi
Publication venue
Publication date: 10/06/2022
Field of study

Training foundation models, such as GPT-3 and PaLM, can be extremely expensive, often involving tens of thousands of GPUs running continuously for months. These models are typically trained in specialized clusters featuring fast, homogeneous interconnects and using carefully designed software systems that support both data parallelism and model/pipeline parallelism. Such dedicated clusters can be costly and difficult to obtain. Can we instead leverage the much greater amount of decentralized, heterogeneous, and lower-bandwidth interconnected compute? Previous works examining the heterogeneous, decentralized setting focus on relatively small models that can be trained in a purely data parallel manner. State-of-the-art schemes for model parallel foundation model training, such as Megatron, only consider the homogeneous data center setting. In this paper, we present the first study of training large foundation models with model parallelism in a decentralized regime over a heterogeneous network. Our key technical contribution is a scheduling algorithm that allocates different computational "tasklets" in the training of foundation models to a group of decentralized GPU devices connected by a slow heterogeneous network. We provide a formal cost model and further propose an efficient evolutionary algorithm to find the optimal allocation strategy. We conduct extensive experiments that represent different scenarios for learning over geo-distributed devices simulated using real-world network measurements. In the most extreme case, across 8 different cities spanning 3 continents, our approach is 4.8X faster than prior state-of-the-art training systems (Megatron)

arXiv.org e-Print Archive

Decentralized Training of Foundation Models in Heterogeneous Environments

Author: Chen Beidi
Dao Tri
He Yongjun
Liang Percy
Quincy Davis Jared
Re Christopher
Yuan Binhang
Zhang Ce
Zhang Tianyi
Publication venue: Curran Associates, Inc.
Publication date: 01/01/2022
Field of study

Training foundation models, such as GPT-3 and PaLM, can be extremely expensive, often involving tens of thousands of GPUs running continuously for months. These models are typically trained in specialized clusters featuring fast, homogeneous interconnects and using carefully designed software systems that support both data parallelism and model/pipeline parallelism. Such dedicated clusters can be costly and difficult to obtain. Can we instead leverage the much greater amount of decentralized, heterogeneous, and lower-bandwidth interconnected compute? Previous works examining the heterogeneous, decentralized setting focus on relatively small models that can be trained in a purely data parallel manner. State-of-the-art schemes for model parallel foundation model training, such as Megatron and Deepspeed, only consider the homogeneous data center setting. In this paper, we present the first study of training large foundation models with model parallelism in a decentralized regime over a heterogeneous network. Our key technical contribution is a scheduling algorithm that allocates different computational “tasklets” in the training of foundation models to a group of decentralized GPU devices connected by a slow heterogeneous network. We provide a formal cost model and further propose an efficient evolutionary algorithm to find the optimal allocation strategy. We conduct extensive experiments that represent different scenarios for learning over geo-distributed devices simulated using real-world network measurements. In the most extreme case, across 8 different cities spanning 3 continents, our approach is 4.8× faster than prior state-of-the-art training systems

Repository for Publications and Research Data

Is it time to revisit the role of psychedelic drugs in enhancing human creativity?

Author: Artaud A.
B. Sessa
Balzac F.
Barron F.
Coleridge ST
De Quincy T
Dikov NN
Dobkin de Rios M.
Dumas A.
Huxley A.
Krippner S.
Lewin R.
Markoff J.
Mogar RE
Pahnke WN
Rimm SB
Sessa B.
Tennyson AL
Torrance EP
Wallas G.
Zinkhan G.
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

Immunocytochemistry in the Diagnosis of Pathological Conditions of the Ear, Nose and Throat

Author: AH Coons
BA Gusterson
BM Wenig
CM Milroy
DK Heffner
DM Knowles
FJ Woude van der
G Buchanan
H Spencer
J Salisbury
J Sosa-Melgarejo
JA Thomas
JH Krouse
JM Polak
JM Polak
JM Woodruff
K Lennert
KE Kliewer
L Michaels
L Slootweg
M Wassef
MA Weiss
PJ Roholl
R Marks
RE Quincy
RV Lloyd
SE Mills
VJ Hyams
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1994
Field of study

Crossref

Survival kinetics of Listeria monocytogenes on chickpeas, sesame seeds, pine nuts, and black pepper as affected by relative humidity storage conditions

Author: AA Kader
AK Jukanti
AN Martinchik
C Nergiz
C Zweifel
CA Hwang
Diana Stewart
DW Fleming
IM Barmpalia-Davis
Joelle K. Salazar
Lauren J. Gonsalves
LR Beuchat
M Venkatachalam
MA Kimber
Mary Lou Tortorello
Olivier Habimana
PK Brar
Quincy Suehr
RE Brackett
S Koseki
SJ Kenney
SM Farakos
SM Farakos
SM Farakos
T Blessington
T Blessington
Tanvi Mhetras
Vidya Natarajan
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref

Regional variations in transepidermal water loss, eccrine sweat gland density, sweat secretion rates and electrolyte composition in resting and exercising humans

Author: A Kawahata
A Krogh
A Mishra
A Rougier
A Silver
A van Leeuwenhoek
A van Leeuwenhoek
A Wendt
AB Goodman
AB Hertzman
AB Hertzman
AL Nilsson
AMJ van den Heuvel
AS Knip
AS Knip
B Saltin
C Blair
C Féré
C Johnson
C Neumann
C Tronstad
C-Y Yu
C-Y Yu
CA Machado-Moreira
CA Machado-Moreira
CA Machado-Moreira
CA Machado-Moreira
CA Machado-Moreira
CA Machado-Moreira
CA Machado-Moreira
CA Machado-Moreira
CA Machado-Moreira
CF List
CF List
CH Wyndham
CJ Gordon
CJ Smith
CW Darrow
CW Darrow
D Gagnon
D Kahn
D Katz
D Robertshaw
DB Dill
DB Dill
DB Dill
DE Bass
DF Brebner
DF Roberts
DL Costill
DMK Kerslake
DR Harris
E Clark
EA Pinson
EH Christensen
ER Nadel
ET Renbourn
F Costa
F Herrmann
F Sargent
F Sato
FA Thiele
FG Benedict
FG Benedict
G Bini
G Galeotti
G Guidry
G Havenith
G Hayden
G Szabo
G Szabo
G Welch
GE Burch
GE Nilsson
GH Wang
GL Brengelmann
GP Kenny
GR Brisson
GW Cage
H Graichen
HD Janowitz
HM Buley
HM Emrich
HR van Gasselt
I Sarkany
I Willis
IJ Schulz
IL Schwartz
J Harrison
J Mangos
J Oehler
J Peter
J Quincy
J Renaut
J Sugenoya
J Timbal
J Waterhouse
J Werner
J-P Libert
J-S Bae
JA Allen
JA Boulant
JD Cotter
JD Cotter
JD Cotter
JD Hardy
JF Slegers
JH Ayling
JJ Catania
JK Behm
JR Allan
JS Weiner
JS Weiner
JS Weiner
K Aoki
K Hashimoto
K Hashimoto
K Hashimoto
K Hwang
K Ikeuchi
K Juniper
K Ogata
K Ohara
K Sato
K Sato
K Sato
K Sato
K Sato
K Sato
K Sato
K Sato
K Sato
K Takahara
K Wilke
K-E Hagbarth
KFT Krause
KJ Collins
KJ Collins
KJ Collins
KJK Buettner
KT Sato
L Gluck
L Landmann
L-A Ranvier
LB Baker
LB Weschler
LE Armstrong
LG Berglund
LG Madeira
LW Eichna
M Babic
M Cabanac
M Kobayashi
M Machado
M Noppen
M Ohmi
M Phillips
M Saat
MJ Buono
MJ Huheey
MJ Patterson
MJ Patterson
MK Yousef
ML Thompson
N Cauna
N Grew
N Kakitsuba
N Kondo
N Kondo
N Kondo
NAS Taylor
NAS Taylor
NAS Taylor
NAS Taylor
NJ Rehrer
O Bar-Or
O Schaefer
O Veraguth
O Yamamoto
P Boisvert
P Boisvert
P Groscurth
P Schiefferdecker
P Schiefferdecker
PCB MacKinnon
PE Thomas
PM Quinton
PO Åstrand
R Edelberg
R Miller
R van Heyningen
RA Ellis
RC Wilcott
RD McCook
RE Albert
RF Hellon
RF Rushmer
RH Fallon
RI Garcia
RJ Maughan
RJ Scheuplein
RJ Scheuplein
RJ Schotzinger
RK MacPherson
RO Ojikutu
RS Elizondo
RW Bullard
RW Bullard
RW Kenefick
S Adelman
S Glaser
S Grimnes
S Homma
S Hori
S Iwase
S Kerassidis
S Nicolaidis
S Robinson
S Sanctorius
S Takano
SC Landis
SJ Park
SM Shirreffs
Sutarman
SW Brusilow
T Adams
T Amano
T Fukumoto
T Ichihashi
T Morimoto
T Nakayama
T Nakayama
T Nishiyama
T Ogawa
T Ogawa
T Ogawa
T Verde
TC Boysen
TE Gibson
TJ Yen
TM Chalmers
TS Spencer
V Candas
V Minor
W Greuer
W Hancock
W Höfler
W Locke
W van Beaumont
W van Beaumont
WA Latzka
WA Sodeman
WC Randall
WC Randall
WJ Warwick
WL Kenney
WSS Ladell
Y Inoue
Y Inoue
Y Inoue
Y Inoue
Y Kuno
Y Kuno
Y Kuno
Y Nakazato
Y Toda
Y-W Hsu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref