Search CORE

11 research outputs found

Automatically Scaling Multi-Tenant Machine Learning

Author: Fiedel Noah
Olston Christopher
Ross Steven J.
Publication venue: Technical Disclosure Commons
Publication date: 12/12/2017
Field of study

Generally, the present disclosure is directed to optimizing use of computing resources in a system. In particular, in some implementations, the systems and methods of the present disclosure can include or otherwise leverage one or more machine-learned models to predict task allocation for a job serving a plurality of machine-learned models based on current system state and queries per second (QPS) data for the plurality of models. Alternatively, the tasks can be allocated according to one or more rules (e.g., a new task is allocated to a job until the compute usage for the job falls below a scaling threshold). Thus, the systems and methods of the present disclosure are able to efficiently serve a mix of high-QPS and low-QPS machine-learned models at low latency with minimal waste of compute resources (e.g., CPU, GPU, TPU, etc.) and memory (e.g., RAM)

Technical Disclosure Common

Elastic multi-resolution model-serving to compute inferences

Author: Beutel Alexander
Chi Ed H.
Fiedel Noah
Olston Christopher
Publication venue: Technical Disclosure Commons
Publication date: 15/09/2017
Field of study

Machine-learning models are consuming an increasing fraction of the world\u27s computing resources. The cost of computing inferences with some machine-learning models is extremely high. Provisioning computing resources for peak performance, e.g., high availability and quality of service, entails the creation of headroom for traffic spikes (increases in demand) and preparing for the possibility of outages (decreases in capacity). Executing computer applications that utilize machine-learning models, also known as machine-learned models, can require significant capital and operational expenses. This disclosure describes techniques to optimize use of computing resources for a machine-learning model. Multi-resolution models and/or models with recurrence are utilized. These models can compute inferences to varying degrees of quality (resolution). The multi-resolution models are served in an elastic manner such that a model of a resolution that fits both the available computing resources and is utilized to compute inferences

Technical Disclosure Common

Understanding HTML with Large Language Models

Author: Chowdhery Aakanksha
Faust Aleksandra
Fiedel Noah
Gur Izzeddin
Huang Austin
Miao Yingjie
Nachum Ofir
Narang Sharan
Safdari Mustafa
Publication venue
Publication date: 08/10/2022
Field of study

Large language models (LLMs) have shown exceptional performance on a variety of natural language tasks. Yet, their capabilities for HTML understanding -- i.e., parsing the raw HTML of a webpage, with applications to automation of web-based tasks, crawling, and browser-assisted retrieval -- have not been fully explored. We contribute HTML understanding models (fine-tuned LLMs) and an in-depth analysis of their capabilities under three tasks: (i) Semantic Classification of HTML elements, (ii) Description Generation for HTML inputs, and (iii) Autonomous Web Navigation of HTML pages. While previous work has developed dedicated architectures and training procedures for HTML understanding, we show that LLMs pretrained on standard natural language corpora transfer remarkably well to HTML understanding tasks. For instance, fine-tuned LLMs are 12% more accurate at semantic classification compared to models trained exclusively on the task dataset. Moreover, when fine-tuned on data from the MiniWoB benchmark, LLMs successfully complete 50% more tasks using 192x less data compared to the previous best supervised model. Out of the LLMs we evaluate, we show evidence that T5-based models are ideal due to their bidirectional encoder-decoder architecture. To promote further research on LLMs for HTML understanding, we create and open-source a large-scale HTML dataset distilled and auto-labeled from CommonCrawl

arXiv.org e-Print Archive

PaLM: Scaling Language Modeling with Pathways

Author: Agrawal Shivani
Austin Jacob
Barham Paul
Barnes Parker
Bosma Maarten
Bradbury James
Catasta Michele
Child Rewon
Chowdhery Aakanksha
Chung Hyung Won
Dai Andrew M.
Dean Jeff
Dev Sunipa
Devlin Jacob
Diaz Mark
Dohan David
Du Nan
Duke Toju
Eck Douglas
Fedus Liam
Fiedel Noah
Firat Orhan
Garcia Xavier
Gehrmann Sebastian
Ghemawat Sanjay
Gur-Ari Guy
Hutchinson Ben
Ippolito Daphne
Isard Michael
Lee Katherine
Levskaya Anselm
Lewkowycz Aitor
Lim Hyeontaek
Luan David
Maynez Joshua
Meier-Hellstern Kathy
Michalewski Henryk
Mishra Gaurav
Misra Vedant
Moreira Erica
Narang Sharan
Omernick Mark
Pellat Marie
Petrov Slav
Pillai Thanumalayan Sankaranarayana
Polozov Oleksandr
Pope Reiner
Prabhakaran Vinodkumar
Rao Abhishek
Reif Emily
Roberts Adam
Robinson Kevin
Saeta Brennan
Schuh Parker
Sepassi Ryan
Shazeer Noam
Shi Kensen
Spiridonov Alexander
Sutton Charles
Tay Yi
Tsvyashchenko Sasha
Wang Xuezhi
Wei Jason
Yin Pengcheng
Zhou Denny
Zhou Zongwei
Zoph Barret
Publication venue
Publication date: 19/04/2022
Field of study

Large language models have been shown to achieve remarkable performance across a variety of natural language tasks using few-shot learning, which drastically reduces the number of task-specific training examples needed to adapt the model to a particular application. To further our understanding of the impact of scale on few-shot learning, we trained a 540-billion parameter, densely activated, Transformer language model, which we call Pathways Language Model PaLM. We trained PaLM on 6144 TPU v4 chips using Pathways, a new ML system which enables highly efficient training across multiple TPU Pods. We demonstrate continued benefits of scaling by achieving state-of-the-art few-shot learning results on hundreds of language understanding and generation benchmarks. On a number of these tasks, PaLM 540B achieves breakthrough performance, outperforming the finetuned state-of-the-art on a suite of multi-step reasoning tasks, and outperforming average human performance on the recently released BIG-bench benchmark. A significant number of BIG-bench tasks showed discontinuous improvements from model scale, meaning that performance steeply increased as we scaled to our largest model. PaLM also has strong capabilities in multilingual tasks and source code generation, which we demonstrate on a wide array of benchmarks. We additionally provide a comprehensive analysis on bias and toxicity, and study the extent of training data memorization with respect to model scale. Finally, we discuss the ethical considerations related to large language models and discuss potential mitigation strategies

arXiv.org e-Print Archive

TALM: Tool Augmented Language Models

Author: Fiedel Noah
Parisi Aaron
Zhao Yao
Publication venue
Publication date: 24/05/2022
Field of study

Transformer based language models (LMs) demonstrate increasing performance with scale across a wide variety of tasks. Scale alone however cannot enable models to solve tasks that require access to ephemeral, changing, or private data that was unavailable at training time. Many useful tasks may also benefit from LMs being able to access APIs that read or modify state. In this work, we present Tool Augmented Language Models (TALM), combining a text-only approach to augment language models with non-differentiable tools, and an iterative "self-play" technique to bootstrap performance starting from few tool demonstrations. TALM exhibits strong performance on both a knowledge-heavy QA task and a reasoning oriented math task with simple tools. At a given model scale, TALM significantly outperforms non-augmented LMs. We further demonstrate that TALM successfully performs out-of-distribution inferences on both QA and math tasks, where non-augmented LMs fail. Our results suggest that Tool Augmented Language Models are a promising direction to enrich LMs' capabilities, with less dependence on scale

arXiv.org e-Print Archive

Beyond the imitation game: Quantifying and extrapolating the capabilities of language models

Author: Abid Abubakar
Agarwal Akshat
Agha Omar
Alabi Jesujoba
Ali Tariq
Alipoormolabashi Pegah
Aminnaseri Moin
Anand Sajant
Andreassen Anders
Arakawa Riku
Argueta Cedrick
Arnaud Melody
Asaadi Shima
Ashcraft Courtney
Askell Amanda
Bahri Yasaman
Bai Yuntao
Baitemirova Medina Orduna
Balis John U.
Banjade Rabin
Bansal Mohit
Baral Chitta
Barnes Elizabeth
Barnes Richard
Baturan Marco
Belinkov Yonatan
Berant Jonathan
Betz Gregor
Bevilacqua Michele
Biderman Stella
Bischoff Sebastian
Bogar Hayden
Bojanowski Bartłomiej
Bosma Maarten
Bosscher Jelle
Boudeman Joseph
Bowman Samuel R.
Brown Adam R.
Burden John
Buzan Dilyar
Cain Mike
Callison-Burch Chris
Cameron Nicholas
Casares Pablo Antonio Moreno
Casey Sean
Chang Ernie
Chang Peter
Chang Trenton
Chen Angelica
Chen Danqi
Chen Derek
Chen Qinlang
Chen Yifu
Chi Ethan A.
Chi Nathan
Chi Ryan
Chiafullo Kristen
Choi Yejin
Chollet Francois
Chu Eric
Chua Joyce
Cohen Michael
Colón Luis Oliveros
Constant Noah
Contreras-Ochando Lidia
Cubuk Ekin Dogus
Dai Andrew
Datta Debajyoti
Debnath
Deckers Niklas
Dehaene Stanislas
Delgado Ramón Risco
Demberg Vera
Desbordes Théo
Dhole Kaustubh D.
Diao Cameron
Dillavou Sam
Divic Stefan
Dohan David
Doiron Nick
Donoway Elizabeth
Doshi Parth
Dour Cameron
Drakard David
Dsouza Amanda
Dugan Liam
Dyer Ethan
Eckersley Peter
Efrat Avia
Ekmekci Berk
Elbaghdadi Omar
Emelin Denis
Engel Jesse
Erdem Aykut
Erdem Erkut
Ermon Stefano
Evans Owain
Farooqi Maheen
Faruqui Manaal
Fedus William
Fiedel Noah
Fisac Jaime Fernández
Fisch Adam
Frank Robert
Freeman Daniel
Frohberg Jörg
Fung Pascale
Gabriel Raefer
Galijasevic Hana
Ganguli Deep
Gao Leo
Garbacea Cristina
Garg Rhythm
Garrette Dan
Garriga-Alonso Adrià
Gehrmann Sebastian
Geissinger Jack
Gerstenberg Tobias
Geva Mor
Ghazarian Sarik
Gheini Mozhdeh
Gholamidavoodi Arash
Ghosh Sayan
Gilboa Dar
Gimpel Kevin
Giulianelli Mario
González Daniel Moseguí
Gopalakrishnan Karthik
Gottardi Anna
Gruetter Samuel
Gu Michael
Gu Shixiang Shane
Gupta Aditya
Gupta Animesh
Gur-Ari Guy
Habacker Rahel
Hagen Matthias
Hagerman Eleanor
Hajishirzi Hannaneh
Hamdan Shadi
Han Sanghyun
Hao Yiding
Happé Francesca
Hashimoto Tatsu
Hatwar Sriharsha
He Luheng
Hedayatnia Behnam
Hendrycks Dan
Hernandez Danny
Hernandez-Orallo Jose
Herrick Austin
Hilton Jacob
Hoeve Maartje ter
Hou Yu
Hou Yufang
Howald Blake
Htut Phu Mon
Hupkes Dieuwke
Hussain Aman
Hwang Pinyu
Ignatyeva Katerina
Inden Benjamin
Ippolito Daphne
Ivanitskiy Michael
Iyer Anantharaman S.
Iyer Niveditha S.
Jacobs Rowan
Jaimovitch-López Gonzalo
Jerzak Ethan
Jiang Angela
Jones Joseph
Jumelet Jaap
Jurgens David
Kale Mihir
Kanclerz Kamil
Kaplan Jared
Karakaş Ayla
Kernion Jackson
Keskar Nitish Shirish
Khashabi Daniel
Khot Tushar
Kilman Dan
Kim Ethan
Kim Hannah
Kim Jeremy
Kiritchenko Svetlana
Kirubarajan Arun
Kleyko Denis
Kluska Agnieszka
Kocoń Jan
Kocurek Alexander W.
Koppel James
Kornev Timofei
Krakover Neta Gur-Ari
Krauth Karl
Kruszewski Germán
Kwatra Sanjeev
La Andrew
Lakretz Yair
Lam Emma
Lam Lucas
Lampinen Andrew
Leavitt Matthew L.
LeBras Ronan
Lee Dong-Ho
Lee Jaehoon
Lee Nayeon
Lee Ryan
Lee Soo-Hwan
Levy Daniel
Levy Omer
Lewis Martha
Lewkowycz Aitor
Li Tao
Liang Paul Pu
Liang Percy
Liao Peiyuan
Lin Bill Yuchen
Lin Stephanie
Linzen Tal
Liu Rosanne
Livescu Karen
Loe Bao Sheng
Lyu Qing
Madotto Andrea
Makini Sneha Priscilla
Manning Christopher D.
Manyasi Eunice Engefu
Marelli Marco
Mariani Giorgio
Markert Katja
Marsh Jennifer
Martínez-Plumed Fernando
Maru Marco
Mathewson Kory
Mazeika Mantas
McDonell Kyle
McElrath Melvin
Mehta Harsh
Mei Qiaozhu
Melo Gerard de
Melzi Simone
Menezes Arul
Meng Chenlin
Metz Luke
Miller John
Millière Raphaël
Misherghi Summer
Mishra Gaurav
Mishra Swaroop
Misra Diganta
Misra Vedant
Miłkowski Piotr
Mohammad Saif M.
Mollo Dimitri Coelho
Morency Louis-Philippe
Moschella Luca
Muennighoff Niklas
Mukund Varma T
Mullokandov Asher
Nangia Nikita
Neeraj Trishala
Neyshabur Behnam
Ng Ian
Nie Allen
Nkinyili Tiberius
Noble Isaac
Noble Lucy
Norelli Antonio
Novak Roman
Novikova Jekaterina
Nyamai Victoria
Oli Priti
Omondi Kevin
Pachchigar Shubh
Padmakumar Vishakh
Parascandolo Giambattista
Parrish Alicia
Patil Piyush
Pavlick Ellie
Peng Nanyun
Perszyk Danielle
Pezeshkpour Pouya
Phan Thomas
Phang Jason
Piantadosi Steven T.
Potthast Martin
Potts Christopher
Power Alethea
Prabhu Vinay Uday
Prasad Stephen
Qin Lianhui
Quintana Maria Jose Ramírez
Radom Jarema
Raffel Colin
Rahane Ameet
Ramasesh Vinay
Ramirez Cindy
Ramírez César Ferri
Rao Abhishek
Rashkin Hannah
Rastogi Abhinav
Rathkopf Charles
Raunak Vikas
Ray Alex
Raymaekers Robbe
Reddy Siva
Ren Xiang
Reynolds Laria
Richardson Kyle
Rivera Clara E.
Roberts B. Ryan
Roberts Nicholas
Rodola Emanuele
Rong Frieda
Roth Dan
Rothschild Theodore
Rous Sarah A.
Rozen Jos
Rudolph Rachel Etta
Rule Joshua S.
Sabharwal Ashish
Sadeghi Sepideh
Safaya Ali
Salakhutdinov Ruslan
Santilli Andrea
Santoro Adam
Sap Maarten
Saunders William
Saurous Rif A.
Schick Timo
Schmidt Ludwig
Schoenholz Samuel S.
Schubert Mátyás
Schuster Sebastian
Schuster Tal
Schütze Hinrich
Segal Elad
Seid Zachary
Shaham Uri
Shakeri Siamak
Shen Xudong
Shevlin Henry
Shi Sherry
Shieber Stuart M.
Shkaruta Ksenia
Shleifer Sam
Shoeb Abu Awal Md
Shridhar Kumar
Shultz Tyler
Shutova Ekaterina
Shyamolima
Siar Fatemeh
Sikand Rohan
Sileo Damien
Simon James B.
Singh Chandan
Singh Shikhar
Siro Clemencia
Sitelew Roman
Slone Ambrose
Sohl-Dickstein Jascha
Song Jiaming
Song Yangqiu
Srikumar Vivek
Srivastava Aarohi
Srivastava Shashank
Starritt Michael
Stein Benno
Stinson Catherine
Stovall Ryan
Strube Michael
Stuhlmüller Andreas
Suzgun Mirac
Swędrowski Michał
Taal Jeroen
Tabassum Arfa
Tam Derek
Tang Eric
Tang Jillian
Tazarv Ali
Teehan Ryan
Telleen-Lawton Timothy
Tenenbaum Joshua B.
Thompson Jana
Thormeyer Simon
Tiwari Mo
Tolkiehn Marie
Tong Xiaoyu
Torene Spencer
Toshniwal Shubham
Tunduny Titus
Upadhyay Shyam
Venkatesh Anu
Vicol Paul
Voigt Christian
Vossen Wout
Vuong Anh
Waites Chris
Wang Gloria
Wang Tianle
Wang Zijian
Wang Zijie J.
Wang Zirui
Warstadt Alex
Waweru Joan
Wei Jason
Wen Nuan
Winata Genta Indra
Wiseman Sam
Wong Hugh Mee
Wu Chiyu
Wu Te-Lin
Wu Xinyi
Wu Ziyi
Xia Fanyue
Xiang Alice
Xu Jiacheng
Xu Mimee
Yaghoobzadeh Yadollah
Yakura Hiromu
Yang Diyi
Yang Rylan
Yang Yichi
Yasunaga Michihiro
Yee Michael A.
Yosinski Jason
Yu Tao
Yuret Deniz
Zhang Hongming
Zhang Li
Zhang Oliver
Zhang Rui
Zhang William
Zhao Xinran
Zhao Zhuoye
Zheltonozhskii Evgenii
Zheng James
Zhou Sharon
Zoph Barret
Zou Andy
Zou James
Özyurt Batuhan
Şenel Lütfi Kerem
Publication venue
Publication date: 09/06/2022
Field of study

Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-future capabilities and limitations of language models. To address this challenge, we introduce the Beyond the Imitation Game benchmark (BIG-bench). BIG-bench currently consists of 204 tasks, contributed by 442 authors across 132 institutions. Task topics are diverse, drawing problems from linguistics, childhood development, math, common-sense reasoning, biology, physics, social bias, software development, and beyond. BIG-bench focuses on tasks that are believed to be beyond the capabilities of current language models. We evaluate the behavior of OpenAI's GPT models, Google-internal dense transformer architectures, and Switch-style sparse transformers on BIG-bench, across model sizes spanning millions to hundreds of billions of parameters. In addition, a team of human expert raters performed all tasks in order to provide a strong baseline. Findings include: model performance and calibration both improve with scale, but are poor in absolute terms (and when compared with rater performance); performance is remarkably similar across model classes, though with benefits from sparsity; tasks that improve gradually and predictably commonly involve a large knowledge or memorization component, whereas tasks that exhibit "breakthrough" behavior at a critical scale often involve multiple steps or components, or brittle metrics; social bias typically increases with scale in settings with ambiguous context, but this can be improved with prompting