Search CORE

3 research outputs found

Real or Fake Text?: Investigating Human Ability to Detect Boundaries Between Human-Written and Machine-Generated Text

Author: Callison-Burch Chris
Dugan Liam
Ippolito Daphne
Kirubarajan Arun
Shi Sherry
Publication venue
Publication date: 24/12/2022
Field of study

As text generated by large language models proliferates, it becomes vital to understand how humans engage with such text, and whether or not they are able to detect when the text they are reading did not originate with a human writer. Prior work on human detection of generated text focuses on the case where an entire passage is either human-written or machine-generated. In this paper, we study a more realistic setting where text begins as human-written and transitions to being generated by state-of-the-art neural language models. We show that, while annotators often struggle at this task, there is substantial variance in annotator skill and that given proper incentives, annotators can improve at this task over time. Furthermore, we conduct a detailed comparison study and analyze how a variety of variables (model size, decoding strategy, fine-tuning, prompt genre, etc.) affect human detection performance. Finally, we collect error annotations from our participants and use them to show that certain textual genres influence models to make different types of errors and that certain sentence-level features correlate highly with annotator selection. We release the RoFT dataset: a collection of over 21,000 human annotations paired with error classifications to encourage future work in human detection and evaluation of generated text.Comment: AAAI 2023 Long Paper. Code is available at https://github.com/liamdugan/human-detectio

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Beyond the imitation game: Quantifying and extrapolating the capabilities of language models

Author: Abid Abubakar
Agarwal Akshat
Agha Omar
Alabi Jesujoba
Ali Tariq
Alipoormolabashi Pegah
Aminnaseri Moin
Anand Sajant
Andreassen Anders
Arakawa Riku
Argueta Cedrick
Arnaud Melody
Asaadi Shima
Ashcraft Courtney
Askell Amanda
Bahri Yasaman
Bai Yuntao
Baitemirova Medina Orduna
Balis John U.
Banjade Rabin
Bansal Mohit
Baral Chitta
Barnes Elizabeth
Barnes Richard
Baturan Marco
Belinkov Yonatan
Berant Jonathan
Betz Gregor
Bevilacqua Michele
Biderman Stella
Bischoff Sebastian
Bogar Hayden
Bojanowski Bartłomiej
Bosma Maarten
Bosscher Jelle
Boudeman Joseph
Bowman Samuel R.
Brown Adam R.
Burden John
Buzan Dilyar
Cain Mike
Callison-Burch Chris
Cameron Nicholas
Casares Pablo Antonio Moreno
Casey Sean
Chang Ernie
Chang Peter
Chang Trenton
Chen Angelica
Chen Danqi
Chen Derek
Chen Qinlang
Chen Yifu
Chi Ethan A.
Chi Nathan
Chi Ryan
Chiafullo Kristen
Choi Yejin
Chollet Francois
Chu Eric
Chua Joyce
Cohen Michael
Colón Luis Oliveros
Constant Noah
Contreras-Ochando Lidia
Cubuk Ekin Dogus
Dai Andrew
Datta Debajyoti
Debnath
Deckers Niklas
Dehaene Stanislas
Delgado Ramón Risco
Demberg Vera
Desbordes Théo
Dhole Kaustubh D.
Diao Cameron
Dillavou Sam
Divic Stefan
Dohan David
Doiron Nick
Donoway Elizabeth
Doshi Parth
Dour Cameron
Drakard David
Dsouza Amanda
Dugan Liam
Dyer Ethan
Eckersley Peter
Efrat Avia
Ekmekci Berk
Elbaghdadi Omar
Emelin Denis
Engel Jesse
Erdem Aykut
Erdem Erkut
Ermon Stefano
Evans Owain
Farooqi Maheen
Faruqui Manaal
Fedus William
Fiedel Noah
Fisac Jaime Fernández
Fisch Adam
Frank Robert
Freeman Daniel
Frohberg Jörg
Fung Pascale
Gabriel Raefer
Galijasevic Hana
Ganguli Deep
Gao Leo
Garbacea Cristina
Garg Rhythm
Garrette Dan
Garriga-Alonso Adrià
Gehrmann Sebastian
Geissinger Jack
Gerstenberg Tobias
Geva Mor
Ghazarian Sarik
Gheini Mozhdeh
Gholamidavoodi Arash
Ghosh Sayan
Gilboa Dar
Gimpel Kevin
Giulianelli Mario
González Daniel Moseguí
Gopalakrishnan Karthik
Gottardi Anna
Gruetter Samuel
Gu Michael
Gu Shixiang Shane
Gupta Aditya
Gupta Animesh
Gur-Ari Guy
Habacker Rahel
Hagen Matthias
Hagerman Eleanor
Hajishirzi Hannaneh
Hamdan Shadi
Han Sanghyun
Hao Yiding
Happé Francesca
Hashimoto Tatsu
Hatwar Sriharsha
He Luheng
Hedayatnia Behnam
Hendrycks Dan
Hernandez Danny
Hernandez-Orallo Jose
Herrick Austin
Hilton Jacob
Hoeve Maartje ter
Hou Yu
Hou Yufang
Howald Blake
Htut Phu Mon
Hupkes Dieuwke
Hussain Aman
Hwang Pinyu
Ignatyeva Katerina
Inden Benjamin
Ippolito Daphne
Ivanitskiy Michael
Iyer Anantharaman S.
Iyer Niveditha S.
Jacobs Rowan
Jaimovitch-López Gonzalo
Jerzak Ethan
Jiang Angela
Jones Joseph
Jumelet Jaap
Jurgens David
Kale Mihir
Kanclerz Kamil
Kaplan Jared
Karakaş Ayla
Kernion Jackson
Keskar Nitish Shirish
Khashabi Daniel
Khot Tushar
Kilman Dan
Kim Ethan
Kim Hannah
Kim Jeremy
Kiritchenko Svetlana
Kirubarajan Arun
Kleyko Denis
Kluska Agnieszka
Kocoń Jan
Kocurek Alexander W.
Koppel James
Kornev Timofei
Krakover Neta Gur-Ari
Krauth Karl
Kruszewski Germán
Kwatra Sanjeev
La Andrew
Lakretz Yair
Lam Emma
Lam Lucas
Lampinen Andrew
Leavitt Matthew L.
LeBras Ronan
Lee Dong-Ho
Lee Jaehoon
Lee Nayeon
Lee Ryan
Lee Soo-Hwan
Levy Daniel
Levy Omer
Lewis Martha
Lewkowycz Aitor
Li Tao
Liang Paul Pu
Liang Percy
Liao Peiyuan
Lin Bill Yuchen
Lin Stephanie
Linzen Tal
Liu Rosanne
Livescu Karen
Loe Bao Sheng
Lyu Qing
Madotto Andrea
Makini Sneha Priscilla
Manning Christopher D.
Manyasi Eunice Engefu
Marelli Marco
Mariani Giorgio
Markert Katja
Marsh Jennifer
Martínez-Plumed Fernando
Maru Marco
Mathewson Kory
Mazeika Mantas
McDonell Kyle
McElrath Melvin
Mehta Harsh
Mei Qiaozhu
Melo Gerard de
Melzi Simone
Menezes Arul
Meng Chenlin
Metz Luke
Miller John
Millière Raphaël
Misherghi Summer
Mishra Gaurav
Mishra Swaroop
Misra Diganta
Misra Vedant
Miłkowski Piotr
Mohammad Saif M.
Mollo Dimitri Coelho
Morency Louis-Philippe
Moschella Luca
Muennighoff Niklas
Mukund Varma T
Mullokandov Asher
Nangia Nikita
Neeraj Trishala
Neyshabur Behnam
Ng Ian
Nie Allen
Nkinyili Tiberius
Noble Isaac
Noble Lucy
Norelli Antonio
Novak Roman
Novikova Jekaterina
Nyamai Victoria
Oli Priti
Omondi Kevin
Pachchigar Shubh
Padmakumar Vishakh
Parascandolo Giambattista
Parrish Alicia
Patil Piyush
Pavlick Ellie
Peng Nanyun
Perszyk Danielle
Pezeshkpour Pouya
Phan Thomas
Phang Jason
Piantadosi Steven T.
Potthast Martin
Potts Christopher
Power Alethea
Prabhu Vinay Uday
Prasad Stephen
Qin Lianhui
Quintana Maria Jose Ramírez
Radom Jarema
Raffel Colin
Rahane Ameet
Ramasesh Vinay
Ramirez Cindy
Ramírez César Ferri
Rao Abhishek
Rashkin Hannah
Rastogi Abhinav
Rathkopf Charles
Raunak Vikas
Ray Alex
Raymaekers Robbe
Reddy Siva
Ren Xiang
Reynolds Laria
Richardson Kyle
Rivera Clara E.
Roberts B. Ryan
Roberts Nicholas
Rodola Emanuele
Rong Frieda
Roth Dan
Rothschild Theodore
Rous Sarah A.
Rozen Jos
Rudolph Rachel Etta
Rule Joshua S.
Sabharwal Ashish
Sadeghi Sepideh
Safaya Ali
Salakhutdinov Ruslan
Santilli Andrea
Santoro Adam
Sap Maarten
Saunders William
Saurous Rif A.
Schick Timo
Schmidt Ludwig
Schoenholz Samuel S.
Schubert Mátyás
Schuster Sebastian
Schuster Tal
Schütze Hinrich
Segal Elad
Seid Zachary
Shaham Uri
Shakeri Siamak
Shen Xudong
Shevlin Henry
Shi Sherry
Shieber Stuart M.
Shkaruta Ksenia
Shleifer Sam
Shoeb Abu Awal Md
Shridhar Kumar
Shultz Tyler
Shutova Ekaterina
Shyamolima
Siar Fatemeh
Sikand Rohan
Sileo Damien
Simon James B.
Singh Chandan
Singh Shikhar
Siro Clemencia
Sitelew Roman
Slone Ambrose
Sohl-Dickstein Jascha
Song Jiaming
Song Yangqiu
Srikumar Vivek
Srivastava Aarohi
Srivastava Shashank
Starritt Michael
Stein Benno
Stinson Catherine
Stovall Ryan
Strube Michael
Stuhlmüller Andreas
Suzgun Mirac
Swędrowski Michał
Taal Jeroen
Tabassum Arfa
Tam Derek
Tang Eric
Tang Jillian
Tazarv Ali
Teehan Ryan
Telleen-Lawton Timothy
Tenenbaum Joshua B.
Thompson Jana
Thormeyer Simon
Tiwari Mo
Tolkiehn Marie
Tong Xiaoyu
Torene Spencer
Toshniwal Shubham
Tunduny Titus
Upadhyay Shyam
Venkatesh Anu
Vicol Paul
Voigt Christian
Vossen Wout
Vuong Anh
Waites Chris
Wang Gloria
Wang Tianle
Wang Zijian
Wang Zijie J.
Wang Zirui
Warstadt Alex
Waweru Joan
Wei Jason
Wen Nuan
Winata Genta Indra
Wiseman Sam
Wong Hugh Mee
Wu Chiyu
Wu Te-Lin
Wu Xinyi
Wu Ziyi
Xia Fanyue
Xiang Alice
Xu Jiacheng
Xu Mimee
Yaghoobzadeh Yadollah
Yakura Hiromu
Yang Diyi
Yang Rylan
Yang Yichi
Yasunaga Michihiro
Yee Michael A.
Yosinski Jason
Yu Tao
Yuret Deniz
Zhang Hongming
Zhang Li
Zhang Oliver
Zhang Rui
Zhang William
Zhao Xinran
Zhao Zhuoye
Zheltonozhskii Evgenii
Zheng James
Zhou Sharon
Zoph Barret
Zou Andy
Zou James
Özyurt Batuhan
Şenel Lütfi Kerem
Publication venue
Publication date: 09/06/2022
Field of study

Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-future capabilities and limitations of language models. To address this challenge, we introduce the Beyond the Imitation Game benchmark (BIG-bench). BIG-bench currently consists of 204 tasks, contributed by 442 authors across 132 institutions. Task topics are diverse, drawing problems from linguistics, childhood development, math, common-sense reasoning, biology, physics, social bias, software development, and beyond. BIG-bench focuses on tasks that are believed to be beyond the capabilities of current language models. We evaluate the behavior of OpenAI's GPT models, Google-internal dense transformer architectures, and Switch-style sparse transformers on BIG-bench, across model sizes spanning millions to hundreds of billions of parameters. In addition, a team of human expert raters performed all tasks in order to provide a strong baseline. Findings include: model performance and calibration both improve with scale, but are poor in absolute terms (and when compared with rater performance); performance is remarkably similar across model classes, though with benefits from sparsity; tasks that improve gradually and predictably commonly involve a large knowledge or memorization component, whereas tasks that exhibit "breakthrough" behavior at a critical scale often involve multiple steps or components, or brittle metrics; social bias typically increases with scale in settings with ambiguous context, but this can be improved with prompting