Search CORE

27 research outputs found

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Author: Agrawal Adarsh
Akinwande Victor
Al-Nuaimi Namir
Alfaraj Najla
Alhajjar Elie
Aroyo Lora
Bavalatti Trupti
Blili-Hamelin Borhane
Bollacker Kurt
Bomassani Rishi
Boston Marisa Ferrara
Campos Siméon
Chakra Kal
Chen Canyu
Coleman Cody
Coudert Zacharie Delpierre
Derczynski Leon
Dutta Debojyoti
Eisenberg Ian
Ezick James
Frase Heather
Fuller Brian
Gandikota Ram
Gangavarapu Agasthya
Gangavarapu Ananya
Gealy James
Ghosh Rajat
Goel James
Gohar Usman
Goswami Sujata
Hale Scott A.
Hutiri Wiebke
Imperial Joseph Marvin
Jandial Surgan
Judd Nick
Juefei-Xu Felix
Kailkhura Bhavya
Khomh Foutse
Kirk Hannah Rose
Klyman Kevin
Knotz Chris
Kuchnik Michael
Kumar Shachi H.
Lengerich Chris
Liang Percy
Liao Zeyi
Long Eileen Peters
Lu Victor
Mai Yifan
Mammen Priyanka Mary
Manyeki Kelvin
Mattson Peter
McGregor Sean
Mehta Virendra
Mohammed Shafee
Moss Emanuel
Nachman Lama
Naganna Dinesh Jinenhally
Nikanjam Amin
Nushi Besmira
Oala Luis
Orr Iftach
Parrish Alicia
Patlak Cigdem
Pietri William
Poursabzi-Sangdeh Forough
Presani Eleonora
Puletti Fabrizio
Röttger Paul
Sahay Saurav
Santos Tim
Scherrer Nino
Schramowski Patrick
Sebag Alice Schoenauer
Shahbazi Abolfazl
Sharma Vin
Shen Xudong
Sistla Vamsi
Tang Leonard
Testuggine Davide
Thangarasa Vithursan
Vanschoren Joaquin
Vidgen Bertie
Watkins Elizabeth Anne
Weiss Rebecca
Welty Chris
Wilbers Tyler
Williams Adina
Wu Carole-Jean
Yadav Poonam
Yang Xianjun
Zeng Yi
Zhang Wenhui
Zhdanov Fedor
Zhu Jiacheng
Publication venue: 'Center for Open Science'
Publication date: 18/04/2024
Field of study

This paper introduces v0.5 of the AI Safety Benchmark, which has been created by the MLCommons AI Safety Working Group. The AI Safety Benchmark has been designed to assess the safety risks of AI systems that use chat-tuned language models. We introduce a principled approach to specifying and constructing the benchmark, which for v0.5 covers only a single use case (an adult chatting to a general-purpose assistant in English), and a limited set of personas (i.e., typical users, malicious users, and vulnerable users). We created a new taxonomy of 13 hazard categories, of which 7 have tests in the v0.5 benchmark. We plan to release version 1.0 of the AI Safety Benchmark by the end of 2024. The v1.0 benchmark will provide meaningful insights into the safety of AI systems. However, the v0.5 benchmark should not be used to assess the safety of AI systems. We have sought to fully document the limitations, flaws, and challenges of v0.5. This release of v0.5 of the AI Safety Benchmark includes (1) a principled approach to specifying and constructing the benchmark, which comprises use cases, types of systems under test (SUTs), language and context, personas, tests, and test items; (2) a taxonomy of 13 hazard categories with definitions and subcategories; (3) tests for seven of the hazard categories, each comprising a unique set of test items, i.e., prompts. There are 43,090 test items in total, which we created with templates; (4) a grading system for AI systems against the benchmark; (5) an openly available platform, and downloadable tool, called ModelBench that can be used to evaluate the safety of AI systems on the benchmark; (6) an example evaluation report which benchmarks the performance of over a dozen openly available chat-tuned language models; (7) a test specification for the benchmark

OPUS

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Author: Agrawal Adarsh
Akinwande Victor
Al-Nuaimi Namir
Alfaraj Najla
Alhajjar Elie
Aroyo Lora
Bavalatti Trupti
Blili-Hamelin Borhane
Bollacker Kurt
Bomassani Rishi
Boston Marisa Ferrara
Campos Siméon
Chakra Kal
Chen Canyu
Coleman Cody
Coudert Zacharie Delpierre
Derczynski Leon
Dutta Debojyoti
Eisenberg Ian
Ezick James
Frase Heather
Fuller Brian
Gandikota Ram
Gangavarapu Agasthya
Gangavarapu Ananya
Gealy James
Ghosh Rajat
Goel James
Gohar Usman
Goswami Sujata
Hale Scott A.
Hutiri Wiebke
Imperial Joseph Marvin
Jandial Surgan
Judd Nick
Juefei-Xu Felix
Kailkhura Bhavya
Khomh Foutse
Kirk Hannah Rose
Klyman Kevin
Knotz Chris
Kuchnik Michael
Kumar Shachi H.
Lengerich Chris
Li Bo
Liang Percy
Liao Zeyi
Long Eileen Peters
Lu Victor
Mai Yifan
Mammen Priyanka Mary
Manyeki Kelvin
Mattson Peter
McGregor Sean
Mehta Virendra
Mohammed Shafee
Moss Emanuel
Nachman Lama
Naganna Dinesh Jinenhally
Nikanjam Amin
Nushi Besmira
Oala Luis
Orr Iftach
Parrish Alicia
Patlak Cigdem
Pietri William
Poursabzi-Sangdeh Forough
Presani Eleonora
Puletti Fabrizio
Röttger Paul
Sahay Saurav
Santos Tim
Scherrer Nino
Schramowski Patrick
Sebag Alice Schoenauer
Shahbazi Abolfazl
Sharma Vin
Shen Xudong
Sistla Vamsi
Tang Leonard
Testuggine Davide
Thangarasa Vithursan
Vanschoren Joaquin
Vidgen Bertie
Watkins Elizabeth Anne
Weiss Rebecca
Welty Chris
Wilbers Tyler
Williams Adina
Wu Carole-Jean
Yadav Poonam
Yang Xianjun
Zeng Yi
Zhang Wenhui
Zhdanov Fedor
Zhu Jiacheng
Publication venue
Publication date: 18/04/2024
Field of study

White Rose Research Online

Performance and characterization of the SPT-3G digital frequency-domain multiplexed readout system using an improved noise and crosstalk model

Author: Ade Peter A. R.
Ahmed Zeeshan
Anderes Ethan
Anderson Adam J.
Archipley Melanie
Avva Jessica S.
Aylor Kevin
Balkenhol Lennart
Barry Peter S.
Benabed Karim
Bender Amy N.
Benson Bradford A.
Bianchini Federico
Bleem Lindsey E.
Bouchet Francois R.
Bryant Lincoln
Byrum Karen
Carlstrom John E.
Carter Faustin W.
Cecil Thomas W.
Chang Clarence L.
Chaubal Prakrut
Chen Geoffrey
Cho Hsiaomei
Chou Ti-Lin
Cliche Jean-Francois
Crawford Tom M.
Cukierman Ari
Daley Cail
de Haan Tijmen
Denison Edward V.
Dibert Karia
Ding Junjia
Dobbs Matt A.
Dutcher Daniel
Elleflot Tucker
Everett Wendeline
Feng Cahng
Ferguson Kyle R.
Foster Allen
Fu Jianyang
Galli Silvia
Gambrel Anne E.
Gardner Robert W.
Goeckner-Wald Neil
Groh John C.
Gualtieri Riccardo
Guns Sam
Gupta Nikhel
Guyser Robert
Halverson Nils W.
Harke-Hosemann Angelina H.
Harrington Nicholas L.
Henning Jason W.
Hilton Gene C.
Hivon Eric
Holzapfel William L.
Hood John C.
Howe Doug
Huang Nicholas
Irwin Kent D.
Jeong Oliver B.
Jonas Michelle
Jones Adam
Khaire Trupti S.
Knox Lloyd
Kofman Anna M.
Korman Milo
Kubik Donna L.
Kuhlmann Stephen
Kuo Chao-Lin
Lee Adrian T.
Leitch Erik M.
Lowitz Amy E.
Lu Chunyu
Meyer Stephan S.
Michalik Daniel
Millea Marius
Montgomery Joshua
Nadolski Andrew
Natoli Tyler
Nguyen Hogan
Noble Gavin I.
Novosad Valentine
Omori Yuuki
Padin Steve
Pan Zhaodi
Paschos Pascal
Pearson John
Posada Chrystian M.
Prabhu Karthik
Quan Wei
Rahlin Alexandra
Reichardt Christian L.
Riebel David
Riedel Benedikt
Rouble Maclean
Ruhl John E.
Sayre James T.
Schiappucci Eduardo
Shirokoff Erik
Smecher Graeme
Sobrin Joshua A.
Stark Antony A.
Stephen Judith
Story Kyle T.
Suzuki Aritoki
Thakur Ritoban B.
Thompson Keith L.
Thorne Ben
Tucker Carole
Umilta Caterina
Vale Leila R.
Vanderlinde Keith
Vieira Joaquin D.
Wang Gensheng
Whitehorn Nathan
Wu Wai L. K.
Yefremenko Volodymyr
Yoon Ki W.
Young Matt R.
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 08/01/2022
Field of study

The third-generation South Pole Telescope camera (SPT-3G) improves upon its predecessor (SPTpol) by an order of magnitude increase in detectors on the focal plane. The technology used to read out and control these detectors, digital frequency-domain multiplexing (DfMUX), is conceptually the same as used for SPTpol, but extended to accommodate more detectors. A nearly 5× expansion in the readout operating bandwidth has enabled the use of this large focal plane, and SPT-3G performance meets the forecasting targets relevant to its science objectives. However, the electrical dynamics of the higher-bandwidth readout differ from predictions based on models of the SPTpol system due to the higher frequencies used and parasitic impedances associated with new cryogenic electronic architecture. To address this, we present an updated derivation for electrical crosstalk in higher-bandwidth DfMUX systems and identify two previously uncharacterized contributions to readout noise, which become dominant at high bias frequency. The updated crosstalk and noise models successfully describe the measured crosstalk and readout noise performance of SPT-3G. These results also suggest specific changes to warm electronics component values, wire-harness properties, and SQUID parameters, to improve the readout system for future experiments using DfMUX, such as the LiteBIRD space telescope

Online Research @ Cardiff

Porn, pantomime and protest: the politics of bawdiness as feminine style

Author: Arthurs Jane.
Castle Terry.
Davis Natalie Zemon.
Garber Marjorie.
Halberstam J. Jack.
Irigaray Luce.
Mayer David.
Mazo Karras Ruth.
McClintock Anne.
Payne Cynthia.
Russo Mary.
Thomas Keith.
Tyler Carole-Anne.
Publication venue: 'Informa UK Limited'
Publication date: 02/10/2018
Field of study

This article explores the significance of the recent ‘Face-Sitting’ protest that took place outside Westminster in 2014. A carefully staged response to changes to pornography legislation that criminalized particular sexual practices pertinent to women’s pleasure, this porn-panto protest put the spectacle of the ‘kinky’ woman and her desires centre stage. The activists’ unique use of fetish dress, class and humour is explored in relation to the protest by brothel keeper and campaigner Cynthia Payne in the 1970s/1980s. Payne deployed bawdy humour and a particular high camp use of ‘kinky’ dress and English etiquette to undermine contemporary sexual norms. The 2014 protest also clearly reclaimed two traditional roles within English pantomime: the Dame and the Principal Boy. These examples will be used to examine the political function of humour in relation to cross-dressing and the ‘woman-on-top’. Ultimately, this study argues that ‘bawdiness’ is a politics that offers us potential promise but not without critical limitations established through media representations

Crossref

White Rose Research Online

La Confusion des Genres

Author: Butler Judith
Cairns Lucille
Coccinelle
Halberstam Judith
Kirk Kris
Kotz Liz
LaGrace Volcano Del
Namaste Viviane K.
Nick Rees-Roberts
Prosser Jay
Segal Lynne
Sinfield Alan
Tyler Carole-Anne
Watney Simon
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

Temporalities of Transition: Trans- temporal Femininity in a Human Musical Automaton

Author: Carter Julian
Deleuze Gilles
Deleuze Gilles
Deleuze Gilles
Freccero Carla
Halberstam Judith
Muñoz José Esteban
Prosser Jay
Skeggs Beverly
Stone Allucquère Rosanne
Stryker Susan
Sullivan Nikki
Sundén Jenny
Tyler Carole-Anne
Wajcman Judy
Publication venue: 'Edinburgh University Press'
Publication date
Field of study

Crossref

Performing White Triangles: Joan Riviere's “Womanliness as a Masquerade” and Imitation of Life

Author: Appignanesi Lisa
Berlant Lauren
Bhabha Homi
Butler Judith
Carby Hazel
Doane Mary Ann
Doane Mary Ann
Greta Ai-Yu Niu
Heath Stephen
Hughes Athol
Hurst Fannie
Irigaray Luce
Lacan Jacques
Morrison Toni
Riviere Joan
Sedgwick Eve Kosofsky
Tyler Carole-Anne
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref