Search CORE

11 research outputs found

Zero-shot audio captioning with audio-language model guidance and audio context keywords

Author: Akata Zeynep
Fauth Stefan
Koepke A. Sophia
Salewski Leonard
Publication venue
Publication date: 14/11/2023
Field of study

Zero-shot audio captioning aims at automatically generating descriptive textual captions for audio content without prior training for this task. Different from speech recognition which translates audio content that contains spoken language into text, audio captioning is commonly concerned with ambient sounds, or sounds produced by a human performing an action. Inspired by zero-shot image captioning methods, we propose ZerAuCap, a novel framework for summarising such general audio signals in a text caption without requiring task-specific training. In particular, our framework exploits a pre-trained large language model (LLM) for generating the text which is guided by a pre-trained audio-language model to produce captions that describe the audio content. Additionally, we use audio context keywords that prompt the language model to generate text that is broadly relevant to sounds. Our proposed framework achieves state-of-the-art results in zero-shot audio captioning on the AudioCaps and Clotho datasets. Our code is available at https://github.com/ExplainableML/ZerAuCap.Comment: NeurIPS 2023 - Machine Learning for Audio Workshop (Oral

arXiv.org e-Print Archive

CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations

Author: Akata Zeynep
Koepke A. Sophia
Lensch Hendrik P. A.
Salewski Leonard
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

Providing explanations in the context of Visual Question Answering (VQA) presents a fundamental problem in machine learning. To obtain detailed insights into the process of generating natural language explanations for VQA, we introduce the large-scale CLEVR-X dataset that extends the CLEVR dataset with natural language explanations. For each image-question pair in the CLEVR dataset, CLEVR-X contains multiple structured textual explanations which are derived from the original scene graphs. By construction, the CLEVR-X explanations are correct and describe the reasoning and visual information that is necessary to answer a given question. We conducted a user study to confirm that the ground-truth explanations in our proposed dataset are indeed complete and relevant. We present baseline results for generating natural language explanations in the context of VQA using two state-of-the-art frameworks on the CLEVR-X dataset. Furthermore, we provide a detailed analysis of the explanation generation quality for different question and answer types. Additionally, we study the influence of using different numbers of ground-truth explanations on the convergence of natural language generation (NLG) metrics. The CLEVR-X dataset is publicly available at \url{https://explainableml.github.io/CLEVR-X/}

arXiv.org e-Print Archive

MPG.PuRe

In-Context Impersonation Reveals Large Language Models' Strengths and Biases

Author: Akata Zeynep
Alaniz Stephan
Rio-Torto Isabel
Salewski Leonard
Schulz Eric
Publication venue
Publication date: 24/05/2023
Field of study

In everyday conversations, humans can take on different roles and adapt their vocabulary to their chosen roles. We explore whether LLMs can take on, that is impersonate, different roles when they generate text in-context. We ask LLMs to assume different personas before solving vision and language tasks. We do this by prefixing the prompt with a persona that is associated either with a social identity or domain expertise. In a multi-armed bandit task, we find that LLMs pretending to be children of different ages recover human-like developmental stages of exploration. In a language-based reasoning task, we find that LLMs impersonating domain experts perform better than LLMs impersonating non-domain experts. Finally, we test whether LLMs' impersonations are complementary to visual information when describing different categories. We find that impersonation can improve performance: an LLM prompted to be a bird expert describes birds better than one prompted to be a car expert. However, impersonation can also uncover LLMs' biases: an LLM prompted to be a man describes cars better than one prompted to be a woman. These findings demonstrate that LLMs are capable of taking on diverse roles and that this in-context impersonation can be used to uncover their hidden strengths and biases

arXiv.org e-Print Archive

e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks

Author: Akata Zeynep
Salewski Leonard
Publication venue: IEEE
Publication date: 01/01/2021
Field of study

Publikationsserver der Universität Tübingen

Diverse Video Captioning by Adaptive Spatio-temporal Attention

Author: Lensch Hendrik P. A.
Salewski Leonard
Publication venue: Springer Link
Publication date: 22/09/2022
Field of study

Publikationsserver der Universität Tübingen

In-Context Impersonation Reveals Large Language Models' Strengths and Biases

Author: Akata Zeynep
Alaniz Stephan
Salewski Leonard
Schulz Eric
Publication venue: arXiv
Publication date: 23/05/2023
Field of study

Publikationsserver der Universität Tübingen

Zero-Shot Translation of Attention Patterns in VQA Models to Natural Language

Author: Akata Zeynep
Koepke Almut Sophia
Lensch Hendrik P. A.
Salewski Leonard
Publication venue: Springer Nature Switzerland
Publication date: 08/03/2024
Field of study

Publikationsserver der Universität Tübingen

Zero-Shot Translation of Attention Patterns in VQA Models to Natural Language

Author: Akata Zeynep
Koepke A. Sophia
Lensch Hendrik P. A.
Salewski Leonard
Publication venue: arXiv
Publication date: 08/11/2023
Field of study

Publikationsserver der Universität Tübingen

DIII-D research advancing the physics basis for optimizing the tokamak approach to fusion energy

Author: Abbate J.
Abe S.
Abrams T.
Adams M.
Adamson B.
Aiba N.
Akiyama Tsuyoshi
Aleynikov P B
Allen E.
Allen S.
Anand H.
Anderson Johan
Andrew Y.
Andrews T.
Appelt D.
Arbon R.
Ashikawa N.
Ashourvan A.
Aslin M.
Asnis Y.
Austin M.
Ayala D.
Bak J.
Bandyopadhyay I.
Banerjee S.
Barada K.
Bardoczi L.
Barr J.
Bass E.
Battaglia D.
Battey A.
Baumgartner W.
Baylor L.
Beckers J.
Beidler M.
Belli E
Berkery J.
Bernard T.
Bertelli N.
Beurskens M. N. A.
Bielajew R.
Bilgili S.
Biswas B.
Blondel S.
Boedo J.A.
Bogatu I.
Boivin R.
Bolzonella T.
Bongard M.
Bonnin X.
Bonoli P.T.
Bonotto M.
Bortolon A.
Bose S.
Bosviel N.
Bouwmans S.
Boyer M.
Boyes W.
Bradley L.
Brambila R.
Brennan D.
Bringuier S.
Brodsky L.
Brookman M.
Brooks J.
Brower D.
Brown G.
Brown W.
Burke M.
Burrell K.
Butler K.
Buttery R.
Bykov I.
Byrne P.
Cacheris A.
Callahan K.
Callen J. D.
Campbell G.
Candy Jeff
Canik J.
Cano Meg\uedas P.
Cao N.
Carayannopoulos L.
Carlstrom T.
Carrig W.
Carter T.
Cary W.
Casali L.
Cengher M.
Cespedes Paz G.
Chaban R.
Chan V.
Chapman B.
Char I.
Chattopadhyay A.
Chen J.
Chen J.
Chen J.
Chen M.
Chen R.
Chen X.
Chen X.
Chen Z.
Choi G.
Choi M.
Choi W.
Chousal L.
Chrobak C.
Chrystal C.
Chung Y.
Churchill R.
Cianciosa M.
Clark J.
Clement M.
Coda S.
Cole A.
Collins C.
Conlin W.
Cooper A.
Cordell J.
Coriton B.
Cote T.B.
Cothran J.
Creely A.
Crocker N.
Crowe C.
Crowley B.
Crowley T.
Cruz Zabala D.J.
Cummings D.
Curie M.
Curreli D.
Dal Molin A.
Dannels B.
Dautt-Silva A.
Davda K.
De Tommasi G.
de Vries P.C.
Degrandchamp G.
Degrassie J.
Demers D.
Denk S.S.
Depasquale S.
Deshazer E.
Diallo A.
Diem S.
Dimits A.
Ding R.
Ding S.
Ding W.
Do T.
Doane J.
Dong G.
Donovan D.
Drake J.
Drews W.
Drobny J.
Du H.
Du X.
Duarte V.
Dudt D.
Dunn C.
Duran J.
Dvorak A.
Effenberg F.
Eidietis N. W.
Elder D.
Eldon D.
Ellis R.
Elwasif W.
Ennis D.
Erickson K.
Ernst D.
Fasciana M.
Fedorov D.
Feibush E.
Fenstermacher M.E.
Ferraro N.
Ferreira J.
Ferron J.
Fimognari P.
Finkenthal D.
Fitzpatrick R.
Fox P.
Fox W.
Frassinetti L.
Frerichs H.
Frye H.
Fu Y.
Gage K.
Galdon-Quiroga J.
Gallo A.
Gao Q.
Garcia A.
Garcia-Munoz M.
Garnier D.
Garofalo A. M.
Gattuso A.
Geng D.
Gentle K.
Ghosh D.
Giacomelli L.
Gibson S.
Gilson E.
Giroud C
Glass F.
Glasser A. H.
Glibert D.
Gohil P.
Gomez R.
Gomez S.
Gong X.
Gonzales E.
Goodman A.
Gorelov Y.
Graber V.
Granetz R. S.
Gray T.
Green D.
Greenfield C.
Greenwald M.
Grierson B.
Groebner R.
Grosnickle W.
Groth M.
Grunloh H.
Gu S.
Guo H.
Guo W.
Gupta P.
Guterl J.
Guzman T.
Haar S.
Hager R.
Hahn S.
Halfmoon M.
Hall T.
Hallatschek K.
Halpern Federico
Hammett G.
Han H.
Hansen C.
Hansen E.
Hansink M.
Hanson J.
Hanson M.
Hao G.
Harris A.
Harvey R.
Haskey S.
Hassan E.
Hassanein A.
Hatch D.R.
Hawryluk R.
Hayashi W.
Heidbrink W.W.
Herfindal J.
Hicok J.
Hill D.
Hinson E.
Holcomb C.
Holland C.
Holland L.
Hollmann E M
Hollocombe J.
Holm A.
Holmes I.
Holtrop K.
Honda Mitsuru
Hong R.
Hood R.
Horton A.
Horvath L.
Hosokawa M.
Houshmandyar S.
Howard Nathan T.
Howell E.
Hoyt D.
Hu Q.
Hu W.
Hu Y.
Huang J.
Huang Y.
Hughes J.
Human T.
Humphreys D.
Huynh P.
Hyatt A.
Ibanez C.
Ibarra L.
Icasas R.
Ida K.
Igochine V.
In Y.
Inoue Shizuo
Isayama Akihiko
Izacard O.
Izzo V.
J\ue4rvinen A.
Jackson A.
Jacobsen G.
Jalalvand A.
Janhunen J.
Jardin S.
Jarleblad H.
Jeon Y.
Ji H.
Jian X.
Joffrin E.
Johansen A.
Johnson C.
Johnson T.
Jones C.
Joseph I.
Jubas D.
Junge B.
K\uf6hn A.
Kalb W.
Kalling R.
Kamath C.
Kang J.
Kaplan D.
Kaptanoglu A.
Kasdorf S.
Kates-Harbeck J.
Kazantzidis P.-V.
Kellman A.
Kellman D.
Kessel C. E.
Khumthong K.
Kim C.
Kim E.
Kim H.S.
Kim Hyun-Tae
Kim J.
Kim J.
Kim K.
Kim S.
Kimura W.
King J.
King M.
Kinsey J.
Kirk A.
Kiyan B.
Kleiner A.
Klevarova V.
Knapp R.
Knolker M.
Ko W.
Kobayashi T.
Koch E.
Kochan M.
Koel B.
Koepke M.
Kolasinski R.
Kolemen E.
Kostadinova E.
Kostuk M.
Kramer G.J.
Kriete D.
Kripner L.
Kubota S.
Kulchar J.
Kwon K.
La Haye R. J.
Laggner F.
Lan H.
Lantsov R.
Lao L.
Lasa Esquisabel A.
Lasnier C.
Lau C.
Leard B.
Lee C.
Lee J.
Lee J.
Lee M.
Lee M.
Lee R.
Lee S.
Lee Y.
Lehnen M
Leonard A.
Leppink E.
Lesher M.
Lestz J.
Leuer J.
Leuthold N.
Li E.
Li G.
Li J.
Li K.
Li L.
Li X.
Li Y.
Li Z.
Lin D.
Lin Z.
Liu A.
Liu C.
Liu C.
Liu D.
Liu D.
Liu J.
Liu T.
Liu X.
Liu Y.
Liu Yueqiang
Liu Z.
Loarte-Prieto A.
Lodestro L.
Logan N.
Lohr J.
Lombardo B.
Lore J.
Luan Q.
Luce T.
Luda Di Cortemiglia T.
Luhmann N.C.
Lunsford R.
Luo Z.
Lvovskiy A.
Lyons B.
M\ufcller Dirk
Ma X.
Madruga M.
Madsen B.
Maggi C.
Maheshwari K.
Mail A.
Mailloux J.
Maingi R.
Major M.
Makowski M.
Manchanda R.
Marini C.
Marinoni A.
Maris A.
Markovič T.
Marrelli L.
Martin E.
Mateja J.
Matsunaga G.
Maurizio R.
Mauzey D.
Mauzey P.
McArdle G.
McClenaghan J.
McCollam K.
McDevitt C.
McKay K.
McKee G.
McLean A.
Mehta V.
Meier E.
Menard J.
Meneghini O.
Merlo G.
Messer S.
Meyer W.
Michael C.
Michoski C.
Milne P.
Minet G.
Misleh A.
Mitrishkin Y.
Moeller C.
Montes K.
Morales M.
Mordijck S.
Moreau D.
Morosohk S.
Morris P.
Morton L.
Moser A.
Moyer R.
Moynihan C.
Mrazkova T.
Munaretto S.
Munoz Burgos J.
Murphy C.
Murphy K.
Muscatello C.
Myers C.
Nagy A.
Nandipati G.
Navarro M.
Nave F.
Navratil G.
Nazikian R.
Neff A.
Neilson G.
Neiser T.
Neiswanger W.
Nelson A.
Nelson D.
Nespoli F.
Nguyen L.
Nguyen R.
Nguyen X.
Nichols J.
Nocente M.
Nogami S.
Noraky S.
Norausky N.
Nornberg M.
Nygren R.
Odstrcil T.
Ogas D.
Ogorman T.
Ohdachi Satoshi
Ohtani Yoshiaki
Okabayashi M.
Okamoto M.
Olavson L.
Olofsson E.
Omullane M.
Oneill R.
Orlov D.
Orvis W.
Osborne T.
Pace D.
Paganini Canal G.
Pajares Martinez A.
Palacios L.
Pan C.
Pan Q.
Pandit R.
Pandya M.
Pankin A. Y.
Park J.
Park J. M.
Park Y.
Parker S.
Parks P.
Parsons M.
Patel B.
Pawley C.
Paz-Soldan C.
Peebles W.
Pelton S.
Perillo R.
Petty C.
Peysson Y
Pierce D.
Pigarov A.
Pigatto L.
Piglowski D.
Pinches S. D.
Pinsker R.
Piovesan P.
Piper N.
Pironti A.
Pitts R.
Pizzo J.
Plank U.
Podesta M.
Poli E.
Poli F.
Ponce D.
Popovic
Porkolab M.
Porter Grace C.E.
Powers C.
Powers S.
Prater R.
Pratt Q.
Pusztai Istvan
Qian J.
Qin X.
Ra O.
Rafiq T
Raines T.
Raman R.
Rauch J.
Raymond A.
Rea C.
Reich M.
Reiman A.
Reinhold S.
Reinke M.
Reksoatmodjo R.
Ren J.
Ren Q.
Ren Y.
Rensink M.
Renteria J.
Rhodes T. L.
Rice J.
Roberts R.
Robinson J.
Rodriguez-Fernandez P.
Rognlien T.
Rosenthal A.
Rosiello S.
Rost J.
Roveto J.
Rowan W.
Rozenblat R.
Ruane J.
Rudakov D.
Ruiz Ruiz J.
Rupani R.
Saarelma S.
Sabbagh S. A.
Sachdev J.
Saenz J.
Saib S.
Salewski M.
Salmi A.
Sammuli B.
Samuell C.
Sandorfi A.
Sang C.
Sarff J.
Sauter O.
Schaubel K.
Schmitz L.
Schmitz O.
Schneider J.
Schroeder P.
Schultz K.
Schuster E.
Schwartz J.
Sciortino F.
Scotti F.
Scoville J. T.
Seltzman A.
Seol S.
Sfiligoi I.
Shafer M.
Sharapov S. E.
Shen H.
Sheng Z.
Shepard T.
Shi S.
Shibata Y.
Shin G.
Shiraki D.
Shousha R.
Si H.
Simmerling P.
Sinclair G.
Sinha J.
Sinha P.
Sips G.
Sizyuk T.
Skinner C.
Sladkomedova A.
Slendebroek T.
Slief J.
Smirnov R.
Smith D.
Smith J.
Smith S.
Snipes J A
Snoep G.
Snyder A.
Snyder P.
Solano E.
Solomon W.
Song J.
Sontag A. C.
Soukhanovskii V.
Spendlove J.
Spong D.
Squire J.
Srinivasan C.
Stacey W.
Staebler G.
Stagner L.
Stange T.
Stangeby P.
Stefan R.
Stemprok R.
Stephan D.
Stillerman J.
Stoltzfus-Dueck T.
Stonecipher W.
Storment S.
Strait E. J.
Su D.
Sugiyama L.
Sun A.
Sun P.
Sun Y.
Sun Z.
Sundstrom D.
Sung C.
Sungcoco J.
Suttrop W.
Suzuki T.T.
Suzuki Y.
Svyatkovskiy A.
Swee C.
Sweeney R.
Sweetnam C.
Szepesi G.
Takechi M.
Tala T.
Tanaka K.
Tang S.
Tang X.
Tao R.
Tao Y.
Taussig D.
Taylor T.
Teixeira K.
Teo K.
Theodorsen A.
Thomas D.
Thome K. E.
Thorman A.
Thornton A.J.
Ti A.
Tillack M.
Timchenko Natalia
Tinguely R. A.
Tompkins R.
Tooker J.
Torrezan De Sousa A.
Trevisan G.L.
Tripathi S.
Trujillo Ochoa A.
Truong D.
Tsui C.K.
Turco F.
Turnbull A.
Umansky M. V.
Unterberg E.
Vaezi P.
Vail P.
Valdez J.
Valkis W.
Van Compernolle B.
Van Galen J.
Van Kampen R.
Van Zeeland M.A.
Verdoolaege Geert
Vianello N.
Victor B.
Viezzer E.
Vincena S.
W Guttenfelder
Wade M.
Waelbroeck F.
Wai J.
Wakatsuki Takuma
Walker M.
Wallace G.M.
Waltz R.
Wampler W.
Wang G.
Wang H.
Wang H.
Wang H.
Wang L.
Wang Y.
Wang Y.
Wang Z.
Wang Z.
Ward S.
Watkins J.
Watkins M.
Wehner W.
Wei Y.
Weiland M.
Weisberg D.
Welander A.
White A.E.
White R.
Wiesen S.
Wilcox R.
Wilks T.
Willensdorfer M.
Wilson H.R.
Wingen A.
Wolde M.
Wolff M.
Woller K.
Wolz A.
Wong H.
Woodruff S.
Wu M.
Wu Y.
Wukitch S.
Wurden G.
Xiao W.
Xie R.
Xing Z.
Xu C.
Xu G.
Xu X.
Yan Z.
Yang S.
Yang X.
Yokoyama T.
Yoneda R.
Yoshida Maiko
You K.
Younkin T.
Yu G.
Yu J.
Yu M.
Yuan Q.
Zaidenberg L.
Zakharov L.
Zamengo A.
Zamperini S.
Zarnstorff M.
Zeger E.
Zeller K.
Zeng L.
Zerbini M.
Zhang B.
Zhang J.
Zhang J.
Zhang L.
Zhang R.
Zhang X.
Zhao B.
Zhao L.
Zheng L.
Zheng Y.
Zhu B.
Zhu J.
Zhu Y.
Zhu Y.
Zsutty M.
Zuin M.
Publication venue: 'IOP Publishing'
Publication date: 01/01/2022
Field of study

Funding Information: This material is based upon work supported by the US Department of Energy, Office of Science, Office of Fusion Energy Sciences, using the DIII-D National Fusion Facility, a DOE Office of Science user facility, under Awards DE-FC02-04ER54698 and DE-AC52-07NA27344. Publisher Copyright: © 2022 IAEA, Vienna.DIII-D physics research addresses critical challenges for the operation of ITER and the next generation of fusion energy devices. This is done through a focus on innovations to provide solutions for high performance long pulse operation, coupled with fundamental plasma physics understanding and model validation, to drive scenario development by integrating high performance core and boundary plasmas. Substantial increases in off-axis current drive efficiency from an innovative top launch system for EC power, and in pressure broadening for Alfven eigenmode control from a co-/counter-I p steerable off-axis neutral beam, all improve the prospects for optimization of future long pulse/steady state high performance tokamak operation. Fundamental studies into the modes that drive the evolution of the pedestal pressure profile and electron vs ion heat flux validate predictive models of pedestal recovery after ELMs. Understanding the physics mechanisms of ELM control and density pumpout by 3D magnetic perturbation fields leads to confident predictions for ITER and future devices. Validated modeling of high-Z shattered pellet injection for disruption mitigation, runaway electron dissipation, and techniques for disruption prediction and avoidance including machine learning, give confidence in handling disruptivity for future devices. For the non-nuclear phase of ITER, two actuators are identified to lower the L-H threshold power in hydrogen plasmas. With this physics understanding and suite of capabilities, a high poloidal beta optimized-core scenario with an internal transport barrier that projects nearly to Q = 10 in ITER at ∼8 MA was coupled to a detached divertor, and a near super H-mode optimized-pedestal scenario with co-I p beam injection was coupled to a radiative divertor. The hybrid core scenario was achieved directly, without the need for anomalous current diffusion, using off-axis current drive actuators. Also, a controller to assess proximity to stability limits and regulate β N in the ITER baseline scenario, based on plasma response to probing 3D fields, was demonstrated. Finally, innovative tokamak operation using a negative triangularity shape showed many attractive features for future pilot plant operation.Peer reviewe

University of Liverpool Repository

Archivio della ricerca - Università degli studi di Napoli Federico II

Pure OAI Repository

Ghent University Academic Bibliography

Aaltodoc Publication Archive

Chalmers Research

Coventry University Pure Portal

White Rose Research Online

MPG.PuRe

Online Research Database In Technology