Search CORE

56 research outputs found

Gender bias and stereotypes in Large Language Models

Author: Dockum Rikker
Kotek Hadas
Sun David Q.
Publication venue
Publication date: 28/08/2023
Field of study

Large Language Models (LLMs) have made substantial progress in the past several months, shattering state-of-the-art benchmarks in many domains. This paper investigates LLMs' behavior with respect to gender stereotypes, a known issue for prior models. We use a simple paradigm to test the presence of gender bias, building on but differing from WinoBias, a commonly used gender bias dataset, which is likely to be included in the training data of current LLMs. We test four recently published LLMs and demonstrate that they express biased assumptions about men and women's occupations. Our contributions in this paper are as follows: (a) LLMs are 3-6 times more likely to choose an occupation that stereotypically aligns with a person's gender; (b) these choices align with people's perceptions better than with the ground truth as reflected in official job statistics; (c) LLMs in fact amplify the bias beyond what is reflected in perceptions or the ground truth; (d) LLMs ignore crucial ambiguities in sentence structure 95% of the time in our study items, but when explicitly prompted, they recognize the ambiguity; (e) LLMs provide explanations for their choices that are factually inaccurate and likely obscure the true reason behind their predictions. That is, they provide rationalizations of their biased behavior. This highlights a key property of these models: LLMs are trained on imbalanced datasets; as such, even with the recent successes of reinforcement learning with human feedback, they tend to reflect those imbalances back at us. As with other types of societal biases, we suggest that LLMs must be carefully tested to ensure that they treat minoritized individuals and communities equitably.Comment: ACM Collective Intelligenc

arXiv.org e-Print Archive

Gender representation in linguistic example sentences

Author: Babinski Sarah
Dockum Rikker
Geissler Christopher
Kotek Hadas
Publication venue: 'Linguistic Society of America'
Publication date: 23/03/2020
Field of study

Prior studies have shown that example sentences in syntax textbooks systematically under-represent women and perpetuate gender stereotypes (Macaulay & Brice 1994, 1997; Pabst et al. 2018). We examine the articles published over the past 20 years in Language, Linguistic Inquiry, and Natural Language & Linguistic Theory, and find striking similarities to this prior work. Among our findings, we show a stark imbalance of male (N=10807) to female (N=5019) arguments, and that male-gendered arguments are more likely to be subjects, and female arguments non-subjects. We show that female-gendered arguments are less likely to be referred to using pronouns and are more likely to be referred to using a kinship term, whereas male-gendered arguments are more likely to have occupations and to perpetrate violence. We show that this pattern has remained stable, with very little change, over the course of the twenty years that we examine, leading up to the present day. We conclude with a brief discussion of possible remedies and suggestions for improvement

Proceedings Published by the LSA (Linguistic Society of America)

Who speaks for us?:Lessons from the Pinker letter

Author: Anonymous
Dockum Rikker
Dow Michael
Kastner Itamar
Kotek Hadas
Publication venue
Publication date: 01/05/2021
Field of study

Edinburgh Research Explorer

Pama-Nyungan grandparent systems change with grandchildren, but not cross-cousin terms or social norms

Author: Bowern Claire
Dockum Rikker
Jordan Fiona M
Sheard Catherine
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 05/06/2020
Field of study

Kinship is a fundamental and universal aspect of the structure of human society. The kinship category of ‘grandparents’ is socially salient, due to grandparents’ investment in the care of the grandchildren as well as to older generations’ control of wealth and cultural knowledge, but the evolutionary dynamics of grandparent terms has yet to be studied in a phylogenetically explicit context. Here, we present the first phylogenetic comparative study of grandparent terms by investigating 134 languages in Pama-Nyungan, an Australian family of hunter-gatherer languages. We infer that proto-Pama-Nyungan had, with high certainty, four separate terms for grandparents. This state then shifted into either a two-term system that distinguishes the genders of the grandparents or a three-term system that merges the ‘parallel’ grandparents, which could then transition into a different three-term system that merges the ‘cross’ grandparents. We find no support for the co-evolution of these systems with either community marriage organisation or post-marital residence. We find some evidence for the correlation of grandparent and grandchild terms, but no support for the correlation of grandparent and cross-cousin terms, suggesting that grandparents and grandchildren potentially form a single lexical category but that the entire kinship system does not necessarily change synchronously

Aberdeen University Research

PubMed Central

Explore Bristol Research

Text-Speech Alignment: A Robin Hood Approach for Endangered Languages

Author: Babinski Sarah
Bowern Claire
Craft Hunter
Dockum Rikker
Fergus Anelisa
Goldenberg Dolly
Publication venue: EliScholar – A Digital Platform for Scholarly Publishing at Yale
Publication date: 17/01/2019
Field of study

Forced alignment automatically aligns audio recordings of spoken language with transcripts at the level of individual sounds, greatly reducing the time required to prepare data for linguistic analysis. However, existing algorithms are mostly trained on a few well-documented languages. We test the performance of three algorithms against manually aligned data on data from a highly endangered language. At least some tasks, unsupervised alignment (either based on English or trained from a small corpus) is sufficiently reliable for it to be used on legacy data for low-resource languages. Descriptive phonetic work on vowel inventories and prosody can be accurately captured by automatic alignment with minimal training data. Underutilized legacy data exist for many endangered languages. This creates both a need and an opportunity to leverage new technology

Yale University

Determination of 24 primary aromatic amines in aqueous food simulants by combining solid phase extraction and salting-out assisted liquid?liquid extraction with liquid chromatography tandem mass spectrometry

Author: Bodai Zsolt
Eke Zsuzsanna
Hegedus Janos
Jakab Peter Pal
Kirchkeszner Csaba
Nyiri Zoltan
Petrovics Noemi
Rikker Tamas
Szabo Balint Samuel
Publication venue: 'Elsevier BV'
Publication date: 01/01/2021
Field of study

Carcinogenic primary aromatic amines (PAAs) can be released from improperly manufactured food packaging materials. The limit for the sum of PAAs is set to 10 ?gkg- 1 in Commission Regulation No. 10/2011 (FCM Regulation). However, a lower individual limit, 2 ?gkg- 1 has been recently introduced for the carcinogenic PAAs in Commission Regulation No. 2020/1245. As the majority of the previously published methods are no longer compliant with the current regulation, a UHPLC-MS/MS method was developed to enable food packaging compliance testing for PAAs not only from 3% (w/v) acetic acid, but also from 10% (v/v) ethanol food simulant. Since the latest amendment of the FCM Regulation refers to the list of the 22 restricted PAAs of EU Regulation No. 1907/2006, these PAAs were selected as target compounds along with aniline and p-toluidine, the most common impurities of azo colorants and isocyanates. An enrichment factor of 20 could be achieved combining solid phase extraction with salting-out assisted liquid?liquid extraction. The method was successfully validated and applied on real samples. Limit of quantitation (LOQ) and limit of detection (LOD) values were 0.15 ?gL-1 and 0.05 ?gL-1 for both food simulants, respectively; except for 2,4-diaminotoluene, aniline and 4,4?-oxydianiline. However, even these compounds had lower LOD values than the new individual limit of 2 ?gkg- 1. Cumulative LOD values for both food simulants (1.6 ?gL-1 and 1.5 ?gL-1 for 3% (w/v) acetic acid and 10% (v/v) ethanol, respectively) were lower than the 10 ?gkg- 1 specified in the FCM Regulation. Accuracy values were between 70 and 118% for both food simulants for the majority of PAAs. Both within-day and between-day precision values were below 20%. This method proved to be suitable for daily routine analysis enabling compliance testing of food packaging materials according to the latest regulations. The method was successfully applied for the analysis of plastic kitchenware samples

ELTE Digital Institutional Repository (EDIT)

Efficient Targeted Next Generation Sequencing-Based Workflow for Differential Diagnosis of Alport-Related Disorders

Author: Bereczki Csaba
Endreffy Emőke
Haszon Ibolya
Iványi Béla
Kalmár Tibor
Kovács Gábor
Maróti Zoltán
Ondrik Zoltán
Rikker Csaba
Sinkó Mária
Túri Sándor
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2016
Field of study

SZTE Publicatio Repozitórium - SZTE - Repository of Publications

A Robin Hood approach to forced alignment: English-trained algorithms and their use on Australian languages

Author: Babinski Sarah
Bowern Claire
Craft J. Hunter
Dockum Rikker
Fergus Anelisa
Goldenberg Dolly
Publication venue: 'Linguistic Society of America'
Publication date: 15/03/2019
Field of study

Forced alignment automatically aligns audio recordings of spoken language with transcripts at the segment level, greatly reducing the time required to prepare data for phonetic analysis. However, existing algorithms are mostly trained on a few well-documented languages. We test the performance of three algorithms against manually aligned data. For at least some tasks, unsupervised alignment (either based on English or trained from a small corpus) is sufficiently reliable for it to be used on legacy data for low-resource languages. Descriptive phonetic work on vowel inventories and prosody can be accurately captured by automatic alignment with minimal training data. Consonants provided significantly more challenges for forced alignment

Proceedings Published by the LSA (Linguistic Society of America)

Micromechanical Properties of Injection-Molded Starch–Wood Particle Composites

Author: Abukurah A.
Acharya M.
Agarwal A.
Ahmad S.
Al-Bander H.
Al-Saghir F.
Alas E.
Albertazzi A.
Alcázar de La Ossa J.
Alfred H.
Almeida F.
Altobelli V.
Alvarez Sandoval E.
Amatya A.
Anashkin V.
Andrusev A.
Andrés Ribes E.
Ang K.
Anger M.
Aranda Verástegui F.
Araújo S.
Arbeit L.
Arfeen S.
Arias I.
Arif F.
Arkossy O.
Arrieta J.
Arruda J.
Ashfaq A.
Assefi A.
Auricchio M.
Avella F.
Ayodeji O.
Azer M.
Baer H.
Balkarova O.
Bannister K.
Bastos M.
Batista P.
Batlle D.
Becker B.
Belledonne M.
Bentkowski W.
Beresan H.
Bernardo M.
Berta K.
Bhandari K.
Bhatia D.
Bidas K.
Billiouw J.
Block G.
Block Geoffrey A.
Blumenthal S.
Bochicchio-Ricardelli T.
Bolasco P.
Borbas B.
Boulechfar H.
Bouman K.
Brancaccio D.
Brandi L.
Braun J.
Braun M.
Bregman R.
Brink H.
Brosnahan G.
Brown C.
Brown E.
Brunet P.
Buerkert J.
Burdmann E.
Burnier M.
Cambier-Dwelschauwers P.
Cangiano-Rivera J.
Cannella G.
Capasso G.
Capelli J.
Cappelli G.
Cardeal da Costa J.
Carvalho A.
Carvalho B.
Cases A.
Cavalieri T.
Chaffin M.
Charytan C.
Cheng J.
Chertow Glenn M.
Chew Wong A.
Chidester P.
Choukroun G.
Chow S.
Cleveland W.
Coll Piera E.
Combe C.
Cook M.
Correa-Rotter R.
Correa-Rotter Ricardo
Cottiero R.
Cotton J.
Cournoyer S.
Cozzolino M.
Crouch T.
Cruz-Valdez J.
Csiky B.
Culleton B.
Culpepper M.
Cunha P. D.
Cusumano A.
d'Avila D.
Da Roza G.
Daelemans R.
Dalal S.
Darwish R.
Dasgupta I.
Daugaard H.
Davies M.
Davies S.
de Almeida Romão E.
de Francesco Daher E.
De Meester J.
Deboni L.
Dehmel Bastian
Dejagere T.
Del Valle E.
DelGiorno T.
Dellanna F.
Denu-Ciocca C.
Desmeules S.
Dhaene M.
Dhingra R.
Di Giulio S.
Diamond S.
Dickenmann M.
Diez G.
Disney A.
Dolson G.
Domashenko O.
Donck J.
Donnelly S.
Douthat W.
Dratwa M.
Dreyer P.
Drüeke Tilman B.
Dumler F.
Dunnigan E.
Durlik M.
Durrbach A.
Dykes P.
Eadington D.
Eckardt K.
Eigner M.
El Khatib M.
El Shahawy M.
Elli A.
Endsley J.
Erley C.
Ermolenko V.
Errico R.
Eustace J.
Evaluation of Cinacalcet HCl Therapy to Lower Cardiovascular Events (EVOLVE) Trial Investigators
Evenepoel P.
Fadem S.
Farina M.
Fassett R.
Fekete A.
Felsenfeld A.
Ferenczi S.
Fernandez Lucas M.
Ferrari P.
Ferreira Filho S.
Fessi H.
Fiedler R.
Finkelstein F.
Fischer D.
Floege J.
Flöge Jürgen
Fraenkel M.
Frascà G.
Frazao J.
Friedman E.
Friedman L.
Frischmuth N.
Galicia M.
Galindo-Ramos E.
Gallart C.
Gandhi K.
Garcia V.
García F.
García N.
Garrote N.
Gehr T.
Gesualdo L.
Gillies A.
Gladish R.
Goldberger S.
Golden J.
Goldfarb D.
Goldman J.
Goldsmith D.
Gonzalez R.
González M.
Goodman William G.
Gordan P.
Graf H.
Grandaliano G.
Gross M.
Gupta A.
Gupta A.
Gupta B.
Gura V.
Gurevich A.
Gurevich K.
Guzman-Rivera J.
Hagen E.
Halligan R.
Hannedouche T.
Haque M.
Hawley C.
Hazzan A.
Henriquez M.
Herman T.
Hermida O.
Herzog Charles A.
Hoerl W.
Holzer H.
Horn S.
Hunt N.
Hura C.
Husserl F.
Hutchison A.
Hutchison B.
Hübel E.
Israelit A.
Jacobson S.
Jadoul M.
Jamal A.
James G.
Jensen G.
Jensen P.
Jindal K.
Joly D.
Jones B.
Jones E.
Jorgetti V.
Jose M.
Juncos L.
Kalra P.
Kant K.
Kapatkin K.
Kark A.
Karunakaran S.
Kazup Erdelyine S.
Keightley G.
Kerr P.
Ketteler M.
Khadikova N.
Khan M.
Khokhar S.
Khrustalev O.
Klauser R.
Kleinman L.
Klingberg M.
Klinger M.
Kohli R.
Kolmakova E.
Komandenko M.
Kopelman R.
Kopyt N.
Kovarik J.
Kramar R.
Kshirsagar A.
Ksiazek A.
Kubo Yumi
Kulcsar I.
Kumar J.
Kumar N.
Kunzendorf U.
Kwan J.
Ladanyi E.
Lafalla A.
Lai L.
Lang P.
Langham R.
Laski M.
Laville M.
Lea J.
LeBlanc M.
Lee M.
Lef L.
Lew S.
Lhotta K.
Liebl R.
Lien Y.
Light P.
Linares B.
Lionet A.
Liss K.
Lobo J.
Locatelli F.
Lok C.
London G.
London Gerard M.
Lonergan M.
Lopez N.
Lorica V.
Losito A.
Lugon J.
Luiz Gross J.
Lund U.
Lynn R.
Macario F.
Machado D.
Mackie J.
MacLaurin J.
MacRae J.
Maes B.
Mahaffey Kenneth W.
Major L.
Malberti F.
Malireddi K.
Marchetta N.
Marczewski K.
Marin I.
Martin P.
Martinez Saye J.
Martinez C.
Martín de Francisco Á.
Martín-Malo A.
Martínez García J.
Matalon A.
Mathew M.
Matuszkiewicz-Rowinska J.
Mayer G.
McCarthy J.
McClellan W.
McConnell K.
McCrary R.
Mehrotra R.
Meier P.
Menahem S.
Messa P.
Mezei I.
Michael B.
Michaud M.
Middleton J.
Minasian R.
Minga T.
Mingardi G.
Mittleman J.
Mittman N.
Moe Sharon M.
Montambault P.
Montenegro J.
Moriero E.
Morse S.
Mount P.
Mousson C.
Moustafa M.
Moyses Neto M.
Moysés R.
Muirhead N.
Murphy S.
Murray B.
Murthyr B.
Muszytowski M.
Mysliwiec M.
Nader P.
Najun Zarazaga C.
Nammour T.
Navarro J.
Neiva Coelho S.
Neumayer H.
Newman G.
Neyer U.
Nissenson A.
Noble S.
Nosrati S.
Nowicki M.
Ntoso K.
Oberbauer R.
Olgaard K.
Oliveira I.
Ondei P.
Ong S.
Ortalda V.
Ortiz J.
Osanloo E.
Ostrowski M.
Padmanabhan N.
Pahl M.
Pai P.
Parfrey Patrick S.
Passauer J.
Patak R.
Pecoits Filho R.
Pedagogos E.
Pedrosa A.
Peeters J.
Pellegrino B.
Pergola P.
Perlin D.
Pertosa G.
Petraglia G.
Peña J.
Peñalba N.
Picollo de Oliveira J.
Pitone J.
Plumb T.
Pogue V.
Polack D.
Pollock C.
Pons J.
Poole C.
Prados M.
Pritchard N.
Rabbat C.
Raff A.
Raguram P.
Rahim F.
Raja R.
Rambausek M.
Ramirez J.
Ramirez M.
Ramirez N.
Ranjit U.
Rano M.
Rastogi A.
Rattensberger D.
Reddan D.
Reddy M.
Reichel H.
Rekhi A.
Renders L.
Ricciardi B.
Riegel W.
Rieu P.
Rigolosi R.
Rikker C.
Rizk D.
Robertson J.
Roe S.
Roger S.
Rogers T.
Romero R.
Roppolo M.
Rosa Diez G.
Rosansky S.
Rozhinskaya L.
Rubin J.
Rubinstein S.
Rudolf G.
Rump L.
Rutkowski B.
Ryckelynck P.
Sabto J.
Saklayen M.
Saldanha Thome F.
Salgado N.
Sampaio Lacativa P.
Samuels L.
Santiago C.
Santos J.
Sapir D.
Sarkar S.
Schena F.
Schiller-Moran B.
Schmidt R.
Schonefeld M.
Schwenger V.
Schwertfeger E.
Scott D.
Sebastian Diaz M.
Sedor J.
Sekkarie M.
Shahmir E.
Shapiro W.
Sharon Z.
Shilo V.
Sholer C.
Shostka G.
Sidhu P.
Silver M.
Simon P.
Skinner F.
Smak Gregoor P.
Smirnov A.
Smith M.
Soler Amigó J.
Solis M.
Soman S.
Soroka S.
Specter R.
Sperto Baptista M.
Spiegel D.
Sprague S.
Stafford C.
Stahl R.
Staroselsky K.
Stefoni S.
Stegman M.
Steinberg S.
Sterner G.
Sterrett J.
Stolear J.
Stolina Maria
Stratton J.
Streja D.
Strokov A.
Stroumza P.
Ståhl A.
Sugihara J.
Sulowicz W.
Suranyi M.
Swift P.
Switalski M.
Szabo T.
Szegedi J.
Talaulikar G.
Taparia B.
Tareen N.
Teixeira Araújo M.
Teredesai P.
Thakur V.
Tharpe D.
Thomas A.
Thompson N.
Tielemans C.
Timofeev M.
Timokhovskaya G.
Tobe S.
Tolkan S.
Tomson C.
Topf J.
Torres Zamora M.
Torres M.
Touchard G.
Tsang A.
Tuazon J.
Tucker K.
Uehlinger D.
Urena Torres P.
Valtuille R.
van den Dorpel M.
van der Sande F.
Van Kuijk W.
Vanholder R.
Vanwalleghem J.
Vasconcellos L.
Vasilevsky M.
Vela C.
Vermeij C.
Verrelli M.
Villa G.
Vital Flores M.
Volgina G.
von Albertini B.
Voßkühler A.
Wagner G.
Walker R.
Warling X.
Warnock D.
Weigert A.
Weinberg M.
Weise W.
Weiss L.
West C.
Wheeler D.
Wheeler David C.
Widerhorn A.
Wiecek A.
Wijeyesinghe E.
Wikström B.
Wilkie M.
Williams M.
Wizemann V.
Wong G.
Wooldridge T.
Woredekal Y.
Yaqoob M.
Youell T.
Yu L.
Zacharias J.
Zager P.
Zakar G.
Zantvoort F.
Zaoui P.
Zehnder D.
Zeig S.
Zemtchenkov A.
Zimmerman D.
Zoccali C.
Publication venue
Publication date: 01/01/2006
Field of study

The micromechanical properties of injection molded starch–wood particle composites were investigated as a function of particle content and humidity conditions. The composite materials were characterized by scanning electron microscopy and X-ray diffraction methods. The microhardness of the composites was shown to increase notably with the concentration of the wood particles. In addition,creep behavior under the indenter and temperature dependence were evaluated in terms of the independent contribution of the starch matrix and the wood microparticles to the hardness value. The influence of drying time on the density and weight uptake of the injection-molded composites was highlighted. The results revealed the role of the mechanism of water evaporation, showing that the dependence of water uptake and temperature was greater for the starch–wood composites than for the pure starch sample. Experiments performed during the drying process at 70°C indicated that the wood in the starch composites did not prevent water loss from the samples.Peer reviewe

Archivio istituzionale della ricerca - Università di Bari

Publikationsserver der RWTH Aachen University

Archivio Istituzionale della Ricerca- Università degli Studi di Foggia

Digital.CSIC

Archivio Istituzionale della Ricerca - Università degli Studi della Campania "Luigi Vanvitelli"