Search CORE

14 research outputs found

Investigating Trade-offs For Fair Machine Learning Systems

Author: Hort Max
Publication venue: UCL (University College London)
Publication date: 28/01/2023
Field of study

Fairness in software systems aims to provide algorithms that operate in a nondiscriminatory manner, with respect to protected attributes such as gender, race, or age. Ensuring fairness is a crucial non-functional property of data-driven Machine Learning systems. Several approaches (i.e., bias mitigation methods) have been proposed in the literature to reduce bias of Machine Learning systems. However, this often comes hand in hand with performance deterioration. Therefore, this thesis addresses trade-offs that practitioners face when debiasing Machine Learning systems. At first, we perform a literature review to investigate the current state of the art for debiasing Machine Learning systems. This includes an overview of existing debiasing techniques and how they are evaluated (e.g., how is bias measured). As a second contribution, we propose a benchmarking approach that allows for an evaluation and comparison of bias mitigation methods and their trade-offs (i.e., how much performance is sacrificed for improving fairness). Afterwards, we propose a debiasing method ourselves, which modifies already trained Machine Learning models, with the goal to improve both, their fairness and accuracy. Moreover, this thesis addresses the challenge of how to deal with fairness with regards to age. This question is answered with an empirical evaluation on real-world datasets

UCL Discovery

Privileged and Unprivileged Groups: An Empirical Study on the Impact of the Age Attribute on Fairness

Author: Hort Max
Sarro Federica
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 09/05/2022
Field of study

UCL Discovery

An Empirical Study on the Fairness of Pre-trained Word Embeddings

Author: Hort Max
Sarro Federica
Sesari Emeralda
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 15/07/2022
Field of study

Pre-trained word embedding models are easily distributed and applied, as they alleviate users from the effort to train models themselves. With widely distributed models, it is important to ensure that they do not exhibit undesired behaviour, such as biases against population groups. For this purpose, we carry out an empirical study on evaluating the bias of 15 publicly available, pre-trained word embeddings model based on three training algorithms (GloVe, word2vec, and fastText) with regard to four bias metrics (WEAT, SEMBIAS, DIRECT BIAS, and ECT). The choice of word embedding models and bias metrics is motivated by a literature survey over 37 publications which quantified bias on pre-trained word embeddings. Our results indicate that fastText is the least biased model (in 8 out of 12 cases) and small vector lengths lead to a higher bias

UCL Discovery

Py2Cy: A Genetic Improvement Tool To Speed Up Python

Author: Hort Max
Sarro Federica
Zhong James
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 09/07/2022
Field of study

Due to its ease of use and wide range of custom libraries, Python has quickly gained popularity and is used by a wide range of developers all over the world. While Python allows for fast writing of source code, the resulting programs are slow to execute when compared to programs written in other programming languages like C. One of the reasons for its slow execution time is the dynamic typing of variables. Cython is an extension to Python, which can achieve execution speed-ups by compiler optimization. One possibility for improvements is the use of static typing, which can be added to Python scripts by developers. To alleviate the need for manual effort, we create Py2Cy, a Genetic Improvement tool for automatically converting Python scripts to statically typed Cython scripts. To show the feasibility of improving runtime with Py2Cy, we optimize a Python script for generating Fibonacci numbers. The results show that Py2Cy is able to speed up the execution time by up to a factor of 18

UCL Discovery

The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder Models for More Efficient Code Classification

Author: Grishina Anastasiia
Hort Max
Moonen Leon
Publication venue
Publication date: 11/09/2023
Field of study

The use of modern Natural Language Processing (NLP) techniques has shown to be beneficial for software engineering tasks, such as vulnerability detection and type inference. However, training deep NLP models requires significant computational resources. This paper explores techniques that aim at achieving the best usage of resources and available information in these models. We propose a generic approach, EarlyBIRD, to build composite representations of code from the early layers of a pre-trained transformer model. We empirically investigate the viability of this approach on the CodeBERT model by comparing the performance of 12 strategies for creating composite representations with the standard practice of only using the last encoder layer. Our evaluation on four datasets shows that several early layer combinations yield better performance on defect detection, and some combinations improve multi-class classification. More specifically, we obtain a +2 average improvement of detection accuracy on Devign with only 3 out of 12 layers of CodeBERT and a 3.3x speed-up of fine-tuning. These findings show that early layers can be used to obtain better results using the same resources, as well as to reduce resource usage during fine-tuning and inference.Comment: The content in this pre-print is the same as in the CRC accepted for publication in the ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2023

arXiv.org e-Print Archive

Fairness Testing: A Comprehensive Survey and Analysis of Trends

Author: Chen Zhenpeng
Harman Mark
Hort Max
Sarro Federica
Zhang Jie M.
Publication venue
Publication date: 19/07/2023
Field of study

Unfair behaviors of Machine Learning (ML) software have garnered increasing attention and concern among software engineers. To tackle this issue, extensive research has been dedicated to conducting fairness testing of ML software, and this paper offers a comprehensive survey of existing studies in this field. We collect 100 papers and organize them based on the testing workflow (i.e., how to test) and testing components (i.e., what to test). Furthermore, we analyze the research focus, trends, and promising directions in the realm of fairness testing. We also identify widely-adopted datasets and open-source tools for fairness testing

arXiv.org e-Print Archive

Enhanced Fairness Testing via Generating Effective Initial Individual Discriminatory Instances

Author: Hort Max
Lin Qingwei
Ma Minghua
Sarro Federica
Tian Zhao
Zhang Dongmei
Zhang Hongyu
Publication venue
Publication date: 17/09/2022
Field of study

Fairness testing aims at mitigating unintended discrimination in the decision-making process of data-driven AI systems. Individual discrimination may occur when an AI model makes different decisions for two distinct individuals who are distinguishable solely according to protected attributes, such as age and race. Such instances reveal biased AI behaviour, and are called Individual Discriminatory Instances (IDIs). In this paper, we propose an approach for the selection of the initial seeds to generate IDIs for fairness testing. Previous studies mainly used random initial seeds to this end. However this phase is crucial, as these seeds are the basis of the follow-up IDIs generation. We dubbed our proposed seed selection approach I&D. It generates a large number of initial IDIs exhibiting a great diversity, aiming at improving the overall performance of fairness testing. Our empirical study reveal that I&D is able to produce a larger number of IDIs with respect to four state-of-the-art seed generation approaches, generating 1.68X more IDIs on average. Moreover, we compare the use of I&D to train machine learning models and find that using I&D reduces the number of remaining IDIs by 29% when compared to the state-of-the-art, thus indicating that I&D is effective for improving model fairnessComment: 19 pages, 7 figure

arXiv.org e-Print Archive

25th annual computational neuroscience meeting: CNS-2016

Author: Abbott L.F.
Abeysuriya Romesh G.
Aertsen Ad
Agnes Everton J.
Ahamed Tosif
Ahmadabadi Majid Nili
Ahn Sora
Aihara Kazuyuki
Aihara Kazuyuki
Andreassen Ole A.
Andreassen Ole A.
Ardestani Mohammad Hovaidi
Ardestani Mohammad Hovaidi
Arroyo David
Aton Sara J.
Babichev Andrey
Bachmann Claudia
Badel Laurent
Baek Hyeon-Man
Baek JeongHun
Baek Kwangyeol
Bahuguna Jyotika
Bak Ji Hyun
Baker Chris I.
Bakker Rembrandt
Balaguer‑Ballester Emili
Bard G.
Barnett William H.
Baroni Fabiano
Basnayake Kanishka
Baysal Velt
Bennett Matthew R.
Bernard Christophe
Berry Hugues
Beuth Frederick
Bezgin` Gleb
Bill Johannes
Birgolias Justas
Blackwell Justin
Bohnenkamp Lisa
Bojak Ingo
Borisyuk Roman
Bos Hannah
Bradley Samual P.
Breakspear Michael
Breitwieser Oliver
Briaire` Jeroen J.
Briggman Kevin L
Brinkman Braden A.
Brown John
Brown Ritchie E.
Brunel Nicolas
Buhry Laure
Buice Michael
Burkitt Anthony N.
Burton Shawn D.
Buttler Simone
Bytschok Ilja
Cantarelli Matteo
Chakravarthy V.Srinivasa
Chan Ho Ka
Chapman Phillip D.
Chatzikalymniou Alexandra Pierri
Chavane Frédéric
Chen Liang
Chen Weiliang
Cheung Chung Ching
Chhabria Karishma
Chintaluri Chaitanya
Choe Yoonsuck
Choi Hannah
Choi Hansol
Choi Ilhwan
Choi Jee Hyun
Choi Woochul
Choi Yun Seo
Choung Oh‑hyeon
Chung SueYeon
Clarke Eric F.
Clements Katie
Cloherty Shaun L.
Clopath Claudia
Cocchi Luca
Cohen Yale E.
Cook Mark
Crook Sharon M.
Cserpán Dorottya
Culmone Viviana
Dabaghian Yuri
Dabaghian Yuri
Dale Anders M.
Daly Kevin C.
Dasgupta Sakyasingha
Davey Neil
Davey Neil
Davison Andrew
de Weerd Peter
Deco Gustavo
Demkó László
Demutz Harald
Denk Cornelia
Destexhe Alain
Devor Anna
DeVuti Justin
Diamond Alan
Diesmann Markus
Dillen Kim
Doya Kenji
Dragoi Valentin
Draguljić Daniel
Drew Jordan
Drysdale Peter M.
Duarte Renato
Dura‑Bernal Salvador
Dura‑Bernal Salvador
Dura‑Bernal Salvador
Edwards Andy
Einevoll Gaute T.
Elices Irene
Elnevoll Gaute T.
Ernst Udo A.
Esler Timothy B.
Esposito Elric
Faraji Mohammad Java
Fedorov Leonid A.
Fenk Lisa M.
Ferguson Katie
Ferrario Andrea
Filipovi Marko
Fink Christian G.
Fink Gereon R.
Fishman Yonatan I.
Fornito Alex
Forrow Csaba
Fouquet Coralie
Frangou Sophia
Freestone Dean R.
Frijns Johan H. M.
Fulcher Ben D.
Fung Felix
Gajic N. Alex Cayco
Gallimore Andrew R.
Gallinaro Júlia
Gerkin Richard C.
Gerstner Wulfram
Giaffar Hamza
Giese Martin
Giese Martin
Giese Martin A.
Gilson Matthieu
Gips Bart
Gleeson Padraig
Gliske Stephen V.
Glomb Katharina
Goetze Felix
Goldsworthy Mitchell R.
Gollo Leonardo L.
Goncharenko Julia
Goodarzinic Abdorreza
Graham Bruce P.
Grayden David B.
Grayden David B.
Grewe Jan
Hadrava Michal
Hagen Espen
Halnes Geir
Halnes Geir
Hamade Khaldoun
Hamker Fred H.
Han Hio-Been
Han Seung Kee
Hansen Mads
Harper Zachary J.
He Hu
Helias Moritz
Hermann Christoph S.
Hilgetag` Claus‑Christian
Hines Michael L.
Hlinka Jaroslav
Hof Patrick R.
Holman Katherine A.
Hong Sungho
Hordacre Brenton
Howard Jr. James H.
Huang Guang-Bin
Huang Haiping
Huerta Ramon
Huh Dongsung
Hutt Axel
Hwang Dong‑Uk
Hwang Eunjin
Hye Jr. Eoon
Iannella Nicolangelo
Iannella Nicolangelo
Ibbotson Michael R.
Ionta Silvlo
Ishii Shin
Issa Fadi A.
Iyer Ramakrishnan
Jacobs Heidi
Jang Hyun Jae
Jang Jaeson
Jang Jaeson
Jensen Ole
Jeong Jaeseung
Jeong Jaesung
Jeong Yong
Jirsa Viktor K.
Jo Sumin
Joo Pangyu
Josić Kresimir
Ju Huiwen
Jun Eunji
Jun Sang Beom
Jung Nam
Jung Woo-Sung
Jung Younginha
Kahng B.
Kale Penelope J.
Kalkman Randy K.
Kameneva Tatiana
Kameneva Tatiana
Kang Jiyoung
Karoly Philippa J.
Kasumi Ohta
Kavalali Enge T.
Kawato Mitsuo
Kazama Hokto
Kedziora David J.
Kekona Tyler
Keller Daniel
Kennedy Henry
Kepple Daniel
Kerr Cliff C.
Kerr Robert R.
Kilpatrick Zachary P.
Kim Ammo J.
Kim Bowon
Kim Bowon
Kim Chang Sub
Kim DaeEun
Kim Hojeong
Kim Hoon-Hee
Kim Hyoungkyu
Kim Jae Kyoung
Kim Jimin
Kim Jinseop
Kim Juhee
Kim Minjung
Kim Seongkyun
Kim Su Hyun
Kim Sung-Phil
Kim Sung-Phil
Kim Tae
Kim Taegyo
Kim Won Sup
Kim Youngsoo
Kiser Seth A.
Klanner Felix
Kleberg Florence I.
Klingbeil Guido
Knösche Thomas
Koren Veronika
Koren Veronika
Kotaleski Jeanette Hellgren
Koulakov Alex
Kralik Jerald D.
Kringelbach Morten L.
Kruscha Alexandra
Kuhlmann Levin
Kukolja Juraj
Kumar Arvind
Kumar Arvind
Kundu Prantik
Kunze Tim
Kuravi Pradeep
Kwag Jeehyun
Kwon Jaehyung
Lai Pik‑Yin
Lakatos Peter
Latorre Roberto
Leahy Will
Lee Changju
Lee Chungho
Lee Dan D.
Lee Do-won
Lee Heonsoo
Lee Hyang Jung
Lee Hyang Woon
Lee Hyeonsu
Lee Jae Woo
Lee Jaejin
Lee Jeungmin
Lee Joonwon
Lee Jung H.
Lee Sang Wan
Lee Sang-Hun
Lee Seungjun
Lee Soohyun
Lee Sue-Hyun
Lee Tae Ho
Lee Won Hee
Lee Yong‑il
Lefebvre Baptiste
Lefebvre Jérémie
Leleu Timothée
Leng Luziwei
Levi Rafael
Levina Anna
Levy Brandon A.
Li Luozheng
Liang Guangsheng
Lidner Benjamin
Liedtke Joscha
Lim Daeseob
Lim Sewoong
Lin Xiahoan
Linder Benjamin
Lines Glenn T.
Lizler Joseph T.
Lochmann Timm
Lowet Eric
Luebke Jennifer
Lytton William W.
Lytton William W.
Lyu Cheng
Ma Hailin
Maeng Seung Eu
Malmon Gabby
Mandall Alekhya
Maouene M.
Marcelli Angelo
Marin Boris
Markin Sergey
Markram Henry
Marre Olivier
Marsalek Petr
Marsat Gary
Martel Roman
Marucci Lucia
Maturana Matias I.
McCarley Robert W.
McDonnell Mark D.
McDonnell Mark D.
McKenna James T.
McLauchlan Campbell
Meffin Hamish
Meffin Hamish
Mehta Hima
Meier Karlheinz
Meijas Jorge F.
Mellen Nick
Memmeshei Raol-Martin
Menzies Rosemary J.
Merriosn-Hort Robert
Metzner Christoph
Mi Yuanyuan
Mi Yuanyuan
Mihalas Stefan
Miller Thomas
Moezzi Bahar
Moezzi Bahar
Molkov Yaroslav I.
Moon Jangsup
Moon Seok-hun
Morris Laurel S.
Morrison Abigail
Mosqueiro Thiago S
Mu Shang
Muler Eilif
Muralidharan Vignesh
Murray John D.
Murray Micha M.
Mäki‑Marttunen Tuomo
Neymotin Samuel
Neymotin Samuel A.
Niry Mohammad
Nishikawa Isao
Nolte Max
Nowotny Thomas
Oba Shigeyuki
Obermayer Klaus
Obermayer Klaus
Ognjanovski Nicolette
Ouyang Guang
Ozer Mahmut
Paik Se-Bum
Paik Se‑Bum
Palmer S.E.
Palva Matias J.
Paninski Liam
Pariz Aref
Park Chang-hyun
Park Choongseok
Park Hae‑Jeong
Park Ji Sung
Park Memming
Park Sang-Min
Park Sol
Parsi Shervin S.
Parziale Antonio
Pasupathy Anitha
Perotti Luca
Peterson Andre
Petkoski Spase
Petrovici Mihai A.
Petterson Klas H.
Philips Ryan T.
Phillips Ryan S.
Pillow Jonathan
Pittà Maurizio De
Plogmacher Lukas
Podlaski William
Pollonini Luca
Ponce‑Alvarez Adrián
Popp Pamela Osborn
Preuschoff Kerstin
Priesemann Viola
Priesemann Viola
Priyadharsini B. Praga
Psarrou Maria
Quang Le Anh
Quintana Adrian
Ramsey Julia
Ranjan Rajnish
Rankin James
Rankin James
Rasch Malte J.
Rasuli Nader
Ratnadurai‑Giridharan Shivakeshavan
Reig Ramon
Reimann Michael W.
Rennle Chris J.
Reyes Amy
Richter René
Ridding Michael C.
Rieke Fred
Rinberg Dima
Rinzel John
Ritter Petra
Roach James P.
Robb Daniel T.
Roberts Mark J.
Robinson Peter A.
Robinson Peter A.
Rodriguez Francisco B.
Rotter Stefan
Rubchinsky Leonid L.
Rubinov Mikail
Rumbell Timothy
Rupp André
Rybak Ilya A.
Ryu Juhyoung
Sadeh Sadra
Saggio Maria L.
Sander Leonard M.
Sanger Terence D.
Sanz-Leon Paula
Sanz‑Leon Paula
Saska Daniel
Schaworonkow Natalie
Schemmel Johannes
Scheutz Matthias
Schiff Steven J.
Schilstra Maria
Schilstra Marla
Schmidt Maximilian
Schmidt Robert
Schottdorf Manual
Schutter Erik De
Schwikard Achim
Seeholzer Alexander
Seidenstein Alexandra
Sejnowski Terrence J.
Sekulić Vladisla
Senatore Rosa
Senk Johanna
Seo Sat Byul
Seung H. Sebastian
Sharpee Tatyana O.
Shea Steven
Shea-Brown Eric
Shea‑Brown Eric
Shen Kelly
Shiau LieJune
Shimazaki Hideaki
Shin Hee‑sup
Shin In-Seob
Shivkumar Sabyasach
Shlizerman Eli
Shomali Safura Rashid
Siep Silvan F.
Silberberg Gilad
Silver Angus
Silver R. Angus
Skiker K.
Skilling Quinton M.
Skinner Frances K.
Skinner Frances K.
Smit Daniel
Smith Brian
Smith Jeffrey
Soh Jaehyun
Soman Karthik
Somogyvári Zoltán
Sompolinsky Haim
Song Min
Song Min-Ho
Song Youngjo
Soundry Daniel
Sourina Olga
Spampinato Giulia Lia Beatrice
Spiegler Andreas
Spinney Richard E.
Sprecher Simon
Stacey William C.
Stacey William C.
Stephens Greg
Stern Merav
Steuber Volker
Steyn-Ross D. Alistair
Steyn-Ross Moira L.
Stimberg Marcel
Strube‑Bloss Martin F.
Stöckel David
Su Jianzhong
Sun Haoqi
Sweeney Yann
Tabas Alejandro
Tahayori Bahman
Takashima Akira
Tam Nicoladie D.
Tamagnini Francesco
Tang Rongxiang
Tang Yi-Yuan
Tang Yi-Yuan
Teka Wondimu
Tetzlaff Tom
Tezuka Taro
Toporikova Natalia
Torres Joaquin J.
Toyoizumi Taro
Tran Patricia H. P.
Trembleau Alain
Triesch Jochen
Trisch Jochen
Tsaneva‑Atanasova Krasimira
Tsuchimoto Yoshiko
Tuomo Maki-Martun
Tveito Aslak
Valizadeh Alireza
Valizadeh Alireza
van Albada Sacha J
van Albada Sacha J.
van der Eerden Jan
Varona Pablo
Varona Pablo
Veale Richard
Viriyopase Atthaphon
Vitay Julien
Vogels Rufin
Vogels Tim
Vogels Tim P.
Vogt Simon M.
Voon Valerie
Voronenko Sergej O.
Vuust Peter
Vörös János
Wallentin Mikkel
Wang Dahui
Wang Jisung
Wang Sheng-Ju
Wang Yuzhe
Warburton Julia M.
Weaver Christina M.
Wegener Detlef
Weidel Philipp
Welzig Charles M.
Werdt Stephen Van
Wibral Michael
Wickens Jeffery R.
Widmer Yves
Witek Maria A. G.
Witting Jens
Wolf Fred
Wong Michael
Wu Si
Wu Sl
Wójcik Daniel K.
Xu Zhiheng
Yamada Yasnori
Yamamura Yorkio
Yang Huei-Fang
Yang Xu
Yeon Ji Won
Yger Pierre
Yilmaz Ergin
Yoo Minsu
Yoon Sangsup
Yoshimoto Junichiro
Young-Ah Rho
Yu Suin
Zaho Yuan
Zamora Criseida
Zaptocky Martin
Zhang Mingsha
Zhang Wenhao
Zhao Chang
Zhao Xiaochen
Zhao Xuelong
Zhou Changsong
Zochowski Michal
Zochowski Michal R.
Zouridakis George
Zurowski Bartosz
Publication venue: BMC
Publication date: 01/01/2016
Field of study

The same neuron may play different functional roles in the neural circuits to which it belongs. For example, neurons in the Tritonia pedal ganglia may participate in variable phases of the swim motor rhythms [1]. While such neuronal functional variability is likely to play a major role the delivery of the functionality of neural systems, it is difficult to study it in most nervous systems. We work on the pyloric rhythm network of the crustacean stomatogastric ganglion (STG) [2]. Typically network models of the STG treat neurons of the same functional type as a single model neuron (e.g. PD neurons), assuming the same conductance parameters for these neurons and implying their synchronous firing [3, 4]. However, simultaneous recording of PD neurons shows differences between the timings of spikes of these neurons. This may indicate functional variability of these neurons. Here we modelled separately the two PD neurons of the STG in a multi-neuron model of the pyloric network. Our neuron models comply with known correlations between conductance parameters of ionic currents. Our results reproduce the experimental finding of increasing spike time distance between spikes originating from the two model PD neurons during their synchronised burst phase. The PD neuron with the larger calcium conductance generates its spikes before the other PD neuron. Larger potassium conductance values in the follower neuron imply longer delays between spikes, see Fig. 17.Neuromodulators change the conductance parameters of neurons and maintain the ratios of these parameters [5]. Our results show that such changes may shift the individual contribution of two PD neurons to the PD-phase of the pyloric rhythm altering their functionality within this rhythm. Our work paves the way towards an accessible experimental and computational framework for the analysis of the mechanisms and impact of functional variability of neurons within the neural circuits to which they belong

HAL AMU

ScholarWorks@UNIST

Juelich Shared Electronic Resources

Central Archive at the University of Reading

Crossref

IUPUIScholarWorks

Springer - Publisher Connector

Harvard University - DASH

Heidelberger Dokumentenserver

PubMed Central

Archivio della Ricerca - Università di Salerno

Apollo (Cambridge)

Repository@Napier

University of Hertfordshire Research Archive

DSpace at Rice University

Deep Blue Documents at the University of Michigan

Multi-objective Search for Gender-fair and Semantically Correct Word Embeddings

Author: Hort Max
Moussa Rebecca
Sarro Federica
Publication venue: 'Elsevier BV'
Publication date: 31/01/2023
Field of study

UCL Discovery

Bias Mitigation for Machine Learning Classifiers: A Comprehensive Survey

Author: Chen Zhenpeng
Harman Mark
Hort Max
Sarro Federica
Zhang Jie M
Publication venue
Publication date: 01/01/2023
Field of study

This paper provides a comprehensive survey of bias mitigation methods for achieving fairness in Machine Learning (ML) models. We collect a total of 341 publications concerning bias mitigation for ML classifiers. These methods can be distinguished based on their intervention procedure (i.e., pre-processing, in-processing, post-processing) and the technique they apply. We investigate how existing bias mitigation methods are evaluated in the literature. In particular, we consider datasets, metrics and benchmarking. Based on the gathered insights (e.g., What is the most popular fairness metric? How many datasets are used for evaluating bias mitigation methods?), we hope to support practitioners in making informed choices when developing and evaluating new bias mitigation methods

UCL Discovery