Search CORE

2,345 research outputs found

A continuous information gain measure to find the most discriminatory problems for AI benchmarking

Author: Anderson Damien
Khalifa Ahmed
Levine John
Renz Jochen
Salge Christoph
Stephenson Matthew
Togelius Julian
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 18/05/2020
Field of study

This paper introduces an information-theoretic method for selecting a subset of problems which gives the most information about a group of problem-solving algorithms. This method was tested on the games in the General Video Game AI (GVGAI) framework, allowing us to identify a smaller set of games that still gives a large amount of information about the abilities of different game-playing agents. This approach can be used to make agent testing more efficient. We can achieve almost as good discriminatory accuracy when testing on only a handful of games as when testing on more than a hundred games, something which is often computationally infeasible. Furthermore, this method can be extended to study the dimensions of the effective variance in game design between these games, allowing us to identify which games differentiate between agents in the most complementary ways

arXiv.org e-Print Archive

Maastricht University Research Portal

Crossref

University of Strathclyde Institutional Repository

A Performance-Explainability-Fairness Framework For Benchmarking ML Models

Author: Chakrobartty Shuvro
Publication venue: Beadle Scholar
Publication date: 01/10/2023
Field of study

Machine learning (ML) models have achieved remarkable success in various applications; however, ensuring their robustness and fairness remains a critical challenge. In this research, we present a comprehensive framework designed to evaluate and benchmark ML models through the lenses of performance, explainability, and fairness. This framework addresses the increasing need for a holistic assessment of ML models, considering not only their predictive power but also their interpretability and equitable deployment. The proposed framework leverages a multi-faceted evaluation approach, integrating performance metrics with explainability and fairness assessments. Performance evaluation incorporates standard measures such as accuracy, precision, and recall, but extends to overall balanced error rate, overall area under the receiver operating characteristic (ROC) curve (AUC), to capture model behavior across different performance aspects. Explainability assessment employs state-of-the-art techniques to quantify the interpretability of model decisions, ensuring that model behavior can be understood and trusted by stakeholders. The fairness evaluation examines model predictions in terms of demographic parity, equalized odds, thereby addressing concerns of bias and discrimination in the deployment of ML systems. To demonstrate the practical utility of the framework, we apply it to a diverse set of ML algorithms across various functional domains, including finance, criminology, education, and healthcare prediction. The results showcase the importance of a balanced evaluation approach, revealing trade-offs between performance, explainability, and fairness that can inform model selection and deployment decisions. Furthermore, we provide insights into the analysis of tradeoffs in selecting the appropriate model for use cases where performance, interpretability and fairness are important. In summary, the Performance-Explainability-Fairness Framework offers a unified methodology for evaluating and benchmarking ML models, enabling practitioners and researchers to make informed decisions about model suitability and ensuring responsible and equitable AI deployment. We believe that this framework represents a crucial step towards building trustworthy and accountable ML systems in an era where AI plays an increasingly prominent role in decision-making processes

Beadle Scholar at Dakota State University

Recommended from our members

State-of-the-art on research and applications of machine learning in the building life cycle

Author: Hong T
Luo X
Wang Z
Zhang W
Publication venue: eScholarship, University of California
Publication date: 01/04/2020
Field of study

Fueled by big data, powerful and affordable computing resources, and advanced algorithms, machine learning has been explored and applied to buildings research for the past decades and has demonstrated its potential to enhance building performance. This study systematically surveyed how machine learning has been applied at different stages of building life cycle. By conducting a literature search on the Web of Knowledge platform, we found 9579 papers in this field and selected 153 papers for an in-depth review. The number of published papers is increasing year by year, with a focus on building design, operation, and control. However, no study was found using machine learning in building commissioning. There are successful pilot studies on fault detection and diagnosis of HVAC equipment and systems, load prediction, energy baseline estimate, load shape clustering, occupancy prediction, and learning occupant behaviors and energy use patterns. None of the existing studies were adopted broadly by the building industry, due to common challenges including (1) lack of large scale labeled data to train and validate the model, (2) lack of model transferability, which limits a model trained with one data-rich building to be used in another building with limited data, (3) lack of strong justification of costs and benefits of deploying machine learning, and (4) the performance might not be reliable and robust for the stated goals, as the method might work for some buildings but could not be generalized to others. Findings from the study can inform future machine learning research to improve occupant comfort, energy efficiency, demand flexibility, and resilience of buildings, as well as to inspire young researchers in the field to explore multidisciplinary approaches that integrate building science, computing science, data science, and social science

eScholarship - University of California

Ensemble decision systems for general video game playing

Author: Anderson Damien
Guerrero-Romero Cristina
Levine John
Perez-Liebana Diego
Rodgers Philip
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 26/05/2019
Field of study

Ensemble Decision Systems offer a unique form of decision making that allows a collection of algorithms to reason together about a problem. Each individual algorithm has its own inherent strengths and weaknesses, and often it is difficult to overcome the weaknesses while retaining the strengths. Instead of altering the properties of the algorithm, the Ensemble Decision System augments the performance with other algorithms that have complementing strengths. This work outlines different options for building an Ensemble Decision System as well as providing analysis on its performance compared to the individual components of the system with interesting results, showing an increase in the generality of the algorithms without significantly impeding performance.Comment: 8 Pages, Accepted at COG201

arXiv.org e-Print Archive

Crossref

University of Strathclyde Institutional Repository

Queen Mary Research Online

ResearchOnline@GCU

Outcomes for youth work : coming of age or master’s bidding?

Author: MacKie Gordon
McGinley Brian
Publication venue
Publication date: 01/01/2012
Field of study

Abstract Providing evidence in youth work is a current and important debate. Modern youth work has, at least to some degree, recognised the need to produce practice information, through its various guises, with limited success as requirements and terminology have continually changed. In Scotland, the current demands for youth work to “prove” itself are through a performance management system that promotes outcome-based practice. There are some difficulties with this position because outcome-based practice lacks methodological rigour, is aligned with national governmental commitments and does not adequately capture the impact of youth work practice. This paper argues that youth workers need to develop both a theoretical and methodological approach to data collection and management,which is in keeping with practice values, captures the voice of the young person and enhances youth work practice. Youth work should not be used as a mechanism to deliver the government’s policies but be liberated from centralist control to become a “free practice” so that some of the perennial problems, such as democratic disillusionment, partly caused by this “performance management industry”, can be effectively dealt with. The generation of evidence for youth work should enable it to freely investigate and capture its impact, within the practice, based on the learning that has taken place, the articulation of the learners’ voice with the most appropriate form of data presentation

University of Strathclyde Institutional Repository

Investigating Trade-offs For Fair Machine Learning Systems

Author: Hort Max
Publication venue: UCL (University College London)
Publication date: 28/01/2023
Field of study

Fairness in software systems aims to provide algorithms that operate in a nondiscriminatory manner, with respect to protected attributes such as gender, race, or age. Ensuring fairness is a crucial non-functional property of data-driven Machine Learning systems. Several approaches (i.e., bias mitigation methods) have been proposed in the literature to reduce bias of Machine Learning systems. However, this often comes hand in hand with performance deterioration. Therefore, this thesis addresses trade-offs that practitioners face when debiasing Machine Learning systems. At first, we perform a literature review to investigate the current state of the art for debiasing Machine Learning systems. This includes an overview of existing debiasing techniques and how they are evaluated (e.g., how is bias measured). As a second contribution, we propose a benchmarking approach that allows for an evaluation and comparison of bias mitigation methods and their trade-offs (i.e., how much performance is sacrificed for improving fairness). Afterwards, we propose a debiasing method ourselves, which modifies already trained Machine Learning models, with the goal to improve both, their fairness and accuracy. Moreover, this thesis addresses the challenge of how to deal with fairness with regards to age. This question is answered with an empirical evaluation on real-world datasets

UCL Discovery

Dissecting Deep Language Models: The Explainability and Bias Perspective

Author: ATTANASIO GIUSEPPE
Publication venue: country:Italy
Publication date: 04/10/2022
Field of study

L'abstract è presente nell'allegato / the abstract is in the attachmen

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Handbook for SDG-Aligned Food Companies: Four Pillar Framework Standards

Author: Agarwal Urvi
Baethgen Claudia P.
Espinosa Gaëlle
Marcela Diana
Mardirossian Nora
Marrero Abrania
O’Dwyer Erin
Plekenpol Regan
Rincón Rico
Sachs Jeffrey D.
Sachs Lisa E.
Publication venue: 'Center for Open Science'
Publication date: 01/01/2021
Field of study

The world food system is in crisis. Outright hunger, unhealthy diets and malnutrition occur parallel to food losses and waste. Farming families in poor countries suffer from extreme poverty. And food production is environmentally unsustainable and increasingly vulnerable to extreme weather events caused by climate change. A historic change of direction is needed to bring about a new era of food system sustainability. Our work aims to help companies, investors and other stakeholders move towards a more sustainable food system that is aligned with the Sustainable Development Goals. Transforming the world food system to achieve sustainability in all its dimensions is a major challenge. Achieving the Sustainable Development Goals will require managing major changes to the global food system responsibly, involving hundreds of millions of farmers and their families, global supply chains, thousands of food producing companies, diverse food production systems and local ecologies, food processing and a great diversity of food traditions and cultures. Food companies are engaged in food production, trade, processing, and consumer sales around the world. While they have distinct roles “from farm to fork,” they all share the same responsibility: to be part of the global transformation towards food system sustainability. For more on CCSI and SDSN’s work on corporate alignment with the Sustainable Development Goals, see our framework defining SDG-aligned business practices in the energy sector

bepress Legal Repository

Columbia University Academic Commons

Columbia Law School Scholarship Archive

Nature of the learning algorithms for feedforward neural networks

Author: Pérez-Miñana Elena
Publication venue: The University of Edinburgh
Publication date: 01/01/1997
Field of study

The neural network model (NN) comprised of relatively simple computing elements, operating in parallel, offers an attractive and versatile framework for exploring a variety of learning structures and processes for intelligent systems. Due to the amount of research developed in the area many types of networks have been defined. The one of interest here is the multi-layer perceptron as it is one of the simplest and it is considered a powerful representation tool whose complete potential has not been adequately exploited and whose limitations need yet to be specified in a formal and coherent framework. This dissertation addresses the theory of generalisation performance and architecture selection for the multi-layer perceptron; a subsidiary aim is to compare and integrate this model with existing data analysis techniques and exploit its potential by combining it with certain constructs from computational geometry creating a reliable, coherent network design process which conforms to the characteristics of a generative learning algorithm, ie. one including mechanisms for manipulating the connections and/or units that comprise the architecture in addition to the procedure for updating the weights of the connections. This means that it is unnecessary to provide an initial network as input to the complete training process.After discussing in general terms the motivation for this study, the multi-layer perceptron model is introduced and reviewed, along with the relevant supervised training algorithm, ie. backpropagation. More particularly, it is argued that a network developed employing this model can in general be trained and designed in a much better way by extracting more information about the domains of interest through the application of certain geometric constructs in a preprocessing stage, specifically by generating the Voronoi Diagram and Delaunav Triangulation [Okabe et al. 92] of the set of points comprising the training set and once a final architecture which performs appropriately on it has been obtained, Principal Component Analysis [Jolliffe 86] is applied to the outputs produced by the units in the network's hidden layer to eliminate the redundant dimensions of this space

Edinburgh Research Archive